‘A digital DPU inside a GPU’: Might intelligent {hardware} hack be behind DeepSeek’s groundbreaking AI effectivity?



  • A brand new strategy known as DualPipe appears to be the important thing to DeekSeek’s success
  • One professional describes it as an on-GPU digital DPU that maximizes bandwidth effectivity
  • Whereas DeepSeek has used Nvidia GPUs solely, one wonders how AMD’s Intuition would fare

China’s DeepSeek AI chatbot has shocked the tech trade, representing a reputable various to OpenAI’s ChatGPT at a fraction of the fee.

A current paper revealed DeepSeek V3 was educated on a cluster of two,048 Nvidia H800 GPUs – crippled variations of the H100 (we are able to solely think about how way more highly effective it might be working on AMD Intuition accelerators!). It reportedly required 2.79 million GPU-hours for pretraining, fine-tuning on 14.8 trillion tokens, and value – based on calculations made by The Subsequent Platform – a mere $5.58 million.



Supply hyperlink

Leave a Reply

Your email address will not be published. Required fields are marked *