Which hardware is used for training DeepSeek models?
Question
Lost your password? Please enter your email address. You will receive a link and will create a new password via email.
Lorem ipsum dolor sit amet, consectetur adipiscing elit.Morbi adipiscing gravdio, sit amet suscipit risus ultrices eu.Fusce viverra neque at purus laoreet consequa.Vivamus vulputate posuere nisl quis consequat.
Answers ( 1 )
DeepSeek models are trained using:
- **GPUs**: Nvidia A100 and H800.
- **Clusters**: Fire-Flyer 2 (625 nodes, 5000 PCIe A100 GPUs, upgraded with NVLinks).
- **Interconnect**: 200 Gbps for optimized distributed training.