What are the main features of DeepSeek-V2?
Question
Lost your password? Please enter your email address. You will receive a link and will create a new password via email.
Lorem ipsum dolor sit amet, consectetur adipiscing elit.Morbi adipiscing gravdio, sit amet suscipit risus ultrices eu.Fusce viverra neque at purus laoreet consequa.Vivamus vulputate posuere nisl quis consequat.
Answers ( 1 )
The main features of DeepSeek-V2 include:
- Cost-efficient training through MoE architecture and sparse computation.
- Efficient inference via MLA mechanism, reducing KV cache requirements.
- Large-scale pre-training on 8.1 trillion tokens with increased Chinese data.
- Long context support with YaRN technology, extending the context window to 128K tokens.
- Human preference optimization using the GRPO algorithm for reinforcement learning.