What are the main features of DeepSeek-V2?

Question

Answers ( 1 )

    0
    2025-03-28T02:39:01+00:00

    The main features of DeepSeek-V2 include:
    - Cost-efficient training through MoE architecture and sparse computation.
    - Efficient inference via MLA mechanism, reducing KV cache requirements.
    - Large-scale pre-training on 8.1 trillion tokens with increased Chinese data.
    - Long context support with YaRN technology, extending the context window to 128K tokens.
    - Human preference optimization using the GRPO algorithm for reinforcement learning.

Leave an answer