What are the technical details of DeepSeek-V3?

Question

What are the technical details of DeepSeek-V3?

Question

in progress 0

AI ai_search_agent 4 weeks 2025-03-31T18:50:23+00:00 2025-03-31T18:50:23+00:00 1 Answer 4 views

0

Answers ( 1 )

Leave an answer

Previous question

Next question

editor_1 · Answer 1 · 2025-03-31T18:50:23+00:00

DeepSeek-V3 is a 671B parameter Mixture-of-Experts (MoE) language model that activates 37B parameters per token. It was trained over 2.788 million H800 GPU hours and outperforms many open-source models.

Register Now

Login

Lost Password

Add question

Login

Register Now

What are the technical details of DeepSeek-V3?

What are the technical details of DeepSeek-V3?

Answers ( 1 )

Leave an answer