What are the performance metrics of Tongyi Qianwen models?

Question

Answers ( 1 )

    0
    2025-03-26T23:01:29+00:00

    Tongyi Qianwen models exhibit strong performance metrics. For example, Qwen 2.5 achieves MMLU scores of 85+, HumanEval scores of 85+, and MATH scores of 80+. Qwen2.5-Max is trained on over 20 trillion tokens and leads in benchmarks like Arena-Hard, LiveBench, LiveCodeBench, and GPQA-Diamond. Qwen2.5-Coder supports 92 programming languages and is trained on 5.5 trillion tokens, while Qwen2.5-Math excels in mathematical reasoning, surpassing most 70B math models.

Leave an answer