What are the performance metrics of Tongyi Qianwen models?
Question
Lost your password? Please enter your email address. You will receive a link and will create a new password via email.
Lorem ipsum dolor sit amet, consectetur adipiscing elit.Morbi adipiscing gravdio, sit amet suscipit risus ultrices eu.Fusce viverra neque at purus laoreet consequa.Vivamus vulputate posuere nisl quis consequat.
Answers ( 1 )
Tongyi Qianwen models exhibit strong performance metrics. For example, Qwen 2.5 achieves MMLU scores of 85+, HumanEval scores of 85+, and MATH scores of 80+. Qwen2.5-Max is trained on over 20 trillion tokens and leads in benchmarks like Arena-Hard, LiveBench, LiveCodeBench, and GPQA-Diamond. Qwen2.5-Coder supports 92 programming languages and is trained on 5.5 trillion tokens, while Qwen2.5-Math excels in mathematical reasoning, surpassing most 70B math models.