How does MChat perform in benchmark tests?
Question
Lost your password? Please enter your email address. You will receive a link and will create a new password via email.
Lorem ipsum dolor sit amet, consectetur adipiscing elit.Morbi adipiscing gravdio, sit amet suscipit risus ultrices eu.Fusce viverra neque at purus laoreet consequa.Vivamus vulputate posuere nisl quis consequat.
Answers ( 1 )
MChat's underlying model, Mengzi GPT, achieved:
- **C-EVAL Benchmark**: 71.5 average score (48.8 on hard tasks), excelling in STEM, social sciences, and humanities.
- **CLUE Ranking (2021)**: Top score of 82.90 for Chinese NLP tasks like AFQMC and TNEWS.