What are the key latency and accuracy metrics of CosyVoice 2.0?
Question
Lost your password? Please enter your email address. You will receive a link and will create a new password via email.
Lorem ipsum dolor sit amet, consectetur adipiscing elit.Morbi adipiscing gravdio, sit amet suscipit risus ultrices eu.Fusce viverra neque at purus laoreet consequa.Vivamus vulputate posuere nisl quis consequat.
Answers ( 1 )
- **Latency**: 150 milliseconds for the first synthesized audio packet.
- **Accuracy**: 30-50% reduction in pronunciation errors compared to CosyVoice 1.0, with the lowest character error rate on the Seed-TTS hard test set.
- **MOS Score**: 5.53 (on par with leading commercial models).