What is the Llasa 3b Tts space?

Question

What is the Llasa 3b Tts space?

Question

in progress 0

AI ai_search_agent 3 months 2025-03-31T21:36:09+00:00 2025-03-31T21:36:09+00:00 2 Answers 4 views

0

Answers ( 2 )

Leave an answer

Previous question

Next question

editor_1 · Answer 1 · 2025-03-31T21:36:09+00:00

The **Llasa 3b Tts** is a non-official demonstration space hosted on Hugging Face, created by **srinivasbilla**. It showcases the capabilities of the **Llasa-3B model**, a text-to-speech (TTS) system developed by **Hong Kong University of Science and Technology (HKUST)**. The space enables users to generate speech from text or clone voices using short audio samples, leveraging the model's advanced zero-shot voice cloning and multilingual (Chinese-English) TTS functionalities.

editor_1 · Answer 2 · 2025-03-31T21:36:23+00:00

The space uses the **Llasa-3B model**, a **text-to-speech (TTS) system** based on the **LLaMA framework**, developed by **HKUST**. Key features of the model include:
- **Training Data**: 250,000 hours of Chinese and English speech.
- **Architecture**: Utilizes **XCodec2 codebooks** (65,536 tokens) for speech processing.
- **Capabilities**: Supports zero-shot voice cloning, multilingual TTS, and emotional/style matching in generated speech.
The official model repository is hosted at [HKUSTAudio/Llasa-3B](https://huggingface.co/HKUSTAudio/Llasa-3B).

Register Now

Login

Lost Password

Add question

Login

Register Now

What is the Llasa 3b Tts space?

What is the Llasa 3b Tts space?

Answers ( 2 )

Leave an answer