Register Now

Login

Lost Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Captcha Click on image to update the captcha .

Add question

You must login to ask a question.

Login

Register Now

Lorem ipsum dolor sit amet, consectetur adipiscing elit.Morbi adipiscing gravdio, sit amet suscipit risus ultrices eu.Fusce viverra neque at purus laoreet consequa.Vivamus vulputate posuere nisl quis consequat.

SeACo-Paraformer - A non-autoregressive automatic speech recognition system with hotword customization for enhanced Chinese speech recognition.

## Overview of SeACo-Paraformer SeACo-Paraformer is a non-autoregressive end-to-end automatic speech recognition (ASR) system designed specifically for Chinese speech recognition. It integrates hotword customization to improve the accuracy of recognizing specific entity words such as names and places. ## Key Features of SeACo-Paraformer The key features of SeACo-Paraformer include: 1. Non-autoregressive (NAR) design: This architecture provides higher efficiency during the inference phase compared to traditional autoregressive models. 2. Hotword customization: Users can specify hotwords (e.g., names, entity words) to improve their recognition accuracy. 3. Combination of accuracy and efficiency: It combines the accuracy of attention encoder-decoder (AED) models with the efficiency of NAR models, ensuring a balance between performance and speed. 4. Explicit customization capacity: The system handles hotword customization in an explicit manner, providing superior customization performance. ## Functions of SeACo-Paraformer The main functions of SeACo-Paraformer include: - Automatic Speech Recognition (ASR): Converts speech signals into text, suitable for general Chinese scenarios. - Hotword customization: Allows users to specify hotwords during the ASR process to enhance the recognition accuracy of specific words. ## Access and Usage of SeACo-Paraformer SeACo-Paraformer can be accessed and used via the FunASR toolkit, an open-source speech recognition toolkit. Users can: - Download pre-trained models from platforms like ModelScope or Hugging Face. - Use the FunASR API for model inference, especially in scenarios requiring hotword customization. Detailed usage instructions can be found in the FunASR GitHub repository and related documentation. ## Datasets Used for Testing SeACo-Paraformer SeACo-Paraformer was tested on a large-scale industrial dataset consisting of 50,000 hours of data. It demonstrated superior performance in both hotword customization and general ASR tasks. ## Additional Resources for SeACo-Paraformer More information about SeACo-Paraformer can be found at the following URLs: - [ModelScope Page](https://modelscope.cn/models/iic/speech_seaco_paraformer_large_asr_nat-zh-cn-16k-common-vocab8404-pytorch/summary) - [SeACo-Paraformer GitHub Repository](https://github.com/R1ckShi/SeACo-Paraformer) - [FunASR GitHub Repository](https://github.com/modelscope/FunASR) - [SeACo-Paraformer Paper on arXiv](https://arxiv.org/abs/2308.03266) ### Citation sources: - [SeACo-Paraformer](https://modelscope.cn/models/iic/speech_seaco_paraformer_large_asr_nat-zh-cn-16k-common-vocab8404-pytorch/summary) - Official URL Updated: 2025-03-26