SeACo-Paraformer - A non-autoregressive automatic speech recognition system with hotword customization for enhanced Chinese speech recognition.

## Overview of SeACo-Paraformer SeACo-Paraformer is a non-autoregressive end-to-end automatic speech recognition (ASR) system designed specifically for Chinese speech recognition. It integrates hotword customization to improve the accuracy of recognizing specific entity words such as names and places. ## Key Features of SeACo-Paraformer The key features of SeACo-Paraformer include: 1. Non-autoregressive (NAR) design: This architecture provides higher efficiency during the inference phase compared to traditional autoregressive models. 2. Hotword customization: Users can specify hotwords (e.g., names, entity words) to improve their recognition accuracy. 3. Combination of accuracy and efficiency: It combines the accuracy of attention encoder-decoder (AED) models with the efficiency of NAR models, ensuring a balance between performance and speed. 4. Explicit customization capacity: The system handles hotword customization in an explicit manner, providing superior customization performance. ## Functions of SeACo-Paraformer The main functions of SeACo-Paraformer include: - Automatic Speech Recognition (ASR): Converts speech signals into text, suitable for general Chinese scenarios. - Hotword customization: Allows users to specify hotwords during the ASR process to enhance the recognition accuracy of specific words. ## Access and Usage of SeACo-Paraformer SeACo-Paraformer can be accessed and used via the FunASR toolkit, an open-source speech recognition toolkit. Users can: - Download pre-trained models from platforms like ModelScope or Hugging Face. - Use the FunASR API for model inference, especially in scenarios requiring hotword customization. Detailed usage instructions can be found in the FunASR GitHub repository and related documentation. ## Datasets Used for Testing SeACo-Paraformer SeACo-Paraformer was tested on a large-scale industrial dataset consisting of 50,000 hours of data. It demonstrated superior performance in both hotword customization and general ASR tasks. ## Additional Resources for SeACo-Paraformer More information about SeACo-Paraformer can be found at the following URLs: - [ModelScope Page](https://modelscope.cn/models/iic/speech_seaco_paraformer_large_asr_nat-zh-cn-16k-common-vocab8404-pytorch/summary) - [SeACo-Paraformer GitHub Repository](https://github.com/R1ckShi/SeACo-Paraformer) - [FunASR GitHub Repository](https://github.com/modelscope/FunASR) - [SeACo-Paraformer Paper on arXiv](https://arxiv.org/abs/2308.03266) ### Citation sources: - [SeACo-Paraformer](https://modelscope.cn/models/iic/speech_seaco_paraformer_large_asr_nat-zh-cn-16k-common-vocab8404-pytorch/summary) - Official URL Updated: 2025-03-26

Register Now

Login

Lost Password

Add question

Login

Register Now

SeACo-Paraformer - A non-autoregressive automatic speech recognition system with hotword customization for enhanced Chinese speech recognition.

SeACo-Paraformer - A non-autoregressive automatic speech recognition system with hotword customization for enhanced Chinese speech recognition.