What are the key modality categories supported by Zidong Taichu 2.0?

Question

Answers ( 1 )

    0
    2025-04-01T09:08:03+00:00

    The system supports seven primary modalities:
    - **Image**: Description, generation, retrieval (e.g. "Draw cherry blossom scene")
    - **Language**: Q&A, translation, poetry creation (e.g. ancient Chinese quatrains)
    - **Video**: Content description and retrieval
    - **Music**: Generation and analysis (e.g. Chinese-style chime tunes)
    - **Audio**: Authentication and event classification (11 sound types)
    - **3D**: Scene understanding from point clouds
    - **Signal**: Radar signal recognition

Leave an answer