What are the key features of GOT-OCR2.0?

Question

Answers ( 1 )

    0
    2025-03-26T19:40:04+00:00

    The key features of GOT-OCR2.0 include:
    - **Architecture**: Unified end-to-end design with a high-compression encoder and long-context decoder.
    - **Multi-language support**: Capable of processing text in multiple languages.
    - **Multi-modal recognition**: Supports recognition of text, mathematical formulas, molecular formulas, charts, musical scores, and geometric shapes.
    - **OCR types**: Provides plain text OCR, formatted text OCR, and fine-grained OCR (with ocr_box and ocr_color options).
    - **Multi-crop functionality**: Supports multi-crop OCR for enhanced processing of complex images.
    - **Rendering capability**: Can render formatted OCR results, such as saving them as HTML files.

Leave an answer