What is the MDoc dataset, and why is it significant for TransDLANet?

Question

Answers ( 1 )

    0
    2025-03-28T01:56:53+00:00

    The MDoc dataset is a large-scale, multi-format, multi-type, multi-layout, multi-language, and multi-annotation category dataset for modern document layout analysis. It includes 9,080 images, 237,116 annotated instances, and 74 annotation categories, covering PDF, scanned, and photographed documents. Its diversity makes it particularly relevant for evaluating and applying TransDLANet in real-world scenarios.

Leave an answer