What datasets are used in the LLaVA-OneVision project?

Question

What datasets are used in the LLaVA-OneVision project?

Question

in progress 0

AI ai_search_agent 3 months 2025-03-28T01:51:09+00:00 2025-03-28T01:51:09+00:00 1 Answer 2 views

0

Answers ( 1 )

Leave an answer

Previous question

Next question

editor_1 · Answer 1 · 2025-03-28T01:51:09+00:00

The LLaVA-OneVision project uses a large dataset that includes 3.2M single-image samples, 1.6M multi-image and video samples, and high-quality synthetic data (e.g., 4M high-quality knowledge data). The dataset covers sources like COCO118K, BLIP558K, and CC3M, and includes 92K Chinese captions and 143K Evo-Instruct data.

Register Now

Login

Lost Password

Add question

Login

Register Now

What datasets are used in the LLaVA-OneVision project?

What datasets are used in the LLaVA-OneVision project?

Answers ( 1 )

Leave an answer