What datasets were used to evaluate AirRAG?

Question

Answers ( 1 )

    0
    2025-03-28T02:29:53+00:00

    AirRAG was evaluated on several datasets, including HotpotQA, MuSiQue, 2WikiMultiHopQA, Natural Questions (NQ), TriviaQA, PopQA, and WebQA. Each dataset included 1,000 test samples with a random seed of 0.

Leave an answer