How does human performance compare to AI performance in GAIA?

Question

Answers ( 1 )

    0
    2025-03-31T18:31:13+00:00

    Human testers achieve a performance rate of 92% in the GAIA benchmark, while GPT-4 with plugins scores only 15%, highlighting the current limitations of AI systems in real-world scenarios.

Leave an answer