How does human performance compare to AI performance in GAIA?
Question
Lost your password? Please enter your email address. You will receive a link and will create a new password via email.
Lorem ipsum dolor sit amet, consectetur adipiscing elit.Morbi adipiscing gravdio, sit amet suscipit risus ultrices eu.Fusce viverra neque at purus laoreet consequa.Vivamus vulputate posuere nisl quis consequat.
Answers ( 1 )
Human testers achieve a performance rate of 92% in the GAIA benchmark, while GPT-4 with plugins scores only 15%, highlighting the current limitations of AI systems in real-world scenarios.