ON
← Back to feed
AI beaten by humans in a difficult math test
Italy🔬 Science18 days ago

AI beaten by humans in a difficult math test

In a rigorous mathematical test, four AI models, including ChatGPT 5.5 Pro, were evaluated against human performance. None of the models correctly answered all 10 questions. The best-performing model was developed by ETH Zurich, solving six out of ten problems. The test, part of the independent project First Proof, aimed to assess AI capabilities in mathematical research. Questions were previously unpublished to prevent models from relying on prior training data. A group of 30 mathematicians verified the responses. Only publicly available models participated, which limited involvement to OpenA

Go to the primary sources (1)

The official sources this coverage is built on. Read them directly to bypass framing.

1 reports

ANSA logoANSAIndependentCenter18 days ago
AI beaten by humans in a difficult math test

In a rigorous mathematical test, four AI models, including ChatGPT 5.5 Pro, were evaluated against human performance. None of the models correctly answered all 10 questions. The best-performing model was developed by ETH Zurich, solving six out of ten problems. The test, part of the independent project First Proof, aimed to assess AI capabilities in mathematical research. Questions were previously unpublished to prevent models from relying on prior training data. A group of 30 mathematicians verified the responses. Only publicly available models participated, which limited involvement to OpenA

Bias read (Center): The article presents factual results of an AI benchmarking test without overtly favoring any side. It describes the methodology, participants, and outcomes neutrally.

Keep the news honest.

ObjectiveNews is reader-funded and ad-free — we show you the bias instead of hiding it. Support independent journalism for €5/month.

Become a Supporter

Related stories