ON
← Back to feed
Humans outperform AI at this highly rigorous mathematics test
United Kingdom🔬 Science22 days ago

Humans outperform AI at this highly rigorous mathematics test

In the First Proof project, four AI systems were tested on ten research-level mathematics problems. None of the AI models performed as well as top mathematicians, scoring only 6 out of 10 on average. The test was designed to meet three criteria: using research-level math problems, avoiding problems present in the AI's training data, and being formally graded by human mathematicians. The results were published on the First Proof website on 10 June. This follows recent advancements in AI, such as a chatbot solving an 80-year-old math problem.

Go to the primary sources (1)

The official sources this coverage is built on. Read them directly to bypass framing.

2 reports

Nature News logoNature NewsIndependentCenter22 days ago
Humans outperform AI at this highly rigorous mathematics test

In the First Proof project, four AI systems were tested on ten research-level mathematics problems. None of the AI models performed as well as top mathematicians, scoring only 6 out of 10 on average. The test was designed to meet three criteria: using research-level math problems, avoiding problems present in the AI's training data, and being formally graded by human mathematicians. The results were published on the First Proof website on 10 June. This follows recent advancements in AI, such as a chatbot solving an 80-year-old math problem.

Bias read (Center): The article presents factual information about an AI performance test without taking a stance on the implications or outcomes. It reports on the results objectively, mentioning both the limitations of AI and recent advancements without biased language or emphasis.

Nature News logoNature NewsIndependentCenter26 days ago
How AI is reshaping discovery in maths and physics

The article discusses how artificial intelligence is transforming mathematical and theoretical physics research. AI tools are being used to verify proofs, identify counterexamples, and suggest intermediate steps in complex arguments. In experimental fields, AI is automating parts of the scientific process, though it faces limitations due to physical constraints. In contrast, mathematics and theoretical physics benefit from AI's ability to handle digital 'experiments' efficiently.

Bias read (Center): The article presents a balanced view of AI's role in mathematics and physics without taking a clear ideological stance. It highlights both opportunities and limitations of AI in these fields, citing examples from research without emphasizing any particular political perspective.

Keep the news honest.

ObjectiveNews is reader-funded and ad-free — we show you the bias instead of hiding it. Support independent journalism for €5/month.

Become a Supporter

Related stories