United Kingdom🔬 Science22 days ago

Humans outperform AI at this highly rigorous mathematics test

In the First Proof project, four AI systems were tested on ten research-level mathematics problems. None of the AI models performed as well as top mathematicians, scoring only 6 out of 10 on average. The test was designed to meet three criteria: using research-level math problems, avoiding problems present in the AI's training data, and being formally graded by human mathematicians. The results were published on the First Proof website on 10 June. This follows recent advancements in AI, such as a chatbot solving an 80-year-old math problem.

Go to the primary sources (1)

The official sources this coverage is built on. Read them directly to bypass framing.

Source documentFirst Proof Project Website

2 reports

Nature NewsIndependentCenter22 days ago

Humans outperform AI at this highly rigorous mathematics test

Bias read (Center): The article presents factual information about an AI performance test without taking a stance on the implications or outcomes. It reports on the results objectively, mentioning both the limitations of AI and recent advancements without biased language or emphasis.

Nature NewsIndependentCenter26 days ago

How AI is reshaping discovery in maths and physics

The article discusses how artificial intelligence is transforming mathematical and theoretical physics research. AI tools are being used to verify proofs, identify counterexamples, and suggest intermediate steps in complex arguments. In experimental fields, AI is automating parts of the scientific process, though it faces limitations due to physical constraints. In contrast, mathematics and theoretical physics benefit from AI's ability to handle digital 'experiments' efficiently.

Bias read (Center): The article presents a balanced view of AI's role in mathematics and physics without taking a clear ideological stance. It highlights both opportunities and limitations of AI in these fields, citing examples from research without emphasizing any particular political perspective.

Keep the news honest.

ObjectiveNews is reader-funded and ad-free — we show you the bias instead of hiding it. Support independent journalism for €5/month.

Become a Supporter

Humans outperform AI at this highly rigorous mathematics test

Go to the primary sources (1)

2 reports

Keep the news honest.

Related stories

Three things to watch amid Anthropic’s latest feud with the government

Bohol solon urges Congress to advance Chocolate Hills bill

Sinner's fall? Let's hope it doesn't have any particular consequences. Herb increases the risk: physiotherapist's explanation

Where to invest in the second half: Global variable income, AI and banking and consumer stocks in Chile

Kassio will meet with institutes and big techs after suspending Atlas research

Humanoid robots play soccer and test the limits of artificial intelligence, watch video