#mathematical-reasoning

[ follow ]
fromNature
1 week ago

DeepSeek's self-correcting AI model aces tough maths proofs

The model, DeepSeekMath-V2, scored 118 out of 120 points on questions from the 2024 William Lowell Putnam Mathematical Competition, beating the top human score of 90. The model also performed at the level of gold-medal winners in the International Mathematical Olympiad (IMO) 2025 and the 2024 China Mathematical Olympiad. The results are described in a preprint posted on arXiv on 27 November.
Artificial intelligence
Artificial intelligence
fromNature
1 week ago

DeepSeek's self-correcting AI model aces tough maths proofs

DeepSeekMath-V2 scored 118/120 on the 2024 Putnam, surpassing top humans and using self-verifiable reasoning to detect and correct its own errors.
Artificial intelligence
fromArs Technica
3 weeks ago

DeepMind's latest: An AI for handling mathematical proofs

AlphaProof achieved International Mathematical Olympiad silver-level performance and nearly gold on the Putnam, demonstrating substantial advances in automated mathematical reasoning.
[ Load more ]