Amateur mathematicians solve long-standing maths problems with AI
AI tools are helping to decipher long-standing maths problems andresr/Getty Images Amateur mathematicians are using artificial intelligence chatbots to
AI tools are helping to decipher long-standing maths problems
andresr/Getty Images
Amateur mathematicians are using artificial intelligence chatbots to solve long-standing problems, in a move that has taken professionals by surprise. While the problems in question aren’t the most advanced in the mathematical canon, the success of AI models in tackling them shows that their mathematical performance has passed a significant threshold, say researchers, and could fundamentally change the way we do mathematics.
The questions being solved by AI originate from Hungarian mathematician Paul Erdős, who was famous for his ability to pose useful but difficult questions during a career that spanned over six decades. “The questions tended to be very simple, but very hard,” says Thomas Bloom at the University of Manchester, UK.
By his death in 1996, there were more than 1000 of these unsolved Erdős problems, spanning a wide range of mathematical disciplines, from combinatorics (the study of combinations) to number theory. Today, they are seen as signposts for progress in these fields, says Bloom, who runs a website that catalogues the problems and tracks mathematicians’ progress in solving them.
Because Erdős problems are often simple to state, mathematicians began experimenting with feeding them to AI tools like ChatGPT. Bloom says that in October last year, he began seeing people use AI models to find relevant references in the mathematical literature that helped with their solutions.
Soon after, AI tools began finding partial improvements to results, some of which had been found in past papers, while others appeared new.
“I was surprised then,” says Bloom. “Before, when I tried ChatGPT, it just made up papers, completely hallucinating, and so I had given up using it. But clearly, there was some sort of change around October. I actually found genuine papers because it had read them all, and often in a non-trivial way.”
Inspired by this progress, Kevin Barreto, an undergraduate mathematics student at Cambridge University, and Liam Price, an amateur mathematician, began looking for simple and understudied Erdős problems that they might solve with AI. After finding one such problem, number 728, a conjecture in number theory, they fed it to ChatGPT-5.2 Pro to solve it.
“I looked at the statement, and thought, ‘This one might be able to get solved by ChatGPT, so let’s try it,’” says Barreto. “Sure enough, it comes back with an argument that’s quite nice and that a lot of people would actually agree was rather sophisticated.”
After ChatGPT produced a proof, Barreto and Price used another AI tool called Aristotle, created by the AI company Harmonic, to verify their work. Aristotle converts the conventional language proof into one written in Lean, a mathematical programming language. It can then be instantly checked by a computer for correctness. This is an important step, says Bloom, as it saves the limited time that researchers have to check whether a result is correct or not.
As of mid-January, six Erdős problems have been fully solved by AI tools, though subsequent scrutiny by professional mathematicians revealed that five of these problems had previously been solved in the mathematical literature. Only one problem, number 205, has been fully solved by Barreto and Price with no pre-existing solution. AI tools have also enabled small improvements and partial solutions to seven other problems that don’t appear to be pre-existing in the literature.
As a result, there is an ongoing debate about whether these tools are really proving new ideas, or merely digging out old and forgotten solutions. Bloom points out that the AI models often have to translate the problems into new forms, and are discovering papers that make no mention of Erdős. “A lot of these papers, I wouldn’t have found, and maybe nobody would have found for a lot longer without this sort of [use of] the AI tool,” he says.
Another question is just how far this approach can go. All of these problems aren’t the most demanding in mathematics, and could perhaps be accomplished by a first-year PhD student, but that is still impressive, says Bloom. “To me, it’s incredible that AI is capable of that, because this takes non-trivial effort.”
Barreto also says that the problems being solved are relatively straightforward, even when compared with more difficult Erdős problems, which current AI models fall short of solving. “Once [AI] gets through the low-hanging fruit problems, a lot of them are going to need more capable models,” he says. Some of the hardest problems have prize money set aside for anyone who can solve them, but Barreto thinks that is unlikely to happen soon: “Some people are trying to do bounty problems, and to me that’s kind of nuts. I don’t think the models are there yet.”
Solving Erdős problems using AI is promising progress, says Kevin Buzzard at Imperial College London, but because most of the problems it is solving are either relatively straightforward or have had little attention, it makes it hard to gauge whether it is a significant achievement – or something that should concern professionals. “That is progress, but mathematicians aren’t going to be looking over their shoulders just yet,” says Buzzard. “It’s green shoots.”
But even if the models’ capability stays static, their ability to handle relatively complex mathematics could fundamentally change how researchers research and write proofs, says Bloom, because it will allow mathematicians who have limited knowledge of areas outside their particular discipline to draw on other fields.
“Almost nobody knows every part of math, and that means that we’re quite limited in the sets of tools that we can use,” says Bloom. “The fact that you can just get an answer instantly, without having to bother another human, without having to waste months learning potentially useless knowledge, opens up so many connections. That’s going to be a huge change that we’ll see, just increasing the breadth of research that’s done.”
This could also allow mathematicians to practice an entirely new way of working, says Terence Tao at the University of California, Los Angeles, who has helped validate some of the AI-assisted Erdős problem solutions.
Mathematicians often focus on a small number of difficult problems because of limited time, while many less difficult but still important problems don’t get much attention. If AI tools can be applied to them all at once, it could lead to a more empirical, scientific way of doing mathematics, says Tao, where different ways of solving a problem could be tested on a large scale.
“We are just so resource-limited by how much expert attention we have, that we don’t look at 99 per cent of all the problems that we could be studying,” says Tao. “So we don’t do things like survey hundreds of problems, trying to find one or two really interesting ones, or do statistical studies like, we have two different methods, which one is better?
“This is a type of mathematics that just isn’t done,” he says. “We don’t do large-scale mathematics because we don’t have the intellectual resources, but AI is showing that you can.”
Topics:



