An Amateur Just Solved a 60-Year-Old Math Problem - by Asking AI (scientificamerican.com) 27

Posted by EditorDavid on Saturday May 02, 2026 @02:34PM from the do-the-math dept.

Slashdot reader joshuark writes: Scientific American reports that a ChatGPT AI has proved a conjecture with a method no human had developed. A 23-year-old student Liam Price just cracked a 60-year-old problem that world-class mathematicians have tried and failed to solve.

The new solution that Price got in response to a single prompt to GPT-5.4 Pro was posted on www.erdosproblems.com, a website devoted to the Erds problems. The question Price solved — or prompted ChatGPT to solve—concerns special sets of whole numbers, where no number in the set can be evenly divided by any other...

Price sent it to his occasional collaborator Kevin Barreto, a second-year undergraduate in mathematics at the University of Cambridge. The duo had jump-started the AI-for-Erds craze late last year by prompting a free version of ChatGPT with open problems chosen at random from the Erds problems website. Reviewing Price's message, Barreto realized what they had was special, and experts whom he notified quickly took notice.

An Amateur Just Solved a 60-Year-Old Math Problem - by Asking AI

Post Load All Comments

Search 27 Comments Log In/Create an Account

Comments Filter:

The new wave (Score:2, Insightful)

by Anonymous Coward writes:

This is what a lot of people get wrong about "AI". The AI itself isn't the special thing on its own. It may not be real AGI but honestly you don't want that because then it wouldn't be a tool. It's the way it can wrap huge volumes of information in a way that is easily manipulated by a human. It's a search engine connected directly to your brain. With a little creativity and an AI you would be astounded as to what's possible with careful use.
It's not that different from any other work, just more advanced. L
- Re: (Score:2)
  
  by Cthefuture ( 665326 ) writes:
  
  Futures generations will look back at these times and how "simple" and "awesome" they were. You're part of something bigger and you have a front row seat.
  - - Re: (Score:1)
      
      by dfghjk ( 711126 ) writes:
      
      Correct, those in charge seek to turn us all into mulch, they view people as property that they haven't yet stolen. They seek to replace you, deny you an ability to earn a wage, take everything you own, enslave you and have you starve to death homeless, voteless, and without rights or healthcare. This is capitalism in its purest form, only with a few people capable of taking everything with the aid of machines that we collectively pay for. And they are in league with religious freaks that seek to bring ab
      - That's not quite right (Score:1)
        
        by rsilvergun ( 571051 ) writes:
        
        They view you as a dependency that they would like to break. They don't even want to have to use you as mulch they want to transcend that.
        
        When somebody asks who is going to buy their products the Epstein class is completely aware of that dependency and they are working to use automation in order to break the dependency.
        
        Right now some of us have a little bargaining power for wages and the ruling class doesn't want to pay wages at all. They don't want to have to do anything with filthy humans that are
- Levels of abstraction matter (Score:3)
  
  by shanen ( 462549 ) writes:
  
  [Overlooking the nameless BF.]
  Most relevant recent citation is Stolen Focus by Johann Hari, but my use of "level of abstraction" goes back many years and the more modern label is probably "reference frame". Also related to contextual meaning.
  Fundamental problem is the accumulation of too much information, so we have to attack problems by reframing them at the right level and by using the correct tools to manipulate them within that appropriate frame. Not at all surprised that someone who hasn't mastered a
- Re: (Score:1)
  
  by dfghjk ( 711126 ) writes:
  
  "This is what a lot of people get wrong about "AI""
  What is "this"? And who gets it wrong?
  "The AI itself isn't the special thing on its own."
  Which AI is "the AI"? It's not one thing, but it is a "special thing on its own".
  "...because then it wouldn't be a tool."
  Is your brain a tool? Are workers a tool? What makes AGI not a tool? AGI is merely a nebulous goalpost. a talking point to get money.
  "It's a search engine connected directly to your brain"
  So is your brain. And different AI works differently. Tra
Erds (Score:2)

by 602 ( 652745 ) writes:

lol
- Re: (Score:2)
  
  by shanen ( 462549 ) writes:
  
  Yeah, it's funny on its face, but when I asked a generative AI about "AI for Erds" and it hallucinated an answer involving ye olde Entity Relationship Diagrams.
  Is there a new job category for people who are good at asking questions the generative AIs can't answer properly? I'm pretty sure my batting average against Gemini is way over .300. Combination of nasty questions and wording questions in ways that suggest I might be expecting a particular wrong answer. I'm also considering the possibility that the AI
Just means none of the experts cared enough (Score:1)

by gweihir ( 88907 ) writes:

Likely because they have other things to do that are more important. Seriously, digging though old stuff will occasionally lead to some discovery. That is all that happened here.
- Re: (Score:3)
  
  by jd ( 1658 ) writes:
  
  Whilst you're almost certainly correct (AI would be unlikely to conquer a problem requiring any meaningful original thinking, even with help), this gives the aforementioned student an Erdos number (which is not quite as exciting as a Fields medal, but nothing to sneeze at either) and it's entirely possible that the conjecture will turn out to actually be useful in some area.
  - Re: (Score:2)
    
    by dfghjk ( 711126 ) writes:
    
    "AI would be unlikely to conquer a problem requiring any meaningful original thinking, even with help"
    You have no reason to believe that, and I don't think it's true. The human brain does not work on magic, "original thinking" may be beyond us currently but there's no reason to think it will remain unsolved.
    - Re: (Score:2)
      
      by gweihir ( 88907 ) writes:
      
      Ah, yes, the deranged claim that we know how the human mind works and it is purely mechanistic. You just excluded yourself from rational discussion by pushing a quasi-religious dogma with no supporting scientifically sound evidence.
      Seriously, you "AI believers" are not one bit smarter than the Jesus-freaks.
  - Re: (Score:2)
    
    by JoshuaZ ( 1134087 ) writes:
    
    Aside from them being almost certainly *not* correct here, given that there are a whole bunch of prior papers about this specific problem, you are confusing two different things. There's having solved an Erdos problem which is different than having an Erdos number. An Erdos number https://en.wikipedia.org/wiki/Erd%C5%91s_number [wikipedia.org] comes from having a chain of collaborators going back to Erdos. Erdos has Erdos number 0. Anyone who wrote a paper with Erdos has Erdos number 1. If someone else then writes a paper
- Re:Just means none of the experts cared enough (Score:4, Informative)
  
  by JoshuaZ ( 1134087 ) writes: on Saturday May 02, 2026 @04:45PM (#66124492) Homepage
  
  Mathematician here, and in the same area of research (number theory). This is not a problem where no one one cared. While there are some Erdos problems in this category, this problem is one which was well known enough that I was already familiar with. This is also a problem where multiple people, including Jared Lichtman, who is an up and coming well respected young number theorist, have thought about. And if you go to the page for problem 1196 on the general Erdos Problem data base, you'll see three references all of which include references to further papers which thought about this problem. https://www.erdosproblems.com/1196 [erdosproblems.com].
  
  Reply to This Parent Share
  Flag as Inappropriate
  - Re: (Score:2)
    
    by gweihir ( 88907 ) writes:
    
    If you are a mathematician, you should be able to see the difference between "nobody cared enough" (my claim) and "no one cared" (your gross mis-statement of my claim).
Ahem (Score:1)

by fubarrr ( 884157 ) writes:

Since all it can output is whatever it managed to Google in the internet, then it mens some already had a speculative solution to the problem, which he, or she never properly pushed into the scholarly society.
- Re: (Score:2)
  
  by Tablizer ( 95088 ) writes:
  
  How easy is it to back-track through the model to find the main source(s) of the answer?
  - No, that's not how gen AI works (Score:1)
    
    by shanen ( 462549 ) writes:
    
    But I don't have the math skills to explain properly, so I'll resort to a kind of metaphor. The real "talent" of AIs is in sounding plausible based on lots of examples of "what sounds good". In this case, it turned out that Occam's Razor worked, and an explanation that sounded plausible turned out to be a valid proof.
    But I speculate there were many failures, some of them hilarious, though the human "author" only published the good guess.
  - Re: (Score:2)
    
    by dfghjk ( 711126 ) writes:
    
    There are no "main source(s) of the answer", so not easy at all. However, perhaps you could ask the AI how the answer was generated.
    The human mind does not arrive at answers purely by applying previously learned answers, neither does generative AI. AI makes predictions based on COLLECTIVE previous experience, not with boolean comparisons to collections of "main sources". AI remembers no "main sources" at all, it can, however, predict what they were with remarkable accuracy when asked to do so. This is w
- Re: (Score:3)
  
  by JoshuaZ ( 1134087 ) writes:
  
  Mathematician here. This is highly unlikely to be the case. This was a moderately well known Erdos problem (not one of the famous ones but well known enough that I had seen it before this). If there were a solution on the internet we would likely have already found it, especially because there's been a concerted effort to track down what is happening with all the Erdos problems in the last few years. Moreover, even after this problem was solved, people then went and tried hard to find a copy of the solution
- Re: (Score:2)
  
  by dfghjk ( 711126 ) writes:
  
  Except that is false.
We know (Score:2)

by nospam007 ( 722110 ) * writes:

Somebody already told us.
Unfortunately we still don't have an anti-dupe AI.
"Do you like apples?" (Score:1)

by realmojo ( 62898 ) writes:

"I solved it. How you like them apples?"
- Re: (Score:2)
  
  by EnsilZah ( 575600 ) writes:
  
  If they write a script for searching and trying to solve mathematical problems, they should call it Math Daemon.

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

An Amateur Just Solved a 60-Year-Old Math Problem - by Asking AI (scientificamerican.com) 27

An Amateur Just Solved a 60-Year-Old Math Problem - by Asking AI More | Reply Login

An Amateur Just Solved a 60-Year-Old Math Problem - by Asking AI

The new wave (Score:2, Insightful)

Re: (Score:2)

Re: (Score:1)

That's not quite right (Score:1)

Levels of abstraction matter (Score:3)

Re: (Score:1)

Erds (Score:2)

Re: (Score:2)

Just means none of the experts cared enough (Score:1)

Re: (Score:3)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re:Just means none of the experts cared enough (Score:4, Informative)

Re: (Score:2)

Ahem (Score:1)

Re: (Score:2)

No, that's not how gen AI works (Score:1)

Re: (Score:2)

Re: (Score:3)

Re: (Score:2)

We know (Score:2)

"Do you like apples?" (Score:1)

Re: (Score:2)

Related Links Top of the: day, week, month.

Slashdot Top Deals

Slashdot