CoinsValue.net logo CoinsValue.net logo
Bitcoin World 2026-05-20 20:55:13

OpenAI says its reasoning model solved an 80-year-old math problem — this time with proof

BitcoinWorld OpenAI says its reasoning model solved an 80-year-old math problem — this time with proof OpenAI has announced that its latest reasoning model produced an original mathematical proof disproving a long-standing conjecture in geometry — a problem first posed by mathematician Paul Erdős in 1946. Unlike a similar claim made seven months ago that was later retracted, the company says this achievement has been independently verified by leading mathematicians. A claim with stronger backing this time The announcement comes after a notable misstep. In October 2024, OpenAI’s former vice president Kevil Weil posted on X that GPT-5 had found solutions to 10 previously unsolved Erdős problems and made progress on 11 others. That claim was quickly debunked: the model had only rediscovered existing solutions already documented in mathematical literature. Critics including Yann LeCun and Google DeepMind CEO Demis Hassabis publicly questioned the statement, and Weil deleted the post shortly after. This time, OpenAI appears to have taken greater care. Alongside the announcement, the company published supporting remarks from mathematicians Noga Alon, Melanie Wood, and Thomas Bloom, who maintains the Erdős Problems website. Bloom had previously called Weil’s earlier post “a dramatic misrepresentation.” What the model actually did For nearly 80 years, mathematicians believed the best possible solutions to the geometry problem resembled square grids. OpenAI’s model discovered a new family of constructions that outperforms those traditional approaches, effectively disproving that long-held assumption. OpenAI described the result as “the first time AI has autonomously solved a prominent open problem central to a field of mathematics.” The company emphasized that the proof came from a general-purpose reasoning model, not a system specifically designed for mathematics or for solving this particular problem. Why this matters beyond mathematics The significance, according to OpenAI, lies in the model’s ability to maintain long, complex chains of reasoning and to connect ideas across different fields — capabilities that could extend to biology, physics, engineering, and medicine. Thomas Bloom reflected on the broader implications: “AI is helping us to more fully explore the cathedral of mathematics we have built over the centuries. What other unseen wonders are waiting in the wings?” Conclusion While the field of AI-driven scientific discovery has seen its share of overhyped claims, this verified result marks a genuine step forward. It demonstrates that general-purpose reasoning models can now contribute original insights to longstanding open problems — a milestone that may reshape how researchers approach complex questions across multiple disciplines. FAQs Q1: What problem did OpenAI’s model solve? The model produced a proof disproving a geometry conjecture first posed by mathematician Paul Erdős in 1946, which had remained unsolved for nearly 80 years. Q2: How is this different from OpenAI’s previous claim about solving Erdős problems? Seven months ago, a former OpenAI executive claimed GPT-5 solved multiple Erdős problems, but those turned out to be rediscoveries of existing solutions. This time, the proof is original and has been verified by independent mathematicians. Q3: Could this AI capability be used outside of mathematics? Yes. OpenAI says the reasoning model is general-purpose, meaning its ability to hold long chains of reasoning and connect ideas across fields could be applied to biology, physics, engineering, and medicine. This post OpenAI says its reasoning model solved an 80-year-old math problem — this time with proof first appeared on BitcoinWorld .

阅读免责声明 : 此处提供的所有内容我们的网站,超链接网站,相关应用程序,论坛,博客,社交媒体帐户和其他平台(“网站”)仅供您提供一般信息,从第三方采购。 我们不对与我们的内容有任何形式的保证,包括但不限于准确性和更新性。 我们提供的内容中没有任何内容构成财务建议,法律建议或任何其他形式的建议,以满足您对任何目的的特定依赖。 任何使用或依赖我们的内容完全由您自行承担风险和自由裁量权。 在依赖它们之前,您应该进行自己的研究,审查,分析和验证我们的内容。 交易是一项高风险的活动,可能导致重大损失,因此请在做出任何决定之前咨询您的财务顾问。 我们网站上的任何内容均不构成招揽或要约