Math Models Projects - Search News

News

OpenAI Model Earns Gold-Medal Score at International Math Olympiad and Advances Path to Artificial General Intelligence

A few months before the 2025 International Mathematical Olympiad (IMO) in July, a three-person team at OpenAI made a long bet ...

Scientific American5d

Can Writing Math Proofs Teach AI to Reason Like Humans?

OpenAI researchers reveal how their experimental model, devoid of any external aids, powered through hours-long proofs to earn a gold-medal score at the International Math Olympiad—and they discuss th ...

TechRepublic1mon

OpenAI Model Wins Gold at International Mathematical Olympiad – or ...

And yet, recently, Gemini 2.5 Pro and OpenAI’s o3 scored 86.7% and 88.9%, respectively, in the American Invitational Mathematics Examination, a key math benchmark for AI models.

Ars Technica9mon

New secret math benchmark stumps AI models and PhDs alike

secret math problems dept. New secret math benchmark stumps AI models and PhDs alike FrontierMath's difficult questions remain unpublished so that AI companies can't train against it.

VentureBeat4y

Researchers find that large language models struggle with math

To measure the problem-solving ability of large and general-purpose language models, the researchers created a dataset called MATH, which consists of 12,500 problems taken from high school math ...

7don MSN

Mathematical model reveals how collapsing matter and expanding voids shape universe's evolution

A University of Queensland researcher has developed a new mathematical model to explain the evolution of the universe which, ...

Hosted on MSN7mon

Microsoft says 'rStar-Math' demonstrates how small language models ...

Microsoft enhances the capabilities of small language models (SLMs) with rStar-Math. The technique boosts the capabilities of SLMs, allowing them to compete or even surpass the math reasoning ...

TechCrunch3mon

DeepSeek upgrades its math-focused AI model Prover

Chinese AI lab DeepSeek has quietly updated Prover, its AI system that's designed to solve math-related proofs and theorems.

Forbes3mon

Big Models, Bad Math: The GenAI Problem In Finance - Forbes

The reason for this is fundamental: ChatGPT, and many models like it, can't actually do math. They rely on sophisticated pattern recognition and statistical memory, not true mathematical computation.

The Real Deal1y

ED1 projects multiply in LA as developers question the math

While the number of ED1 projects is poised to accelerate this year, a growing number of developers and advocates question the math behind it.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results