Mathematical Operations Reasoning

National Academies of Sciences%2c Engineering%2c and Medicine

AI to Assist Mathematical Reasoning: A Workshop

The National Academies of Sciences, Engineering, and Medicine are private, nonprofit institutions that provide expert advice on some of the most pressing challenges facing the nation and world. Our ...

VentureBeat

AI’s math problem: FrontierMath benchmark shows how far technology still has to go

Artificial intelligence systems may be good at generating text, recognizing images, and even solving basic math problems—but when it comes to advanced mathematical reasoning, they are hitting a wall.

Ars Technica

New study shows why simulated reasoning AI models don’t yet live up to their billing

There’s a curious contradiction at the heart of today’s most capable AI models that purport to “reason”: They can solve routine math problems with accuracy, yet when faced with formulating deeper ...

EdSource

A greater role in math education for parents: mathematical reasoning at home

EdSource · This California Teacher of the Year embraces her dwarfism as a strength While policymakers, researchers and educators decide how our children learn math, parents don’t seem to be anywhere ...

GIGAZINE

DeepSeek releases AI model 'DeepSeek-Math-V2' specialized for mathematical reasoning, achieving a gold medal-level accuracy rate at the International Mathematical Olympiad

DeepSeek released DeepSeek-Math-V2, an AI model specialized for mathematical reasoning, on November 27, 2025. DeepSeek-Math-V2 focuses on theorem proving and self-verification capabilities, and ...

Business Today

Apple researchers find Large Language Models lack robust mathematical reasoning abilities; here's why

A team of Apple researchers has released a paper scrutinising the mathematical reasoning capabilities of large language models (LLMs), suggesting that while these models can exhibit abstract reasoning ...

InfoQ

Microsoft Research Unveils rStar-Math: Advancing Mathematical Reasoning in Small Language Models

A monthly overview of things you need to know as an architect or aspiring architect. Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with ...

EurekAlert!

MathEval: a comprehensive benchmark for evaluating large language models on mathematical reasoning capabilities

This study introduces MathEval, a comprehensive benchmarking framework designed to systematically evaluate the mathematical reasoning capabilities of large language models (LLMs). Addressing key ...

National Academies of Sciences%2c Engineering%2c and Medicine

AI to Assist Mathematical Reasoning: A Workshop

A National Academies of Sciences, Engineering, and Medicine-appointed ad hoc committee will plan and organize a workshop that will bring together academic, industry, and government stakeholders to ...

Results that may be inaccessible to you are currently showing.

Hide inaccessible results