Benchmarks Math - Search News

10h

Nvidia's Nemotron-Cascade 2 wins math and coding gold medals with 3B active parameters — and its post-training recipe is now open-source

Nvidia's Nemotron-Cascade 2 is a 30B MoE model that activates only 3B parameters at inference time, yet achieved gold ...

Decrypt

Forget AGI—Top AI Models Still Struggle With Math

New benchmark study results show leading AI models, including ChatGPT, Claude, and Gemini, still lag humans in visual math ...

WFAE 90.7

CMS’ uphill climb on Math I — and what it says about teacher retention

Math I was the focus of last week’s CMS school board meeting, giving us the latest look at progress toward the goal.

Scientific American

As AI keeps improving, mathematicians struggle to foretell their own future

First Proof is an effort to see whether LLMs can contribute meaningfully to pure mathematics research. The dust has settled ...

News and Sentinel

Wood County BOE hears presentations on federal funding and i-Ready data

Amber Hardman, director of federal programs, gave the Wood County Board of Education an in-depth overview of what federal ...

Marietta Times

Wood BOE gives info on federal funding and i-Ready data

Amber Hardman, director of federal programs, gave the Wood County Board of Education an in-depth overview of what federal education dollars can and cannot fund Tuesday, and “how the district plans to ...

13d

Leon Schools FAST test data shows student gains

The Leon County school district's FAST testing data shows small increases in student reading and math proficiency across ...

Hacker

The “Benchmark Trap”: Why AI’s Math Scores Don’t Prove It Can Reason

Among other things, launching AIModels.fyi ... Find the right AI model for your project - https://aimodels.fyi ...

Mashable

Google releases Gemini 3.1 Pro: Benchmark performance, how to try it

Google released its latest core reasoning model, Gemini 3.1 Pro, on Thursday. Google says that Gemini 3.1 Pro achieved twice the verified performance of 3 Pro on ARC-AGI-2, a popular benchmark that ...

TechCrunch

Jack Altman joins Benchmark as GP

Jack Altman and Benchmark announced today that he would be joining the firm as a general partner. This news is a big deal, especially since Altman has been running his own VC firm, Alt Capital, since ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results