MaxProof: Scaling Mathematical Proof with Generative-Verifier RL and Population-Level Test-Time Scaling

By Jiacheng Chen, Xinyu Zhang, Shunkai Zhang, Yanmohan Wang, Lin Li

· HF Daily Papers · Jun 11, 2026

MiniMax's M3 proof model and MaxProof test-time scaling report gold-medal-level performance on IMO and USAMO proof tasks.

Categories: Model Releases, Research

Excerpt

Jiacheng Chen, Xinyu Zhang, Shunkai Zhang, Yanmohan Wang, Lin Li — We present MaxProof, a population-level test-time scaling framework for competition-level mathematical proof in the MiniMax-M3 series. M3 first trains three proof-oriented capabilities -- proof generation, proof verification, and critique-conditioned proof repair -- using a defense-in-depth generative verifier engineered for low false-positive rate. These capabilities are merged into a single released M3 model. At test time, MaxProof treats the model as a generator, verifier, refiner, and ranker, searches over a population of candidate proofs, and returns one final proof through tournament selection. With MaxProof test-time scaling, the M3 model reaches 35/42 on IMO 2025 and 36/42 on USAMO 2026, exceeding the human gold-medal threshold on both.

Read at source: https://arxiv.org/abs/2606.13473

Discussions

hn · 78 points · 7 comments
hn · 89 points · 7 comments
hn · 92 points · 7 comments
hn · 94 points · 8 comments
hn · 100 points · 8 comments
hn · 107 points · 8 comments
hn · 108 points · 8 comments
hn · 110 points · 8 comments
hn · 115 points · 9 comments
hn · 118 points · 9 comments
hn · 119 points · 10 comments
hn · 121 points · 10 comments
hn · 123 points · 10 comments
hn · 123 points · 10 comments
hn · 123 points · 10 comments
hn · 127 points · 12 comments
hn · 127 points · 12 comments
hn · 127 points · 12 comments
hn · 127 points · 12 comments
hn · 127 points · 12 comments
hn · 128 points · 12 comments
hn · 128 points · 13 comments
hn · 130 points · 13 comments
hn · 131 points · 13 comments
hn · 131 points · 13 comments
hn · 132 points · 13 comments
hn · 132 points · 13 comments
hn · 132 points · 13 comments
hn · 132 points · 13 comments
hn · 132 points · 13 comments
hn · 132 points · 13 comments
hn · 132 points · 13 comments
hn · 132 points · 13 comments
hn · 132 points · 13 comments
hn · 132 points · 13 comments
hn · 132 points · 13 comments
hn · 132 points · 13 comments
hn · 132 points · 13 comments
hn · 134 points · 13 comments