🤖 AI Summary
Large reasoning models (LRMs) face significant bottlenecks in frontier mathematical research due to the high complexity of open problems and the stringent rigor required in formal proof construction.
Method: We propose an exploratory long-path solving mechanism coupled with a pessimistic rational verification method, integrating heuristic search, formal verification guidance, multi-step logical chain construction, and a domain-adaptive reflection module. This forms the first LRM-based intelligent agent framework explicitly designed for authentic mathematical research.
Contribution/Results: Evaluated on multiple open mathematical problems, the system autonomously completes core proofs, discovers nontrivial lemmas, and generates structured mathematical insights. It achieves, for the first time, systematic, reliable, and creative synergy of LRMs in research-level mathematics. Our results empirically validate the feasibility and acceleration potential of LRM-driven original mathematical discovery.
📝 Abstract
Large Reasoning Models (LRMs) have made significant progress in mathematical capabilities in recent times. However, these successes have been primarily confined to competition-level problems. In this work, we propose AI Mathematician (AIM) framework, which harnesses the reasoning strength of LRMs to support frontier mathematical research. We have identified two critical challenges of mathematical research compared to competition, {it the intrinsic complexity of research problems} and {it the requirement of procedural rigor}. To address these challenges, AIM incorporates two core strategies: an exploration mechanism to foster longer solution paths, and the pessimistic reasonable verification method to ensure reliability. This early version of AIM already exhibits strong capability in tackling research-level tasks. We conducted extensive experiments across several real-world mathematical topics and obtained promising results. AIM is able to autonomously construct substantial portions of proofs and uncover non-trivial insights within each research area. These findings highlight the potential of LRMs in mathematical discovery and suggest that LRM-based agent systems could significantly accelerate mathematical research in the future.