HUNYUANPROVER: A Scalable Data Synthesis Framework and Guided Tree Search for Automated Theorem Proving

📅 2024-12-30

📈 Citations: 0

✨ Influential: 0

career value

207K/year

🤖 AI Summary

High-quality formalized data is scarce, and search efficiency remains low in interactive theorem proving. Method: We propose HunyuanProver—a Lean 4–specific theorem prover built upon Hunyuan-7B—featuring a scalable, low-cost iterative data synthesis framework for automated formalization and synthetic data generation, coupled with a guided tree search algorithm that emulates “System 2” reasoning to enhance proof discovery. Contribution/Results: HunyuanProver achieves a 68.4% pass rate on miniF2F-test, establishing a new state-of-the-art. It is the first model to formally verify four actual International Mathematical Olympiad (IMO) problems on miniF2F-test. Concurrently, we release a high-quality synthetic dataset comprising 30,000 natural-language problems, their formal statements, and complete Lean 4 proofs—significantly alleviating the formalization data bottleneck.

Technology Category

Application Category

📝 Abstract

We introduce HunyuanProver, an language model finetuned from the Hunyuan 7B for interactive automatic theorem proving with LEAN4. To alleviate the data sparsity issue, we design a scalable framework to iterative synthesize data with low cost. Besides, guided tree search algorithms are designed to enable effective ``system 2 thinking`` of the prover. HunyuanProver achieves state-of-the-art (SOTA) performances on major benchmarks. Specifically, it achieves a pass of 68.4% on the miniF2F-test compared to 65.9%, the current SOTA results. It proves 4 IMO statements (imo_1960_p2, imo_1962_p2}, imo_1964_p2 and imo_1983_p6) in miniF2F-test. To benefit the community, we will open-source a dataset of 30k synthesized instances, where each instance contains the original question in natural language, the converted statement by autoformalization, and the proof by HunyuanProver.

Problem

Research questions and friction points this paper is trying to address.

Automated Theorem Proving

Mathematical Proof

Data Scarcity

Innovation

Methods, ideas, or system contributions that make the work stand out.

HunyuanProver

automated theorem proving

efficient search algorithm

🔎 Similar Papers

FGeo-HyperGNet: Geometry Problem Solving Integrating Formal Symbolic System and Hypergraph Neural Network