Enhancing the Geometric Problem-Solving Ability of Multimodal LLMs via Symbolic-Neural Integration

📅 2025-04-17
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Multimodal large language models (MLLMs) exhibit unreliable geometric problem solving (GPS) due to scarcity of fine-grained annotated data and severe reasoning hallucinations. Method: This paper proposes GeoLogic, a neuro-symbolic collaborative framework. It introduces (1) GeoGen, the first automated method for generating high-quality synthetic data—including geometric diagrams and stepwise reasoning traces—and (2) a bidirectional natural language–symbolic verification mechanism integrating a procedural geometric theorem engine, multi-stage data synthesis, supervised fine-tuning, and runtime symbolic validation. Results: Evaluated on multiple geometric reasoning benchmarks, GeoLogic significantly outperforms state-of-the-art MLLMs, effectively mitigating hallucinations while improving answer accuracy, logical rigor, and interpretability of reasoning steps.

Technology Category

Application Category

📝 Abstract
Recent advances in Multimodal Large Language Models (MLLMs) have achieved remarkable progress in general domains and demonstrated promise in multimodal mathematical reasoning. However, applying MLLMs to geometry problem solving (GPS) remains challenging due to lack of accurate step-by-step solution data and severe hallucinations during reasoning. In this paper, we propose GeoGen, a pipeline that can automatically generates step-wise reasoning paths for geometry diagrams. By leveraging the precise symbolic reasoning, extbf{GeoGen} produces large-scale, high-quality question-answer pairs. To further enhance the logical reasoning ability of MLLMs, we train extbf{GeoLogic}, a Large Language Model (LLM) using synthetic data generated by GeoGen. Serving as a bridge between natural language and symbolic systems, GeoLogic enables symbolic tools to help verifying MLLM outputs, making the reasoning process more rigorous and alleviating hallucinations. Experimental results show that our approach consistently improves the performance of MLLMs, achieving remarkable results on benchmarks for geometric reasoning tasks. This improvement stems from our integration of the strengths of LLMs and symbolic systems, which enables a more reliable and interpretable approach for the GPS task. Codes are available at https://github.com/ycpNotFound/GeoGen.
Problem

Research questions and friction points this paper is trying to address.

Improving MLLMs' geometric problem-solving via symbolic-neural integration
Generating high-quality geometry QA pairs to reduce hallucinations
Enhancing logical reasoning by combining LLMs with symbolic tools
Innovation

Methods, ideas, or system contributions that make the work stand out.

Automatically generates step-wise reasoning paths
Integrates symbolic reasoning for high-quality data
Trains LLM with synthetic data for verification
🔎 Similar Papers
No similar papers found.
Y
Yicheng Pan
University of Science and Technology of China
Z
Zhenrong Zhang
University of Science and Technology of China
P
Pengfei Hu
University of Science and Technology of China
Jiefeng Ma
Jiefeng Ma
USTC
NLP、Language Modelling、Document Intelligence
J
Jun Du
University of Science and Technology of China
J
Jianshu Zhang
iFLYTEK Research
Q
Quan Liu
iFLYTEK Research
J
Jianqing Gao
iFLYTEK Research
F
Feng Ma
iFLYTEK Research