Learning First Integrals via Backward-Generated Data and Guided Reinforcement Learning

๐Ÿ“… 2026-05-20
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF

career value

202K/year
๐Ÿค– AI Summary
The discovery of first integrals in dynamical systems has long been hindered by the scarcity of high-quality data and reliance on mathematical intuition. This work proposes FISolver, which introduces a novel inverse generation algorithm to construct a large-scale dataset of differential equationโ€“first integral pairs. By integrating supervised fine-tuning with reinforcement learning guided by Levenshtein distance, FISolver optimizes a small mathematical language model. Through reward shaping and synthetic data strategies, it achieves efficient generalization from sparse examples to complex families of problems. Experiments demonstrate that FISolver significantly outperforms both large mathematical language models and commercial solvers such as Mathematica on challenging benchmarks, while substantially reducing computational costs.
๐Ÿ“ Abstract
The discovery of first integrals is of fundamental scientific importance for understanding conservation laws in dynamical systems. However, existing symbolic computation tools and Large Language Models (LLMs) remain limited on this task because high-quality training data are scarce and successful solutions often depend on mathematical intuition. This paper presents FISolver, an LLM-based solver developed to address this challenge. First, we introduce a "Backward Generation" algorithm that systematically builds large-scale datasets of (differential equation, first integral) pairs by deriving differential equations from sampled integrals, thereby alleviating the data scarcity bottleneck. Second, we apply supervised fine-tuning to a compact mathematical model and further improve its performance through reinforcement learning with a Levenshtein Distance-based shaped reward. In addition, we design data synthesis and blending strategies that support effective adaptation to difficult problem families from sparse examples. Experiments show that FISolver, while requiring substantially lower computational cost, significantly outperforms larger mathematical LLMs and commercial solvers such as Mathematica on challenging benchmarks, indicating a new data-driven route for automated discovery of first integrals.
Problem

Research questions and friction points this paper is trying to address.

first integrals
dynamical systems
conservation laws
symbolic computation
data scarcity
Innovation

Methods, ideas, or system contributions that make the work stand out.

Backward Generation
First Integrals
Reinforcement Learning
Levenshtein Distance
Data Synthesis
๐Ÿ”Ž Similar Papers
2023-12-09arXiv.orgCitations: 2