RECON: An LLM-Enhanced Backward Constraint Analysis Framework

📅 2026-06-08

📈 Citations: 0

✨ Influential: 0

🤖 AI Summary

Traditional symbolic execution struggles in complex systems like Android due to path explosion, difficulties in function modeling, and semantic loss, hindering efficient extraction of precise constraints. This work proposes RECON, a novel framework that integrates large language models (LLMs) into backward constraint analysis: starting from a target method and tracing back to entry points, it extracts method-level control-flow constraints and leverages an LLM to translate bytecode conditions into interpretable semantic specifications. While preserving logical equivalence, this approach substantially enhances both analysis efficiency and interpretability. Experimental results demonstrate that RECON achieves 100% success across 78 Android scenarios, operating 5.8× faster than conventional symbolic execution, and generates semantic constraints triggering dangerous APIs with 84% success on 100 malicious samples.

📝 Abstract

While traditional techniques, such as symbolic execution, provide a principled foundation for precise constraint reasoning in program analysis, they struggle to scale to modern software systems mainly due to path explosion, the need for function modeling, and the loss of semantic intent at low-level program representations. In complex execution environments such as Android, characterized by extensive framework interactions and event-driven behavior, these limitations are even more amplified. Thus, in this paper, we present a novel large language model (LLM)-enhanced backward constraint analysis framework that combines the precision of static program analysis with LLM's semantic understanding to extract precise execution constraints from Android bytecode. Our approach, titled RECON, performs backward path discovery from target method(s) to the application entry point(s), discovers method-level control-flow constraints, and leverages LLM reasoning to transform bytecode conditions into interpretable specifications. We evaluated RECON using five LLMs across 78 Android constraint-extraction scenarios and compared it with traditional symbolic execution on real-world applications. Results demonstrate that our approach operates 5.8X faster than traditional symbolic execution, with a 100% success rate, while maintaining logical equivalence and providing significantly more precise and interpretable output. We further evaluated RECON for malware analysis on 100 samples. The results indicate an 84% success rate in generating semantic constraints that lead to the execution of dangerous API behaviors and in detecting complex constraints across multiple execution paths.

Problem

Research questions and friction points this paper is trying to address.

symbolic execution

path explosion

constraint analysis

Android

semantic intent

Innovation

Methods, ideas, or system contributions that make the work stand out.

LLM-enhanced program analysis

backward constraint analysis

symbolic execution