Fast, Fine-Grained Equivalence Checking for Neural Decompilers

📅 2025-01-08

📈 Citations: 0

✨ Influential: 0

career value

186K/year

🤖 AI Summary

Current neural decompiler evaluation methods rely on coarse-grained similarity scores and lack instruction-level equivalence verification, hindering precise error localization and interpretability of prediction quality. To address this, we propose CodeAlign—the first instruction-level equivalence alignment framework grounded in formal semantic modeling. It rigorously defines equivalence relations between intra-function code segments and variable names. Leveraging symbolic execution for auxiliary verification, CodeAlign enables fine-grained, interpretable correctness assessment, outputting structured equivalence mappings rather than a single dimensionless score. Experiments demonstrate that CodeAlign significantly improves evaluation accuracy and debuggability, establishing a novel paradigm for trustworthy verification of neural decompilers.

Technology Category

Application Category

📝 Abstract

Neural decompilers are machine learning models that reconstruct the source code from an executable program. Critical to the lifecycle of any machine learning model is an evaluation of its effectiveness. However, existing techniques for evaluating neural decompilation models have substantial weaknesses, especially when it comes to showing the correctness of the neural decompiler's predictions. To address this, we introduce codealign, a novel instruction-level code equivalence technique designed for neural decompilers. We provide a formal definition of a relation between equivalent instructions, which we term an equivalence alignment. We show how codealign generates equivalence alignments, then evaluate codealign by comparing it with symbolic execution. Finally, we show how the information codealign provides-which parts of the functions are equivalent and how well the variable names match-is substantially more detailed than existing state-of-the-art evaluation metrics, which report unitless numbers measuring similarity.

Problem

Research questions and friction points this paper is trying to address.

Neural Decoder Verification

Prediction Accuracy

Instruction-Level Code Comparison

Innovation

Methods, ideas, or system contributions that make the work stand out.

codealign

neural decompiler

instruction-level comparison

🔎 Similar Papers

No similar papers found.