Fast, Fine-Grained Equivalence Checking for Neural Decompilers

๐Ÿ“… 2025-01-08
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
Current neural decompiler evaluation methods rely on coarse-grained similarity scores and lack instruction-level equivalence verification, hindering precise error localization and interpretability of prediction quality. To address this, we propose CodeAlignโ€”the first instruction-level equivalence alignment framework grounded in formal semantic modeling. It rigorously defines equivalence relations between intra-function code segments and variable names. Leveraging symbolic execution for auxiliary verification, CodeAlign enables fine-grained, interpretable correctness assessment, outputting structured equivalence mappings rather than a single dimensionless score. Experiments demonstrate that CodeAlign significantly improves evaluation accuracy and debuggability, establishing a novel paradigm for trustworthy verification of neural decompilers.

Technology Category

Application Category

๐Ÿ“ Abstract
Neural decompilers are machine learning models that reconstruct the source code from an executable program. Critical to the lifecycle of any machine learning model is an evaluation of its effectiveness. However, existing techniques for evaluating neural decompilation models have substantial weaknesses, especially when it comes to showing the correctness of the neural decompiler's predictions. To address this, we introduce codealign, a novel instruction-level code equivalence technique designed for neural decompilers. We provide a formal definition of a relation between equivalent instructions, which we term an equivalence alignment. We show how codealign generates equivalence alignments, then evaluate codealign by comparing it with symbolic execution. Finally, we show how the information codealign provides-which parts of the functions are equivalent and how well the variable names match-is substantially more detailed than existing state-of-the-art evaluation metrics, which report unitless numbers measuring similarity.
Problem

Research questions and friction points this paper is trying to address.

Neural Decoder Verification
Prediction Accuracy
Instruction-Level Code Comparison
Innovation

Methods, ideas, or system contributions that make the work stand out.

codealign
neural decompiler
instruction-level comparison
๐Ÿ”Ž Similar Papers
No similar papers found.