Technical Challenges in Maintaining Tax Prep Software with Large Language Models

📅 2025-04-25
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Frequent tax code revisions impose high maintenance costs and error-prone updates on tax compliance software. Method: This paper proposes an LLM-based automated code diff generation and secure integration framework. It employs rule-guided prompt engineering and structured parsing of IRS official publications to systematically model and address six domain-specific challenges in tax-law-to-code translation—including legal ambiguity, implicit logical dependencies, and cross-year provision coupling. Contribution/Results: We introduce the first LLM-driven code diff paradigm tailored for tax software. Evaluated on ChatGPT and Llama, our approach demonstrates feasibility in accurately parsing statutory text and generating executable, semantically faithful code changes. Experiments show significant improvements in maintenance efficiency and correctness, establishing a reusable technical foundation for sustainable evolution of tax compliance systems.

Technology Category

Application Category

📝 Abstract
As the US tax law evolves to adapt to ever-changing politico-economic realities, tax preparation software plays a significant role in helping taxpayers navigate these complexities. The dynamic nature of tax regulations poses a significant challenge to accurately and timely maintaining tax software artifacts. The state-of-the-art in maintaining tax prep software is time-consuming and error-prone as it involves manual code analysis combined with an expert interpretation of tax law amendments. We posit that the rigor and formality of tax amendment language, as expressed in IRS publications, makes it amenable to automatic translation to executable specifications (code). Our research efforts focus on identifying, understanding, and tackling technical challenges in leveraging Large Language Models (LLMs), such as ChatGPT and Llama, to faithfully extract code differentials from IRS publications and automatically integrate them with the prior version of the code to automate tax prep software maintenance.
Problem

Research questions and friction points this paper is trying to address.

Automating tax software updates from IRS publications using LLMs
Reducing manual errors in tax law amendment interpretations
Translating tax regulation changes to executable code efficiently
Innovation

Methods, ideas, or system contributions that make the work stand out.

Automate tax code updates using Large Language Models
Translate IRS publications to executable specifications
Integrate code differentials automatically for maintenance
🔎 Similar Papers
No similar papers found.