NEMO: Execution-Aware Optimization Modeling via Autonomous Coding Agents

📅 2026-01-29
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenge of automatically translating natural language descriptions of decision problems into executable optimization models by proposing an execution-aware modeling framework based on Autonomous Coding Agents (ACA). The approach ensures code executability through a sandboxed environment and introduces novel coordination mechanisms—including asymmetric verification loops, external memory reuse, minimum Bayes risk decoding, and self-consistency—to significantly enhance modeling robustness and accuracy. The system supports both interactive and fully autonomous operation, achieving state-of-the-art performance across nine standard optimization benchmarks and substantially outperforming existing methods on multiple datasets. These results validate the effectiveness of an architecture that treats ACA as a first-class abstraction for automated optimization modeling.

Technology Category

Application Category

📝 Abstract
In this paper, we present NEMO, a system that translates Natural-language descriptions of decision problems into formal Executable Mathematical Optimization implementations, operating collaboratively with users or autonomously. Existing approaches typically rely on specialized large language models (LLMs) or bespoke, task-specific agents. Such methods are often brittle, complex and frequently generating syntactically invalid or non-executable code. NEMO instead centers on remote interaction with autonomous coding agents (ACAs), treated as a first-class abstraction analogous to API-based interaction with LLMs. This design enables the construction of higher-level systems around ACAs that structure, consolidate, and iteratively refine task specifications. Because ACAs execute within sandboxed environments, code produced by NEMO is executable by construction, allowing automated validation and repair. Building on this, we introduce novel coordination patterns with and across ACAs, including asymmetric validation loops between independently generated optimizer and simulator implementations (serving as a high-level validation mechanism), external memory for experience reuse, and robustness enhancements via minimum Bayes risk (MBR) decoding and self-consistency. We evaluate NEMO on nine established optimization benchmarks. As depicted in Figure 1, it achieves state-of-the-art performance on the majority of tasks, with substantial margins on several datasets, demonstrating the power of execution-aware agentic architectures for automated optimization modeling.
Problem

Research questions and friction points this paper is trying to address.

Natural-language to optimization
Executable code generation
Mathematical optimization
Autonomous coding agents
Decision problem modeling
Innovation

Methods, ideas, or system contributions that make the work stand out.

Autonomous Coding Agents
Executable Optimization
Execution-Aware Modeling
Asymmetric Validation
Minimum Bayes Risk Decoding
Yang Song
Yang Song
C3 AI
signal processingmachine learning
A
Anoushka Vyas
C3 AI, 1400 Seaport Blvd, Redwood City, CA 94063
Z
Zirui Wei
C3 AI, 1400 Seaport Blvd, Redwood City, CA 94063
S
Sina Pakazad
C3 AI, 1400 Seaport Blvd, Redwood City, CA 94063
H
Henrik Ohlsson
C3 AI, 1400 Seaport Blvd, Redwood City, CA 94063
Graham Neubig
Graham Neubig
Carnegie Mellon University, All Hands AI
Natural Language ProcessingMachine LearningArtificial Intelligence