π€ AI Summary
This work addresses the limitations of existing vision-language-action (VLA) models, which employ fixed intervention coefficients for neural steering at test timeβa form of open-loop control that fails to adapt to dynamic shifts in task states and conceptual errors, often leading to over-correction, oscillation, and reduced success rates. To overcome this, the authors propose CTRL-STEER, a novel framework that introduces closed-loop control into VLA neural intervention without modifying or retraining the underlying model. By decoupling representation and modulation, CTRL-STEER dynamically adjusts intervention strength along residual directions aligned with motion. The framework integrates PID and reinforcement learning controllers to enable temporally adaptive concept steering. Experiments across four LIBERO benchmark suites demonstrate that CTRL-STEER significantly improves the trade-off between steering stability and task success compared to fixed-coefficient baselines.
π Abstract
Vision-Language-Action (VLA) models can be steered at test time by intervening on semantically meaningful internal directions, but existing methods use a fixed steering coefficient, effectively operating in open loop. This is poorly suited to embodied control, where task state and concept error evolve over time, often causing overcorrection, oscillation, and reduced task success, especially for temporal behaviors such as speed and smoothness. We propose CTRL-STEER, a closed-loop framework that replaces static intervention strength with adaptive, time-varying control signals. The key idea is to decouple representation from regulation: rather than assuming temporal concepts are directly controlled by individual neurons, we steer along motion-aligned residual directions while a feedback controller adjusts intervention magnitude online. We instantiate this framework with both PID and reinforcement learning based controllers. Experiments with a fine-tuned OpenVLA policy on four LIBERO task suites show that CTRL-STEER achieves more stable concept regulation and a better steering-task success trade-off than fixed-coefficient baselines, without modifying or retraining the base model.