UMDATrack: Unified Multi-Domain Adaptive Tracking Under Adverse Weather Conditions

πŸ“… 2025-07-01
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
Visual object tracking suffers significant performance degradation under adverse weather conditions (e.g., nighttime, fog) due to domain shift. To address this, we propose a unified training-free domain adaptation framework. Our method comprises three key components: (1) a controllable scene generator guided by text prompts to synthesize a small set of unlabeled multi-weather videos, alleviating real-data scarcity; (2) a lightweight Domain-Customized Adapter (DCA) enabling plug-and-play, rapid domain transfer without modifying the backbone; and (3) a Target-aware Confidence Alignment (TCA) module leveraging optimal transport theory to enhance cross-domain localization consistency. Crucially, our approach requires no fine-tuning of the backbone network or retraining. Evaluated on multiple adverse-weather tracking benchmarks, it substantially outperforms state-of-the-art methods, establishing new performance benchmarks and demonstrating strong generalization capability and engineering practicality.

Technology Category

Application Category

πŸ“ Abstract
Visual object tracking has gained promising progress in past decades. Most of the existing approaches focus on learning target representation in well-conditioned daytime data, while for the unconstrained real-world scenarios with adverse weather conditions, e.g. nighttime or foggy environment, the tremendous domain shift leads to significant performance degradation. In this paper, we propose UMDATrack, which is capable of maintaining high-quality target state prediction under various adverse weather conditions within a unified domain adaptation framework. Specifically, we first use a controllable scenario generator to synthesize a small amount of unlabeled videos (less than 2% frames in source daytime datasets) in multiple weather conditions under the guidance of different text prompts. Afterwards, we design a simple yet effective domain-customized adapter (DCA), allowing the target objects' representation to rapidly adapt to various weather conditions without redundant model updating. Furthermore, to enhance the localization consistency between source and target domains, we propose a target-aware confidence alignment module (TCA) following optimal transport theorem. Extensive experiments demonstrate that UMDATrack can surpass existing advanced visual trackers and lead new state-of-the-art performance by a significant margin. Our code is available at https://github.com/Z-Z188/UMDATrack.
Problem

Research questions and friction points this paper is trying to address.

Adapting visual tracking to adverse weather conditions
Reducing domain shift in multi-weather scenarios
Maintaining target prediction accuracy inζΆεŠ£ε€©ζ°”
Innovation

Methods, ideas, or system contributions that make the work stand out.

Controllable scenario generator synthesizes adverse weather videos
Domain-customized adapter enables rapid weather adaptation
Target-aware confidence alignment enhances localization consistency
πŸ”Ž Similar Papers
No similar papers found.
Siyuan Yao
Siyuan Yao
University of Notre Dame
VisualizationComputer GraphicsComputer Vision
R
Rui Zhu
Beijing University of Posts and Telecommunications
Z
Ziqi Wang
Beijing University of Posts and Telecommunications
W
Wenqi Ren
Sun Yat-sen University
Y
Yanyang Yan
University of Chinese Academy of Sciences
Xiaochun Cao
Xiaochun Cao
Sun Yat-sen University
Computer VisionArtificial IntelligenceMultimediaMachine Learning