TranStable: Towards Robust Pixel-level Online Video Stabilization by Jointing Transformer and CNN

📅 2025-01-25
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address severe geometric distortion and excessive cropping in video stabilization, this paper proposes TranStable, an end-to-end framework. Methodologically, it introduces a novel TransformerUNet (TUNet) generator and a Stability Discrimination Module (SDM), integrated via a Hierarchical Adaptive Fusion Mechanism (HAFM) to jointly model global motion consistency and preserve local structural fidelity. Furthermore, it employs pixel-wise warping map regression coupled with adversarial training under realism-aware supervision. Evaluated on NUS, DeepStab, and Selfie benchmarks, TranStable achieves state-of-the-art performance: it significantly suppresses jitter artifacts, enhances field-of-view retention and visual fidelity, and effectively alleviates the distortion–cropping trade-off inherent in conventional approaches.

Technology Category

Application Category

📝 Abstract
Video stabilization often struggles with distortion and excessive cropping. This paper proposes a novel end-to-end framework, named TranStable, to address these challenges, comprising a genera tor and a discriminator. We establish TransformerUNet (TUNet) as the generator to utilize the Hierarchical Adaptive Fusion Module (HAFM), integrating Transformer and CNN to leverage both global and local features across multiple visual cues. By modeling frame-wise relationships, it generates robust pixel-level warping maps for stable geometric transformations. Furthermore, we design the Stability Discriminator Module (SDM), which provides pixel-wise supervision for authenticity and consistency in training period, ensuring more complete field-of-view while minimizing jitter artifacts and enhancing visual fidelity. Extensive experiments on NUS, DeepStab, and Selfie benchmarks demonstrate state-of-the-art performance.
Problem

Research questions and friction points this paper is trying to address.

Video Stabilization
Distortion
Cropping
Innovation

Methods, ideas, or system contributions that make the work stand out.

TransformerUNet
Stability Inspection Module
Video Stabilization
🔎 Similar Papers
No similar papers found.
Zhizhen Li
Zhizhen Li
NCSU CS student
Y
Yifei Cao
J
Jizhe Yu
Y
Yu Liu