FTIO: Frequent Temporally Integrated Objects

📅 2025-08-22
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Addressing key challenges in unsupervised video object segmentation (UVOS)—including difficulty in discovering the initial salient object, frequent omission of small objects, and temporal inconsistency induced by motion deformation—this paper proposes a two-stage post-processing framework. Methodologically, the first stage introduces a frequency-based criterion to refine salient object selection, mitigating uncertainty in initial proposals. The second stage employs a three-phase temporal mask completion mechanism that jointly integrates saliency detection, temporal ensemble, and region completion to explicitly enforce inter-frame consistency. Our core contribution lies in unifying frequency-aware modeling with structured temporal completion, substantially enhancing segmentation robustness under complex motion and for small objects. Evaluated on mainstream UVOS benchmarks, the method achieves state-of-the-art performance, outperforming existing approaches in both segmentation accuracy and temporal stability.

Technology Category

Application Category

📝 Abstract
Predicting and tracking objects in real-world scenarios is a critical challenge in Video Object Segmentation (VOS) tasks. Unsupervised VOS (UVOS) has the additional challenge of finding an initial segmentation of salient objects, which affects the entire process and keeps a permanent uncertainty about the object proposals. Moreover, deformation and fast motion can lead to temporal inconsistencies. To address these problems, we propose Frequent Temporally Integrated Objects (FTIO), a post-processing framework with two key components. First, we introduce a combined criterion to improve object selection, mitigating failures common in UVOS--particularly when objects are small or structurally complex--by extracting frequently appearing salient objects. Second, we present a three-stage method to correct temporal inconsistencies by integrating missing object mask regions. Experimental results demonstrate that FTIO achieves state-of-the-art performance in multi-object UVOS. Code is available at: https://github.com/MohammadMohammadzadehKalati/FTIO
Problem

Research questions and friction points this paper is trying to address.

Improving object selection in unsupervised video segmentation
Correcting temporal inconsistencies from deformation and motion
Enhancing multi-object tracking accuracy in UVOS tasks
Innovation

Methods, ideas, or system contributions that make the work stand out.

Combined criterion for object selection
Three-stage temporal inconsistency correction
Extracts frequently appearing salient objects
🔎 Similar Papers
No similar papers found.
M
Mohammad Mohammadzadeh Kalati
Department of Computer Science, University of Saskatchewan
Farhad Maleki
Farhad Maleki
Assistant Professor, University of Calgary
AIMachine LearningDeep LearningData ScienceBioinformatics
I
Ian McQuillan
Department of Computer Science, University of Saskatchewan