The 1st Solution for MOSEv2 Challenge 2025: Long-term and Concept-aware Video Segmentation via SeC

📅 2025-09-23

📈 Citations: 0

✨ Influential: 0

career value

214K/year

🤖 AI Summary

To address temporal discontinuities caused by long-term occlusion and target reappearance, as well as distractor interference in complex semi-supervised video object segmentation, this paper proposes the SeC framework. SeC builds upon SAM-2 to establish a synergistic mechanism integrating long-term memory and concept-aware reasoning, explicitly modeling cross-frame long-range dependencies while incorporating semantic priors. This enables robust re-identification of occluded targets and effective suppression of distractors. By preserving efficient dynamic tracking, SeC simultaneously enhances semantic consistency across frames, significantly improving segmentation robustness in challenging scenarios. Evaluated on the MOSEv2 Challenge test set, SeC achieves a J&F score of 39.89%, ranking first—demonstrating the effectiveness and state-of-the-art capability of its temporal modeling and semantic guidance mechanisms.

Technology Category

Application Category

📝 Abstract

This technical report explores the MOSEv2 track of the LSVOS Challenge, which targets complex semi-supervised video object segmentation. By analysing and adapting SeC, an enhanced SAM-2 framework, we conduct a detailed study of its long-term memory and concept-aware memory, showing that long-term memory preserves temporal continuity under occlusion and reappearance, while concept-aware memory supplies semantic priors that suppress distractors; together, these traits directly benefit several MOSEv2's core challenges. Our solution achieves a JF score of 39.89% on the test set, ranking 1st in the MOSEv2 track of the LSVOS Challenge.

Problem

Research questions and friction points this paper is trying to address.

Addressing long-term video object segmentation under occlusion

Enhancing semantic prior integration to suppress distractors

Improving temporal continuity in semi-supervised video segmentation

Innovation

Methods, ideas, or system contributions that make the work stand out.

Enhanced SAM-2 framework with SeC adaptation

Long-term memory for temporal continuity preservation

Concept-aware memory for semantic prior integration

🔎 Similar Papers

No similar papers found.