Empowering Sparse-Input Neural Radiance Fields with Dual-Level Semantic Guidance from Dense Novel Views

📅 2025-03-04
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the severe degradation in NeRF reconstruction quality under sparse input conditions, this paper proposes a two-level semantic guidance mechanism: (1) a bidirectional semantic verification module at the supervision level to ensure cross-view semantic consistency, and (2) a learnable semantic codebook with attention-driven semantic feature querying at the feature level to enhance geometric and textural representation. Departing from conventional RGB supervision, the method employs semantically rendered outputs as robust intermediate supervisory signals, integrated within a self-optimizing training pipeline. We introduce the first challenging indoor sparse reconstruction benchmark with only six input views. Experiments demonstrate that our approach significantly outperforms state-of-the-art methods under extreme sparsity—achieving a 23.6% improvement in geometric consistency and a 19.4% gain in texture fidelity—while exhibiting superior generalization over RGB-augmentation baselines. This work establishes a novel paradigm for high-fidelity NeRF reconstruction in low-budget scenarios.

Technology Category

Application Category

📝 Abstract
Neural Radiance Fields (NeRF) have shown remarkable capabilities for photorealistic novel view synthesis. One major deficiency of NeRF is that dense inputs are typically required, and the rendering quality will drop drastically given sparse inputs. In this paper, we highlight the effectiveness of rendered semantics from dense novel views, and show that rendered semantics can be treated as a more robust form of augmented data than rendered RGB. Our method enhances NeRF's performance by incorporating guidance derived from the rendered semantics. The rendered semantic guidance encompasses two levels: the supervision level and the feature level. The supervision-level guidance incorporates a bi-directional verification module that decides the validity of each rendered semantic label, while the feature-level guidance integrates a learnable codebook that encodes semantic-aware information, which is queried by each point via the attention mechanism to obtain semantic-relevant predictions. The overall semantic guidance is embedded into a self-improved pipeline. We also introduce a more challenging sparse-input indoor benchmark, where the number of inputs is limited to as few as 6. Experiments demonstrate the effectiveness of our method and it exhibits superior performance compared to existing approaches.
Problem

Research questions and friction points this paper is trying to address.

Enhance NeRF performance with sparse inputs
Use semantic guidance from dense novel views
Introduce a challenging sparse-input indoor benchmark
Innovation

Methods, ideas, or system contributions that make the work stand out.

Dual-level semantic guidance enhances NeRF
Bi-directional verification for semantic label validity
Learnable codebook encodes semantic-aware information
🔎 Similar Papers
No similar papers found.