Category-level Meta-learned NeRF Priors for Efficient Object Mapping

📅 2025-03-03
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address challenges in real-time category-level 3D mapping from single-view RGB inputs—including difficulty integrating category-level priors with object-level NeRFs, inaccurate canonical pose estimation, and low reconstruction efficiency—this paper proposes PRENOM. Methodologically, PRENOM introduces three key innovations: (1) a meta-learning-based category-level NeRF prior enabling cross-instance generalization; (2) a multi-objective genetic algorithm for automatic, category-specific NeRF architecture search; and (3) a prior-driven probabilistic ray sampling strategy to enhance rendering efficiency and geometric fidelity. Evaluated on low-power GPUs, PRENOM achieves a 21% reduction in Chamfer distance, a 13% improvement in overall metrics under realistic noise conditions, and a 5× speedup in training time—significantly outperforming DeepSDF and standard NeRF baselines.

Technology Category

Application Category

📝 Abstract
In 3D object mapping, category-level priors enable efficient object reconstruction and canonical pose estimation, requiring only a single prior per semantic category (e.g., chair, book, laptop). Recently, DeepSDF has predominantly been used as a category-level shape prior, but it struggles to reconstruct sharp geometry and is computationally expensive. In contrast, NeRFs capture fine details but have yet to be effectively integrated with category-level priors in a real-time multi-object mapping framework. To bridge this gap, we introduce PRENOM, a Prior-based Efficient Neural Object Mapper that integrates category-level priors with object-level NeRFs to enhance reconstruction efficiency while enabling canonical object pose estimation. PRENOM gets to know objects on a first-name basis by meta-learning on synthetic reconstruction tasks generated from open-source shape datasets. To account for object category variations, it employs a multi-objective genetic algorithm to optimize the NeRF architecture for each category, balancing reconstruction quality and training time. Additionally, prior-based probabilistic ray sampling directs sampling toward expected object regions, accelerating convergence and improving reconstruction quality under constrained resources. Experimental results on a low-end GPU highlight the ability of PRENOM to achieve high-quality reconstructions while maintaining computational feasibility. Specifically, comparisons with prior-free NeRF-based approaches on a synthetic dataset show a 21% lower Chamfer distance, demonstrating better reconstruction quality. Furthermore, evaluations against other approaches using shape priors on a noisy real-world dataset indicate a 13% improvement averaged across all reconstruction metrics, and comparable pose and size estimation accuracy, while being trained for 5x less time.
Problem

Research questions and friction points this paper is trying to address.

Efficient 3D object reconstruction using category-level priors.
Integration of NeRFs with category-level priors for real-time mapping.
Optimization of NeRF architecture for improved reconstruction quality and speed.
Innovation

Methods, ideas, or system contributions that make the work stand out.

Integrates category-level priors with object-level NeRFs
Uses meta-learning on synthetic reconstruction tasks
Employs multi-objective genetic algorithm for NeRF optimization
🔎 Similar Papers
No similar papers found.
S
Saad Ejaz
Automation and Robotics Research Group, Interdisciplinary Centre for Security, Reliability, and Trust (SnT), University of Luxembourg, Luxembourg
Hriday Bavle
Hriday Bavle
Computer Vision and SLAM Specialist at GAMMA-AR
SLAMSituational AwarenessState EstimationMobile RoboticsAerial Robotics
Laura Ribeiro
Laura Ribeiro
PhD at SnT - University of Luxembourg
Holger Voos
Holger Voos
University of Luxembourg, SnT Automation & Robotics Research Group
Control EngineeringAutomationMobile Robotics
J
J. L. Sánchez-López
Automation and Robotics Research Group, Interdisciplinary Centre for Security, Reliability, and Trust (SnT), University of Luxembourg, Luxembourg