RelMap: Enhancing Online Map Construction with Class-Aware Spatial Relation and Semantic Priors

📅 2025-07-29
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing Transformer-based online high-definition (HD) map construction methods overlook intrinsic spatial and semantic relationships among map elements, leading to limited accuracy and generalization. This paper proposes RelMap, the first approach to introduce a class-aware spatial relation encoder that explicitly models geometric constraints between elements of different categories. Additionally, we design a semantic-aware Mixture-of-Experts (MoE) decoder to enable fine-grained, class-adaptive feature decoding. RelMap supports both single-frame and multi-frame temporal inputs and seamlessly integrates with mainstream Transformer backbones. Evaluated on nuScenes and Argoverse 2, it achieves state-of-the-art performance, significantly improving detection accuracy and topological reconstruction robustness for HD map elements.

Technology Category

Application Category

📝 Abstract
Online high-definition (HD) map construction plays an increasingly important role in scaling autonomous driving systems. Transformer-based methods have become prevalent in online HD map construction; however, existing approaches often neglect the inherent spatial and semantic relationships among map elements, which limits their accuracy and generalization. To address this, we propose RelMap, an end-to-end framework that enhances online map construction by incorporating spatial relations and semantic priors. We introduce a Class-aware Spatial Relation Prior, which explicitly encodes relative positional dependencies between map elements using a learnable class-aware relation encoder. Additionally, we propose a Mixture-of-Experts (MoE)-based Semantic Prior, which routes features to class-specific experts based on predicted class probabilities, refining instance feature decoding. Our method is compatible with both single-frame and temporal perception backbones, achieving state-of-the-art performance on both the nuScenes and Argoverse 2 datasets.
Problem

Research questions and friction points this paper is trying to address.

Enhances online HD map construction accuracy
Incorporates spatial relations among map elements
Improves semantic understanding with class-specific features
Innovation

Methods, ideas, or system contributions that make the work stand out.

Class-aware relation encoder for spatial dependencies
MoE-based Semantic Prior for feature routing
Compatible with single-frame and temporal backbones
🔎 Similar Papers
No similar papers found.
T
Tianhui Cai
University of California, Los Angeles
Y
Yun Zhang
University of California, Los Angeles
Zewei Zhou
Zewei Zhou
University of California, Los Angeles
Deep learningComputer VisionAutonomous DrivingRobotics
Zhiyu Huang
Zhiyu Huang
Postdoctoral Scholar, University of California, Los Angeles
Machine LearningAutonomous DrivingRoboticsEmbodied AI
J
Jiaqi Ma
University of California, Los Angeles