EagleVision: A Multi-Task Benchmark for Cross-Domain Perception in High-Speed Autonomous Racing

📅 2026-04-13

📈 Citations: 0

✨ Influential: 0

career value

216K/year

🤖 AI Summary

This work addresses the challenge of evaluating perception performance under extreme dynamic conditions in high-speed autonomous racing, where large relative velocities and significant domain shifts render existing benchmarks inadequate. The authors propose the first unified LiDAR-based multitask benchmark tailored for high-speed racing, integrating real-world racecar data, simulator-generated sequences, and urban datasets, with standardized evaluation protocols for 3D object detection and trajectory prediction. Their analysis reveals that pretraining on urban data improves detection performance (NDS 0.72 vs. 0.69), while intermediate pretraining on real racecar data achieves the best overall result on the A2RL benchmark (NDS 0.726). Moreover, models trained on IndyCar data significantly outperform those trained in-domain in trajectory prediction on A2RL, yielding a notably lower final displacement error (FDE: 0.947 vs. 1.250), thereby establishing a new paradigm for perception in extreme driving scenarios.

Technology Category

Application Category

📝 Abstract

High-speed autonomous racing presents extreme perception challenges, including large relative velocities and substantial domain shifts from conventional urban-driving datasets. Existing benchmarks do not adequately capture these high-dynamic conditions. We introduce EagleVision, a unified LiDAR-based multi-task benchmark for 3D detection and trajectory prediction in high-speed racing, providing newly annotated 3D bounding boxes for the Indy Autonomous Challenge dataset (14,893 frames) and the A2RL Real competition dataset (1,163 frames), together with 12,000 simulator-generated annotated frames, all standardized under a common evaluation protocol. Using a dataset-centric transfer framework, we quantify cross-domain generalization across urban, simulator, and real racing domains. Urban pretraining improves detection over scratch training (NDS 0.72 vs. 0.69), while intermediate pretraining on real racing data achieves the best transfer to A2RL (NDS 0.726), outperforming simulator-only adaptation. For trajectory prediction, Indy-trained models surpass in-domain A2RL training on A2RL test sequences (FDE 0.947 vs. 1.250), highlighting the role of motion-distribution coverage in cross-domain forecasting. EagleVision enables systematic study of perception generalization under extreme high-speed dynamics. The dataset and benchmark are publicly available at https://avlab.io/EagleVision

Problem

Research questions and friction points this paper is trying to address.

high-speed autonomous racing

cross-domain perception

domain shift

3D detection

trajectory prediction

Innovation

Methods, ideas, or system contributions that make the work stand out.

cross-domain perception

multi-task benchmark

high-speed autonomous racing