Spatially-Aware Evaluation Framework for Aerial LiDAR Point Cloud Semantic Segmentation: Distance-Based Metrics on Challenging Regions

📅 2026-03-23

📈 Citations: 0

✨ Influential: 0

career value

214K/year

🤖 AI Summary

This work addresses the limitations of conventional evaluation metrics—such as mean Intersection over Union (mIoU) and Overall Accuracy (OA)—in airborne LiDAR-based 3D point cloud semantic segmentation, which are often dominated by easily classified points and fail to account for spatial contextual errors, thereby obscuring model performance in challenging regions. To overcome this, the authors propose a novel evaluation framework that emphasizes geometric error patterns in spatially sensitive tasks by integrating distance-weighted error metrics and a hard subset derived from points consistently misclassified across multiple models. Experiments on three real-world airborne LiDAR datasets demonstrate that the proposed approach effectively uncovers performance discrepancies masked by traditional metrics, offering a more reliable basis for model selection in geospatial applications such as digital terrain modeling.

Technology Category

Application Category

📝 Abstract

Semantic segmentation metrics for 3D point clouds, such as mean Intersection over Union (mIoU) and Overall Accuracy (OA), present two key limitations in the context of aerial LiDAR data. First, they treat all misclassifications equally regardless of their spatial context, overlooking cases where the geometric severity of errors directly impacts the quality of derived geospatial products such as Digital Terrain Models. Second, they are often dominated by the large proportion of easily classified points, which can mask meaningful differences between models and under-represent performance in challenging regions. To address these limitations, we propose a novel evaluation framework for comparing semantic segmentation models through two complementary approaches. First, we introduce distance-based metrics that account for the spatial deviation between each misclassified point and the nearest ground-truth point of the predicted class, capturing the geometric severity of errors. Second, we propose a focused evaluation on a common subset of hard points, defined as the points misclassified by at least one of the evaluated models, thereby reducing the bias introduced by easily classified points and better revealing differences in model performance in challenging regions. We validate our framework by comparing three state-of-the-art deep learning models on three aerial LiDAR datasets. Results demonstrate that the proposed metrics provide complementary information to traditional measures, revealing spatial error patterns that are critical for Earth Observation applications but invisible to conventional evaluation approaches. The proposed framework enables more informed model selection for scenarios where spatial consistency is critical.

Problem

Research questions and friction points this paper is trying to address.

semantic segmentation

aerial LiDAR

evaluation metrics

spatial awareness

challenging regions

Innovation

Methods, ideas, or system contributions that make the work stand out.

distance-based metrics

challenging regions

spatially-aware evaluation