🤖 AI Summary
To address the challenge of accurately distinguishing ground from non-ground objects in large-scale outdoor point clouds, this paper proposes an elevation-guided semantic segmentation method. It computes non-local relative elevation features derived from a Digital Terrain Model (DTM) to explicitly model long-range terrain contextual dependencies—a cue systematically validated here for the first time as critical for urban point cloud segmentation and demonstrably superior to conventional local geometric features. The method integrates multi-source features—including relative elevation, planarity, surface normals, and 2D projection—and embeds them into the RandLA-Net architecture. Evaluated on three heterogeneous outdoor datasets, it achieves consistent improvements: notably, an average F1-score of 76.01% (+3.7 percentage points) on the Hessigheim dataset. These results confirm both the effectiveness and generalizability of the proposed approach.
📝 Abstract
This paper presents an analysis of utilizing elevation data to aid outdoor point cloud semantic segmentation through existing machine-learning networks in remote sensing, specifically in urban, built-up areas. In dense outdoor point clouds, the receptive field of a machine learning model may be too small to accurately determine the surroundings and context of a point. By computing Digital Terrain Models (DTMs) from the point clouds, we extract the relative elevation feature, which is the vertical distance from the terrain to a point. RandLA-Net is employed for efficient semantic segmentation of large-scale point clouds. We assess its performance across three diverse outdoor datasets captured with varying sensor technologies and sensor locations. Integration of relative elevation data leads to consistent performance improvements across all three datasets, most notably in the Hessigheim dataset, with an increase of 3.7 percentage points in average F1 score from 72.35% to 76.01%, by establishing long-range dependencies between ground and objects. We also explore additional local features such as planarity, normal vectors, and 2D features, but their efficacy varied based on the characteristics of the point cloud. Ultimately, this study underscores the important role of the non-local relative elevation feature for semantic segmentation of point clouds in remote sensing applications.