Transparent Machine Learning: Training and Refining an Explainable Boosting Machine to Identify Overshooting Tops in Satellite Imagery

📅 2025-07-02
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Automated detection of Overshooting Tops (OTs) in satellite imagery lacks interpretability, hindering trust and operational adoption in high-stakes meteorological applications. Method: This paper proposes a domain-informed, interpretable machine learning framework leveraging GOES-16 visible/infrared imagery. It extracts cloud-texture features via Gray-Level Co-occurrence Matrices (GLCMs) and trains an Explainable Boosting Machine (EBM) using Multi-Radar/Multi-Sensor (MRMS) system–derived OT labels. Crucially, it introduces a human-in-the-loop modeling paradigm: meteorological expert knowledge guides EBM feature selection and structural constraints to enforce physical plausibility and transparency. Contribution/Results: The resulting model achieves high detection performance while providing both global and local interpretability—enabling traceable, physics-aligned decision reasoning. It significantly enhances model credibility and offers a reusable methodology and practical exemplar for deploying trustworthy AI in critical weather forecasting and severe storm monitoring.

Technology Category

Application Category

📝 Abstract
An Explainable Boosting Machine (EBM) is an interpretable machine learning (ML) algorithm that has benefits in high risk applications but has not yet found much use in atmospheric science. The overall goal of this work is twofold: (1) explore the use of EBMs, in combination with feature engineering, to obtain interpretable, physics-based machine learning algorithms for meteorological applications; (2) illustrate these methods for the detection of overshooting top (OTs) in satellite imagery. Specifically, we seek to simplify the process of OT detection by first using mathematical methods to extract key features, such as cloud texture using Gray-Level Co-occurrence Matrices, followed by applying an EBM. Our EBM focuses on the classification task of predicting OT regions, utilizing Channel 2 (visible imagery) and Channel 13 (infrared imagery) of the Advanced Baseline Imager sensor of the Geostationary Operational Environmental Satellite 16. Multi-Radar/Multi-Sensor system convection flags are used as labels to train the EBM model. Note, however, that detecting convection, while related, is different from detecting OTs. Once trained, the EBM was examined and minimally altered to more closely match strategies used by domain scientists to identify OTs. The result of our efforts is a fully interpretable ML algorithm that was developed in a human-machine collaboration. While the final model does not reach the accuracy of more complex approaches, it performs well and represents a significant step toward building fully interpretable ML algorithms for this and other meteorological applications.
Problem

Research questions and friction points this paper is trying to address.

Develop interpretable ML for meteorological applications using EBMs
Simplify overshooting top detection via feature engineering and EBMs
Enhance human-machine collaboration in weather satellite image analysis
Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses Explainable Boosting Machine for interpretable ML
Combines feature engineering with physics-based algorithms
Simplifies OT detection via human-machine collaboration
🔎 Similar Papers
No similar papers found.
N
Nathan Mitchell
Department of Statistics, Colorado State University, Fort Collins, Colorado
L
Lander Ver Hoef
Cooperative Institute for Research in the Atmosphere, Colorado State University, Fort Collins, Colorado
Imme Ebert-Uphoff
Imme Ebert-Uphoff
Colorado State University
Causalitymachine learningearth sciences.
K
Kristina Moen
Department of Mathematics, Colorado State University, Fort Collins, Colorado
K
Kyle Hilburn
Cooperative Institute for Research in the Atmosphere, Colorado State University, Fort Collins, Colorado
Yoonjin Lee
Yoonjin Lee
Ewha Womans University, Department of Mathematics
Number TheoryCoding Theory
Emily J. King
Emily J. King
Colorado State University
mathematical data scienceframe theoryalgebraic combinatoricssignal & image pr