Hierarchical Transformer for Electrocardiogram Diagnosis

📅 2024-11-01
🏛️ arXiv.org
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Lightweight and interpretable Transformer models for multi-lead ECG diagnosis remain scarce. Method: This paper proposes a hierarchical Transformer architecture with adaptive embedding scaling. It eliminates manual downsampling and complex attention designs by partitioning the embedding space along the temporal dimension for multi-stage processing; introduces cross-scale classification tokens for inter-stage information aggregation; and integrates a deep convolutional encoder (6 layers) with an attention gating mechanism to explicitly model inter-lead dependencies and enhance feature interpretability. Contribution/Results: The model supports dynamic input lengths and configurable embeddings, achieving significant improvements in diagnostic accuracy and clinical trustworthiness while maintaining lightweight design—establishing a novel paradigm for interpretable AI-assisted ECG analysis.

Technology Category

Application Category

📝 Abstract
Transformers, originally prominent in NLP and computer vision, are now being adapted for ECG signal analysis. This paper introduces a novel hierarchical transformer architecture that segments the model into multiple stages by assessing the spatial size of the embeddings, thus eliminating the need for additional downsampling strategies or complex attention designs. A classification token aggregates information across feature scales, facilitating interactions between different stages of the transformer. By utilizing depth-wise convolutions in a six-layer convolutional encoder, our approach preserves the relationships between different ECG leads. Moreover, an attention gate mechanism learns associations among the leads prior to classification. This model adapts flexibly to various embedding networks and input sizes while enhancing the interpretability of transformers in ECG signal analysis.
Problem

Research questions and friction points this paper is trying to address.

Develops a hierarchical Transformer for ECG diagnosis
Integrates multi-scale features and inter-lead relationships
Enhances interpretability with lightweight flexible design
Innovation

Methods, ideas, or system contributions that make the work stand out.

Hierarchical Transformer with depth-wise convolutions
Multi-scale feature aggregation via CLS token
Attention-gated module for inter-lead relationships
🔎 Similar Papers
No similar papers found.
X
Xiaoya Tang
Scientific Computing and Imaging Institute, University of Utah, SLC, UT, USA
J
Jake Berquist
Scientific Computing and Imaging Institute, University of Utah, SLC, UT, USA; Department of Biomedical Engineering, University of Utah, SLC, UT, USA; Nora Eccles Harrison Cardiovascular Research and Training Institute, University of Utah, SLC, UT, USA
B
Benjamin A. Steinberg
University of Colorado Anschutz Medical Campus, Denver, CO, USA
Tolga Tasdizen
Tolga Tasdizen
University of Utah
Biomedical Image AnalysisMachine LearningImage ProcessingComputer Vision