Hierarchical Transformer for Electrocardiogram Diagnosis

📅 2024-11-01

🏛️ arXiv.org

📈 Citations: 0

✨ Influential: 0

career value

174K/year

🤖 AI Summary

Lightweight and interpretable Transformer models for multi-lead ECG diagnosis remain scarce. Method: This paper proposes a hierarchical Transformer architecture with adaptive embedding scaling. It eliminates manual downsampling and complex attention designs by partitioning the embedding space along the temporal dimension for multi-stage processing; introduces cross-scale classification tokens for inter-stage information aggregation; and integrates a deep convolutional encoder (6 layers) with an attention gating mechanism to explicitly model inter-lead dependencies and enhance feature interpretability. Contribution/Results: The model supports dynamic input lengths and configurable embeddings, achieving significant improvements in diagnostic accuracy and clinical trustworthiness while maintaining lightweight design—establishing a novel paradigm for interpretable AI-assisted ECG analysis.

Technology Category

Application Category

📝 Abstract

Transformers, originally prominent in NLP and computer vision, are now being adapted for ECG signal analysis. This paper introduces a novel hierarchical transformer architecture that segments the model into multiple stages by assessing the spatial size of the embeddings, thus eliminating the need for additional downsampling strategies or complex attention designs. A classification token aggregates information across feature scales, facilitating interactions between different stages of the transformer. By utilizing depth-wise convolutions in a six-layer convolutional encoder, our approach preserves the relationships between different ECG leads. Moreover, an attention gate mechanism learns associations among the leads prior to classification. This model adapts flexibly to various embedding networks and input sizes while enhancing the interpretability of transformers in ECG signal analysis.

Problem

Research questions and friction points this paper is trying to address.

Develops a hierarchical Transformer for ECG diagnosis

Integrates multi-scale features and inter-lead relationships

Enhances interpretability with lightweight flexible design

Innovation

Methods, ideas, or system contributions that make the work stand out.

Hierarchical Transformer with depth-wise convolutions

Multi-scale feature aggregation via CLS token

Attention-gated module for inter-lead relationships

🔎 Similar Papers

ECG-FM: An Open Electrocardiogram Foundation Model