🤖 AI Summary
Automated detection of linear/exponential chirp patterns—key dynamic biomarkers in epileptic EEG spectrograms—remains underaddressed. Method: We introduce the first large-scale synthetic chirp spectrogram benchmark (100,000 samples) and propose a Vision Transformer (ViT)-based regression framework. Innovatively adapting Low-Rank Adaptation (LoRA) to ViT for regression, our method enables end-to-end prediction of three chirp parameters: onset time, onset frequency, and offset frequency. Realistic EEG characteristics are modeled via Gaussian noise injection and spectral smoothing; training employs MSE loss, AdamW optimization, and learning rate scheduling. Results: The model achieves a Pearson correlation coefficient of 0.9841 for onset time prediction, stable inference latency (137–140 s), unbiased error distribution, and significantly improved chirp localization accuracy and generalization—especially under low-data regimes.
📝 Abstract
Spectrograms are pivotal in time-frequency signal analysis, widely used in audio processing and computational neuroscience. Chirp-like patterns in electroencephalogram (EEG) spectrograms (marked by linear or exponential frequency sweep) are key biomarkers for seizure dynamics, but automated tools for their detection, localization, and feature extraction are lacking. This study bridges this gap by fine-tuning a Vision Transformer (ViT) model on synthetic spectrograms, augmented with Low-Rank Adaptation (LoRA) to boost adaptability. We generated 100000 synthetic spectrograms with chirp parameters, creating the first large-scale benchmark for chirp localization. These spectrograms mimic neural chirps using linear or exponential frequency sweep, Gaussian noise, and smoothing. A ViT model, adapted for regression, predicted chirp parameters. LoRA fine-tuned the attention layers, enabling efficient updates to the pre-trained backbone. Training used MSE loss and the AdamW optimizer, with a learning rate scheduler and early stopping to curb overfitting. Only three features were targeted: Chirp Start Time (Onset Time), Chirp Start Frequency (Onset Frequency), and Chirp End Frequency (Offset Frequency). Performance was evaluated via Pearson correlation between predicted and actual labels. Results showed strong alignment: 0.9841 correlation for chirp start time, with stable inference times (137 to 140s) and minimal bias in error distributions. This approach offers a tool for chirp analysis in EEG time-frequency representation, filling a critical methodological void.