VoxKnesset: A Large-Scale Longitudinal Hebrew Speech Dataset for Aging Speaker Modeling

๐Ÿ“… 2026-03-01
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
This work addresses the performance degradation of speaker verification systems due to age-related vocal changes, a challenge exacerbated by the absence of large-scale longitudinal speech datasets. To bridge this gap, the authors introduce VoxKnessetโ€”the first large-scale Hebrew longitudinal speech corpus, comprising approximately 2,300 hours of parliamentary speeches from 393 speakers recorded over 15 years, accompanied by aligned transcripts and official demographic metadata. Benchmark evaluations using WavLM-Large, ECAPA-TDNN, and Wav2Vec2-XLSR-1B reveal that speaker verification equal error rates (EER) increase from 2.15% to 4.58% over the 15-year span. Furthermore, age regressors trained longitudinally successfully capture individual vocal aging patterns, whereas cross-sectional models fail to do so. VoxKnesset thus provides a critical resource for advancing research in longitudinal speaker verification and age modeling.

Technology Category

Application Category

๐Ÿ“ Abstract
Speech processing systems face a fundamental challenge: the human voice changes with age, yet few datasets support rigorous longitudinal evaluation. We introduce VoxKnesset, an open-access dataset of ~2,300 hours of Hebrew parliamentary speech spanning 2009-2025, comprising 393 speakers with recording spans of up to 15 years. Each segment includes aligned transcripts and verified demographic metadata from official parliamentary records. We benchmark modern speech embeddings (WavLM-Large, ECAPA-TDNN, Wav2Vec2-XLSR-1B) on age prediction and speaker verification under longitudinal conditions. Speaker verification EER rises from 2.15\% to 4.58\% over 15 years for the strongest model, and cross-sectionally trained age regressors fail to capture within-speaker aging, while longitudinally trained models recover a meaningful temporal signal. We publicly release the dataset and pipeline to support aging-robust speech systems and Hebrew speech processing.
Problem

Research questions and friction points this paper is trying to address.

aging speaker modeling
longitudinal speech dataset
voice variation with age
speaker verification
speech processing
Innovation

Methods, ideas, or system contributions that make the work stand out.

longitudinal speech dataset
aging speaker modeling
Hebrew speech processing
speaker verification
age prediction
๐Ÿ”Ž Similar Papers
No similar papers found.
Y
Yanir Marmor
Department of Computer Science and Applied Mathematics, Department of Molecular Cell Biology, Weizmann Institute of Science, Rehovot, Israel
A
Arad Zulti
Department of Computer Science and Applied Mathematics, Department of Molecular Cell Biology, Weizmann Institute of Science, Rehovot, Israel
D
David Krongauz
Department of Computer Science and Applied Mathematics, Department of Molecular Cell Biology, Weizmann Institute of Science, Rehovot, Israel
A
Adam Gabet
Department of Computer Science and Applied Mathematics, Department of Molecular Cell Biology, Weizmann Institute of Science, Rehovot, Israel
Y
Yoad Snapir
ivrit.ai
Y
Yair Lifshitz
ivrit.ai
Eran Segal
Eran Segal
Professor of Computer Science, Weizmann Institute of Science
Computational biology