RoboVox: A Single/Multi-channel Far-field Speaker Recognition Benchmark for a Mobile Robot

Mohammad Mohammadamini, Driss Matrouf, Michael Rouvier, Jean-Francois Bonastre, Romain Serizel, Theophile Gonos


Abstract
In this paper, we introduce a new far-field speaker recognition benchmark called RoboVox. RoboVox is a French corpus recorded by a mobile robot. The files are recorded from different distances under severe acoustical conditions with the presence of several types of noise and reverberation. In addition to noise and reverberation, the robot’s internal noise acts as an extra additive noise. RoboVox can be used for both single-channel and multi-channel speaker recognition. In the evaluation protocols, we are considering both cases. The obtained results demonstrate a significant decline in performance in far-filed speaker recognition and urge the community to further research in this domain
Anthology ID:
2024.lrec-main.1234
Volume:
Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)
Month:
May
Year:
2024
Address:
Torino, Italia
Editors:
Nicoletta Calzolari, Min-Yen Kan, Veronique Hoste, Alessandro Lenci, Sakriani Sakti, Nianwen Xue
Venues:
LREC | COLING
SIG:
Publisher:
ELRA and ICCL
Note:
Pages:
14152–14156
Language:
URL:
https://aclanthology.org/2024.lrec-main.1234
DOI:
Bibkey:
Cite (ACL):
Mohammad Mohammadamini, Driss Matrouf, Michael Rouvier, Jean-Francois Bonastre, Romain Serizel, and Theophile Gonos. 2024. RoboVox: A Single/Multi-channel Far-field Speaker Recognition Benchmark for a Mobile Robot. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pages 14152–14156, Torino, Italia. ELRA and ICCL.
Cite (Informal):
RoboVox: A Single/Multi-channel Far-field Speaker Recognition Benchmark for a Mobile Robot (Mohammadamini et al., LREC-COLING 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.lrec-main.1234.pdf