A Spoken Drug Prescription Dataset in French for Spoken Language Understanding

Ali Can Kocabiyikoglu, François Portet, Prudence Gibert, Hervé Blanchon, Jean-Marc Babouchkine, Gaëtan Gavazzi


Abstract
Spoken medical dialogue systems are increasingly attracting interest to enhance access to healthcare services and improve quality and traceability of patient care. In this paper, we focus on medical drug prescriptions acquired on smartphones through spoken dialogue. Such systems would facilitate the traceability of care and would free the clinicians’ time. However, there is a lack of speech corpora to develop such systems since most of the related corpora are in text form and in English. To facilitate the research and development of spoken medical dialogue systems, we present, to the best of our knowledge, the first spoken medical drug prescriptions corpus, named PxNLU. It contains 4 hours of transcribed and annotated dialogues of drug prescriptions in French acquired through an experiment with 55 participants experts and non-experts in prescriptions. We also present some experiments that demonstrate the interest of this corpus for the evaluation and development of medical dialogue systems.
Anthology ID:
2022.lrec-1.109
Volume:
Proceedings of the Thirteenth Language Resources and Evaluation Conference
Month:
June
Year:
2022
Address:
Marseille, France
Editors:
Nicoletta Calzolari, Frédéric Béchet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
1023–1031
Language:
URL:
https://aclanthology.org/2022.lrec-1.109
DOI:
Bibkey:
Cite (ACL):
Ali Can Kocabiyikoglu, François Portet, Prudence Gibert, Hervé Blanchon, Jean-Marc Babouchkine, and Gaëtan Gavazzi. 2022. A Spoken Drug Prescription Dataset in French for Spoken Language Understanding. In Proceedings of the Thirteenth Language Resources and Evaluation Conference, pages 1023–1031, Marseille, France. European Language Resources Association.
Cite (Informal):
A Spoken Drug Prescription Dataset in French for Spoken Language Understanding (Kocabiyikoglu et al., LREC 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.lrec-1.109.pdf