Generating Multiple-choice Questions for Medical Question Answering with Distractors and Cue-masking

Damien Sileo, Kanimozhi Uma, Marie-Francine Moens


Abstract
Medical multiple-choice question answering (MCQA) is a challenging evaluation for medical natural language processing and a helpful task in itself. Medical questions may describe patient symptoms and ask for the correct diagnosis, which requires domain knowledge and complex reasoning. Standard language modeling pretraining alone is not sufficient to achieve the best results with BERT-base size (Devlin et al., 2019) encoders. Jin et al. (2020) showed that focusing masked language modeling on disease name prediction when using medical encyclopedic paragraphs as input leads to considerable MCQA accuracy improvement. In this work, we show that (1) fine-tuning on generated MCQA dataset outperforms the masked language modeling based objective and (2) correctly masking the cues to the answers is critical for good performance. We release new pretraining datasets and achieve state-of-the-art results on 4 MCQA datasets, notably +5.7% with base-size model on MedQA-USMLE.
Anthology ID:
2024.lrec-main.675
Volume:
Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)
Month:
May
Year:
2024
Address:
Torino, Italia
Editors:
Nicoletta Calzolari, Min-Yen Kan, Veronique Hoste, Alessandro Lenci, Sakriani Sakti, Nianwen Xue
Venues:
LREC | COLING
SIG:
Publisher:
ELRA and ICCL
Note:
Pages:
7647–7653
Language:
URL:
https://aclanthology.org/2024.lrec-main.675
DOI:
Bibkey:
Cite (ACL):
Damien Sileo, Kanimozhi Uma, and Marie-Francine Moens. 2024. Generating Multiple-choice Questions for Medical Question Answering with Distractors and Cue-masking. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pages 7647–7653, Torino, Italia. ELRA and ICCL.
Cite (Informal):
Generating Multiple-choice Questions for Medical Question Answering with Distractors and Cue-masking (Sileo et al., LREC-COLING 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.lrec-main.675.pdf