Mind the User! Measures to More Accurately Evaluate the Practical Value of Active Learning Strategies

Julia Romberg


Abstract
One solution to limited annotation budgets is active learning (AL), a collaborative process of human and machine to strategically select a small but informative set of examples. While current measures optimize AL from a pure machine learning perspective, we argue that for a successful transfer into practice, additional criteria must target the second pillar of AL, the human annotator. In text classification, e.g., where practitioners regularly encounter datasets with an increased number of imbalanced classes, measures like F1 fall short when finding all classes or identifying rare cases is required. We therefore introduce four measures that reflect class-related demands that users place on data acquisition. In a comprehensive comparison of uncertainty-based, diversity-based, and hybrid query strategies on six different datasets, we find that strong F1 performance is not necessarily associated with full class coverage. Uncertainty sampling outperforms diversity sampling in selecting minority classes and covering classes more efficiently, while diversity sampling excels in selecting less monotonous batches. Our empirical findings emphasize that a holistic view is essential when evaluating AL approaches to ensure their usefulness in practice - the actual, but often overlooked, goal of development. To this end, standard measures for assessing the performance of text classification need to be complemented by such that more appropriately reflect user needs.
Anthology ID:
2023.ranlp-1.107
Volume:
Proceedings of the 14th International Conference on Recent Advances in Natural Language Processing
Month:
September
Year:
2023
Address:
Varna, Bulgaria
Editors:
Ruslan Mitkov, Galia Angelova
Venue:
RANLP
SIG:
Publisher:
INCOMA Ltd., Shoumen, Bulgaria
Note:
Pages:
996–1006
Language:
URL:
https://aclanthology.org/2023.ranlp-1.107
DOI:
Bibkey:
Cite (ACL):
Julia Romberg. 2023. Mind the User! Measures to More Accurately Evaluate the Practical Value of Active Learning Strategies. In Proceedings of the 14th International Conference on Recent Advances in Natural Language Processing, pages 996–1006, Varna, Bulgaria. INCOMA Ltd., Shoumen, Bulgaria.
Cite (Informal):
Mind the User! Measures to More Accurately Evaluate the Practical Value of Active Learning Strategies (Romberg, RANLP 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.ranlp-1.107.pdf