Evaluating Modular Dialogue System for Form Filling Using Large Language Models

Sherzod Hakimov, Yan Weiser, David Schlangen


Abstract
This paper introduces a novel approach to form-filling and dialogue system evaluation by leveraging Large Language Models (LLMs). The proposed method establishes a setup wherein multiple modules collaborate on addressing the form-filling task. The dialogue system is constructed on top of LLMs, focusing on defining specific roles for individual modules. We show that using multiple independent sub-modules working cooperatively on this task can improve performance and handle the typical constraints of using LLMs, such as context limitations. The study involves testing the modular setup on four selected forms of varying topics and lengths, employing commercial and open-access LLMs. The experimental results demonstrate that the modular setup consistently outperforms the baseline, showcasing the effectiveness of this approach. Furthermore, our findings reveal that open-access models perform comparably to commercial models for the specified task.
Anthology ID:
2024.scichat-1.4
Volume:
Proceedings of the 1st Workshop on Simulating Conversational Intelligence in Chat (SCI-CHAT 2024)
Month:
March
Year:
2024
Address:
St. Julians, Malta
Editors:
Yvette Graham, Qun Liu, Gerasimos Lampouras, Ignacio Iacobacci, Sinead Madden, Haider Khalid, Rameez Qureshi
Venues:
SCI-CHAT | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
36–52
Language:
URL:
https://aclanthology.org/2024.scichat-1.4
DOI:
Bibkey:
Cite (ACL):
Sherzod Hakimov, Yan Weiser, and David Schlangen. 2024. Evaluating Modular Dialogue System for Form Filling Using Large Language Models. In Proceedings of the 1st Workshop on Simulating Conversational Intelligence in Chat (SCI-CHAT 2024), pages 36–52, St. Julians, Malta. Association for Computational Linguistics.
Cite (Informal):
Evaluating Modular Dialogue System for Form Filling Using Large Language Models (Hakimov et al., SCI-CHAT-WS 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.scichat-1.4.pdf
Video:
 https://aclanthology.org/2024.scichat-1.4.mp4