ProMISe: A Proactive Multi-turn Dialogue Dataset for Information-seeking Intent Resolution

Yash Butala, Siddhant Garg, Pratyay Banerjee, Amita Misra


Abstract
Users of AI-based virtual assistants and search systems encounter challenges in articulating their intents while seeking information on unfamiliar topics, possibly due to complexity of the user’s intent or the lack of meta-information on the topic. We posit that an iterative suggested question-answering (SQA) conversation can improve the trade-off between the satisfaction of the user’s intent while keeping the information exchange natural and cognitive load of the interaction minimal on the users. In this paper, we evaluate a novel setting ProMISe by means of a sequence of interactions between a user, having a predefined information-seeking intent, and an agent that generates a set of SQA pairs at each step to aid the user to get closer to their intent. We simulate this two-player setting to create a multi-turn conversational dataset of SQAs and user choices (1025 dialogues comprising 4453 turns and 17812 SQAs) using human-feedback, chain-of-thought prompting and web-retrieval augmented large language models. We evaluate the quality of the SQs in the dataset on attributes such as diversity, specificity, grounding, etc, and benchmark the performance of different language models for the task of replicating user behavior.
Anthology ID:
2024.findings-eacl.124
Volume:
Findings of the Association for Computational Linguistics: EACL 2024
Month:
March
Year:
2024
Address:
St. Julian’s, Malta
Editors:
Yvette Graham, Matthew Purver
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
1774–1789
Language:
URL:
https://aclanthology.org/2024.findings-eacl.124
DOI:
Bibkey:
Cite (ACL):
Yash Butala, Siddhant Garg, Pratyay Banerjee, and Amita Misra. 2024. ProMISe: A Proactive Multi-turn Dialogue Dataset for Information-seeking Intent Resolution. In Findings of the Association for Computational Linguistics: EACL 2024, pages 1774–1789, St. Julian’s, Malta. Association for Computational Linguistics.
Cite (Informal):
ProMISe: A Proactive Multi-turn Dialogue Dataset for Information-seeking Intent Resolution (Butala et al., Findings 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.findings-eacl.124.pdf
Note:
 2024.findings-eacl.124.note.zip