Incomplete Utterance Rewriting with Editing Operation Guidance and Utterance Augmentation

Zhiyu Cao, Peifeng Li, Yaxin Fan, Qiaoming Zhu


Abstract
Although existing fashionable generation methods on Incomplete Utterance Rewriting (IUR) can generate coherent utterances, they often result in the inclusion of irrelevant and redundant tokens in rewritten utterances due to their inability to focus on critical tokens in dialogue context. Furthermore, the limited size of the training datasets also contributes to the insufficient training of the IUR model. To address the first issue, we propose a multi-task learning framework EO-IUR (Editing Operation-guided Incomplete Utterance Rewriting) that introduces the editing operation labels generated by sequence labeling module to guide generation model to focus on critical tokens. Furthermore, we introduce a token-level heterogeneous graph to represent dialogues. To address the second issue, we propose a two-dimensional utterance augmentation strategy, namely editing operation-based incomplete utterance augmentation and LLM-based historical utterance augmentation. The experimental results on three datasets demonstrate that our EO-IUR outperforms previous state-of-the-art (SOTA) baselines in both open-domain and task-oriented dialogue.
Anthology ID:
2024.emnlp-main.410
Volume:
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing
Month:
November
Year:
2024
Address:
Miami, Florida, USA
Editors:
Yaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen
Venue:
EMNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
7225–7238
Language:
URL:
https://aclanthology.org/2024.emnlp-main.410/
DOI:
10.18653/v1/2024.emnlp-main.410
Bibkey:
Cite (ACL):
Zhiyu Cao, Peifeng Li, Yaxin Fan, and Qiaoming Zhu. 2024. Incomplete Utterance Rewriting with Editing Operation Guidance and Utterance Augmentation. In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, pages 7225–7238, Miami, Florida, USA. Association for Computational Linguistics.
Cite (Informal):
Incomplete Utterance Rewriting with Editing Operation Guidance and Utterance Augmentation (Cao et al., EMNLP 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.emnlp-main.410.pdf