Non-Parametric Memory Guidance for Multi-Document Summarization

Florian Baud, Alex Aussem


Abstract
Multi-document summarization (MDS) is a difficult task in Natural Language Processing, aiming to summarize information from several documents. However, the source documents are often insufficient to obtain a qualitative summary. We propose a retriever-guided model combined with non-parametric memory for summary generation. This model retrieves relevant candidates from a database and then generates the summary considering the candidates with a copy mechanism and the source documents. The retriever is implemented with Approximate Nearest Neighbor Search (ANN) to search large databases. Our method is evaluated on the MultiXScience dataset which includes scientific articles. Finally, we discuss our results and possible directions for future work.
Anthology ID:
2023.ranlp-1.17
Volume:
Proceedings of the 14th International Conference on Recent Advances in Natural Language Processing
Month:
September
Year:
2023
Address:
Varna, Bulgaria
Editors:
Ruslan Mitkov, Galia Angelova
Venue:
RANLP
SIG:
Publisher:
INCOMA Ltd., Shoumen, Bulgaria
Note:
Pages:
153–158
Language:
URL:
https://aclanthology.org/2023.ranlp-1.17
DOI:
Bibkey:
Cite (ACL):
Florian Baud and Alex Aussem. 2023. Non-Parametric Memory Guidance for Multi-Document Summarization. In Proceedings of the 14th International Conference on Recent Advances in Natural Language Processing, pages 153–158, Varna, Bulgaria. INCOMA Ltd., Shoumen, Bulgaria.
Cite (Informal):
Non-Parametric Memory Guidance for Multi-Document Summarization (Baud & Aussem, RANLP 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.ranlp-1.17.pdf