Monolingual text summarization for Indic Languages using LLMs

Jothir Adithya T K, Nithish Kumar S, Felicia Lilian J, Mahalakshmi S


Abstract
We have analyzed the growth of advanced text summarization method leveraging LLM for Indic language. Text summarization involves transforming a longer text information into a more concise version, ensuring that the most prominent information and key meanings are maintained. Our goal is to produce concise and accurate summaries from longer texts, focusing on maintaining detailed information and coherence. We utilize NLP techniques for text cleaning, keyword extraction and summarization, along with performance evaluation metrics such as ROUGE score, BLEU score and BERT Score. The results demonstrate an incremental improvement in the quality of generated summaries, with a particular emphasis on enhancing informativeness while minimizing redundancy. This research work also highlights the importance of tuning parameters and leveraging advanced models for producing high quality summaries in diverse domains for Indic Language.
Anthology ID:
2024.icon-1.11
Volume:
Proceedings of the 21st International Conference on Natural Language Processing (ICON)
Month:
December
Year:
2024
Address:
AU-KBC Research Centre, Chennai, India
Editors:
Sobha Lalitha Devi, Karunesh Arora
Venue:
ICON
SIG:
Publisher:
NLP Association of India (NLPAI)
Note:
Pages:
94–101
Language:
URL:
https://aclanthology.org/2024.icon-1.11/
DOI:
Bibkey:
Cite (ACL):
Jothir Adithya T K, Nithish Kumar S, Felicia Lilian J, and Mahalakshmi S. 2024. Monolingual text summarization for Indic Languages using LLMs. In Proceedings of the 21st International Conference on Natural Language Processing (ICON), pages 94–101, AU-KBC Research Centre, Chennai, India. NLP Association of India (NLPAI).
Cite (Informal):
Monolingual text summarization for Indic Languages using LLMs (T K et al., ICON 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.icon-1.11.pdf