Compressing Context to Enhance Inference Efficiency of Large Language Models Yucheng Li author Bo Dong author Frank Guerin author Chenghua Lin author 2023-12 text Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing Houda Bouamor editor Juan Pino editor Kalika Bali editor Association for Computational Linguistics Singapore conference publication li-etal-2023-compressing 10.18653/v1/2023.emnlp-main.391 https://aclanthology.org/2023.emnlp-main.391/ 2023-12 6342 6353