Multi-modal Summarization for Asynchronous Collection of Text, Image, Audio and Video Haoran Li author Junnan Zhu author Cong Ma author Jiajun Zhang author Chengqing Zong author 2017-09 text Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing Martha Palmer editor Rebecca Hwa editor Sebastian Riedel editor Association for Computational Linguistics Copenhagen, Denmark conference publication li-etal-2017-multi 10.18653/v1/D17-1114 https://aclanthology.org/D17-1114/ 2017-09 1092 1102