VMSMO: Learning to Generate Multimodal Summary for Video-based News Articles Mingzhe Li author Xiuying Chen author Shen Gao author Zhangming Chan author Dongyan Zhao author Rui Yan author 2020-11 text Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP) Bonnie Webber editor Trevor Cohn editor Yulan He editor Yang Liu editor Association for Computational Linguistics Online conference publication li-etal-2020-vmsmo 10.18653/v1/2020.emnlp-main.752 https://aclanthology.org/2020.emnlp-main.752/ 2020-11 9360 9369