Efficient Large Scale Language Modeling with Mixtures of Experts Mikel Artetxe author Shruti Bhosale author Naman Goyal author Todor Mihaylov author Myle Ott author Sam Shleifer author Xi Victoria Lin author Jingfei Du author Srinivasan Iyer author Ramakanth Pasunuru author Giridharan Anantharaman author Xian Li author Shuohui Chen author Halil Akin author Mandeep Baines author Louis Martin author Xing Zhou author Punit Singh Koura author Brian O’Horo author Jeffrey Wang author Luke Zettlemoyer author Mona Diab author Zornitsa Kozareva author Veselin Stoyanov author 2022-12 text Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing Yoav Goldberg editor Zornitsa Kozareva editor Yue Zhang editor Association for Computational Linguistics Abu Dhabi, United Arab Emirates conference publication artetxe-etal-2022-efficient 10.18653/v1/2022.emnlp-main.804 https://aclanthology.org/2022.emnlp-main.804/ 2022-12 11699 11732