An Empirical Study on Multi-Task Learning for Text Style Transfer and Paraphrase Generation

Pawel Bujnowski, Kseniia Ryzhova, Hyungtak Choi, Katarzyna Witkowska, Jaroslaw Piersa, Tymoteusz Krumholc, Katarzyna Beksa


Abstract
The topic of this paper is neural multi-task training for text style transfer. We present an efficient method for neutral-to-style transformation using the transformer framework. We demonstrate how to prepare a robust model utilizing large paraphrases corpora together with a small parallel style transfer corpus. We study how much style transfer data is needed for a model on the example of two transformations: neutral-to-cute on internal corpus and modern-to-antique on publicly available Bible corpora. Additionally, we propose a synthetic measure for the automatic evaluation of style transfer models. We hope our research is a step towards replacing common but limited rule-based style transfer systems by more flexible machine learning models for both public and commercial usage.
Anthology ID:
2020.coling-industry.6
Volume:
Proceedings of the 28th International Conference on Computational Linguistics: Industry Track
Month:
December
Year:
2020
Address:
Online
Editors:
Ann Clifton, Courtney Napoles
Venue:
COLING
SIG:
Publisher:
International Committee on Computational Linguistics
Note:
Pages:
50–63
Language:
URL:
https://aclanthology.org/2020.coling-industry.6
DOI:
10.18653/v1/2020.coling-industry.6
Bibkey:
Cite (ACL):
Pawel Bujnowski, Kseniia Ryzhova, Hyungtak Choi, Katarzyna Witkowska, Jaroslaw Piersa, Tymoteusz Krumholc, and Katarzyna Beksa. 2020. An Empirical Study on Multi-Task Learning for Text Style Transfer and Paraphrase Generation. In Proceedings of the 28th International Conference on Computational Linguistics: Industry Track, pages 50–63, Online. International Committee on Computational Linguistics.
Cite (Informal):
An Empirical Study on Multi-Task Learning for Text Style Transfer and Paraphrase Generation (Bujnowski et al., COLING 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.coling-industry.6.pdf
Data
MultiNLI