ISO 24617-2 on a cusp of languages

Krzysztof Hwaszcz, Marcin Oleksy, Aleksandra Domogała, Jan Wieczorek


Abstract
The article discusses the challenges of cross-linguistic dialogue act annotation, which involves using methods developed for one language to annotate conversations in another language. The article specifically focuses on the research on dialogue act annotation in Polish, based on the ISO standard developed for English. The article examines the differences between Polish and English in dialogue act annotation based on selected examples from DiaBiz.Kom corpus, such as the use of honorifics in Polish, the use of inflection to convey meaning in Polish, the tendency to use complex sentence structures in Polish, and the cultural differences that may play a role in the annotation of dialogue acts. The article also discusses the creation of DiaBiz.Kom, a Polish dialogue corpus based on ISO 24617-2 standard applied to 1100 transcripts.
Anthology ID:
2023.isa-1.6
Volume:
Proceedings of the 19th Joint ACL-ISO Workshop on Interoperable Semantics (ISA-19)
Month:
June
Year:
2023
Address:
Nancy, France
Editor:
Harry Bunt
Venues:
ISA | WS
SIG:
SIGSEM
Publisher:
Association for Computational Linguistics
Note:
Pages:
40–46
Language:
URL:
https://aclanthology.org/2023.isa-1.6
DOI:
Bibkey:
Cite (ACL):
Krzysztof Hwaszcz, Marcin Oleksy, Aleksandra Domogała, and Jan Wieczorek. 2023. ISO 24617-2 on a cusp of languages. In Proceedings of the 19th Joint ACL-ISO Workshop on Interoperable Semantics (ISA-19), pages 40–46, Nancy, France. Association for Computational Linguistics.
Cite (Informal):
ISO 24617-2 on a cusp of languages (Hwaszcz et al., ISA-WS 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.isa-1.6.pdf