Shekhar, Ravi and Karan, Mladen and Purver, Matthew (2022) CoRAL: a Context-aware Croatian Abusive Language Dataset. In: 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing - Findings of the Association for Computational Linguistics: AACL-IJCNLP 2022, 2022-11-20 - 2022-11-23, Online only.
Shekhar, Ravi and Karan, Mladen and Purver, Matthew (2022) CoRAL: a Context-aware Croatian Abusive Language Dataset. In: 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing - Findings of the Association for Computational Linguistics: AACL-IJCNLP 2022, 2022-11-20 - 2022-11-23, Online only.
Shekhar, Ravi and Karan, Mladen and Purver, Matthew (2022) CoRAL: a Context-aware Croatian Abusive Language Dataset. In: 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing - Findings of the Association for Computational Linguistics: AACL-IJCNLP 2022, 2022-11-20 - 2022-11-23, Online only.
Abstract
In light of unprecedented increases in the popularity of the internet and social media, comment moderation has never been a more relevant task. Semi-automated comment moderation systems greatly aid human moderatorsby either automatically classifying the examples or allowing the moderators to prioritizewhich comments to consider first. However,the concept of inappropriate content is oftensubjective, and such content can be conveyedin many subtle and indirect ways. In this work,we propose CoRAL1 a language and culturally aware Croatian Abusive dataset covering phenomena of implicitness and relianceon local and global context. We show experimentally that current models degrade whencomments are not explicit and further degradewhen language skill and context knowledgeare required to interpret the comment.
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Divisions: | Faculty of Science and Health Faculty of Science and Health > Computer Science and Electronic Engineering, School of |
SWORD Depositor: | Unnamed user with email elements@essex.ac.uk |
Depositing User: | Unnamed user with email elements@essex.ac.uk |
Date Deposited: | 12 Jan 2024 12:42 |
Last Modified: | 12 Jan 2024 12:43 |
URI: | http://repository.essex.ac.uk/id/eprint/35791 |
Available files
Filename: 2022.findings-aacl.21.pdf
Licence: Creative Commons: Attribution 4.0