Research Repository

A probabilistic annotation model for crowdsourcing coreference

Paun, S and Chamberlain, J and Kruschwitz, U and Yu, J and Poesio, M (2020) A probabilistic annotation model for crowdsourcing coreference. In: UNSPECIFIED, ? - ?.

[img]
Preview
Text
Paun2018Probabilistic.pdf - Published Version

Download (465kB) | Preview

Abstract

The availability of large scale annotated corpora for coreference is essential to the development of the field. However, creating resources at the required scale via expert annotation would be too expensive. Crowdsourcing has been proposed as an alternative; but this approach has not been widely used for coreference. This paper addresses one crucial hurdle on the way to make this possible, by introducing a new model of annotation for aggregating crowdsourced anaphoric annotations. The model is evaluated along three dimensions: the accuracy of the inferred mention pairs, the quality of the post-hoc constructed silver chains, and the viability of using the silver chains as an alternative to the expert-annotated chains in training a state of the art coreference system. The results suggest that our model can extract from crowdsourced annotations coreference chains of comparable quality to those obtained with expert annotation.

Item Type: Conference or Workshop Item (Paper)
Additional Information: Published proceedings: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, EMNLP 2018
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions: Faculty of Science and Health > Computer Science and Electronic Engineering, School of
Depositing User: Elements
Date Deposited: 13 Feb 2019 10:19
Last Modified: 08 Apr 2021 01:15
URI: http://repository.essex.ac.uk/id/eprint/23421

Actions (login required)

View Item View Item