Research Repository

Using Mechanical Turk to Create a Corpus of Arabic Summaries

El-Haj, M and Kruschwitz, U and Fox, C (2010) 'Using Mechanical Turk to Create a Corpus of Arabic Summaries.' In: UNSPECIFIED, (ed.) Proceedings of the International Conference on Language Resources and Evaluation. European Language Resources Association. ISBN 2951740867

[img]
Preview
Text
LREC2010_MTurk.pdf

Download (33kB) | Preview

Abstract

This paper describes the creation of a human-generated corpus of extractive Arabic summaries of a selection of Wikipedia and Arabic newspaper articles using Mechanical Turk?an online workforce. The purpose of this exercise was two-fold. First, it addresses a shortage of relevant data for Arabic natural language processing. Second, it demonstrates the application of Mechanical Turk to the problem of creating natural language resources. The paper also reports on a number of evaluations we have performed to compare the collected summaries against results obtained from a variety of automatic summarisation systems.

Item Type: Book Section
Subjects: P Language and Literature > P Philology. Linguistics
Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions: Faculty of Science and Health > Computer Science and Electronic Engineering, School of
Depositing User: Users 161 not found.
Date Deposited: 03 Jul 2013 08:38
Last Modified: 17 Aug 2017 18:07
URI: http://repository.essex.ac.uk/id/eprint/4064

Actions (login required)

View Item View Item