Using Mechanical Turk to Create a Corpus of Arabic Summaries

El-Haj, M and Kruschwitz, U and Fox, C (2010) Using Mechanical Turk to Create a Corpus of Arabic Summaries. In: UNSPECIFIED, ? - ?.

Abstract

This paper describes the creation of a human-generated corpus of extractive Arabic summaries of a selection of Wikipedia and Arabic newspaper articles using Mechanical Turk?an online workforce. The purpose of this exercise was two-fold. First, it addresses a shortage of relevant data for Arabic natural language processing. Second, it demonstrates the application of Mechanical Turk to the problem of creating natural language resources. The paper also reports on a number of evaluations we have performed to compare the collected summaries against results obtained from a variety of automatic summarisation systems.

Item Metadata

Item Type:	Conference or Workshop Item (UNSPECIFIED)
Additional Information:	Published proceedings: _not provided_ - Notes:
Subjects:	P Language and Literature > P Philology. Linguistics Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions:	Faculty of Science and Health Faculty of Science and Health > Computer Science and Electronic Engineering, School of
SWORD Depositor:	Unnamed user with email elements@essex.ac.uk
Depositing User:	Unnamed user with email elements@essex.ac.uk
Date Deposited:	03 Jul 2013 08:38
Last Modified:	16 May 2024 17:51
URI:	http://repository.essex.ac.uk/id/eprint/4064

Available files

UNSPECIFIED

Filename: LREC2010_MTurk.pdf

Download

Using Mechanical Turk to Create a Corpus of Arabic Summaries

Abstract

Item Metadata

Share and export

Available files

UNSPECIFIED

Statistics

Downloads