Research Repository

Using Mechanical Turk to Create a Corpus of Arabic Summaries

El-Haj, M and Kruschwitz, U and Fox, C (2010) Using Mechanical Turk to Create a Corpus of Arabic Summaries. In: UNSPECIFIED, ? - ?.


Download (33kB) | Preview


This paper describes the creation of a human-generated corpus of extractive Arabic summaries of a selection of Wikipedia and Arabic newspaper articles using Mechanical Turk?an online workforce. The purpose of this exercise was two-fold. First, it addresses a shortage of relevant data for Arabic natural language processing. Second, it demonstrates the application of Mechanical Turk to the problem of creating natural language resources. The paper also reports on a number of evaluations we have performed to compare the collected summaries against results obtained from a variety of automatic summarisation systems.

Item Type: Conference or Workshop Item (UNSPECIFIED)
Additional Information: Published proceedings: _not provided_ - Notes:
Subjects: P Language and Literature > P Philology. Linguistics
Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions: Faculty of Science and Health
Faculty of Science and Health > Computer Science and Electronic Engineering, School of
SWORD Depositor: Elements
Depositing User: Elements
Date Deposited: 03 Jul 2013 08:38
Last Modified: 15 Jan 2022 01:04

Actions (login required)

View Item View Item