Research Repository

Multi-document arabic text summarisation

El-Haj, M and Kruschwitz, U and Fox, C (2011) Multi-document arabic text summarisation. In: UNSPECIFIED, ? - ?.

Full text not available from this repository.

Abstract

In this paper we present our generic extractive Arabic and English multi-document summarisers. We also describe the use of machine translation for evaluating the generated Arabic multi-document summaries using English extractive gold standards. In this work we first address the lack of Arabic multi-document corpora for summarisation and the absence of automatic and manual Arabic gold-standard summaries. These are required to evaluate any automatic Arabic summarisers. Second, we demonstrate the use of Google Translate in creating an Arabic version of the DUC-2002 dataset. The parallel Arabic/English dataset is summarised using the Arabic and English summarisation systems. The automatically generated summaries are evaluated using the ROUGE metric, as well as precision and recall. The results we achieve are compared with the top five systems in the DUC-2002 multi-document summarisation task. © 2011 IEEE.

Item Type: Conference or Workshop Item (UNSPECIFIED)
Additional Information: Published proceedings: 2011 3rd Computer Science and Electronic Engineering Conference, CEEC'11
Subjects: P Language and Literature > P Philology. Linguistics
Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions: Faculty of Science and Health > Computer Science and Electronic Engineering, School of
Depositing User: Users 161 not found.
Date Deposited: 18 Oct 2012 21:35
Last Modified: 23 Jan 2019 01:15
URI: http://repository.essex.ac.uk/id/eprint/4065

Actions (login required)

View Item View Item