Research Repository

Multi-Document Arabic Text Summarisation

El-Haj, M and Kruschwitz, U and Fox, C (2011) Multi-Document Arabic Text Summarisation. In: 2011 3rd Computer Science and Electronic Engineering Conference (CEEC), 2011-07-13 - 2011-07-14.

Full text not available from this repository.


In this paper we present our generic extractive Arabic and English multi-document summarisers. We also describe the use of machine translation for evaluating the generated Arabic multi-document summaries using English extractive gold standards. In this work we first address the lack of Arabic multi-document corpora for summarisation and the absence of automatic and manual Arabic gold-standard summaries. These are required to evaluate any automatic Arabic summarisers. Second, we demonstrate the use of Google Translate in creating an Arabic version of the DUC-2002 dataset. The parallel Arabic/English dataset is summarised using the Arabic and English summarisation systems. The automatically generated summaries are evaluated using the ROUGE metric, as well as precision and recall. The results we achieve are compared with the top five systems in the DUC-2002 multi-document summarisation task.

Item Type: Conference or Workshop Item (UNSPECIFIED)
Additional Information: Notes: Proceedings of the 3rd Computer Science and Electronic Engineering Conference, 13-14 July 2011, Colchester, United Kingdom.
Subjects: P Language and Literature > P Philology. Linguistics
Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions: Faculty of Science and Health
Faculty of Science and Health > Computer Science and Electronic Engineering, School of
SWORD Depositor: Elements
Depositing User: Elements
Date Deposited: 18 Oct 2012 21:35
Last Modified: 15 Jan 2022 00:45

Actions (login required)

View Item View Item