Exploring Clustering for Multi-Document Arabic Summarisation

El-Haj, M and Kruschwitz, U and Fox, C (2011) Exploring Clustering for Multi-Document Arabic Summarisation. In: UNSPECIFIED, ? - ?.

Abstract

In this paper we explore clustering for multi-document Arabic summarisation. For our evaluation we use an Arabic version of the DUC-2002 dataset that we previously generated using Google Translate. We explore how clustering (at the sentence level) can be applied to multi-document summarisation as well as for redundancy elimination within this process. We use different parameter settings including the cluster size and the selection model applied in the extractive summarisation process. The automatically generated summaries are evaluated using the ROUGE metric, as well as precision and recall. The results we achieve are compared with the top five systems in the DUC-2002 multi-document summarisation task.

Item Metadata

Item Type:	Conference or Workshop Item (UNSPECIFIED)
Additional Information:	Notes: Proceedings of the 7th Asia Information Retrieval Societies Conference, AIRS 2011, Dubai, United Arab Emirates, December 18-20, 2011.
Subjects:	P Language and Literature > P Philology. Linguistics Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions:	Faculty of Science and Health Faculty of Science and Health > Computer Science and Electronic Engineering, School of
SWORD Depositor:	Unnamed user with email elements@essex.ac.uk
Depositing User:	Unnamed user with email elements@essex.ac.uk
Date Deposited:	19 Oct 2012 10:02
Last Modified:	18 Jun 2025 00:28
URI:	http://repository.essex.ac.uk/id/eprint/3882

Exploring Clustering for Multi-Document Arabic Summarisation

Abstract

Item Metadata

Share and export

Available files

Statistics

Altmetrics

Downloads