Sikdar, UK and Ekbal, A and Saha, S and Uryupina, O and Poesio, M (2014) Differential evolution-based feature selection technique for anaphora resolution. Soft Computing, 19 (8). pp. 2149-2161. DOI https://doi.org/10.1007/s00500-014-1397-3
Sikdar, UK and Ekbal, A and Saha, S and Uryupina, O and Poesio, M (2014) Differential evolution-based feature selection technique for anaphora resolution. Soft Computing, 19 (8). pp. 2149-2161. DOI https://doi.org/10.1007/s00500-014-1397-3
Sikdar, UK and Ekbal, A and Saha, S and Uryupina, O and Poesio, M (2014) Differential evolution-based feature selection technique for anaphora resolution. Soft Computing, 19 (8). pp. 2149-2161. DOI https://doi.org/10.1007/s00500-014-1397-3
Abstract
In this paper a differential evolution (DE)-based feature selection technique is developed for anaphora resolution in a resource-poor language, namely Bengali. We discuss the issues of adapting a state-of-the-art English anaphora resolution system for a resource-poor language like Bengali. Performance of any anaphoric resolver greatly depends on the quality of a high accurate mention detector and the use of appropriate features for anaphora resolution. We develop a number of models for mention detection based on machine learning and heuristics. In anaphora resolution there is no globally accepted metric for measuring the performance, and each of them such as MUC, B3 , CEAF, Blanc exhibit significantly different behaviors. Our proposed feature selection technique determines the near-optimal feature set by optimizing each of these evaluation metrics. Experiments show how a language-dependent system (designed primarily for English) can attain reasonably good performance level when re-trained and tested on a new language with a proper subset of features. Evaluation results yield the F-measure values of 66.70, 59.47, 51.56, 33.08 and 72.75 % for MUC, B 3, CEAFM, CEAFE and BLANC, respectively
Item Type: | Article |
---|---|
Uncontrolled Keywords: | Anaphora resolution; Differential evolution; Feature selection; CRF; BART; Bengali |
Subjects: | P Language and Literature > P Philology. Linguistics Q Science > QA Mathematics > QA75 Electronic computers. Computer science |
Divisions: | Faculty of Science and Health Faculty of Science and Health > Computer Science and Electronic Engineering, School of |
SWORD Depositor: | Unnamed user with email elements@essex.ac.uk |
Depositing User: | Unnamed user with email elements@essex.ac.uk |
Date Deposited: | 04 Feb 2015 11:52 |
Last Modified: | 23 Oct 2024 05:42 |
URI: | http://repository.essex.ac.uk/id/eprint/12581 |