Research Repository

Anaphora resolution for bengali: An experiment with domain adaptation

Sikdar, UK and Ekbal, A and Saha, S and Uryupina, O and Poesio, M (2013) 'Anaphora resolution for bengali: An experiment with domain adaptation.' Computacion y Sistemas, 17 (2). 137 - 146. ISSN 1405-5546

[img]
Preview
Text
1518-2640-1-PB.pdf

Download (1MB) | Preview

Abstract

In this paper we present our first attempt on anaphora resolution for a resource poor language, namely Bengali. We address the issue of adapting a state-of-the-art system, BART, which was originally developed for English. Overall performance of co-reference resolution greatly depends on the high accurate mention detectors. We develop a number of models based on the heuristics used as well as on the particular machine learning employed. Thereafter we perform a series of experiments for adapting BART for Bengali. Our evaluation shows, a language-dependant system (designed primarily for English) can achieve a good performance level when re-trained and tested on a new language with proper subsets of features. The system produces the recall, precision and F-measure values of 56.00%, 46.50% and 50.80%, respectively. The contribution of this work is two-fold, viz. (i). attempt to build a machine learning based anaphora resolution system for a resource-poor Indian language; and (ii). domain adaptation of a state-of-the-art English co-reference resolution system for Bengali, which has completely different orthography and characteristics.

Item Type: Article
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions: Faculty of Science and Health > Computer Science and Electronic Engineering, School of
Depositing User: Jim Jamieson
Date Deposited: 10 Jun 2014 07:51
Last Modified: 23 Jan 2019 06:15
URI: http://repository.essex.ac.uk/id/eprint/9682

Actions (login required)

View Item View Item