Louis, Annie P (2014) A Bayesian Method to Incorporate Background Knowledge during Automatic Text Summarization. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Short Papers), June 23-25 2014, Baltimore, Maryland, USA.
Louis, Annie P (2014) A Bayesian Method to Incorporate Background Knowledge during Automatic Text Summarization. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Short Papers), June 23-25 2014, Baltimore, Maryland, USA.
Louis, Annie P (2014) A Bayesian Method to Incorporate Background Knowledge during Automatic Text Summarization. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Short Papers), June 23-25 2014, Baltimore, Maryland, USA.
Abstract
In order to summarize a document, it is often useful to have a background set of documents from the domain to serve as a reference for determining new and important information in the input document. We present a model based on Bayesian surprise which provides an intuitive way to identify surprising information from a summarization input with respect to a background corpus. Specifically, the method quantifies the degree to which pieces of information in the input change one’s beliefs’ about the world represented in the background. We develop systems for generic and update summarization based on this idea. Our method provides competitive content selection performance with particular advantages in the update task where systems are given a small and topical background corpus.
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Subjects: | P Language and Literature > P Philology. Linguistics Q Science > QA Mathematics > QA75 Electronic computers. Computer science |
Divisions: | Faculty of Science and Health > Computer Science and Electronic Engineering, School of |
Depositing User: | Jim Jamieson |
Date Deposited: | 13 Dec 2016 16:27 |
Last Modified: | 13 Dec 2016 16:27 |
URI: | http://repository.essex.ac.uk/id/eprint/18543 |
Available files
Filename: P14-2055.pdf