Research Repository

Verbose, Laconic or Just Right: A Simple Computational Model of Content Appropriateness under Length Constraints

Louis, Annie P and Nenkova, Ani (2014) Verbose, Laconic or Just Right: A Simple Computational Model of Content Appropriateness under Length Constraints. In: Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, April 26-30 2014, Gothenburg.

[img]
Preview
Text
E14-1067.pdf

Download (146kB) | Preview

Abstract

Length constraints impose implicit requirements on the type of content that can be included in a text. Here we pro- pose the first model to computationally assess if a text deviates from these requirements. Specifically, our model predicts the appropriate length for texts based on content types present in a snippet of constant length. We consider a range of features to approximate content type, including syntactic phrasing, constituent compression probability, presence of named entities, sentence specificity and intersentence continuity. Weights for these features are learned using a corpus of summaries written by experts and on high quality journalistic writing. During test time, the difference between actual and predicted length allows us to quantify text verbosity. We use data from manual evaluation of summarization systems to assess the verbosity scores produced by our model. We show that the automatic verbosity scores are significantly negatively correlated with manual content quality scores given to the summaries.

Item Type: Conference or Workshop Item (Paper)
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions: Faculty of Science and Health > Computer Science and Electronic Engineering, School of
Depositing User: Jim Jamieson
Date Deposited: 13 Dec 2016 16:36
Last Modified: 13 Dec 2016 16:36
URI: http://repository.essex.ac.uk/id/eprint/18545

Actions (login required)

View Item View Item