Baroni, Marco and Murphy, Brian and Barbu, Eduard and Poesio, Massimo (2010) Strudel: A Corpus‐Based Semantic Model Based on Properties and Types. Cognitive Science, 34 (2). pp. 222-254. DOI https://doi.org/10.1111/j.1551-6709.2009.01068.x
Baroni, Marco and Murphy, Brian and Barbu, Eduard and Poesio, Massimo (2010) Strudel: A Corpus‐Based Semantic Model Based on Properties and Types. Cognitive Science, 34 (2). pp. 222-254. DOI https://doi.org/10.1111/j.1551-6709.2009.01068.x
Baroni, Marco and Murphy, Brian and Barbu, Eduard and Poesio, Massimo (2010) Strudel: A Corpus‐Based Semantic Model Based on Properties and Types. Cognitive Science, 34 (2). pp. 222-254. DOI https://doi.org/10.1111/j.1551-6709.2009.01068.x
Abstract
<jats:title>Abstract</jats:title><jats:p>Computational models of meaning trained on naturally occurring text successfully model human performance on tasks involving simple similarity measures, but they characterize meaning in terms of undifferentiated bags of words or topical dimensions. This has led some to question their psychological plausibility (<jats:ext-link xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="#b48">Murphy, 2002;</jats:ext-link><jats:ext-link xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="#b66">Schunn, 1999</jats:ext-link>). We present here a fully automatic method for extracting a structured and comprehensive set of concept descriptions directly from an English part‐of‐speech‐tagged corpus. Concepts are characterized by weighted properties, enriched with concept–property types that approximate classical relations such as hypernymy and function. Our model outperforms comparable algorithms in cognitive tasks pertaining not only to concept‐internal structures (discovering properties of concepts, grouping properties by property type) but also to inter‐concept relations (clustering into superordinates), suggesting the empirical validity of the property‐based approach.</jats:p>
Item Type: | Article |
---|---|
Uncontrolled Keywords: | Corpus-based semantic models; Conceptual knowledge induction; Property-based concept representations |
Subjects: | P Language and Literature > P Philology. Linguistics Q Science > QA Mathematics > QA75 Electronic computers. Computer science |
Divisions: | Faculty of Science and Health Faculty of Science and Health > Computer Science and Electronic Engineering, School of |
SWORD Depositor: | Unnamed user with email elements@essex.ac.uk |
Depositing User: | Unnamed user with email elements@essex.ac.uk |
Date Deposited: | 05 Mar 2013 16:13 |
Last Modified: | 30 Oct 2024 16:43 |
URI: | http://repository.essex.ac.uk/id/eprint/5547 |