Semantically-Based Patent Thicket Identification

Gatkowski, Mateusz and Dietl, Marek and Skrok, Lukasz and Whalen, Ryan and Rockett, Katharine (2020) Semantically-Based Patent Thicket Identification. Research Policy, 49 (2). p. 103925. DOI https://doi.org/10.1016/j.respol.2020.103925

Abstract

Patent thickets have been identified as a major stumbling block in the development of new technologies, creating the need to accurately identify thicket membership. Various citations-based methodologies (Graevenitz et al, 2011; Clarkson, 2005) have been proposed, which have relied on broad survey results (Cohen et al, 2000) for validation. Expert evaluation is an alternative direct method of judging thicket membership at the individual patent level. While this method potentially is robust to drafting and jurisdictional differences in patent design, it is also costly to use on a large scale. We employ a natural language processing technique, which does not carry these large costs, to proxy expert views closely. Furthermore, we investigate the relation between our semantic measure and citation based measures, finding them quite distinct. We then combine a variety of thicket indicators into a statistical model to assess the probability that a newly added patent belongs to a thicket. We also study the role each measure plays, as part of creating a prospective screening model that could improve efficiency of the patent system, in response to Lemley (2001).

Item Metadata

Item Type:	Article
Uncontrolled Keywords:	Patent thicket; Intellectual Property; Semantic Distance; Latent Semantic Analysis; Natural Language Processing; Complexity
Divisions:	Faculty of Social Sciences Faculty of Social Sciences > Economics, Department of
SWORD Depositor:	Unnamed user with email elements@essex.ac.uk
Depositing User:	Unnamed user with email elements@essex.ac.uk
Date Deposited:	31 Jan 2020 16:35
Last Modified:	16 Aug 2025 03:42
URI:	http://repository.essex.ac.uk/id/eprint/26505

Available files

Accepted Version

Filename: RIS version.pdf

Licence: Creative Commons: Attribution-Noncommercial-No Derivative Works 3.0

Download

Semantically-Based Patent Thicket Identification

Abstract

Item Metadata

Share and export

Available files

Accepted Version

Statistics

Altmetrics

Downloads