Text matching to measure patent similarity |
| |
Authors: | Sam Arts Bruno Cassiman Juan Carlos Gomez |
| |
Affiliation: | 1. Faculty of Economics and Business, KU Leuven, Antwerp, Belgium;2. IESE Business School, KU Leuven and CEPR, Barcelona, Spain;3. Department of Electronics Engineering, University of Guanajuato Campus Irapuato‐Salamanca, Salamanca, Mexico |
| |
Abstract: | Research Summary : We propose using text matching to measure the technological similarity between patents. Technology experts from different fields validate the new similarity measure and its improvement on measures based on the United States Patent Classification System, and identify its limitations. As an application, we replicate prior findings on the localization of knowledge spillovers by constructing a case–control group of text‐matched patents. We also provide open access to the code and data to calculate the similarity between any two utility patents granted by the United States Patent and Trademark Office between 1976 and 2013, or between any two patent portfolios. Managerial Summary : We propose using text matching to measure the technological similarity between patents. The method can be used by various practitioners such as inventors, attorneys, patent examiners, and managers to search for closely related prior art, to assess the novelty of a patent, to identify R&D opportunities in less crowded areas, to detect in‐ or out‐licensing opportunities, to map companies in technology space, and to find acquisition targets. We use an expert panel to validate the improvement of the new similarity measure on measures based on the United States Patent Classification System, and provide open access to the code and data to calculate the similarity between any two utility patents granted by the USPTO between 1976 and 2013, or between any two patent portfolios. |
| |
Keywords: | matching patent patent classification technological similarity text mining |
|
|