Match of PATSTAT data (2019 spring) and PatentsView (jan 2019) is discussed here, with focus on how this match can help to enrich PATSTAT data with information not contained in USPTO patents (and the other way round).
Advanced Machine Learning for Business Professionals
PATSTAT & Patentsview: complements or substitutes
1. PATSTAT & Patentsview:
complements or
substitutes?
Gianluca Tarasconi – Icrios res. Center / Bocconi University
IVth Summer School KUL – EPO, Vienna 25-27 September 2019
8. patstat patentsview
both applications and granted patents only granted patents
patents from 1790 patents published after 1975
designs only after 2001 Designs since 1975
plants after 2001 Plants since first filing
'statutory invention registration' missing
publishing patent applications on which they no longer felt they could get patents.
By publishing the patent applications, they helped ensure that the inventions were in the public domain
and no one else could subsequently get a patent on them
9. id type number country date abstract title kind num_claims filename withdrawn
10000000 'utility' '10000000' 'US' 2018-06-19
'A frequency modulated
(coherent) laser detection and
ranging system includes […]
'Coherent LADAR
using intra-pixel
quadrature detection' 'B2' '20' 'ipg180619.xml' 'NULL'
PAT_PUBLN_ID PUBLN_AUTH PUBLN_NR PUBLN_KIND publn_nr_original APPLN_ID PUBLN_DATE PUBLN_LG PUBLN_FIRST_GRANT PUBLN_CLAIMS
496002459 'US' '10000000' ‘ B2' '10000000' 469404924 '2018-06-19' 'en' 'Y' '20'
12. matched total perc.
# appln_id in patstat with at least on applicant
matched: 5.103.946 6.670.491 77%
# person_id in patstat with us appln_id matched: 594.366 5.728.768 10%
# person_id in all companies with us appln_id
matched: 594.366 1.344.980 44%
# person_id in all matched us appln_id matched: 594.366 3.315.186 18%
# person_id in all companies with matched us
appln_id matched: 594.366 786.982 76%
number assignee_id in PW matched: 418.701 504.293 83%
number appln_id in patstat with at least on inventor
matched: 3.298.009 6.670.491 49%
number person_id with us appln_id in patstat
matched: 1.372.498 9.767.430 14%
number person_id with matched us appln_id in
patstat matched: 1.372.498 6.047.985 23%
Get high recall
from PW to
PATSTAT since
PW tables are
disambiguated
13.
14. assignee_id organization # patents PERSON_NAME # appln_id
org_00EVa99OXRcLoEeKomIa
American Telephone and Telegraph
Company, AT&T Bell Laboratories 831American Telephone and Telegraph Company, AT&T Bell Laboratories 587
American Telephone and Telegraph Company AT&T Bell Laboratories 74
American Telephone and Telegraph Company, AT&T Technologies, Inc. 32
American Telephone & Telegraph Company, AT&T Bell Laboratories 17
AT&T Technologies, Inc. 13
American Telephone and Telegraph Company, AT&T Information Systems 10
American Telephone and Telegraph Company and AT&T Information Systems 7
American Telephone and Telegraph Company AT&T Technologies, Inc. 7
American Telephone and Telegraph Co., AT&T Bell Laboratories 6
American Telephone and Telegraph Company, AT&T Bell Labs 6
American Telephone and Telegraph Company, AT&T Laboratories 6
American Telephone and Telegraph Co., AT&T Bell Labs 5
American Telephone & Telegraph Co., AT&T Bell Laboratories 4
American Telephone and Telegraph Company, AT&T Technologies Inc. 3
AT&T Bell Laboratories 3
American Telephone & Telegraph Co., AT&T Bell Labs 2
American Telephone & Telegraph Company 2
American Telephone & Telegraph Company AT&T Bell Laboratories 2
American Telephone and Telegraph Co., AT&T Bell Labs. 2
American Telephone and Telegraph Company - AT&T Information Systems 2
American Telephone and Telegraph Company and AT&T Bell Laboratories 2
American Telephone and Telgraph Company, AT&T Bell Laboratories 2
+ 37 other spellings with one patent 37
First line matches from PATSTAT; the remaining spelling are missing; that’s why we
have high recall PATSTAT to PV, but not the other way round
15. Inventors
improve much
still room to do
better…
matched total perc.
# person_id in all companies with matched us appln_id
matched: 729.945 786.982 93%
# appln_id in patstat with at least on applicant matched: 5.624.070 6.670.491 84%
number assignee_id in PV matched: 425.761 504.293 84%
number appln_id in patstat with at least on inventor matched: 4.802.753 6.670.491 72%
number person_id with matched us appln_id in patstat
matched: 3.870.710 6.047.985 64%
number inventor_id in PV matched: 744.893 767.931 97%