HistPatent Table
Return to Patent Data Specifications.
Contents
Table Structure
Table "public.histpatent" Column | Type | Modifiers ----------------+-------------------+----------- applicationid | integer | pubno | character varying | patentnumber | character varying | nber | integer | uspc | character varying | uspc_sub | character varying | applicationdate | date | prioritydate | date | pubdate | date | displaydate | date | disptype | character varying | exp_dt | date | exp_dt_max | date | pta | integer |
Example of table entries:
applicationid | pubno | patent | nber | uspc | uspc_sub | applicationdate | prioritydate | pubdate | dispdate | disptype | exp_dt | exp_dt_max | pta --------------+-----------------+----------+------+------+----------+-----------------+--------------+------------+-------------+----------+------------+------------+----- 10035634 | US20020056924A1 | -357336 | 46 | 257 | 784 | 2001-10-26 | 2001-10-26 | 2002-05-16 | 2002-12-03 | ABN | | | 0 10035996 | US20020059513A1 | -75798 | 25 | 703 | 027 | 2001-11-09 | 2001-11-09 | 2002-05-16 | 2006-09-12 | ABN | | | 0 10000241 | US20020035749A1 | 6463595 | 65 | 4 | 213 | 2001-10-18 | 2000-01-14 | 2002-03-28 | 2002-10-15 | ISS | 2020-01-13 | | 0 10005395 | US20020035760A1 | 6460212 | 69 | 14 | 71 | 2001-12-03 | 1993-10-04 | 2002-03-28 | 2002-10-08 | ISS | 2014-10-08 | | 0 10002827 | US20020035761A1 | 6588042 | 69 | 15 | 22 | 2001-11-15 | 1998-07-30 | 2002-03-28 | 2003-07-08 | ISS | 2018-07-29 | | 0 10000979 | US20020035795A1 | 6763614 | 63 | 36 | 50 | 2001-10-30 | 2000-06-27 | 2002-03-28 | 2004-07-20 | ISS | 2020-06-26 | | 0 10003664 | US20020035797A1 | 6671983 | 64 | 37 | 222 | 2001-10-23 | 1998-08-14 | 2002-03-28 | 2004-01-06 | ISS | 2019-04-02 | | 232 10002674 | US20020035805A1 | 6557299 | 59 | 49 | 42 | 2001-10-30 | 2001-10-30 | 2002-03-28 | 2003-05-06 | ISS | 2021-10-29 | | 0
Table Variables
The applicationid (also referred to as an application number) and pubno (patent publication number) correspond to the patent in question. Patent refers to patent number. The NBER two digit code assigns a patent to a sub-category of granted utility patents developed by the NBER. The USPC (U.S. Patent Classification) codes categorize patents based on similar subject matter. USPC_sub codes subdivide patents within the same USPC class into sub-categories based on structural, procedural, or functional differences. [1] The application date is the date the application was filed. THE pubdate is the date the patent was issued. The dispdate is the disposal date for the application and disptype gives the application status. ABN stands for abandoned. ISS stands for issued. PEN stands for pending. The priority date is the date of the earliest application filing that is given priority in determining the length of enforceability for the patent. Exp_date is the expiration date for the patent. Pta stands for patent term adjustment, which allows for a patent to be in force for a longer period of time, and extensions are counted in days. [2]
Table Purpose
The Histpatent Table will primarily be used to match publication numbers with the Citation Table. The priority date may be used to determine first inventor to file.
Current Problems
The USPTO bulk data contains negative patent numbers.
applicationid | pubno | patent | nber | uspc | uspc_sub | applicationdate | prioritydate | pubdate | displaydate | disptype | exp_dt | exp_dt_max | pta --------------+-------+----------+------+------+----------+-----------------+--------------+---------+-------------+----------+------------+------------+------ 8466602 | | -2037052 | 23 | 347 | 259 | 1995-06-06 | 1995-06-06 | | 1997-06-19 | ABN | | | 0 8605804 | | -2962913 | 70 | 395 | 500 | 1996-02-23 | 1996-02-23 | | 1998-08-22 | ABN | | | 0 | | 1332054 | 63 | 112 | 153 | | | | 1920-02-24 | ISS | 1937-02-23 | | 0
Application numbers are 8 digits long (99/999999). The first two digits are a series code and the last six represent a serial code assigned by the USPTO. The histpatent table has over 3 million application numbers with 7 digits, since leading zeros were dropped. Leading zeros were also dropped for patent numbers.
patent=# SELECT COUNT(*) FROM histpatent WHERE applicationid > 9999999; count --------- 4048050 (1 row)
patent=# SELECT COUNT(*) FROM histpatent WHERE applicationid <9999999; count --------- 3028846 (1 row)
LexJudge
There are duplicate entries for certain judges that may be dropped once data on patent litigation has been added.
name | court | count -----------------------------+-------+------- Malcolm Jones Howard | EDNC | 2 Richard Leroy Williams | EDVa | 2 Peter Jo Messitte | DMd | 2 Andre Maurice Davis | DMd | 2 Paula Xinis | DMd | 2 Mary Hannah Lauck | EDVa | 2 Julie E. Carnes | NDGa | 2 Claude M. Hilton | EDVa | 2 William D. Quarles Jr. | DMd | 2 James C. Dever III | EDNC | 2 Catherine C. Blake | DMd | 2 Gerald Bruce Lee | EDVa | 2 Leonie M. Brinkema | EDVa | 2
PatentId
The patentid table has many entries that do not consist of only integers.
SELECT COUNT(*) FROM patentid WHERE patentid ~ '[A-Z]'; count ------- 775982
SELECT COUNT(*) FROM patentid WHERE patentid ~ '[^A-Z]'; count -------- 27294418