Difference between revisions of "HistPatent Table"

From edegan.com
Jump to navigation Jump to search
Line 37: Line 37:
  
 
==Table Purpose==
 
==Table Purpose==
The Histpatent Table will be primarily used to match publication numbers with the Citation Table.  
+
The Histpatent Table will primarily be used to match publication numbers with the Citation Table.
  
 
==Current Problems==
 
==Current Problems==
 
{{#section:Patent_Data_Issues|histpatent}}
 
{{#section:Patent_Data_Issues|histpatent}}

Revision as of 11:36, 21 July 2016

Return to Patent Data Specifications.

Table Structure

              Table "public.histpatent"
    Column      |       Type        | Modifiers 
----------------+-------------------+-----------
applicationid   | integer           | 
pubno           | character varying | 
patentnumber    | character varying | 
nber            | integer           | 
uspc            | character varying | 
uspc_sub        | character varying | 
applicationdate | date              | 
prioritydate    | date              | 
pubdate         | date              | 
displaydate     | date              | 
disptype        | character varying | 
exp_dt          | date              | 
exp_dt_max      | date              | 
pta             | integer           | 

Example of table entries:

applicationid |      pubno      |  patent  | nber | uspc | uspc_sub | applicationdate | prioritydate |  pubdate   |   dispdate  | disptype |   exp_dt   | exp_dt_max | pta 
--------------+-----------------+----------+------+------+----------+-----------------+--------------+------------+-------------+----------+------------+------------+-----
     10035634 | US20020056924A1 | -357336  |   46 | 257  | 784      | 2001-10-26      | 2001-10-26   | 2002-05-16 | 2002-12-03  | ABN      |            |            |   0
     10035996 | US20020059513A1 | -75798   |   25 | 703  | 027      | 2001-11-09      | 2001-11-09   | 2002-05-16 | 2006-09-12  | ABN      |            |            |   0
     10000241 | US20020035749A1 | 6463595  |   65 | 4    | 213      | 2001-10-18      | 2000-01-14   | 2002-03-28 | 2002-10-15  | ISS      | 2020-01-13 |            |   0
     10005395 | US20020035760A1 | 6460212  |   69 | 14   | 71       | 2001-12-03      | 1993-10-04   | 2002-03-28 | 2002-10-08  | ISS      | 2014-10-08 |            |   0
     10002827 | US20020035761A1 | 6588042  |   69 | 15   | 22       | 2001-11-15      | 1998-07-30   | 2002-03-28 | 2003-07-08  | ISS      | 2018-07-29 |            |   0
     10000979 | US20020035795A1 | 6763614  |   63 | 36   | 50       | 2001-10-30      | 2000-06-27   | 2002-03-28 | 2004-07-20  | ISS      | 2020-06-26 |            |   0
     10003664 | US20020035797A1 | 6671983  |   64 | 37   | 222      | 2001-10-23      | 1998-08-14   | 2002-03-28 | 2004-01-06  | ISS      | 2019-04-02 |            | 232
     10002674 | US20020035805A1 | 6557299  |   59 | 49   | 42       | 2001-10-30      | 2001-10-30   | 2002-03-28 | 2003-05-06  | ISS      | 2021-10-29 |            |   0

Table Variables

The applicationid (also referred to as an application number) and pubno (patent publication number) correspond to the patent in question. Patent refers to patent number. The NBER two digit code assigns a patent to a sub-category of granted utility patents developed by the NBER. The USPC (U.S. Patent Classification) codes categorize patents based on similar subject matter. USPC_sub codes subdivide patents within the same USPC class into sub-categories based on structural, procedural, or functional differences. [1] The application date is the date the application was filed. THE pubdate is the date the patent was issued. The dispdate is the disposal date for the application and disptype gives the application status. ABN stands for abandoned. ISS stands for issued. PEN stands for pending. The priority date is the date of the earliest application filing that is given priority in determining the length of enforceability for the patent. Exp_date is the expiration date for the patent. Pta stands for patent term adjustment, which allows for a patent to be in force for a longer period of time, and extensions are counted in days. [2]

Table Purpose

The Histpatent Table will primarily be used to match publication numbers with the Citation Table.

Current Problems

The USPTO bulk data contains negative patent numbers.

applicationid | pubno |  patent  | nber | uspc | uspc_sub | applicationdate | prioritydate | pubdate | displaydate | disptype |   exp_dt   | exp_dt_max | pta  
--------------+-------+----------+------+------+----------+-----------------+--------------+---------+-------------+----------+------------+------------+------
      8466602 |       | -2037052 |   23 | 347  | 259      | 1995-06-06      | 1995-06-06   |         | 1997-06-19  | ABN      |            |            |    0
      8605804 |       | -2962913 |   70 | 395  | 500      | 1996-02-23      | 1996-02-23   |         | 1998-08-22  | ABN      |            |            |    0
              |       | 1332054  |   63 | 112  | 153      |                 |              |         | 1920-02-24  | ISS      | 1937-02-23 |            |    0

Application numbers are 8 digits long (99/999999). The first two digits are a series code and the last six represent a serial code assigned by the USPTO. The histpatent table has over 3 million application numbers with 7 digits, since leading zeros were dropped. Leading zeros were also dropped for patent numbers.

patent=# SELECT COUNT(*) FROM histpatent WHERE applicationid > 9999999; 
 count  
---------
4048050
(1 row)
patent=# SELECT COUNT(*) FROM histpatent WHERE applicationid <9999999; 
 count  
---------
3028846
(1 row)


LexJudge

There are duplicate entries for certain judges that may be dropped once data on patent litigation has been added.

            name             | court | count 
-----------------------------+-------+-------
Malcolm Jones Howard         | EDNC  |     2
Richard Leroy Williams       | EDVa  |     2
Peter Jo Messitte            | DMd   |     2
Andre Maurice Davis          | DMd   |     2
Paula Xinis                  | DMd   |     2
Mary Hannah Lauck            | EDVa  |     2
Julie E. Carnes              | NDGa  |     2
Claude M. Hilton             | EDVa  |     2
William D. Quarles Jr.       | DMd   |     2
James C. Dever III           | EDNC  |     2
Catherine C. Blake           | DMd   |     2
Gerald Bruce Lee             | EDVa  |     2
Leonie M. Brinkema           | EDVa  |     2


PatentId

The patentid table has many entries that do not consist of only integers.

SELECT COUNT(*) FROM patentid WHERE patentid ~ '[A-Z]'; 
 count  
-------
775982
SELECT COUNT(*) FROM patentid WHERE patentid ~ '[^A-Z]';
 count   
--------
27294418