Difference between revisions of "HistPatent Table"
(Created page with " Table "public.histpatent" Column | Type | Modifiers ----------------+-------------------+----------- applicationid | integer...") |
|||
(18 intermediate revisions by one other user not shown) | |||
Line 1: | Line 1: | ||
+ | Return to [[Patent]]. | ||
− | + | ==Table Purpose== | |
+ | <section begin="hist"/> | ||
+ | The HistPatent Table will primarily be used to match publication numbers with the Citation Table. The priority date may be used to determine first inventor to file according to the new standard set by the America Invents Act. Additionally, the patent term adjustment data may provide valuable insight on the effects of litigation on a patent. The USPC codes may be helpful if patents in a particular subject matter or technical field are of interest. | ||
+ | <section end="hist"/> | ||
+ | ==Table Structure== | ||
Table "public.histpatent" | Table "public.histpatent" | ||
Column | Type | Modifiers | Column | Type | Modifiers | ||
Line 19: | Line 24: | ||
pta | integer | | pta | integer | | ||
− | applicationid | pubno | | + | Example of table entries: |
− | --------------+-----------------+ | + | |
− | + | applicationid | pubno | patent | nber | uspc | uspc_sub | applicationdate | prioritydate | pubdate | dispdate | disptype | exp_dt | exp_dt_max | pta | |
− | + | --------------+-----------------+----------+------+------+----------+-----------------+--------------+------------+-------------+----------+------------+------------+----- | |
− | + | 10035634 | US20020056924A1 | -357336 | 46 | 257 | 784 | 2001-10-26 | 2001-10-26 | 2002-05-16 | 2002-12-03 | ABN | | | 0 | |
− | + | 10035996 | US20020059513A1 | -75798 | 25 | 703 | 027 | 2001-11-09 | 2001-11-09 | 2002-05-16 | 2006-09-12 | ABN | | | 0 | |
− | + | 10000241 | US20020035749A1 | 6463595 | 65 | 4 | 213 | 2001-10-18 | 2000-01-14 | 2002-03-28 | 2002-10-15 | ISS | 2020-01-13 | | 0 | |
− | + | 10005395 | US20020035760A1 | 6460212 | 69 | 14 | 71 | 2001-12-03 | 1993-10-04 | 2002-03-28 | 2002-10-08 | ISS | 2014-10-08 | | 0 | |
− | + | 10002827 | US20020035761A1 | 6588042 | 69 | 15 | 22 | 2001-11-15 | 1998-07-30 | 2002-03-28 | 2003-07-08 | ISS | 2018-07-29 | | 0 | |
− | + | 10000979 | US20020035795A1 | 6763614 | 63 | 36 | 50 | 2001-10-30 | 2000-06-27 | 2002-03-28 | 2004-07-20 | ISS | 2020-06-26 | | 0 | |
− | + | 10003664 | US20020035797A1 | 6671983 | 64 | 37 | 222 | 2001-10-23 | 1998-08-14 | 2002-03-28 | 2004-01-06 | ISS | 2019-04-02 | | 232 | |
− | + | 10002674 | US20020035805A1 | 6557299 | 59 | 49 | 42 | 2001-10-30 | 2001-10-30 | 2002-03-28 | 2003-05-06 | ISS | 2021-10-29 | | 0 | |
− | + | ||
− | + | ==Table Variables== | |
− | + | The applicationid (also referred to as an application number) and pubno (patent publication number) correspond to the patent in question. Patent refers to patent number. The NBER two digit code assigns a patent to a sub-category of granted utility patents developed by the NBER. The USPC (U.S. Patent Classification) codes categorize patents based on similar subject matter. USPC_sub codes subdivide patents within the same USPC class into sub-categories based on structural, procedural, or functional differences. [http://www.uspto.gov/sites/default/files/patents/resources/classification/overview.pdf] The application date is the date the application was filed. The pubdate is the date the patent was issued. The dispdate is the disposal date for the application and disptype gives the application status. ABN stands for abandoned. ISS stands for issued. PEN stands for pending. The priority date is the date of the earliest application filing that is given priority in determining the length of enforceability for the patent. Exp_date is the expiration date for the patent. Pta stands for patent term adjustment, which allows for a patent to be in force for a longer period of time, and extensions are counted in days. [http://www.uspto.gov/patent/laws-and-regulations/american-inventors-protection-act-1999/patent-term-guarantee-overview] | |
− | + | ||
− | + | The applicationid, pubno, or patent may be used as unique identifiers for the table. | |
− | + | ||
− | + | ==Current Problems== | |
− | + | {{#section:Patent_Data_Issues|histpatent}} | |
− | + | [[Category: Internal]] | |
− | + | [[Internal Classification: Data Tables| ]] |
Latest revision as of 16:35, 2 September 2016
Return to Patent.
Contents
Table Purpose
The HistPatent Table will primarily be used to match publication numbers with the Citation Table. The priority date may be used to determine first inventor to file according to the new standard set by the America Invents Act. Additionally, the patent term adjustment data may provide valuable insight on the effects of litigation on a patent. The USPC codes may be helpful if patents in a particular subject matter or technical field are of interest.
Table Structure
Table "public.histpatent" Column | Type | Modifiers ----------------+-------------------+----------- applicationid | integer | pubno | character varying | patentnumber | character varying | nber | integer | uspc | character varying | uspc_sub | character varying | applicationdate | date | prioritydate | date | pubdate | date | displaydate | date | disptype | character varying | exp_dt | date | exp_dt_max | date | pta | integer |
Example of table entries:
applicationid | pubno | patent | nber | uspc | uspc_sub | applicationdate | prioritydate | pubdate | dispdate | disptype | exp_dt | exp_dt_max | pta --------------+-----------------+----------+------+------+----------+-----------------+--------------+------------+-------------+----------+------------+------------+----- 10035634 | US20020056924A1 | -357336 | 46 | 257 | 784 | 2001-10-26 | 2001-10-26 | 2002-05-16 | 2002-12-03 | ABN | | | 0 10035996 | US20020059513A1 | -75798 | 25 | 703 | 027 | 2001-11-09 | 2001-11-09 | 2002-05-16 | 2006-09-12 | ABN | | | 0 10000241 | US20020035749A1 | 6463595 | 65 | 4 | 213 | 2001-10-18 | 2000-01-14 | 2002-03-28 | 2002-10-15 | ISS | 2020-01-13 | | 0 10005395 | US20020035760A1 | 6460212 | 69 | 14 | 71 | 2001-12-03 | 1993-10-04 | 2002-03-28 | 2002-10-08 | ISS | 2014-10-08 | | 0 10002827 | US20020035761A1 | 6588042 | 69 | 15 | 22 | 2001-11-15 | 1998-07-30 | 2002-03-28 | 2003-07-08 | ISS | 2018-07-29 | | 0 10000979 | US20020035795A1 | 6763614 | 63 | 36 | 50 | 2001-10-30 | 2000-06-27 | 2002-03-28 | 2004-07-20 | ISS | 2020-06-26 | | 0 10003664 | US20020035797A1 | 6671983 | 64 | 37 | 222 | 2001-10-23 | 1998-08-14 | 2002-03-28 | 2004-01-06 | ISS | 2019-04-02 | | 232 10002674 | US20020035805A1 | 6557299 | 59 | 49 | 42 | 2001-10-30 | 2001-10-30 | 2002-03-28 | 2003-05-06 | ISS | 2021-10-29 | | 0
Table Variables
The applicationid (also referred to as an application number) and pubno (patent publication number) correspond to the patent in question. Patent refers to patent number. The NBER two digit code assigns a patent to a sub-category of granted utility patents developed by the NBER. The USPC (U.S. Patent Classification) codes categorize patents based on similar subject matter. USPC_sub codes subdivide patents within the same USPC class into sub-categories based on structural, procedural, or functional differences. [1] The application date is the date the application was filed. The pubdate is the date the patent was issued. The dispdate is the disposal date for the application and disptype gives the application status. ABN stands for abandoned. ISS stands for issued. PEN stands for pending. The priority date is the date of the earliest application filing that is given priority in determining the length of enforceability for the patent. Exp_date is the expiration date for the patent. Pta stands for patent term adjustment, which allows for a patent to be in force for a longer period of time, and extensions are counted in days. [2]
The applicationid, pubno, or patent may be used as unique identifiers for the table.
Current Problems
The USPTO bulk data contains negative patent numbers.
applicationid | pubno | patent | nber | uspc | uspc_sub | applicationdate | prioritydate | pubdate | displaydate | disptype | exp_dt | exp_dt_max | pta --------------+-------+----------+------+------+----------+-----------------+--------------+---------+-------------+----------+------------+------------+------ 8466602 | | -2037052 | 23 | 347 | 259 | 1995-06-06 | 1995-06-06 | | 1997-06-19 | ABN | | | 0 8605804 | | -2962913 | 70 | 395 | 500 | 1996-02-23 | 1996-02-23 | | 1998-08-22 | ABN | | | 0 | | 1332054 | 63 | 112 | 153 | | | | 1920-02-24 | ISS | 1937-02-23 | | 0
Application numbers are 8 digits long (99/999999). The first two digits are a series code and the last six represent a serial code assigned by the USPTO. The histpatent table has over 3 million application numbers with 7 digits, since leading zeros were dropped. Leading zeros were also dropped for patent numbers.
patent=# SELECT COUNT(*) FROM histpatent WHERE applicationid > 9999999; count --------- 4048050 (1 row)
patent=# SELECT COUNT(*) FROM histpatent WHERE applicationid <9999999; count --------- 3028846 (1 row)
LexJudge
There are duplicate entries for certain judges that may be dropped once data on patent litigation has been added.
name | court | count -----------------------------+-------+------- Malcolm Jones Howard | EDNC | 2 Richard Leroy Williams | EDVa | 2 Peter Jo Messitte | DMd | 2 Andre Maurice Davis | DMd | 2 Paula Xinis | DMd | 2 Mary Hannah Lauck | EDVa | 2 Julie E. Carnes | NDGa | 2 Claude M. Hilton | EDVa | 2 William D. Quarles Jr. | DMd | 2 James C. Dever III | EDNC | 2 Catherine C. Blake | DMd | 2 Gerald Bruce Lee | EDVa | 2 Leonie M. Brinkema | EDVa | 2
PatentId
The patentid table has many entries that do not consist of only integers.
SELECT COUNT(*) FROM patentid WHERE patentid ~ '[A-Z]'; count ------- 775982
SELECT COUNT(*) FROM patentid WHERE patentid ~ '[^A-Z]'; count -------- 27294418