Difference between revisions of "USPTO Patent Assignment Dataset"
Jump to navigation
Jump to search
(Created page with "{{McNair Projects |Has title=USPTO Patent Assignment Dataset |Has owner=Ed Egan, |Has keywords=Data |Has project status=Active }}") |
|||
Line 5: | Line 5: | ||
|Has project status=Active | |Has project status=Active | ||
}} | }} | ||
+ | |||
+ | This project describes the build out and basic use of the USPTO Assignment Dataset. | ||
+ | |||
+ | The data is described in a USPTO Economic Working Paper by Marco, Myers, Graham and others: https://www.uspto.gov/sites/default/files/documents/USPTO_Patents_Assignment_Dataset_WP.pdf | ||
+ | |||
+ | |||
+ | ==Pre-load checks== | ||
+ | |||
+ | The data is large. We don't have space on the main dbase server for it. | ||
+ | |||
+ | df -h | ||
+ | /dev/nvme1n1p2 235G 208G 15G 94% /var/postgresql | ||
+ | |||
+ | |||
+ | Note: To check dbase space usage on the dbase server see [[Posgres_Server_Configuration#Size.2C_Backup_.26_Restore]]. | ||
+ | |||
+ | The postgres dbase on the RDP, however, currently has more than 300Gb free and is on a solid state drive, so its performance should be acceptable. | ||
+ | |||
+ | ==Getting the data== |
Revision as of 12:30, 15 November 2017
USPTO Patent Assignment Dataset | |
---|---|
Project Information | |
Project Title | USPTO Patent Assignment Dataset |
Owner | Ed Egan |
Start Date | |
Deadline | |
Keywords | Data |
Primary Billing | |
Notes | |
Has project status | Active |
Copyright © 2016 edegan.com. All Rights Reserved. |
This project describes the build out and basic use of the USPTO Assignment Dataset.
The data is described in a USPTO Economic Working Paper by Marco, Myers, Graham and others: https://www.uspto.gov/sites/default/files/documents/USPTO_Patents_Assignment_Dataset_WP.pdf
Pre-load checks
The data is large. We don't have space on the main dbase server for it.
df -h /dev/nvme1n1p2 235G 208G 15G 94% /var/postgresql
Note: To check dbase space usage on the dbase server see Posgres_Server_Configuration#Size.2C_Backup_.26_Restore.
The postgres dbase on the RDP, however, currently has more than 300Gb free and is on a solid state drive, so its performance should be acceptable.