Changes

Jump to navigation Jump to search
308 bytes added ,  14:05, 22 March 2016
no edit summary
schema file, in this case "us-patent-application-v44-2014-04-03.dtd" to be able to extract each of the 4 object types from the Patents.
If any error happened during the parsing of any file, that file will be moved to a directory called "failed_files". Most likely if a file failed the parsing it is likely not a Utility patent.
 
====About the Harvard Dataverse====
The patents from 1975-2010 loaded as .sqlite3 and csv files can be found at
 
[https://dataverse.harvard.edu/dataset.xhtml?persistentId=hdl:1902.1/15705 Harvard Dataverse]
 
I have also downloaded all of them on to the database server and can be found at
cd /bulk/patent
Anonymous user

Navigation menu