Changes
Jump to navigation
Jump to search
← Older edit
USPTO Patent Assignment Dataset
(view source)
Revision as of 13:41, 21 September 2020
831 bytes added
,
13:41, 21 September 2020
no edit summary
{{
Project
|Has project output=Data
|Has sponsor=
McNair
Projects
Center
|Has title=USPTO Patent Assignment Dataset
|Has owner=Ed Egan,
The load script is:
LoadUSPTOPAD.sql
To get the data into ASCII or ASCII, move it to the dbase server then:
*Check its encoding using:
file -i Car.java
*Convert it to UTF-8 using (the TRANSLIT option approximates characters that can't be directly encoded)
iconv -f oldformat -t UTF-8//TRANSLIT file -o outfile
**The sc options forces iconv to ignore bad chars and move on:
iconv -sc -f oldformat -t UTF-8//TRANSLIT file -o outfile
*Bash scripts to do all of the csvs is in Z:\USPTO_assigneesdata; make them executable and then run whichever you need
chmod +x encoding.sh
./encoding.sh
*Note that the final source encoding was Win1252 and the final target encoding was ASCII
*All bar three of the files had to be manually fixed to remove errors. Final files are in E:\McNair\Projects\USPTO Patent Assignment Dataset
Ed
Bureaucrats
,
Interface administrators
,
Administrators (Semantic MediaWiki)
,
Administrators
7,649
edits
Navigation menu
Personal tools
Log in
Request account
Namespaces
Page
Discussion
Variants
Views
Read
View source
View history
More
Search
Navigation
Sites
Wiki
Articles
Sections
Projects
Papers in Development
Paper Reviews
Team Members
Legislation
Research Computing
Organizations
Incubator Project
McNair Center
Berkeley's BPP Group
NBER Patent Data
Help
General help
Team help
Administration
Batch Upload Files
Tools
Special pages
Printable version