Difference between revisions of "Oliver Chang (Work Log)"

Revision as of 15:07, 27 June 2017

Oliver Chang Work Logs (logpage)

Current Primary Project: Reproducible Patent Data

Day-by-Day (in reverse chronological order)

June 2017

June 27: write SQL to replicate assignees, extract postcodes for ongoing projects
June 26: speedup code, abstract in-memory file splitters to avoid repetition and some weird edge cases
June 25: create mappings for APS, assignment properties, XML 2.5 for data import; run data imports for granted data
June 23: cleanup hacky models with a better set of abstractions; cleanup IDE warnings; redefine patent-address mapping
June 22: create postcode<->patent table
June 21: document granted patent queries and equivalencies
June 20: sketch out APS driver; discuss patent id problem; further document with evidence the zipcode data validity
June 19: skim address regular expressions; cursory investigation of patent table
June 16: create method of getting all data into the database, whether it likes it or not; copy over assignments, granted data using new scheme
June 15: add more robust error reporting, fix race conditions; build out assignment driver; build out fee event driver; add error logging
June 14: migrate bulk inserts to copy command; refresh on address data and start in on that; convert processor to multi-threaded application
June 13: spot check SQL tables; fix broken final case endlessly looping; investigate smarter insert methods
June 12: add XML printer, use it to inspect applications; extend BaseScraper to fetch patent application data; add applications documentation to my project page; add CREATE of other tables
June 8: add foreign key inserts; create pretty printer for XML analysis
June 7: finalize DB abstraction layer; migrate code to bulk inserts; upgrade webserver software and do optimization on RDP postgres with Ed
June 6: add jdbc; create basic schema; add db interaction; schedule meeting for later in the week
June 5: look into postgresql; refersh on postgis; add some notes to the Enclosing Circle Algorithm page
June 1: add RDP git remote; add more documentation to wiki page; refactor downloader scripts; start creation of tooling for interacting with data

May 2017

May 31: finish copy-pasting attributes into the wiki page; retroactively fill out work log; meet with Ed to discuss next steps
May 30: update documentation on wiki, restructure large binary files to have more hierarchy instead of a flat listing at the root
May 29: expand to APS; expand to raw assignment data
May 27: expand to maintenance fee data
May 26: create models, translate xmlparser*.pl file into Java; start using builder pattern
May 25: sketch out OO design of project; download bulk data
May 24: move wiki pages around; start git repository for project
May 21: discuss technical details of previous work with Ed
May 8: cleanup dead links on wiki and start reading about previous work; discuss current project status with Ed
May 4: setup wiki account, rdp account, database training

@@ Line 10: / Line 10: @@
 === June 2017 ===
+* June 27: write SQL to replicate assignees, extract postcodes for ongoing projects
+* June 26: speedup code, abstract in-memory file splitters to avoid repetition and some weird edge cases
+* June 25: create mappings for APS, assignment properties, XML 2.5 for data import; run data imports for granted data
 * June 23: cleanup hacky models with a better set of abstractions; cleanup IDE warnings; redefine patent-address mapping
 * June 22: create postcode<->patent table

Difference between revisions of "Oliver Chang (Work Log)"

Revision as of 15:07, 27 June 2017

Day-by-Day (in reverse chronological order)

June 2017

May 2017

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Sites

Sections

Organizations

Help

Tools