Difference between revisions of "Oliver Chang (Work Log)"

From edegan.com
Jump to navigation Jump to search
Line 11: Line 11:
  
 
* June 16: join given assignment data, join with patents, create innovation district table; ...
 
* June 16: join given assignment data, join with patents, create innovation district table; ...
* June 15: build out assignment driver; build out applications driver; ...
+
* June 15: add more robust error reporting, fix race conditions; build out assignment driver; build out applications driver; ...
* June 14: migrate bulk inserts to copy command; refresh on address data and start in on that
+
* June 14: migrate bulk inserts to copy command; refresh on address data and start in on that; convert processor to multi-threaded application
 
* June 13: spot check SQL tables; fix broken final case endlessly looping; investigate smarter insert methods
 
* June 13: spot check SQL tables; fix broken final case endlessly looping; investigate smarter insert methods
 
* June 12: add XML printer, use it to inspect applications; extend BaseScraper to fetch patent application data; add applications documentation to my project page; add CREATE of other tables
 
* June 12: add XML printer, use it to inspect applications; extend BaseScraper to fetch patent application data; add applications documentation to my project page; add CREATE of other tables

Revision as of 09:59, 15 June 2017

Oliver Chang Work Logs (logpage)


Current Primary Project: Reproducible Patent Data


Day-by-Day (in reverse chronological order)

June 2017

  • June 16: join given assignment data, join with patents, create innovation district table; ...
  • June 15: add more robust error reporting, fix race conditions; build out assignment driver; build out applications driver; ...
  • June 14: migrate bulk inserts to copy command; refresh on address data and start in on that; convert processor to multi-threaded application
  • June 13: spot check SQL tables; fix broken final case endlessly looping; investigate smarter insert methods
  • June 12: add XML printer, use it to inspect applications; extend BaseScraper to fetch patent application data; add applications documentation to my project page; add CREATE of other tables
  • June 8: add foreign key inserts; create pretty printer for XML analysis
  • June 7: finalize DB abstraction layer; migrate code to bulk inserts; upgrade webserver software and do optimization on RDP postgres with Ed
  • June 6: add jdbc; create basic schema; add db interaction; schedule meeting for later in the week
  • June 5: look into postgresql; refersh on postgis; add some notes to the Enclosing Circle Algorithm page
  • June 1: add RDP git remote; add more documentation to wiki page; refactor downloader scripts; start creation of tooling for interacting with data

May 2017

  • May 31: finish copy-pasting attributes into the wiki page; retroactively fill out work log; meet with Ed to discuss next steps
  • May 30: update documentation on wiki, restructure large binary files to have more hierarchy instead of a flat listing at the root
  • May 29: expand to APS; expand to raw assignment data
  • May 27: expand to maintenance fee data
  • May 26: create models, translate xmlparser*.pl file into Java; start using builder pattern
  • May 25: sketch out OO design of project; download bulk data
  • May 24: move wiki pages around; start git repository for project
  • May 21: discuss technical details of previous work with Ed
  • May 8: cleanup dead links on wiki and start reading about previous work; discuss current project status with Ed
  • May 4: setup wiki account, rdp account, database training