Shelby Bice (Work Log)
Spring 2017 Work
2/14/2017 10:00 am - 12:00 pm Set up personal wiki, set up work log
2/16/2017 10:30 am - 12:00 pm Researched past work on databases, discussed project with Ed
2/21/2017 9:00 am - 12:00 pm Set up work page, reviewed SQL, researched designing database, continued going through wiki
2/23/2017 9:30 am - 12:00 pm Reviewed Perl, read about database design, set up project page for redesigning database, started documenting process
3/2/2017 9:15 am - 12:15 pm Started excel spreadsheet to document current schema design and improvements to be made, updated project pages
3/7/2017 9:30 am - 12:00 pm Continued working on spreadsheet, added relevant page links to project page, took notes on what I want documentation to look like in the future
3/9/2017 9:00 am - 12:00 pm Finished first draft of spreadsheet describing the current schema (and possible changes) to the Patent database
3/21/2017 9:30 am - 12:00 pm - Worked on determining "core" tables for new patent database
3/22/2017 5:30 pm to 6:30 pm - Patent Data meeting
3/23/2017 9:15 am - 12:00 pm - Narrowed down core tables and fields
4/4/2017 9:00 am - 11:45 pm - Worked on updating documentation, found documentation on pulling data/making tables and databases, started looking through DTDs to find extra fields to pull
4/6/2017 9:30 am - 11:30 pm - Kept looking through DTDs, kept updating documentation
4/11/2017 9:15 am - 12:00 pm - Worked on trying to update patent data through 2016
4/13/2017 9:30 am - 12:00 pm - Continued working on trying to update patent data through 2016, specifically parsing the data, worked with Ed to update perl scripts
4/18/2017 9:45 am - 12:30 pm - Cleaned up documentation more, kept working through the process of parsing the data
4/20/2017 10:00 am - 11:30 pm - wrote copy statements for copying data from RDP to database, continued working on documentation.
4/25/2017 10:00 am - 12:00 pm - worked on documentation, tried to determine how to clean up the USPTO Assignee Data
4/27/2017 1:00 pm - 3:00 pm - worked on documentation more, tried to figure out how to clean citation data
Fall 2017 Work
9/15/2017 2:00 pm - 5:00 pm - introduced to new patent database projected, reviewed and took notes on USPTO Assignment data (notes can be found under McNair/Projects/Redesigning Patent Database/New Patent Database Project as Notes on USPTO Assignment Data Paper)
9/22/2017 8:30 am - 10:30 am - continued looking at paper on USPTO assignment data and adding to the notes on what the design of that database should look like, specifically on what I need for different tables and what I don't know yet about the design. Had to set up connection to RDP again due to technical issues.
9/23/2017 2:00 pm - 4:00 pm - continued working on design of Assignment database and how it will connect to Patent database by writing out what will be in each table in Assignment and questions about different possible structures of tables that we will have to address before finalizing the design - the notes can be found under McNair/Projects/Redesigning Patent Database/New Patent Database Project as Notes on USPTO Assignment Data Paper. Questions are highlighted in yellow throughout the document
9/26/2017 8:45 am - 9:45 am - continued worked on design of Assignment database by checking my design against the work done last semester on the assignment data restructure to make sure I didn't miss anything major. Began going over my patent database design from last semester to tweak it. Will need to sync up with Joe Reilly to see if there are any new fields that we are pulling from the data.
The main takeaway from looking over Patent Assignment Data Restructure is that, after assembling the table according to my design (which doesn't seem to have any contradictions with the Patent Assignment Data Restructure) that there will by multiple steps for cleaning the data, specifically the fields relating to location and address in the assignment table. While the Patent Assignment Data Restructure mentions connecting to the Patent database, it is not clear from the page what field would be used to connect to the Patent database.