Difference between revisions of "Shelby Bice (Research Plan)"

From edegan.com
Jump to navigation Jump to search
 
(20 intermediate revisions by one other user not shown)
Line 1: Line 1:
 +
== Overview ==
 +
 
'''Overall goals:'''  
 
'''Overall goals:'''  
  
Create better database that includes all the patent data to which the McNair Center has access.
+
* Create better database that includes all the patent data to which the McNair Center has access.
More importantly, create documentation of process so it can improved upon/replicated in the future.
+
* More importantly, create documentation of process so it can improved upon/replicated in the future.
  
 
'''General Outline - updated 2/21/2017'''
 
'''General Outline - updated 2/21/2017'''
  
Familiarize myself with SQL, Perl, and database design
+
* Familiarize myself with SQL, Perl, and database design
Familiarize myself with existing scripts and schema for existing database
+
* Familiarize myself with existing scripts and schema for existing database
Design a better representation for database
+
* Design a better representation for database
Fix scripts if necessary
+
* Fix scripts if necessary
Start moving data into new database by querying existing databases (using SQL)
+
* Start moving data into new database by querying existing databases (using SQL)
Use scripts to query new data
+
* Use scripts to query new data
 +
* Test database
 +
* Remove extraneous information from database (copies, patents that we're not interested in, etc.)
  
 
'''Documentation I need to include:'''
 
'''Documentation I need to include:'''
  
Schema of new database (with justification of design), would like to include a visual representation
+
* Schema of new database (with justification of design), would like to include a visual representation
SQL commands that were used to fill database with explanation of what they do
+
* SQL commands that were used to fill database with explanation of what they do
Clear instructions on where to find scripts in bulk drive and an explanation of what each script does
+
* Clear instructions on where to find scripts in bulk drive and an explanation of what each script does
 +
* Visual representation of example table entries that isn't just copied and pasted from a CSV file
 +
 
 +
'''Project Pages:'''
 +
[[Redesigning Patent Database]]
  
2/16/2017 - Talked over project with Ed, began reading existing wiki pages related to patent data and databases
+
== Log ==
 +
[[Category:Work Log]]

Latest revision as of 16:17, 21 March 2017

Overview

Overall goals:

  • Create better database that includes all the patent data to which the McNair Center has access.
  • More importantly, create documentation of process so it can improved upon/replicated in the future.

General Outline - updated 2/21/2017

  • Familiarize myself with SQL, Perl, and database design
  • Familiarize myself with existing scripts and schema for existing database
  • Design a better representation for database
  • Fix scripts if necessary
  • Start moving data into new database by querying existing databases (using SQL)
  • Use scripts to query new data
  • Test database
  • Remove extraneous information from database (copies, patents that we're not interested in, etc.)

Documentation I need to include:

  • Schema of new database (with justification of design), would like to include a visual representation
  • SQL commands that were used to fill database with explanation of what they do
  • Clear instructions on where to find scripts in bulk drive and an explanation of what each script does
  • Visual representation of example table entries that isn't just copied and pasted from a CSV file

Project Pages: Redesigning Patent Database

Log