# Create XPath queries for reissue, design patents (only utility right now)
# Data Cleanup (reference [[Patent_Assignment_Data_Restructure|Marcela and Sonia's work]])
# Setup pipeline script to complete all of these steps in series
# Investigate parallel speedup (e.g. multithread, mmap)
# Data Source Merger (''only USPTO granted, maintfee, assignment'' not USPTO applications or Harvard Dataverse or Lex Machina currently)
# Setup pipeline script to complete all of these steps in series
== Directory Layout ==