7/12 - Looked at input data for Industry Classifier. When I tried to build test data from our final list compiled yesterday, I discovered some duplicate issues. Connor and I figured out which true duplicates to remove and have a final spreadsheet made called 'The File to Rule Them All'.
7/13 - Pulled descriptions and industry tags from crunchbase to match with the UUIDs we already have. Results of these tables are in the Industry Classifier update folder of Accelerators\Summer 2018. This morning, I read Christy and Yang's wiki project pages. To start, I tried to figure out how to best build a new coding system for the industry flags that are given in crunchbase. They are very detailed and more complex than those from the previously used venture capital data.