7/10 - Tried to join crunchbase companies with the companies in the ‘cohort list new’ tab of the master list, most of them do match into crunchbase. I also learned that when pulling company names from crunchbase, there can be formatting issues. I talked with Connor and Dylan about building a master spreadsheet of all our company data. I created a table of companies, their UUIDs, and a count of the number of times the company appears in crunchbase. Then I was able to join this with a rough list of companies from our list that Connor gave me. There were about 350 companies where we needed to manually look up whether or not these companies exist in the “Duplicate Companies” spreadsheet; Connor and I completed this task together.
7/11 - Matched crunchbase companies to our companies and imported all UUIDs into a final list. I talked with Dylan and Connor to determine how to filter the results appropriately and only keep those that are the most accurate. All files and file locations are described in [[Crunchbase Data]].