==Moving Forward==
This project will begin by working with [[Grace Tan]] and [[Maxine Tao]] to connect accelerators to their founders and cohort companies using Crunchbase and LinkedIn crawlers. Grace and Maxine will go through [http://crunchbase.com Crunchbase] and find the UUID for companies and their founders. Connect them using SQL and feed the names of founders into our LinkedIn crawler (headed by [[Grace Tan]]). Accelerators linking to cohort companies is slightly more difficult. Here, we will focus only on accelerators which take equity from their cohort companies (found in Ed’s updated spreadsheet). We will find investments of a given accelerator, and assumes (or checks if that is possible) the company is taking equity in the company it invests in, and the date they invested is the year of the cohort for that company. '''Acquiring the necessary data to complete the Accelerator Master Variable List and the Cohort List will require the following (not necessarily in this order):'''
===Step One: LinkedIn Founders Data===
Find the names of accelerator This project will begin by working with [[Grace Tan]] and [[Maxine Tao]] to connect accelerators to their founders and cohort companies using Crunchbase and LinkedIn crawlers. Grace and Maxine will go through [http://crunchbase.com Crunchbase] and find the UUID for companies and their founders (reference [[Crunchbase Data]], [[Crunchbase Accelerator Founders]], [[Crunchbase Accelerator Equity]]). This will require data from Connect them using SQL and feed the names of founders into our LinkedIn crawler (headed by [[Grace Tan]] and [[Maxine Tao]]).
The list of founders for accelerators can be found at
The '''Unfound Founders''' file codes a 0 for all companies '''''not listed''''' within the LinkedIn Founders file, and a 1 for those that do have founders listed.
'''Given the founders' names, we will then be able to use the [[LinkedIn Crawler (Python)]] to find the relevant details of an accelerator founder (education, work experience, etc.) This data on founders will help us solve the horse, jockey, racetrack question to detect what variables affect a startup's success (the accelerator, the founders, the environment/city).'''
===Step Two: Linking Accelerators to Cohorts Using Investments on Crunchbase===
We then look at the accelerators investments (or companies and the entities which invested in them), cross-reference the list of companies/accelerators, and once we find a match, we know that a company went through an accelerator and during which year they went through a cohort.
'''From this, we get the following data:'''
*Accelerator a given company went through
*Year said company went through a cohort/Specific cohort company went through