Incubator Seed Data

From edegan.com
Revision as of 11:25, 25 March 2019 by AnneFreeman (talk | contribs)
Jump to navigation Jump to search


Project
Incubator Seed Data
Project logo 02.png
Project Information
Has title Incubator Seed Data
Has start date
Has deadline date
Has project status Active
Is dependent on Crunchbase Database
Subsumed by: Ecosystem Organization Classifier
Copyright © 2019 edegan.com. All Rights Reserved.


Goal

We will evaluate data sources based on the number of incubators they have data on and the type of information they supply on these incubators. We will also record whether or not these data sources collect information on any other types of entrepreneurship organizations. Ideally these data sources would provide some or all of the variables that were identified as most important for identifying incubators (Formulate_baseline_attributes). However, it is unlikely that one data source will contain all of the baseline attributes identified, therefore if the data source can provide links to a large quantity of incubators or in-depth descriptions, they could still be viable.


Evaluation of Sources from Specific Google Searches

Source: https://www.whartoneclub.com/resources/entrepreneurship/incubators/

  1. Opened source link
  2. Copied results from "U.S. Based Incubators" into excel spreadsheet
  3. Returned list of 21 US Based Incubators
  4. Data
  1. Name
  2. Url to home Page
  3. City, State (In parentheses next to name)

Review

  • Provides links to the home page url of incubator, may not be able to get very specific information about the incubator with a web crawler
  • Some of the incubators may not fall under our definition of an incubator (e.g. Y Combinator)



Potentially Viable sources from Accelerator_Seed_List_(Data)#Sources

Source: http://www.acceleratorinfo.com/see-all.html

  1. Opened source link
  2. Copied links from first column (“All Startup Support Programs”) into excel and returned 215 results
  3. Copied links from second column (“All University Programs”) into excel and returned 249 results)
  4. Each link on parent list leads to individual home page url of organization

Review

  • Provides only links, does not separate between incubator and accelerator, some of the university supported programs may not be considered either an incubator or an accelerator
  • Does not provide very specific information for incubators




Sources from Accelerator_Seed_List_(Data)#Sources that are not viable

Source: http://www.seed-db.com/accelerators

  • This data source is not appropriate for finding information on incubators as the links provided are for accelerators

Source: https://www.f6s.com/programs?type

  1. On the webpage, set "Type" to "Accelerator/Program", set "Location" to "North America", and set "Invest in Country" to "United States" to return results
  2. Search engine said it returned 173 results
  3. Along with the name of the program/accelerator, the data included:
  1. Dollar value per team
  2. Equity
  3. Link to another site within f6s to learn more and apply
  4. The location within the US

Review

  • Data is cluttered and messy, even with search filters in place. Could be useful if crawler could clean up the search results and target incubators (having equity of 0%).
  • The link provided for each incubator takes user to another site within f6s, not the incubator's home page, so it would be challenging to collect all the information required.

Source: http://gust.com/usa-canada-accelerator-report-2015/

  1. Selected region of US and Canada
  2. Scrolled down to the section labeled "Top 20 Active Accelerators" and selected "see the full list" near the bottom of the listed accelerators

Review

  • This data source is not appropriate for finding information on incubators as the links provided are for accelerators