Accelerator Seed List (Data)
Accelerator Seed List (Data) | |
---|---|
Project Information | |
Project Title | |
Start Date | |
Deadline | |
Primary Billing | |
Notes | |
Has project status | |
Copyright © 2016 edegan.com. All Rights Reserved. |
This project will be used to determine which accelerators are the most effective at churning out successful startups, as well as what characteristics are exhibited by these accelerators. First, we need to gather as much data as we can about as many accelerators as we can in order to look at factors differentiate successful vs. unsuccessful ventures.. Next, we need to create a web crawling program which will gather information about accelerators across the world by accessing their websites and extracting information. I believe that our overall goal with this research project is to gain insight into the methods of successful accelerators, as well as to find out what exactly differentiates very successful accelerators from dead accelerators.
Helpful Links: http://seedrankings.com/ http://www.forbes.com/sites/briansolomon/2016/03/11/the-best-startup-accelerators-of-2016/#38b2114624f2
Contents
- 1 Pre-existing Data
- 2 Sources
- 3 Source Evaluations
- 3.1 Source: http://www.acceleratorinfo.com/see-all.html
- 3.2 Source: http://www.seed-db.com/accelerators/all
- 3.3 Source: http://www.seed-db.com/accelerators
- 3.4 Source: https://www.f6s.com/programs?type
- 3.5 Source: http://gust.com/usa-canada-accelerator-report-2015/
- 3.6 Source: https://bostonstartupsguide.com/guide/every-boston-startup-accelerator-incubator/
- 3.7 Source: https://www.corporate-accelerators.net/database/
- 3.8 Source: https://github.com/florianheinemann/www-corporate-accelerators-net/blob/master/_data/Accelerators.json
- 3.9 Source: https://www.quora.com/Where-can-I-find-a-comprehensive-list-of-startup-incubators-and-accelerators-in-the-US
- 4 List of Sources Obtained from Various Google Searches
- 5 Individual Accelerator Evaluations
- 5.1 Accelerators Chosen (Format = Name (source))
- 5.2 Accelerator: Blue Startups (http://bluestartups.com/)
- 5.3 Accelerator: Launchpad LA (http://launchpad.la/)
- 5.4 Accelerator: Y Combinator (http://www.ycombinator.com)
- 5.5 Accelerator: Flashpoint (http://flashpoint.gatech.edu/)
- 5.6 Accelerator: Prosper Women Entrepreneurs (http://www.prosperstl.com)
- 5.7 Accelerator: Axel Springer Plug and Play(http://www.axelspringerplugandplay.com/)
- 5.8 Accelerator: Bolt (http://bolt.io/)
- 5.9 Accelerator: AIA (http://www.aia-accelerator.com/)
- 5.10 Accelerator: Capital Factory (https://capitalfactory.com/accelerate/)
- 5.11 Accelerator: OwlSpark (http://entrepreneurship.rice.edu/accelerator/)
- 5.12 List of Promising Variables
Pre-existing Data
Sources
Summary: These are sources obtained from List of Accelerators and other Google searches. We will evaluate these sources by looking at the number of accelerators they supply (as most of them are lists) and then also taking a look at the type of information they provide about each accelerator. Key data points are cohort-related data, startup-related data, and logistics of the accelerator. Better sources supply more information that the URL alone.
(Obtained from List of Accelerators)
- http://www.acceleratorinfo.com/see-all.html
- http://www.seed-db.com/accelerators, http://www.seed-db.com/accelerators/all
- https://www.f6s.com/programs?type
- http://gust.com/usa-canada-accelerator-report-2015/?utm_content=35401577&utm_medium=social&utm_source=twitter
- https://bostonstartupsguide.com/guide/every-boston-startup-accelerator-incubator/
(Obtained from Google search: "Accelerator Database")
- seed-db is the first result that pops up
- https://www.corporate-accelerators.net/database/
- https://github.com/florianheinemann/www-corporate-accelerators-net/blob/master/_data/Accelerators.json
- Quora: https://www.quora.com/Where-can-I-find-a-comprehensive-list-of-startup-incubators-and-accelerators-in-the-US"
- By the 5th or 6th search result, the utility diminished greatly
Other ways used to find Accelerators (listed below "List of Sources Obtained from Various Google Searches"):
- Type in generic location + "accelerators" (e.g. Houston Accelerators)
- Looked at roughly the first 20 results
- Used three locations as examples of accelerators that pop up
Source Evaluations
Summary: These evaluations couple with each of the sources above. The evaluations provide instructions for obtaining the information listed, as well as a general review of how useful the data seems. The review serves to determine whether a crawler would be suitable for obtaining information from the source autonomously.
Source: http://www.acceleratorinfo.com/see-all.html
- Opened source website
- Copied Information under "All Accelerator Programs" to TextPad, already sorted. Returned 190 results
- Each link on parent list leads to individual home page url of accelerator
- Used sample size of 20 links, determined 16 to be accelerators, 2 to be incubators, 2 to be inactive or broken links
- Many accelerators do not include founding date, most recent accelerators from around 2013-2014 (as determined from home page)
Review
- Reliable source for specific URLs to older accelerators, not very helpful for more specific information.
- Web crawling seems improbable because information is not readily available from source. Can potentially mine staff information or contact information from associated "about" page in the home url
Source: http://www.seed-db.com/accelerators/all
- Copied "Seed Accelerators" table to TextPad, data sorted itself into lines. Returned 235 results.
- Clicking on the accelerator name itself links to a page with all of its associated startups, up until 6/2016 cohort
- Startup table includes:
- "state"
- "company name"
- "website and CrunchBase links"
- "cohort date"
- "exit value"
- "funding".
- Many entries for "exit value" are missing, some values for "funding" are missing
- On original seed-db webpage, each accelerator has a link to its associated home page url
- From the table, each listed entry was an accelerator, although 24 accelerators out of 235 were classified as "dead"
- Along with the home url, each accelerator table includes the following:
- Status
- Program (name)
- Location
- Country
- Number of companies
- Cumulative exit values
- Cumulative funding
- Average funding for startups
- Median funding for startups
- Many entries for "median funding" are left empty, as well as entries for all types of funding on the bottom half of the table
Review
- Reliable source for accelerators, includes list of accelerators both dead and active, as well as their associated start-ups
- Web crawling potential is promising; startup table is located within the source for each webpage. Can also mine any category from the accelerator table
- Overall very extensive data for accelerators that are included on the list, but after cross-referencing from other sources shows that seed-db is lacking many newer accelerators; list is not all-inclusive.
- Includes regional distributions for accelerator groups as well. For example, rather than just "Techstars", the group is broken into Austin, Berlin, Boston, Boulder, etc.
Source: http://www.seed-db.com/accelerators
- Very similar to "http://www.seed-db.com/accelerators/all", but contains large regional accelerators as groups, rather than individual accelerators. For example, Techstars appears only once.
- Copied "Seed Accelerators" table to TextPad, data sorted itself into lines. Returned 239 results.
- Clicking on the accelerator name itself links to a page with all of its associated startups, up until 6/2016 cohort
- Startup table includes same information as previous source, "http://www.seed-db.com/accelerators/all". However, accelerators spanning across multiple regions have their startups located under one category on this webpage.
- On original seed-db webpage, each accelerator has a link to its associated home page url
- From the table, each listed entry was an accelerator, although 24 accelerators/groups out of 239 were classified as "dead"
- Along with the home url, each accelerator table includes the same information as the "http://www.seed-db.com/accelerators/all" source
Review
- Reliable source for accelerators, includes list of accelerators both dead and active, as well as their associated start-ups
- Web crawling potential is promising; startup table is located within the source for each webpage. Can also mine any category from the accelerator table
- Overall very extensive data for accelerators that are included on the list, includes large groups as well as individual accelerators. It seems that some accelerators missing from "http://www.seed-db.com/accelerators/all" are located here, since there are 239 returns rather than 235.
Source: https://www.f6s.com/programs?type
- On the webpage, set "Type" to "Accelerator/Program", set "Location" to "North America", and set "Invest in Country" to "United States" to return results
- Highlighted results and scrolled down until all results found; copied results to TextPad
- In TextPad, sorted out lines with "by", as well as miscellaneous categories such as dates and dollar signs through Regular Expressions
- Using the "More Info" line which held constant through the entire list, assigned a sequential number to the line (in order to determine the number of results)
- Obtained a grand total of 1467 results from the list
- Along with the name of the program/accelerator, the data included:
- Dollar value per team
- Equity
- Application Site
- Accelerator URL
- Many entries are not accelerators, from a quick glance through the results, there were various conferences, 3-5 days events, and written literature pertaining to accelerators as well
- From a sample size of the first 30 entries, determined 10 to be valid accelerators, 3 incubators, 6 conferences/weekends, and the rest to be miscellaneous entries such as startup events or "studios" (perhaps useful but not relevant to search)
- As we go down the list, the number of accelerators proportionately decreases. Can comfortably say that overall accelerator turnout from this website is much less than 33%, probably closer to 10-15%.
Review
- Potentially useful website if crawler could remove the clutter and target solely the accelerators; very useful for identifying new accelerators since data automatically sorted by date and location.
- Large list of sources includes many irrelevant results, such as conferences or weekends which are difficult to identify. The name of the sorting category itself, "Accelerator/Program" suggests that many of the results fall under the "Program" section rather than being valid accelerators.
- Potential site for identifying accelerators, but limited by in-site sorting; useful for URL and perhaps equity, but not very detailed information relating to the accelerator/program.
Source: http://gust.com/usa-canada-accelerator-report-2015/
- Selected region of US and Canada
- Scrolled down to the section labeled "Top 20 Active Accelerators" and selected "see the full list" near the bottom of the listed accelerators
- Copied resulting entries into TextPad and sorted out the numbers to leave only the name of the accelerator
- Obtained 100 results for different accelerators
- Accelerator lists included:
- Name and URL
- Number of Start-ups funded (2015 only)
- Accelerator list limited to 2015
Review
- Website provides its own evaluation of an accelerator's success based on various factors and provides data for larger trends.
- Usefulness is questionable because website does not provide much except the URL, and all of the entries are based on success in 2015.
- Other interesting data within website such as "Hot Markets", investment breakdowns by state, etc. All of this data is also limited to 2015.
Source: https://bostonstartupsguide.com/guide/every-boston-startup-accelerator-incubator/
- Scrolled down to the section labeled "Startup accelerators in Boston"
- Copied text beginning from "MassChallenge" (the first paragraph was just a general definition of startups) and continued to copy until "Startup Incubators in Boston"
- After pasting in TextPad, I sorted the data to delete any characters after the "-" and added a sequential number at the beginning of each line
- Returned a total of 17 results for startups in Boston
- Accelerator list included:
- Name and URL
- Capital requirements
- Application periods and requirements
- Paragraph describing accelerator and its goals
Review
- Although the guide is dated, useful for identifying strong accelerator programs in Boston
- Limitation: only focuses on Boston, but the description is helpful in identifying the role of the accelerator
- Limited information on accelerator, not very useful by itself without information from the accelerator URL
Source: https://www.corporate-accelerators.net/database/
- Copied and pasted table into Microsoft Excel (Data was already sorted into categories so no need for TextPad)
- Table returned 72 references (but there was a link to the bottom to a larger database)
- The table itself includes:
- Major Company
- Accelerator
- Funding
- Equity
- Website
- Details
- The "Details" link led to a variety of other information including:
- Status (Active or Inactive)
- Locations
- Funding
- Equity
- Term
- Cohort Based? (Regular or Irregular)
- Pitch Day
- Office Space
- Powered by
- Support Offered?
- Launch year
- Focus Areas
- General Description
- Also Included a variety of data regarding the host company as well
Review
- Solid list for corporate accelerators and also includes a variety of information about the accelerator, the cohorts, etc. Some of the entries are international accelerators however so need to filter them out
- Only limited to 72 accelerators from major companies
Source: https://github.com/florianheinemann/www-corporate-accelerators-net/blob/master/_data/Accelerators.json
- This source is a .json file from the previous database
- After placing into TextPad, replaced each space with a ###, replaced each new line with a tab, and replaced each ### with a new line. Ultimately returned 80 results
- From the file, the .json includes:
- NAICS and NAICS sector
- Classification
- Sector Description
- Term
- Goal
- Partner
- Also includes most of the information from the previous source, since they are undoubtedly linked
Review
- Another solid list for corporate accelerators with some more information, but ultimately very similar to the previous source.
Source: https://www.quora.com/Where-can-I-find-a-comprehensive-list-of-startup-incubators-and-accelerators-in-the-US
- Since we already looked at the first listed source (seed-db), I clicked on the second link "(by Robert Shedd) http://blog.shedd.us/321987608/" which took me to a page headed "Help for Startups! – A semi-complete list of startup accelerator programs" created by a blogger, Robert Shedd
- List included 102 entries by the blogger, each of which do look like an accelerator
- Upon immediate overview, noticed many results from previous sources were missing. Immediately noticed lack of "OwlSpark", the accelerator from Rice.
- Shedd only offers us the accelerator name plus its URL
Review
- Nice list to cross-reference with other sources but does not offer much new insight compared to more powerful engines such as seed-db\
List of Sources Obtained from Various Google Searches
Summary: These accelerators are taken from a specific Google search rather than a list. The idea is to compile a list of Google searches that return relevant results of accelerators. This will aid in the creation of a future web crawler.
From "Location + Accelerator"(Only individual results, not lists)
Houston Accelerators
- Examples of single accelerators found
- TMCx: http://www.tmc.edu/innovation/innovation-programs/tmcx/
- RED labs: http://redlabs.uh.edu/8
- SURGE accelerator: https://kirkcoburn.com/
- OwlSpark: http://owlspark.com/
- NextHIT: http://www.houstonhealthventures.com/nexthit-accelerator-program-application/
Los Angeles Accelerators
- Amplify: http://amplify.la/
- Y Combinator: https://www.ycombinator.com/
- Chicklabs: https://www.chicklabsllc.com/
- Disney Accelerator: https://disneyaccelerator.com/
- Launchpad: https://launchpad.la/
New York Accelerators
- DreamIT Ventures: http://www.dreamit.com/#meaningful-experience
- Women Innovate Mobile: http://www.wim.co/
- Techstars NYC: http://www.techstars.com/programs/nyc-program/
- Entrepreneurs Roundtable: http://eranyc.com/
- FirstGrowthVC: http://venturecrush.com/fg/
- New York Digital Health Accelerator: http://digitalhealthaccelerator.com/
- Grand Central Tech: http://www.grandcentraltech.com/
- Accelerator Corp: http://www.acceleratorcorp.com/
- New York Startup Lab: http://nystartuplab.com/
Review
- Some locations return more viable results for a similar sample size. For example, New York returned 9 valid accelerators, whereas Los Angeles and Houston both returned 5 actual accelerators out of the first 20 results: an 80% difference. Some optimization may come from identifying which locations return more accelerators upon searching.
Individual Accelerator Evaluations
Summary: The purpose of this section is to create instructions for each accelerator on how to find cohort information from their URLs. Along with specific instructions for obtaining the cohorts for each accelerator chosen, there should be a list of easy-to-obtain and relevant statistics regarding the accelerator, such as information about its team, location, etc. The variable statistics list is cumulative, whereas the cohort directions are unique per the accelerator.
Accelerators Chosen (Format = Name (source))
- Blue Startups (http://www.acceleratorinfo.com/see-all.html)
- Launchpad LA (http://www.acceleratorinfo.com/see-all.html)
- Y Combinator (http://www.seed-db.com/accelerators)
- FlashPoint (http://www.seed-db.com/accelerators/all)
- Prosper Accelerator (https://www.f6s.com/programs?type)
- Axel Springer Plug and Play (http://www.axelspringerplugandplay.com/)
- Techstars (http://www.seed-db.com/accelerators)
- AIA Accelerator (https://github.com/florianheinemann/www-corporate-accelerators-net/blob/master/_data/Accelerators.json)
- Capital Factory (http://blog.shedd.us/321987608/)
- OwlSpark (Google search: "Houston + accelerators")
Accelerator: Blue Startups (http://bluestartups.com/)
Finding the cohort:
- Navigated to "Track Record" page under the "Home" tab; found total number of graduated cohorts to be 7
- Navigated to "Portfolio" tab. Tab includes list of all seven graduated cohorts along with companies emerging from each one. Each cohort is listed under a separate page (ex. "Cohort 1", "Cohort 2", etc) and at the bottom of each cohort page, there is a link to the other 6. Each company has a short description along with its URL.
- An "Alumni News" page at the bottom of "Portfolio" includes articles pertinent to graduated startups.
- Unfortunately does not include the date and year of each cohort class, but perhaps could cross-reference with other sources.
Accelerator: Launchpad LA (http://launchpad.la/)
Finding the cohort:
- Navigated to "Companies" in the top of the homepage
- "Companies" returns all companies backed by Launchpad LA based on their class year and number (cohort)
- Also sorted by active startups vs. inactive startups
- At the bottom of the "Companies" tab, there is a statistical layout returning values for the number of companies started by Launchpad during its time as an accelerator (2012-present), as well as the total funding funneled into the accelerator.
Accelerator: Y Combinator (http://www.ycombinator.com)
Finding the cohort:
- Scrolled down on the home page and clicked on a link entitled "See all companies".
- Navigated to a drop down menu named "All Batches", and clicked on it to expand the list.
- List is made up of dates ranging from 2005-2016, and these dates return lists of launched companies including most but not all of their URL's, as well as their launch year.
Accelerator: Flashpoint (http://flashpoint.gatech.edu/)
Finding the cohort:
- On upper right corner after animation, there is a tab sign which lets you navigate to a page labeled "Teams"
- The "Team" page has each batch of companies emerging from Georgia Tech, although it does not include the dates or cohorts of these companies. For example, "Batch 1" at the top of the page just lists the companies in the batch without URLs or any additional information.
- On the "Application" page on the tab near the top, there is information regarding Batch 7, which begins early 2017. Suggests that batch 6 either ended spring 2016 or fall 2016.
Accelerator: Prosper Women Entrepreneurs (http://www.prosperstl.com)
Finding the cohort:
- Navigated to "Accelerator" tab and clicked "Companies" when prompted with the drop down menu.
- This tab returned all of the launched company logos which then redirected to the company's home page when clicked.
- No other relevant form of information such as date launched or cohort was included on this page.
Accelerator: Axel Springer Plug and Play(http://www.axelspringerplugandplay.com/)
Finding the cohort:
- Clicked on the "Companies" tab on the home page and was directed to the middle of the page which included a short list of current companies.
- Clicked on the "All Companies" link which returned a page filled with startup logos and brief descriptions of those startups. When clicked, each logo serves to redirect to that startup's home page.
- Companies were not sorted by cohort or in any other relevant way.
Accelerator: Bolt (http://bolt.io/)
Accelerator: AIA (http://www.aia-accelerator.com/)
Accelerator: Capital Factory (https://capitalfactory.com/accelerate/)
Accelerator: OwlSpark (http://entrepreneurship.rice.edu/accelerator/)
List of Promising Variables
- Key People (founders, lead entrepreneurs, strategists, etc.)
- Total number of launched companies
- Funds raised per company (average)
- Features offered by accelerator (perks, space, tools, etc)