Difference between revisions of "Incubator Seed Data Coverage"
(→Data) |
|||
Line 72: | Line 72: | ||
| St. Paul | | St. Paul | ||
| MN | | MN | ||
− | | University | + | | University Enterprise Laboratories |
|- | |- | ||
| Minneapolis | | Minneapolis | ||
Line 88: | Line 88: | ||
*'''USIncubators''' -- 707 records, combining state and regional incubator lists found as a part of the [[US Incubators]] project | *'''USIncubators''' -- 707 records, combining state and regional incubator lists found as a part of the [[US Incubators]] project | ||
*Data from the [[Google Crawler]] run against the five ecosystems | *Data from the [[Google Crawler]] run against the five ecosystems | ||
+ | |||
+ | ==Process== | ||
+ | |||
+ | Load FiveEcosystemIncubators.txt into '''incubators''' then run the matcher: | ||
+ | perl Matcher.pl -mode=2 -file1="FiveEcosystemIncubators.txt" -file2="Incubators.txt" | ||
+ | |||
+ | Note that there is substantial name variation in Incubators.txt for the same firm, so standard name based matching doesn't work. For example: | ||
+ | Inclusive Innovation Incubator Inclusive Innovation Incubator DC in3dc.com Inclusive Innovation Incubator (In3) - D.C's first co-working, training, & incubator space intentional about diversity & inclusion. Washington 2301-D Georgia Ave, NW Crunchbase | ||
+ | Inclusive Innovation Incubator (In3) Inclusive Innovation Incubator (In3) DC www.in3d.com Inclusive Innovation Incubator (In3) is the District's first community space focused on inclusion innovation and incubation. The incubator is committed to creating a collaborative environment where under-resourced members have access to the space and services needed to build or grow a successful business. Washington DC AngelList,USIncubators | ||
+ | |||
+ | And out of the 15 incubator names, 11 were in our '''incubators''' table (irrespective of location), and of these y had name variation(s). | ||
+ | |||
+ | Fortunately the count is small, so we can conduct a manual review. For the Google crawler, we only count a hit if the website of the incubator itself is included in the results, rather than a news article or other information that references the incubator. | ||
+ | |||
+ | {| class="wikitable" | ||
+ | ! Name | ||
+ | ! Location | ||
+ | ! Crunchbase | ||
+ | ! INBIA | ||
+ | ! Angellist | ||
+ | ! US Incubators | ||
+ | ! Google Crawler | ||
+ | ! Any | ||
+ | |- | ||
+ | | Inclusive Innovation Incubator | ||
+ | (In3) | ||
+ | | Washington, DC | ||
+ | | 1 | ||
+ | | 0 | ||
+ | | 1 | ||
+ | | 1 | ||
+ | | 1 | ||
+ | | 1 | ||
+ | |- | ||
+ | | AU Entrepreneurship Incubator | ||
+ | | Washington, DC | ||
+ | | 0 | ||
+ | | 0 | ||
+ | | 0 | ||
+ | | 0 | ||
+ | | 0 | ||
+ | | 0 | ||
+ | |- | ||
+ | | Global Development | ||
+ | Incubator | ||
+ | | Washington, DC | ||
+ | | 0 | ||
+ | | 0 | ||
+ | | 0 | ||
+ | | 1 | ||
+ | | 0 | ||
+ | | 1 | ||
+ | |- | ||
+ | | Halcyon Incubator | ||
+ | | Washington, DC | ||
+ | | 0 | ||
+ | | 0 | ||
+ | | 0 | ||
+ | | 1 | ||
+ | | 1 | ||
+ | | 1 | ||
+ | |- | ||
+ | | The Hatchery | ||
+ | | Washington, DC | ||
+ | | 0 | ||
+ | | 0 | ||
+ | | 0 | ||
+ | | 1 | ||
+ | | 0 | ||
+ | | 1 | ||
+ | |- | ||
+ | | Vermont Center for Emerging | ||
+ | Technologies (VCET) | ||
+ | | Burlington, VT | ||
+ | | 1 | ||
+ | | 1 | ||
+ | | 0 | ||
+ | | 0 | ||
+ | | 1 | ||
+ | | 1 | ||
+ | |- | ||
+ | | Austin Technology Incubator | ||
+ | | Austin, TX | ||
+ | | 1 | ||
+ | | 0 | ||
+ | | 1 | ||
+ | | 0 | ||
+ | | 1 | ||
+ | | 1 | ||
+ | |- | ||
+ | | IncubatorCTX | ||
+ | | Austin, TX | ||
+ | | 0 | ||
+ | | 0 | ||
+ | | 0 | ||
+ | | 0 | ||
+ | | 0 | ||
+ | | 0 | ||
+ | |- | ||
+ | | Economic Growth Business | ||
+ | Incubator | ||
+ | | Austin, TX | ||
+ | | 0 | ||
+ | | 0 | ||
+ | | 0 | ||
+ | | 0 | ||
+ | | 1 | ||
+ | | 1 | ||
+ | |- | ||
+ | | ACC Bioscience Incubator | ||
+ | | Austin, TX | ||
+ | | 0 | ||
+ | | 0 | ||
+ | | 1 | ||
+ | | 0 | ||
+ | | 1 | ||
+ | | 1 | ||
+ | |- | ||
+ | | Bunker Labs | ||
+ | | Austin, TX | ||
+ | | 0 | ||
+ | | 0 | ||
+ | | 0 | ||
+ | | 0 | ||
+ | | 1 | ||
+ | | 1 | ||
+ | |- | ||
+ | | Galvanize | ||
+ | | Austin, TX | ||
+ | | 0 | ||
+ | | 0 | ||
+ | | 0 | ||
+ | | 0 | ||
+ | | 0 | ||
+ | | 0 | ||
+ | |- | ||
+ | | University Enterprise | ||
+ | Laboratories | ||
+ | | St. Paul , MN | ||
+ | | 0 | ||
+ | | 0 | ||
+ | | 0 | ||
+ | | 1 | ||
+ | | 1 | ||
+ | | 1 | ||
+ | |- | ||
+ | | Discovery Launchpad at UMN | ||
+ | | Minneapolis , MN | ||
+ | | 0 | ||
+ | | 0 | ||
+ | | 0 | ||
+ | | 0 | ||
+ | | 0 | ||
+ | | 0 | ||
+ | |- | ||
+ | | Lunar Startups | ||
+ | | St. Paul, MN | ||
+ | | 0 | ||
+ | | 0 | ||
+ | | 0 | ||
+ | | 1 | ||
+ | | 0 | ||
+ | | 1 | ||
+ | |- | ||
+ | | Total | ||
+ | | | ||
+ | | 3 (20%) | ||
+ | | 1 (7%) | ||
+ | | 3 (20%) | ||
+ | | 6 (40%) | ||
+ | | 8 (53%) | ||
+ | | 11 (73%) | ||
+ | |} |
Revision as of 21:29, 4 October 2019
Incubator Seed Data Coverage | |
---|---|
Project Information | |
Has title | Incubator Seed Data Coverage |
Has owner | Ed Egan |
Has start date | |
Has deadline date | |
Has project status | Active |
Subsumed by: | Incubator Seed Data, Incubators in Five Ecosystems |
Copyright © 2019 edegan.com. All Rights Reserved. |
Overview
The purpose of this project is to test the coverage and accuracy of the Incubator Seed Data using the hand-collected data on Incubators in Five Ecosystems as a benchmark.
Specifically, this project fulfills point 6 of the Expected Outcomes by June 2019 of the Kauffman Incubator Project:
- 6. The seed data will have at least a 70% baseline accuracy and coverage of incubators compared to results from hand collected data on 5 ecosystems, as measured by the data analysis.
Data
The five ecosystem incubators are:
City | State | Incubator Name |
---|---|---|
Washington | DC | Inclusive Innovation Incubator (In3) |
Washington | DC | AU Entrepreneurship Incubator |
Washington | DC | Global Development Incubator |
Washington | DC | Halcyon Incubator |
Washington | DC | The Hatchery |
Burlington | VT | Vermont Center for Emerging Technologies (VCET) |
Austin | TX | Austin Technology Incubator |
Austin | TX | IncubatorCTX |
Austin | TX | Economic Growth Business Incubator |
Austin | TX | ACC Bioscience Incubator |
Austin | TX | Bunker Labs |
Austin | TX | Galvanize |
St. Paul | MN | University Enterprise Laboratories |
Minneapolis | MN | Discovery Launchpad at UMN |
St. Paul | MN | Lunar Startups |
The datasets to test against are (as tables in the incubators database, also available as tab-delimited text files):
- Incubators -- 2137 records, combining the records in CIAIncubators and USIncubators
- CIAIncubators -- 1603 records, combining incubators identified in Crunchbase, INBIA, and AngelList
- USIncubators -- 707 records, combining state and regional incubator lists found as a part of the US Incubators project
- Data from the Google Crawler run against the five ecosystems
Process
Load FiveEcosystemIncubators.txt into incubators then run the matcher:
perl Matcher.pl -mode=2 -file1="FiveEcosystemIncubators.txt" -file2="Incubators.txt"
Note that there is substantial name variation in Incubators.txt for the same firm, so standard name based matching doesn't work. For example:
Inclusive Innovation Incubator Inclusive Innovation Incubator DC in3dc.com Inclusive Innovation Incubator (In3) - D.C's first co-working, training, & incubator space intentional about diversity & inclusion. Washington 2301-D Georgia Ave, NW Crunchbase Inclusive Innovation Incubator (In3) Inclusive Innovation Incubator (In3) DC www.in3d.com Inclusive Innovation Incubator (In3) is the District's first community space focused on inclusion innovation and incubation. The incubator is committed to creating a collaborative environment where under-resourced members have access to the space and services needed to build or grow a successful business. Washington DC AngelList,USIncubators
And out of the 15 incubator names, 11 were in our incubators table (irrespective of location), and of these y had name variation(s).
Fortunately the count is small, so we can conduct a manual review. For the Google crawler, we only count a hit if the website of the incubator itself is included in the results, rather than a news article or other information that references the incubator.
Name | Location | Crunchbase | INBIA | Angellist | US Incubators | Google Crawler | Any |
---|---|---|---|---|---|---|---|
Inclusive Innovation Incubator
(In3) |
Washington, DC | 1 | 0 | 1 | 1 | 1 | 1 |
AU Entrepreneurship Incubator | Washington, DC | 0 | 0 | 0 | 0 | 0 | 0 |
Global Development
Incubator |
Washington, DC | 0 | 0 | 0 | 1 | 0 | 1 |
Halcyon Incubator | Washington, DC | 0 | 0 | 0 | 1 | 1 | 1 |
The Hatchery | Washington, DC | 0 | 0 | 0 | 1 | 0 | 1 |
Vermont Center for Emerging
Technologies (VCET) |
Burlington, VT | 1 | 1 | 0 | 0 | 1 | 1 |
Austin Technology Incubator | Austin, TX | 1 | 0 | 1 | 0 | 1 | 1 |
IncubatorCTX | Austin, TX | 0 | 0 | 0 | 0 | 0 | 0 |
Economic Growth Business
Incubator |
Austin, TX | 0 | 0 | 0 | 0 | 1 | 1 |
ACC Bioscience Incubator | Austin, TX | 0 | 0 | 1 | 0 | 1 | 1 |
Bunker Labs | Austin, TX | 0 | 0 | 0 | 0 | 1 | 1 |
Galvanize | Austin, TX | 0 | 0 | 0 | 0 | 0 | 0 |
University Enterprise
Laboratories |
St. Paul , MN | 0 | 0 | 0 | 1 | 1 | 1 |
Discovery Launchpad at UMN | Minneapolis , MN | 0 | 0 | 0 | 0 | 0 | 0 |
Lunar Startups | St. Paul, MN | 0 | 0 | 0 | 1 | 0 | 1 |
Total | 3 (20%) | 1 (7%) | 3 (20%) | 6 (40%) | 8 (53%) | 11 (73%) |