Difference between revisions of "US Incubators"

From edegan.com
Jump to navigation Jump to search
Line 157: Line 157:
 
=Useful Regular Exes=
 
=Useful Regular Exes=
  
1. Replace “\s+$” with [leave blank] removes all the empty lines  
+
1. Replace “\s+$” with [leave blank] to remove all the empty lines  
  
2. Replace "s+$" with [leave blank] removes all the whitespaces
+
2. Replace "s+$" with [leave blank] to removes all the whitespaces
  
 
3. <.*> finds everything that starts with < and ends with >
 
3. <.*> finds everything that starts with < and ends with >
  
 
4. Replace href=" with "\n" to start a new line for each url
 
4. Replace href=" with "\n" to start a new line for each url
 +
 +
5. Replace "\s\s+" with [leave blank] to remove more than one white spaces

Revision as of 15:41, 30 April 2019


Project
US Incubators
Project logo 02.png
Project Information
Has title US Incubators
Has owner Yi Ma
Has start date
Has deadline date
Has project status Active
Copyright © 2019 edegan.com. All Rights Reserved.


Objective

The objective of this project is to assemble a near-population dataset on U.S. incubators! This project uses the Incubator Seed Data.

File Location

E:\projects\Kauffman Incubator Project\01 Classify entrepreneurship ecosystem organizations\YiMaResearch\US Incubators

Progress

Extract incubator data from the following sources

  • National Data
Source Progress How many? Data Method
Whartoneclub Incubators Done 21 url, Company Name, City, State regular expression
InterNational Business Incubation Association or see our INBIA page Done 415
  • Company Name and address
regular expression
Clustermapping Not Done 292
  • Company name with link to a separate page within cluster mapping
  • on that page there is a link to the incubator's website
regular expression
The MBA Is Dead Not Done 186 Results
  • City and Country
  • low equity, high offer, high value
  • high equity, low offer, low value
regular expression
  • Regional Data
Source Progress How many? Region Data Method
Alabama Business Incubation Network Done 12 Alabama Incubator Name, URL, and Brief Description regular expression
Florida Business Incubation Association Done 72 Florida incubator name, address, city, state, phone number and url regular expression
Louisiana Business Incubation Association Done 25 Louisiana
  • incubator name
  • contact name
  • address and phone number
  • link to website
regular expression
Maryland Business Incubation Association Done 35 Maryland Incubator name, short description, and link to another page within main site with contains a link to the incubator home page regular expression
Massachusetts Association of Business Incubators Done 21 Massachusetts incubator name, short description, and link to incubator home page regular expression
Boston Startup Guide Done 10 Boston
  • Company Name and URL
  • Capital Provided & equity taken
  • Application Process
regular expression
Michigan Business Innovation Association Done 15 Michigan company name, ulr, address, url, city, state, zip code regular expression
NH Tech Alliance Done 10 New Hampshire company name, city, url, brief description regular expression
NC Business Incubation Association Done 33 North Carolina Incubator name, address, contact, title, phone number, url and email Manual Data Collection
Oklahoma Business Incubator Association Done 34 Oklahoma Incubator name and link to it regular expression
Incubators/Accelerators In DC Done 55* DC Incubator name and link to it and brief description regular expression

Notes: 55* includes both incubators and accelerators

Useful Regular Exes

1. Replace “\s+$” with [leave blank] to remove all the empty lines

2. Replace "s+$" with [leave blank] to removes all the whitespaces

3. <.*> finds everything that starts with < and ends with >

4. Replace href=" with "\n" to start a new line for each url

5. Replace "\s\s+" with [leave blank] to remove more than one white spaces