Revision as of 11:47, 7 July 2016

Abstract

McNair Project
The Impact of Entrepreneurship Hubs on Urban Venture Capital Investment
Project Information
Project Title
Start Date
Deadline
Primary Billing
Notes
Has project status
	Copyright © 2016 edegan.com. All Rights Reserved.

The Hubs Research Project is a full-length academic paper analyzing the effectiveness of "hubs", a component of the entrepreneurship ecosystem, in the advancement and growth of entrepreneurial success in a metropolitan area.

This research will primarily be focused on large and mid-sized Metropolitan Statistical Areas (MSAs), as that is where the greater majority of Venture Capital funding is located.

A general overview of entrepreneurial ecosystems can be found here: Entrepreneurial Ecosystem.

Data

Venture Capital General Overview

The main goal of the data set is to aggregate company, fund, and round level data to be analyzed at a combined MSA and year level. The data set is compromised of two major parts: a granular company/fund/round and an aggregated CMSA-Year. The data includes all United States Venture Capital transactions (moneytree) from the twenty-five year period of 1990 through 2015.

The Hubs data set, from SDC Platinum, has been constructed in the server:

Data files are in 128.42.44.181/bulk/Hubs
All files are in 128.42.44.182/bulk/Projects/Hubs
psql Hubs2

See the server for the code and ~1st 5 rows of each table

Procedure - Granular Table

Start with separate raw datasets for Companies, Funds, and Rounds
Add Data to Each Individual dataset (e.g. add MSA code)
Clean and standardize names (e.g. company or fund name) for each dataset
Join the Datasets (here we need to exclude undisclosed companies)

Procedure - CMSA-Year Table

Create a consistent CMSA-Year table to be used later
Using the tables from the granular table, parse out the right data
Join the parsed out data with the CMSA-Year Table
Join these Tables

VC Specific Tables and Procedure

Raw data tables

Funds: fund name, first investment date, last investment date, fund closing date, address, known investment, average investment, number of companies invested, MSA, MSA code.
Rounds: round date, company name, state, round number, stage 1, stage 2, stage 3
Combined Rounds: company name, round date, disclosed amount, investor
Companies: company name, first investment, last investment, MSA, MSA code, address, state, date founded, known funding, industry
MSA List: MSA, MSA code, CMSA, CMSA code
Industry List: changes 6 industry categories to 4— ICT, Life Sciences, Semiconductors, Other

Granular Table (Fund-Round-Company)

The final table here contains all venture capital transactions by disclosed funds and portfolio companies, together with their CMSAs. To get the table, we processed the raw data sets in the following steps:

Clean Company data
1. Import raw data companies
2. Add variable 'CMSA' from data set MSA list, update variable 'industry' by joining data set industry list
3. Remove duplicates and remove undisclosed companies
Clean Fund data
1. Import raw data funds
2. Add variable 'CMSA'
3. Remove duplicates and remove undisclosed funds
4. Match fund names with itself using [The Matcher (Tool) |The Matcher] to get the standard fund names
Clean Round data
1. Import raw data rounds and combined rounds
2. Add variables 'number of investment', 'estimated investment' and 'year'
3. Remove duplicates and remove undisclosed funds
Combine Companies and Rounds
1. Combine cleaned companies and rounds data table on company names
2. Add variable 'round number' and 'stage'
3. Remove duplicates
Combine Funds and rounds-companies
1. Match fund names in rounds data table with standard fund names using [The Matcher (Tool) |The Matcher] to standardize fund names in rounds data table
2. Join standard fund names to rounds-companies table
3. Join cleaned funds table to rounds-companies table on standard fund names

CMSA-Year Aggregated Table

The final table contains number of companies and amount of investment, categorized by distance and stages, of each CMSA.

We processed data as follows:

Create the CMSA-Year Table
1. Create single variable tables: Distinct CMSA, year, stage, found year of fund and found year of company.
2. Create the cross production tables: CMSA-year, CMSA-year-fund year founded and CMSA-year-company year founded
Draw data from cleaned companies, funds and rounds tables
1. Create a table with 'CMSA', 'number of companies' and 'year Founded' from cleaned companies table and join it to CMSA -year founded
2. Create a table with 'Company CMSA', 'round year', 'disclosed amount' from rounds-companies combined table, and add stage binary variables. Join it to CMSA-year-company year founded
3. Create a table with 'CMSA', 'fund year', 'number of investors' from cleaned funds table and join it to CMSA-year-fund year founded
Create near-far and stages table
1. Add fund data to rounds-companies
2. Create near-far and stages binary variable
3. Count investment and deals by CMSA and year, categorized by near-far and stages
Combine all tables by CMSA and round-year

Supplementary Data Sets

Supplementary data sets are cleaned and joined back to CMSAyear table on CMSA and year:

Number of STEM graduate student, by university and year(2005 to 2014).
University R&D spending, by university and year(2004 to 2014).
Income per capital, by MSA and year(2000 to 2012)
Wages and salaries, by MSA and year(2000 to 2012)

The datasets can respectively be found at:

E:\McNair\Projects\Hubs\STEM grads for upload v2.xls
E:\McNair\Projects\Hubs\NSF spending for upload.xls
E:\McNair\Projects\Hubs\Income per capita upload.xls
E:\McNair\Projects\Hubs\Wage for upload v2.xls

Hubs Data

As of Spring 2016, a list of potential Hubs with a set of characteristics was created. Many of these are not what will be defined as Hubs. We will be creating a scorecard to help subjectively define Hubs based on certain characteristics.

Variables

Ideally, we would have the following variables (not collected previously):

Onsite VC/Angel/Investors (Count or binary)
Onsite Mentors (binary) --- Are these the same as advisers?
"Office hours" with investors or mentors (binary)
Onsite temporary workshops (binary or count)
Meetups (Binary or count)
Sponsors and Partners (binary and list) --- are these the same?
Alumni Network (binary)
Num of Companies
Nonprofit (binary)
Mission Includes Key Buzzwords (e.g. "ecosystem", "community")

Example of Prior Variables Collected:

Specific Industry (Multinomial)
Num of Events
Price for Single Space
Price for Office
Twitter Activity (Multinomial or Count)
Size (sqft)
Num Conference Rooms
Onsite accelerator (binary)
Onsite code school (binary)
Community Membership (binary)

Example Mechanical Turk

Twitter

Sign up/log in to a Twitter account at https://twitter.com/. In the search bar, key in the company name.
Alternatively, type company name, city/state, and "twitter" into a search engine and click on result from twitter.com with the company name
Sort the searching results by date
Count the number of tweets posted by that company in the most recent complete month (e.g. June 2016)

Nonprofit

Search the company name along with its city/state in a search engine
Click on the result that looks like the company website
Go to links that describe the company, usually they are labelled: 'About', 'Our Story,' 'Mission'
Look for the key word 'nonprofit'/'non-profit'
If 'nonprofit' is identified, mark as 1, otherwise 0.

Onsite Mentors

Go to the company's website
Look for links related to mentorship such as 'mentors', 'mentorship' or 'mentoring programs'
If the key words can be identified, mark as 1
If there is no explicit 'mentoring' section, look for links related to a description of the company, such as: 'About,' 'Our Team,' 'Our Mission,' etc., look for a subsection or mention of mentor/mentorship/mentoring
If these exist, mark as 1.
If not, go to links related to membership 'benefits,' 'perks,' or related.
Do same process as end of 4 and 5
If there is no mention of mentorship in these sections, type the company, city, and 'mentoring' into a search engine. If a link to a reliable website (such as Desktime) appears and mentorship can be found in the description, mark as 1.
If none of these steps result in a mark of 1, mark as 0

Sponsors and Partners

Go to the company's website
Look for the link or mention of 'Sponsors' or 'Partners', many times of which is often under the section of 'About', 'Community', or related selection
If sponsors or partners can be found mark as 1 and list them, otherwise mark as 0.

Size

To check

Start. Co (Memphis, TN)
startuptown (pittsburgh,PA)
The Hive (Palo Alto) 
StartX, Silicon Valley CA
Alloy 26 (Pittsburgh, PA)
Trailhead( Boise, Idaho)
The Vault (San Francisco)

Resources

Additional Resources

Yael Hochberg and Fehder (2015), located in dropbox
- Use this paper as a guideline on how to conduct the analysis
US Census Bureau data on employment by MSA: http://factfinder.census.gov/faces/tableservices/jsf/pages/productview.xhtml?pid=ACS_14_5YR_B23027&prodType=table
USPTO utility patents by MSA: http://www.uspto.gov/web/offices/ac/ido/oeip/taf/cls_cbsa/allcbsa_gd.htm
MSA level trends: http://www.metrotrends.org/data.cf

Difference between revisions of "The Impact of Entrepreneurship Hubs on Urban Venture Capital Investment"

Revision as of 11:47, 7 July 2016

Contents

Abstract

Data

Venture Capital General Overview

Procedure - Granular Table

Procedure - CMSA-Year Table

VC Specific Tables and Procedure

Raw data tables

Granular Table (Fund-Round-Company)

CMSA-Year Aggregated Table

Supplementary Data Sets

Hubs Data

Variables

Example Mechanical Turk

Resources

Additional Resources

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Sites

Sections

Organizations

Help

Tools