Changes
Jump to navigation
Jump to search
no edit summary
The data set has now been uploaded to the database server, named Hubs.
There are 4 tables: *Rounds: Rounddate, coname, state, roundno, stage1, etc.*CombinedRounds: Coname, rounddate, discamount, fundname*Companies: LastInv, FirstInv, coname, MSA, MSACode, and Address, state, datefounded, totalknownfunding, industry(major)*Funds: fundname, closingdate, lastinv, firstinv, msa, msacode, avinv, nocoinv, totalknowninv, address
Used variables:
Companies: Coname, MSACode, Industry, state
MSALookupTable: MSACode, MSASuper
IndustryLookupTable: IndustryMajor, InduCode
->
CompanyInfo: Coname, MSASuper, InduCode, state
Funds: fundname, msacode, state
MSALookupTable: MSACode, MSASuper
->
FundInfo: fundname, msacode, state
Rounds: coname, rounddate, stagecode, roundno
CombinedRounds: coname, rounddate, discamount, fundname
->
RoundInfoSuper: coname, rounddate, '''nofunds''', discamount
->
RoundInfo: Coname, roundyear, fundname, estamount
Then take:
RoundInfo: Coname, roundyear, fundname, estamount
CompanyInfo: Coname, MSASuper, InduCode, state
FundInfo: fundname, msacode, state
->
SuperRoundInfo: Coname, CoMSASuper, CoInduCode, CoState, FundName, FundMSASuper, FundState, RoundYear, RoundEstAmount
->
MSAPortCos: Count(CoName) As NoPortCosFunded, CoMSASuper, RoundYear
...
===Hub Candidates Data Set===
The Hubs candidate data set is a list of potential hubs found in MSAs throughout the country. Researchers are currently pulling qualitative and quantitative information from the candidate's websites, in an attempt to categorize what can be identified as a hub. This is a difficult data set to pull, as there is little to no quantitative information available for this category of institution, and is dependent on accessibility of information to the public on the internet.
*US Census Bureau data on employment by MSA: http://factfinder.census.gov/faces/tableservices/jsf/pages/productview.xhtml?pid=ACS_14_5YR_B23027&prodType=table
*USPTO tility patents by MSA: http://www.uspto.gov/web/offices/ac/ido/oeip/taf/cls_cbsa/allcbsa_gd.htm
*MSA level trends: http://www.metrotrends.org/data.cfmcf
===To Do===
**complete (Employment and Income_MSA.xls)
*Firm births (BDS)
*SELECT MSAs!!!
===Data Cleaning===
Cleaning tasks:
*Remove PortCos named Undisclosed, etc.
*Remove Funds named Unknown, etc.
*Basic Data cleaning:
**Enormous outliers on funds invested
**Check dates
Lookup tables:*select MSAsSuperMSAs*Industry*Stages