Changes
Jump to navigation
Jump to search
no edit summary
===Primary Data Set===
The Hubs data set, from SDC Platinum, has been constructed in the server : Files are in 128.42.44.182181/bulk/Hubs psql Hubs
The data set includes all United States Venture Capital transactions (moneytree) from the twenty-five year period of 1990 through 2015.
Data has been accumulated aggregated at the portfolio company, fund, and round level. It will be analyzed at the combined MSA level. We will be looking at in terms of number of companies funded in number of funds active, and flow of investment in a given MSA.
Process
* cleaned tables to eliminate duplications, undisclosed variables* changed all original characters to include CMSA and Industry Codes (companyinfo3, fundinfocleanfinal, roundinfoclean) * matched funds to avoid any issues with names (i.e. Fund ABC L.P./Fund ABC LP/Fund ABC)
*matched roundinfoclean investors to fundinfocleanfinal investors (roundinfo.txt >> cleanfundfinal.txt)
*join by round and company conames
* populate data with count of companies (Deal flow) and estimated amount ($)
** data set in 181 hubs folder under summarycmsa.txt (38394)
Key decisions:
*Threw out undisclosed co through-out as no address
*Count is done by joining round and company
*Anything fund related must be disclosed fund
*Near and far, and total invested, and fund counts, etc., are all done using disclosed funds that match only
'''Glossary of Tables'''