We processed data as follows:
#Create helper tablesthe '''CMSA-Year''' Table
##Create single variable tables: Distinct CMSA, year, stage, found year of fund and found year of company.
##Create the cross production tables: CMSA-year, CMSA-year-fund year founded and CMSA-year-company year founded
##Create a table with 'Company CMSA', 'round year', 'disclosed amount' from rounds-companies combined table, and add stage binary variables. Join it to CMSA-year-company year founded
##Create a table with 'CMSA', 'fund year', 'number of investors' from cleaned funds table and join it to CMSA-year-fund year founded
#Create '''near-far ''' and stages table
##Add fund data to rounds-companies
##Create near-far and stages binary variable
##Count investment and deals by CMSA and year, categorized by near-far and stages
#Combine all tables by CMSA and round -year
==Supplementary Data Sets==