Difference between revisions of "VC Database Rebuild"
(→Plan) |
|||
Line 24: | Line 24: | ||
Match geodata to companybase. Pull geokey into companybase table. Lookup addresses to get geo data as required using geo.py. | Match geodata to companybase. Pull geokey into companybase table. Lookup addresses to get geo data as required using geo.py. | ||
+ | |||
+ | Clean fundbase and check valid key (fundname, statecode, firstinvdate) | ||
+ | |||
+ | Clean firmbase and check valid key (firmname, foundingdate) |
Revision as of 17:20, 14 July 2017
Plan
Rebuild roundbase, round, geo, ipos, mas from SDC data. Create companybase from roundbase Create round from roundbase. Build stageflags from round.
Clean companybase by putting flags for Undisclosed Company, US location. Check if key (coname, statecode, datefirstinv) is valid. Remove duplicates manually/update command from roundbase. Check if round key is valid. Remove duplicates.
Build statelookup tables and roundlookup tables.
Clean firmbase tables. Clean ipo tables. Clean mas table.
Run matcher on ipos, companybase. Matcher on mas, companybase. Fix duplicate matches.
Join ipos and companybase. Check if count is valid. Fix match as required. Pull ipo key into companybase and companybase key into ipo table first. Then join.
Join mas and companybase. Check if count is valid. Fix match as required. Pull mas key into companybase and companybase key into mas table first. Then join.
Join ipocompanybase with macompanybase to get a table of portcos, ipos and mas.
Calculate exit date based on ipo, ma, datelastinv + 5 years.
Pull in sel flag into companybase and build dead or alive flag.
Match geodata to companybase. Pull geokey into companybase table. Lookup addresses to get geo data as required using geo.py.
Clean fundbase and check valid key (fundname, statecode, firstinvdate)
Clean firmbase and check valid key (firmname, foundingdate)