* The IPO records also contain listings on junior and foreign exchanges, as well as some OTC - I left these in and flagged them.
===Industry===
The industry coding is in IndustryCodes.txt. Note that:
I also tried some keyword industry coding from both short and long descriptions. The source code is at the top of BuildBaseTables.sql. The results are in sheets in the IndustryCodes.xlsx file.
===BuildBaseTables.sql===
Build the PortCoGrowthGeoId table that codes the city-state to a geoid.
{| class="wikitable"
|- style="vertical-align:bottom;"
! origin
! count
! Method
|-
| style="vertical-align:bottom;" | 1
| style="vertical-align:bottom;" | 45,111
| style="color:#808080;" | Address is geocoded and in tiger place
|-
| style="vertical-align:middle; color:#808080;" | 2
| style="vertical-align:bottom;" | 270
| style="color:#808080;" | city, statecode matches to only 1 geoid, so use it
|-
| style="vertical-align:middle; color:#808080;" | 3
| style="vertical-align:bottom;" | 1,374
| style="color:#808080;" | city, statecode matches to multiple geoids, use the most popular
|-
| style="vertical-align:middle; color:#808080;" | 4
| style="vertical-align:bottom;" | 964
| style="color:#808080;" | 1:1 straight city<->place and statecode match with tiger
|-
| style="vertical-align:middle; color:#808080;" | 5
| style="vertical-align:bottom;" | 509
| style="color:#808080;" | Use zctaplaceinfo to lookup the best place choice for the zipcode
|- style="vertical-align:bottom;"
| style="vertical-align:middle; color:#808080;" | 6
| 636
| style="color:#808080;" | Unable to code
|-
| style="vertical-align:middle; color:#808080;" | 9
| style="vertical-align:bottom;" | 24
| style="color:#808080;" | Custom coded
|}
===StartupCities===
The original Startup Cities code is in E:\projects\BayesianStartupCities\V1\startupcities.sql. The new version is in e:\projects\BayesianStartupCities\StartupCitiesV2.sql.