== Processing Steps ==
Get the source data:# Copy over the rpt, ssh, and pl files, and bulk edit the ssh files, now in to E:\projects\vcdb24\SDC, and bulk edit the ssh files. ## Change Make final date 12/31/2020 (2023 and one 07/20/2020) change vcdb23 to 12/31/2022 and vcdb20 to vcdb23vcdb24# Run the ssh files against SDC Platinum. Note that SDC Platinum's service will be withdrawn one last time on 31 December 2023.
# Run the [[SDC Normalizer]] script (one of the pl files) on each output
## Fix the header row in USFirms1980.txt before normalizing (the Capital Under Management column name is too long)
## The private and public M&A file sets have to be separately combined into 2 files after they've been normalized. Then replace \tnp\t and \tnm\t with \t\t in each.
## For RoundOnOneLine, remove the footer, run NormalizeFixedWidth.pl first, then RoundOnOneLine.pl, and then fix the header.
## PortCoLongDescription must be pre-processed from the command line and then post-processed in excel (see [[VCDB20H1 ]] and [[Vcdb4#Long_Description]]). However, I didn't load it for this run. Create the postgres database:# Create a new database on mother (createdb vcdb23vcdb24) and setup set up a directory for the input files: E:\projects bulk\vcdb23vcdb24# Copy over (to sql folder) and edit Load.sql. Run it section-by-section.