==Work Done in Late November by Dylan & Ed==
SBIR Data taken from McNair\Projects\SBIR\Data\Aggregate SBIR\SBIR.txt. -Note! This file needed to be opened in excel to be readable, and took a very long time to open due to its large size. SBIR firm names converted to a pivot table to eliminate exact repeat entries, and then exported to a txt file, NSBIR. NSBIR then matched using The Matcher in mode 2 with the following code : "-file1="NSBIR.txt" -file2="NSBIR.txt" -mode=2" Output then placed in : McNair\Projects\MatchingEntrepsToVC\Matching\Output. The original pre-matched, cleaned NSBIR.txt file is moved to : McNair\Projects\MatchingEntrepsToVC\Matching\IntputInput.
There is a sql file to extract VC portcos (SEL backed only), with key info from vcdb2, and distinct assignee names from allpatentsprocessed here:
*vcbackedselcokeys.txt - extracted with key info from vcdb2. It needs pivot tabling to get unique names.
There These .txt files were made distinct, and then matched against themselves for normalization. The normalized files still need to be matched against each other. They are threelocated in: McNair\Projects\MatchingEntrepsToVC\Matching\Normalized