==Summary==This page <onlyinclude>The [[VCDB20H1]] project documents the build of vcbd20h1-- a '''V'''enture '''C'''apital '''D'''ata'''B'''ase covering until the end of the first half ('''H1''') of 20'''20'''. Vcbd20h1 includes investments, funds, startups, executives, and exits derived from data from [[VentureXpert]]. It This project updates [[vcdb4]], which covered (almost) to the of Q3 2019, to until the end of the first half (H1) of 2020.See also: [[SDC Normalizer]].</onlyinclude>
==SDC files==
However, it appears that Thomson-Reuters have drastically redefined their 'venture related' flag, as about 25% of the data appears missing now: Using the venture-related flag, we now have 42,005 portco records, whereas we did have 49,250 in the last pull around 10 months ago...
I tried re-pulling PortCo without the venture-related constraint but with a flag for venture-related and buyout (STD-VC and STD-Buyout), which appear the only two such options now (there used to be many more to choose from). This gave 68,596 records, of which 42,068 were venture, 26,527 were not venture, 20,208 were buyout, 48,387 were not buyouts, and 2,2256 were both and 8,575 were neither. I noticed that zozi.com was missing from the new pull, and that it had some early stage deals. It is marked as neither venture nor buyout. I therefore repulled everything without the constraint but with a venture-related and buyout flag at the portco level. (I tested it at the round level and it showed no within-firm variation). I omitted "VC" from the new file names. Note that you should select Columinar Columnar Grid and not text as the output method, and don't both appending the session details.
I also re-did the M&A queries, pulling more info from 1980 to 2020H1 in four goes. I removed the 100pc owned constraint from the search, as I'm concerned that when it isn't reported, observations are omitted. It can be added back later. I also included completed vs. withdrawn, pc cash and stock, and some other useful measures. Then I pulled all the non-private acquisitions in two goes as well.
Sub. 111796
Normalizing the files is generally straight-forward. Use NormalizeFixedWidth.pl (see [[Normalizer Documentation]]) and only copy down the missing keys (e.g., coname, statecode, datefirst, etc.). For most files there is nothing to fix, just select the last header row and go. For fund firm fix the header first, so that the capital under management header entry has some space after it(otherwise it blends with the firm status). For RoundOnOneLine, remove the footer, run NormalizeFixedWidth.pl first then RoundOnOneLine.pl, and then fix the header. For fund, fund execs, and portco, remove double quotes after normalization. For M&As, replace \tnp\t and \tnm\t with \t\t.
For PortCoLongDescription (see [[Vcdb4#Long_Description]]):
Other notes:
*The Stage Level 3 (in Round) options now include secondary purchase , secondary buyout, and open market purchase.