VentureXpert Data

From edegan.com
Revision as of 15:21, 28 June 2018 by Adliebster (talk | contribs)
Jump to navigation Jump to search


Augi Liebster (Work Log)

McNair Project
VentureXpert Data
Project logo 02.png
Project Information
Project Title VentureXpert Data
Owner Augi Liebster
Start Date June 20, 2018
Deadline
Primary Billing
Notes
Has project status Active
Copyright © 2016 edegan.com. All Rights Reserved.


Relevant Former Projects

Location

My files are located in the Z drive in a folder called VentureXpert Data:

Z:\VentureXpertDB

Files for the previous project such as LoadScripts or old rpt files are located in the following places:

Database files are located here:

Z:\VentureCapitalData\SDCVCData\vcdb2

SDC files are located here and the normalized versions are copied into the Z folder above:

E:\McNair\Projects\VC Database

Database can be started by typing psql vcdb2 The file containing all the SQL queries used to build vcdb2 is located in the Z drive and named ProcessData2.sql.

Z:\VentureCapitalData\SDCVCData\vcdb2\ProcessData2.sql


Goal

I will be looking to redesign the VC Database in a way that is more intuitively built than the previous one. I will also update the database with current data.

Initial Stages

The first step of the project was to figure out what primary keys to use for each major table that I create. I looked at the primary keys used in the creation of the VC Database Rebuild and found primary keys that are decent. I have updated them and list them below:

  1. CompanyBaseCore- coname, statecode, datefirstinv
  2. IPOCore- issuer, issuedate, statecode
  3. MACore- target name, target state code, announceddate
  4. Geo - city, statecode, coname, datefirst, year
  5. DeadDate - conname, statecode, datefirst, rounddate (tentative could still change)
  6. RoundCore- conname, statecode, datefirst, rounddate
  7. FirmBaseCore - firmname
  8. FundBaseCore - fund name (firstinvedate doesn't work because not every row has an entry)

These are my initial listings and I will come back to update them if needed.

The second part of the initial stage has been to pull data from the SDC Platinum platform with updated dates to make the pull as recent as possible.


VCFund Pull Problem

When pulling the VCFund1980-Present, I encountered two problems. One, is that SDC is not able to sort through the funds that are US only with the built in filters. Two, there are multiple rpt files that specify different variables for the fund pull. I pulled from both to be safe, but in the VC Database Rebuild page there is a section on the fund pull where Ed specifies which rpt file he used to pull data from SDC. Regardless I have both saved in the ExtractedData folder. After speaking with Ed, he told me to use the VCFund1980-present.rpt file to extract the data. Had various problems extracting data including freezing of SDC program or getting error Out of Memory. Check the SDC Platinum (Wiki) page to fix these issues.