Difference between revisions of "Ranking US Cities by Venture Capital"
Line 6: | Line 6: | ||
|Does subsume=Top Cities for VC Backed Companies, | |Does subsume=Top Cities for VC Backed Companies, | ||
}} | }} | ||
− | |||
− | + | ==Project Description== | |
+ | |||
+ | ==Progress Report== | ||
+ | |||
+ | ==Project Location== | ||
+ | |||
E:\McNair\Projects\Ecosystem\Ranking | E:\McNair\Projects\Ecosystem\Ranking | ||
+ | |||
+ | ==Notes== | ||
The main SQL file is: | The main SQL file is: |
Revision as of 12:36, 3 November 2017
Ranking US Cities by Venture Capital | |
---|---|
Project Information | |
Project Title | Ranking US Cities by Venture Capital |
Owner | Ed Egan, Anne Dayton, Diana Carranza |
Start Date | Summer 2017 |
Deadline | |
Primary Billing | |
Notes | |
Has project status | Active |
Subsumes: | Top Cities for VC Backed Companies |
Copyright © 2016 edegan.com. All Rights Reserved. |
Project Description
Progress Report
Project Location
E:\McNair\Projects\Ecosystem\Ranking
Notes
The main SQL file is:
Ranking.sql
This file works on the database cities.
However, the cities database takes vc data from vcdb2. See VC Database Rebuild for information on the build.
The output from vcdb2 is roundleveloutput2, which is exported as roundleveloutput2.txt
roundleveloutput2.txt is imported into cities as roundleveloutput2, and then left joined with populationtablecorrected. This is done in last.population.sql, which is in
Z:\Hubs\2017\sql scripts
Population issue
One issue is that cities without populations in populationtablecorrected but in roundleveloutput2 get dropped later. For example, roundleveloutput2 contains Brooklyn, NY but populationtablecorrected doesn't (its New York count is therefore suspect too).
populationtablecorrected is loaded from files 'population1.txt' and 'population2.txt'. Dups are removed with a max.
'population1.txt' and 'population2.txt' are in:
Z:\Hubs\2017\clean data\Population
Their construction notes are on Hubs#Population_Data