Difference between revisions of "Ranking US Cities by Venture Capital"

From edegan.com
Jump to navigation Jump to search
Line 31: Line 31:
 
  Z:\Hubs\2017\sql scripts
 
  Z:\Hubs\2017\sql scripts
  
===Population issue===
+
===Population data location===
 +
Z:\Hubs\2017\clean data\Population
  
 
One issue is that cities without populations in populationtablecorrected but in roundleveloutput2 get dropped later. For example, roundleveloutput2 contains Brooklyn, NY but populationtablecorrected doesn't (its New York count is therefore suspect too).
 
One issue is that cities without populations in populationtablecorrected but in roundleveloutput2 get dropped later. For example, roundleveloutput2 contains Brooklyn, NY but populationtablecorrected doesn't (its New York count is therefore suspect too).
Line 37: Line 38:
 
'''populationtablecorrected''' is loaded from files 'population1.txt' and 'population2.txt'. Dups are removed with a max.
 
'''populationtablecorrected''' is loaded from files 'population1.txt' and 'population2.txt'. Dups are removed with a max.
  
'population1.txt' and 'population2.txt' are in:
 
Z:\Hubs\2017\clean data\Population
 
  
 
Their construction notes are on [[Hubs#Population_Data]]
 
Their construction notes are on [[Hubs#Population_Data]]

Revision as of 12:46, 3 November 2017


McNair Project
Ranking US Cities by Venture Capital
Project logo 02.png
Project Information
Project Title Ranking US Cities by Venture Capital
Owner Ed Egan, Anne Dayton, Diana Carranza
Start Date Summer 2017
Deadline
Primary Billing
Notes
Has project status Active
Subsumes: Top Cities for VC Backed Companies
Copyright © 2016 edegan.com. All Rights Reserved.


Project Description

Progress Report

Project Location

E:\McNair\Projects\Ecosystem\Ranking

Dependencies

Project Overview

Main SQL file location

Ranking.sql

This file works on the database cities.

However, the cities database takes vc data from vcdb2. See VC Database Rebuild for information on the build.

The output from vcdb2 is roundleveloutput2, which is exported as roundleveloutput2.txt

roundleveloutput2.txt is imported into cities as roundleveloutput2, and then left joined with populationtablecorrected.

Last Population SQL file location

Z:\Hubs\2017\sql scripts

Population data location

Z:\Hubs\2017\clean data\Population

One issue is that cities without populations in populationtablecorrected but in roundleveloutput2 get dropped later. For example, roundleveloutput2 contains Brooklyn, NY but populationtablecorrected doesn't (its New York count is therefore suspect too).

populationtablecorrected is loaded from files 'population1.txt' and 'population2.txt'. Dups are removed with a max.


Their construction notes are on Hubs#Population_Data