Difference between revisions of "Talk:Accelerator Seed List (Data)"

From edegan.com
Jump to navigation Jump to search
Line 8: Line 8:
 
Database Drive is here: '''Z:\Bulk\Accelerators'''
 
Database Drive is here: '''Z:\Bulk\Accelerators'''
  
=important docs=
+
=Important docs=
 
The SDC pull that includes all of the round data since 1999: '''E:\Mcnair\Projects\Accelerators\VC_Data_Repeated_Down.txt''' or '''E:\Mcnair\Projects\Accelerators\"VC Data.xlsx" '''
 
The SDC pull that includes all of the round data since 1999: '''E:\Mcnair\Projects\Accelerators\VC_Data_Repeated_Down.txt''' or '''E:\Mcnair\Projects\Accelerators\"VC Data.xlsx" '''
  
Line 15: Line 15:
 
The Crunchbase Snapshots of organizations:
 
The Crunchbase Snapshots of organizations:
  
=to-do list=
+
=To-do list=
  
  

Revision as of 10:06, 24 April 2017

Hi Veeral,

Intro

Welcome to the project. The documents are here: E:\Mcnair\Projects\Accelerators

SQL documents are here: E:\Mcnair\Projects\Accelerators\SQL_Data

Database Drive is here: Z:\Bulk\Accelerators

Important docs

The SDC pull that includes all of the round data since 1999: E:\Mcnair\Projects\Accelerators\VC_Data_Repeated_Down.txt or E:\Mcnair\Projects\Accelerators\"VC Data.xlsx"

The Cohorts of accelerators (under the updated tab on the bottom): E:\Mcnair\Projects\Accelerators\"Clean Cohort Data.xlsx"

The Crunchbase Snapshots of organizations:

To-do list

Don't worry about this stuff

Rank on VC

  • Getting a VC percentage for each Accelerator

Also categorize

  • Age
  • Nonprofit or not
  • Location

RegEx Code for repeating data down for the round data from SDC:

\n([^\t]+\t[^\t]*\t[^\t]*\t[^\t]*\t[^\t]*\t[^\t]*\t[^\t]*\t[^\t]*\t[^\t]*\t[^\t]*\t)(.*)\n\t\t\t\t\t\t\t\t\t\t

\n\1\2\n\1

=if(isnumber(search("blah",B2))=TRUE,1,0) where blah is the substring (what you're searching for), B2 is the string (what your searching in) and 1 represents that it's present and 0 means it isn't.

=sum(A1:C1) This just sums the cells from A1 to C1