Difference between revisions of "Seed DB Parser"

From edegan.com
Jump to navigation Jump to search
Line 1: Line 1:
 
{{Project
 
{{Project
 +
|Has project output=
 
|Has sponsor=McNair Center
 
|Has sponsor=McNair Center
 
|Has title=Seed DB Parser
 
|Has title=Seed DB Parser

Revision as of 12:34, 21 September 2020


Project
Seed DB Parser
Project logo 02.png
Project Information
Has title Seed DB Parser
Has start date
Has deadline date
Has project status
Has sponsor McNair Center
Copyright © 2019 edegan.com. All Rights Reserved.

Location

E:\McNair\Projects\Seed DB\parser.py

ListOfAccs.txt - input file containing a list of accelerators that we want to pull information on

Functionality

Uses Selenium Webdriver to pull cohort companies, timing info from SeedDB website

SeedDB is structured so that there is a page containing a list of accelerators. If you click on an accelerator name, you are then taken to another page of all their cohorts. This second page of all cohorts for each accelerator is stored in a folder called seedDBhtml.

How To Use

1. Change line 14 to be the name of your input file and line 15 to be the name of your desired output file. 2. The format of the input file is:

First line (header): Accelerators
Next lines: company name per row.

2. Type "python3 parser.py" into the command prompt.