Difference between revisions of "Seed DB Parser"
Jump to navigation
Jump to search
Maxine.tao (talk | contribs) |
Leminh.ams (talk | contribs) |
||
Line 15: | Line 15: | ||
==How To Use== | ==How To Use== | ||
1. Change line 14 to be the name of your input file and line 15 to be the name of your desired output file. | 1. Change line 14 to be the name of your input file and line 15 to be the name of your desired output file. | ||
− | + | 2. The format of the input file is: | |
+ | First line (header): Accelerators | ||
+ | Next lines: company name per row. | ||
2. Type "python3 parser.py" into the command prompt. | 2. Type "python3 parser.py" into the command prompt. |
Revision as of 15:35, 3 August 2018
Seed DB Parser | |
---|---|
Project Information | |
Project Title | Seed DB Parser |
Start Date | |
Deadline | |
Primary Billing | |
Notes | |
Has project status | |
Copyright © 2016 edegan.com. All Rights Reserved. |
Location
E:\McNair\Projects\Seed DB\parser.py
ListOfAccs.txt - input file containing a list of accelerators that we want to pull information on
Functionality
Uses Selenium Webdriver to pull cohort companies, timing info from SeedDB website
SeedDB is structured so that there is a page containing a list of accelerators. If you click on an accelerator name, you are then taken to another page of all their cohorts. This second page of all cohorts for each accelerator is stored in a folder called seedDBhtml.
How To Use
1. Change line 14 to be the name of your input file and line 15 to be the name of your desired output file. 2. The format of the input file is:
First line (header): Accelerators Next lines: company name per row.
2. Type "python3 parser.py" into the command prompt.