Difference between revisions of "Demo Day Page Parser"

From edegan.com
Jump to navigation Jump to search
Line 5: Line 5:
 
}}
 
}}
  
Code Location
+
==Code Location==
 +
The code directory for this project can be found:
 +
E:\McNair\Software\Accelerators
 +
 
 +
The Selenium-based crawler can be found in the file:
 +
DemoDayCrawler.py
 +
This script runs a google search on accelerator names and keywords, and saves the urls and html pages for future use.
 +
 
 +
A script to rip from HTML to TXT can be found:
 +
htmlToText.py
 +
This script reads HTML files from a directory, and writes them to TXT in another directory.

Revision as of 16:28, 14 November 2017


McNair Project
Demo Day Page Parser
Project logo 02.png
Project Information
Project Title Demo Day Page Parser
Owner Peter Jalbert
Start Date
Deadline
Primary Billing
Notes
Has project status Active
Copyright © 2016 edegan.com. All Rights Reserved.


Code Location

The code directory for this project can be found:

E:\McNair\Software\Accelerators

The Selenium-based crawler can be found in the file:

DemoDayCrawler.py

This script runs a google search on accelerator names and keywords, and saves the urls and html pages for future use.

A script to rip from HTML to TXT can be found:

htmlToText.py

This script reads HTML files from a directory, and writes them to TXT in another directory.