Changes

Demo Day Page Parser (view source)

Revision as of 16:27, 15 November 2017

452 bytes added , 16:27, 15 November 2017

no edit summary

A script to rip from HTML to TXT can be found below. This script reads HTML files from a the DemoDayHTML directory, and writes them to ~~TXT in another~~ the DemoDayTxt directory:

htmlToText.py

A script to match Keywords (Accelerator and Cohort names) against the resulting text pages can be found in KeyTerms.py. The script takes the Keywords located in CohortAndAcceleratorsFullList.txt, and the text files in DemoDayTxt, and creates a file with the number of matches of each keyword against each text file.

The script can be found:

KeyTerms.py

The Keyword matches text file can be found:

DemoDayTxt\KeyTermFile\KeyTerms.txt

Peterjalbert

Bureaucrats, Administrators (Semantic MediaWiki), Administrators

479

edits

Changes

Demo Day Page Parser (view source)

Revision as of 16:27, 15 November 2017

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Sites

Sections

Organizations

Help

Tools