Difference between revisions of "Accelerator Demo Day"
Jump to navigation
Jump to search
Leminh.ams (talk | contribs) |
Leminh.ams (talk | contribs) |
||
Line 11: | Line 11: | ||
The source code and relevant files for the project can be found here: | The source code and relevant files for the project can be found here: | ||
E:\McNair\Projects\Accelerator Demo Day\ | E:\McNair\Projects\Accelerator Demo Day\ | ||
− | + | ==Development Notes== | |
− | + | ===The Crawler Functionality=== | |
+ | To be updated | ||
+ | ===The Classifier=== | ||
+ | ====Input (Features)==== | ||
+ | The input (features) right now is the frequency of X_NUMBER of words appearing in each documents. | ||
+ | ==Reading resources== | ||
+ | http://www.fit.vutbr.cz/research/groups/speech/publi/2010/mikolov_interspeech2010_IS100722.pdf |
Revision as of 14:26, 13 July 2018
Accelerator Demo Day | |
---|---|
Project Information | |
Project Title | Accelerator Demo Day |
Owner | Minh Le |
Start Date | 06/18/2018 |
Deadline | |
Primary Billing | |
Notes | |
Has project status | Active |
Subsumes: | Demo Day Page Parser, Demo Day Page Google Classifier |
Copyright © 2016 edegan.com. All Rights Reserved. |
Contents
Project
This project that utilizes Selenium and Machine Learning to get good candidate web pages and classify webpages as a demo day page containing a list of cohort companies, currently using scikit learn's random forest model and a bag of words approach
Code Location
The source code and relevant files for the project can be found here:
E:\McNair\Projects\Accelerator Demo Day\
Development Notes
The Crawler Functionality
To be updated
The Classifier
Input (Features)
The input (features) right now is the frequency of X_NUMBER of words appearing in each documents.
Reading resources
http://www.fit.vutbr.cz/research/groups/speech/publi/2010/mikolov_interspeech2010_IS100722.pdf