Changes

Jump to navigation Jump to search
522 bytes added ,  16:24, 18 July 2016
*Created conditionals for keys in JSON dictionaries. Successfully ran the tool on my 50 companies and then again on 1500 companies. Changed ratio to .75 and higher to elicit URLs that were close but not exact and got more results.<!-- flush -->
 
'''7/18'''
 
*Created a function that took a URL and gave back all of the substantial text blocks on the page (used to find company descriptions)
**The function explores the HTML source code of the URL and finds all parts of the source code with the <p> tag to indicate a text paragraph.
**Then, the function goes though each paragraph, and if it is above a certain number of characters (eliminate for short, unnecessary information), the function adds the description in a new column of the csv file under "description".
383

edits

Navigation menu