Difference between revisions of "Harrison Brown (Work Log)"
Line 6: | Line 6: | ||
09/11/2017 1pm-5pm - Met with Dr. Egan and got assigned project. Set Up Project Page USITC, Started Coding in Python for the Web Crawler. Look in McNair/Projects/UISTC for project notes and code. | 09/11/2017 1pm-5pm - Met with Dr. Egan and got assigned project. Set Up Project Page USITC, Started Coding in Python for the Web Crawler. Look in McNair/Projects/UISTC for project notes and code. | ||
− | 09/ | + | 09/13/2017 1pm-3:50pm - Worked on parsing the USITC website Section 337 Notices. Nearly have all of the data I can scrape. Scraper works, but there are a few edges |
cases where information in the tables are part of a Notice but do not have Investigation Numbers. Will finish this hopefully next time. Also added my USITC project to the projects page I did not have it linked | cases where information in the tables are part of a Notice but do not have Investigation Numbers. Will finish this hopefully next time. Also added my USITC project to the projects page I did not have it linked | ||
+ | |||
+ | 09/14/2017 1pm-3:50pm - Have a python program that can scrape the entire webpage and navigate through all of the pages that contain section 337 documents. You can see where these files and more information on the USITC project page. It can pull all of the information that is in the HTML that can be gathered for each case. The PDFs now need to be scraped; will start work on that next time. | ||
[[Category:Work Log]] | [[Category:Work Log]] |
Revision as of 15:30, 14 September 2017
Harrison Brown Work Logs (log page)
Fall 2017 Work
09/07/2017 2:20pm-3:50pm - Set Up Work Log Pages, Slack, Microsoft Remote Desktop
09/11/2017 1pm-5pm - Met with Dr. Egan and got assigned project. Set Up Project Page USITC, Started Coding in Python for the Web Crawler. Look in McNair/Projects/UISTC for project notes and code.
09/13/2017 1pm-3:50pm - Worked on parsing the USITC website Section 337 Notices. Nearly have all of the data I can scrape. Scraper works, but there are a few edges cases where information in the tables are part of a Notice but do not have Investigation Numbers. Will finish this hopefully next time. Also added my USITC project to the projects page I did not have it linked
09/14/2017 1pm-3:50pm - Have a python program that can scrape the entire webpage and navigate through all of the pages that contain section 337 documents. You can see where these files and more information on the USITC project page. It can pull all of the information that is in the HTML that can be gathered for each case. The PDFs now need to be scraped; will start work on that next time.