Difference between revisions of "Accelerator Website Images"

From edegan.com
Jump to navigation Jump to search
(Created page with "{{McNair Projects |Has title=Accelerator Website Images |Has owner=Christy Warden, |Has start date=12/01/2017 }} =Overview= This code is located in McNair\Software\Accelerato...")
(No difference)

Revision as of 14:12, 12 December 2017


McNair Project
Accelerator Website Images
Project logo 02.png
Project Information
Project Title Accelerator Website Images
Owner Christy Warden
Start Date 12/01/2017
Deadline
Primary Billing
Notes
Has project status
Copyright © 2016 edegan.com. All Rights Reserved.


Overview

This code is located in McNair\Software\Accelerators\ImageDownloading\ImageGather.py

The program takes in a text file of accelerator websites and gets images of all the internal links of the website.

How to Use

Open the python file in komodo. At the bottom of the file, you can change the link in the function call to the txt file you want to run on. The text file should be of the format "Name of website/company" tab "Url of website". You can also change the integer in the function call to be how deep you want to pull links from the website. When you run, the program will pull and store all of the internal links to a certain depth for each website. It will then open each website in selenium and screenshot each section of it. There will be a file called "TrackFile" produced that will tell you which images correspond to which website. Unfortunately, running this code currently will overwrite the TrackFile for the Accelerators data I am currently (12/12/17) running, but you can change the name before you run to get new data stored without overwriting. Additionally, all the files will currently save in the same folder as the Python file, but this can be adjusted by the user as well by adding a directory header to each of the filenames I provide.