Difference between revisions of "PDF Downloader"

From edegan.com
Jump to navigation Jump to search
(Created page with "{{McNair Projects |Has title=PDF Downloader |Has owner=Christy Warden, |Has keywords=PDF, python |Has project status=Complete }}")
 
 
(5 intermediate revisions by 2 users not shown)
Line 1: Line 1:
{{McNair Projects
+
{{Project
 +
|Has project output=Tool
 +
|Has sponsor=McNair Center
 
|Has title=PDF Downloader
 
|Has title=PDF Downloader
 
|Has owner=Christy Warden,
 
|Has owner=Christy Warden,
Line 5: Line 7:
 
|Has project status=Complete
 
|Has project status=Complete
 
}}
 
}}
 +
=Overview=
 +
The code for this function is located at E:/McNair/Software/Google_Scholar_Crawler/pdfdownloader.py
 +
 +
This program takes in a txt file that contains rows of entries where each row is a file name and a link to a pdf, separated by a tab. (For an example, E:/McNair/Projects/Patent_Thickets/ScholarQueries/patent thickets/Query_patent thickets_pdfTable.txt). It also takes in a directory that you want all the PDFs to be placed in. It downloads all the PDFs from the links in the txt file and names them by the file name, also in the text file. It saves all of the PDFs in the output directory provided.
 +
 +
=Dependencies=
 +
urllib
 +
 +
=How to Use=
 +
Open the pdfdownloader.py in Komodo. At the bottom  of the file, type: ''main(your text file, your output directory)''. Click the play button in the top middle of the screen.

Latest revision as of 12:47, 21 September 2020


Project
PDF Downloader
Project logo 02.png
Project Information
Has title PDF Downloader
Has owner Christy Warden
Has start date
Has deadline date
Has keywords PDF, python
Has project status Complete
Dependent(s): Patent Thicket
Has sponsor McNair Center
Has project output Tool
Copyright © 2019 edegan.com. All Rights Reserved.

Overview

The code for this function is located at E:/McNair/Software/Google_Scholar_Crawler/pdfdownloader.py

This program takes in a txt file that contains rows of entries where each row is a file name and a link to a pdf, separated by a tab. (For an example, E:/McNair/Projects/Patent_Thickets/ScholarQueries/patent thickets/Query_patent thickets_pdfTable.txt). It also takes in a directory that you want all the PDFs to be placed in. It downloads all the PDFs from the links in the txt file and names them by the file name, also in the text file. It saves all of the PDFs in the output directory provided.

Dependencies

urllib

How to Use

Open the pdfdownloader.py in Komodo. At the bottom of the file, type: main(your text file, your output directory). Click the play button in the top middle of the screen.