Difference between revisions of "NHL"
Jump to navigation
Jump to search
imported>WillC |
imported>Sahil |
||
Line 62: | Line 62: | ||
Spec General Fanager! | Spec General Fanager! | ||
+ | ==General Fanager Webcrawler== | ||
+ | |||
+ | The Perl Libraries I used to create this webcrawler are | ||
+ | use strict; | ||
+ | use LWP::Simple; | ||
+ | use [http://search.cpan.org/~cjm/HTML-Tree-5.03/lib/HTML/Tree.pm HTML::Tree]; | ||
+ | Using the LWP::Simple library makes it easy to rip the HTML off the website by simply doing | ||
+ | $content = get(your url as a string here); | ||
+ | Now the HTML::Tree library |
Revision as of 14:18, 14 March 2016
Contents
Old Material
Downloading Postgresql on Mac
Download package from:
http://www.enterprisedb.com/products-services-training/pgdownload#osx
Follow instructions given on the website. Macs already come with Perl, using the stackbuilder application which was also downloaded through the same link, download the PL/Perl package.
Variables
List of necessary variables and where to find them in the dropbox.
For all skaters we need:
NHLIDDetails.txt (likely a file we generate) ID (int) Playername from NHL, Playername from CapGeek, Playername from GeneralFanager DOB (transform to ISO8601)
NHLHistoric_Player_summary.txt & NHLPlayer_summary.txt (historic data set includes NHL Player summary except for two games of 2013-2014 season) Playername Current Team (string) Position (F, D) season (YYYY) goals (int) TOI (float)
NHLPlayer_points.txt Playername DOB PPG (float)
NHLPlayer_bios.txt playername dob game type (overtime or no overtime) weights (int) height (int) age (int) - calculated from DOB
NHLPlayer_faceOffPercentageAll.txt playername face-off wins (int)
Capgeek_10_processed-notepad.txt playername dob salary (int) length (int) contract start date (MM/DD/YYYY) contract type (EL, RFA, UFA, TFP) caphit (int) In a separate Table: Year and CPI (2010 Base Year)
Next Tasks
Spec General Fanager!
General Fanager Webcrawler
The Perl Libraries I used to create this webcrawler are
use strict; use LWP::Simple; use HTML::Tree;
Using the LWP::Simple library makes it easy to rip the HTML off the website by simply doing
$content = get(your url as a string here);
Now the HTML::Tree library