Difference between revisions of "NHL"

From edegan.com
Jump to navigation Jump to search
imported>WillC
imported>Sahil
Line 62: Line 62:
  
 
Spec General Fanager!
 
Spec General Fanager!
 +
==General Fanager Webcrawler==
 +
 +
The Perl Libraries I used to create this webcrawler are
 +
use strict;
 +
use LWP::Simple;
 +
use [http://search.cpan.org/~cjm/HTML-Tree-5.03/lib/HTML/Tree.pm HTML::Tree];
 +
Using the LWP::Simple library makes it easy to rip the HTML off the website by simply doing
 +
$content = get(your url as a string here);
 +
Now the HTML::Tree library

Revision as of 14:18, 14 March 2016

Old Material

Downloading Postgresql on Mac

Download package from:

http://www.enterprisedb.com/products-services-training/pgdownload#osx

Follow instructions given on the website. Macs already come with Perl, using the stackbuilder application which was also downloaded through the same link, download the PL/Perl package.

Variables

List of necessary variables and where to find them in the dropbox.

For all skaters we need:

NHLIDDetails.txt (likely a file we generate)
 ID (int) 
 Playername from NHL, Playername from CapGeek, Playername from GeneralFanager
 DOB (transform to ISO8601)
NHLHistoric_Player_summary.txt & NHLPlayer_summary.txt (historic data set includes NHL Player summary except for two games of 2013-2014 season)
 Playername
 Current Team (string)
 Position (F, D) 
 season (YYYY) 
 goals (int) 
 TOI (float)
NHLPlayer_points.txt
 Playername
 DOB
 PPG (float)
NHLPlayer_bios.txt
 playername
 dob 
 game type (overtime or no overtime)
 weights (int)
 height (int)
 age (int) - calculated from DOB
NHLPlayer_faceOffPercentageAll.txt
 playername
 face-off wins (int) 
Capgeek_10_processed-notepad.txt
 playername
 dob
 salary (int)
 length (int)
 contract start date (MM/DD/YYYY)
 contract type (EL, RFA, UFA, TFP)
 caphit (int)
 
In a separate Table:
 Year and CPI (2010 Base Year)

Next Tasks

Spec General Fanager!

General Fanager Webcrawler

The Perl Libraries I used to create this webcrawler are

use strict;
use LWP::Simple;
use HTML::Tree;

Using the LWP::Simple library makes it easy to rip the HTML off the website by simply doing

$content = get(your url as a string here);

Now the HTML::Tree library