Changes
Jump to navigation
Jump to search
→Data
Of the 2865 NHL players with statistics, we have CapGeek contract records for 1455 (51%). Part of the discrepancy is due to retirement. 442 players played their last game before 2001 and do not have CapGeek records. This leaves 968 (34%) of the NHL players that do not have contracts in CapGeek for some other reason - most likely that CapGeek's coverage is incomplete.
====Team Level Data====
The crawler was modified to collect team level data, from pages like:
http://www.nhl.com/ice/teamstats.htm?season=20122013&gameType=2&viewName=summary
Data was collected for all teams for all seasons (1997-1998 to 2011-2012, except 2004-2005), for both the regular season and the playoffs (for teams that made the playoffs). The following variables were joined to the dataset at the (team1,season) level:
*teamcode -- three letter team code for the join
*teamseason -- season xxxxyyyy for the join
*teamgp -- games played (reg season)
*teamp -- points (reg season)
*teamgperg -- goals per game (reg season)
*teamgaperg -- goals against per game (reg season)
*po_teamgp -- games played (playoffs)
*po_teamp -- points (playoffs)
*po_teamgperg -- goals per game (playoffs)
*po_teamgaperg -- goals against per game (playoffs)
Note that some players are recorded for multiple teams during a single season. This recording is reflected in the team1, team2, team3 and team4 variables (no player played for more than four teams), as well as in the teamstring variable (which lists the teams as a comma separated string) and the teamcount variable (which counts the number of teams that a player played for). The join was done at the team1 level. As such, the team data will be 'incorrect' (or at least potentially unfairly allocated) for players that have a teamcount > 1.
==Variables==