Lecture 24 - Wharton Statistics Department

Transcription

Lecture 24 - Wharton Statistics Department
Administrative Notes
Statistics 111 - Lecture 24
•  Homework 7 posted. Due in recitation on Friday,
April 22
Fitting Curves to Data: Application to
Fielding in Baseball
April 14, 2016
Stat 111 - Lecture 24 - Baseball!
Current Methods and Data
Bayesball Model
•  No recitation on Friday, April 15th
1
SAFE
April 14, 2016
Future
Current Methods and Data
Stat 111 - Lecture 24 - Baseball!
Bayesball Model
2
SAFE
Future
Quantifying Fielding Performance in Baseball
Overall goal: accurate evaluation of the fielding
performance of each major league baseball player
Measuring Fielding in Baseball: Present
and Future
Historical Method: Errors
Errors only punishes for bad plays, no corresponding
reward for good plays
No accounting for relative difficulty of each play
Shane T. Jensen
Historical Method: Fielding Percentage
Department of Statistics, The Wharton School,
University of Pennsylvania
Percentage of time a player properly handles the ball
Ambiguity in the denominator: players with poor range
could have high FP due to less opportunities
April 14, 2016
Need to take into account the relative difficulty of
individual balls-in-play (BIP)
Current Methods and Data
Bayesball Model
SAFE
Future
Available Data
Each season has ≈120000 balls-in-play (BIP)
I have worked with BIP data from 2002-08 (seven seasons)
Three BIP types: 42% grounders, 33% flys, 25% liners
BIP velocity information as ordinal category
Flyballs Caught by CF
Flyballs Not Caught by CF
400
400
300
300
Y Coordinate
Y Coordinate
Bayesball Model
SAFE
Current Methods: Ultimate Zone Rating
Ball-in-play data available from Baseball Info Solutions
200
100
200
100
0
0
−300
Current Methods and Data
−200
−100
0
X Coordinate
100
200
300
−300
−200
−100
0
X Coordinate
100
200
300
Ultimate Zone Rating: divides field up into zones and
tabulates success/failures of each fielder within zones
Future
Current Methods and Data
Bayesball Model
SAFE
Future
Current Methods: Ultimate Zone Rating cont’d
Current Methods and Data
Bayesball Model
SAFE
Future
Other Current Methods
Plus-Minus system (John Dewan): uses zones like UZR
Average success rate sk calculated within each zone k
Fielder gets credit of 1 − sk for each successful play, debit
of −sk for each unsuccessful play in zone
Aggregating over zones gives plus-minus value
Version with run values: defensive runs saved (DRS)
Difference between fielders success rate and average
success rate calculated for each zone
Differences weighted by run value and then aggregated
zone for overall rating
Probabilistic Model of Range (David Pinto): uses angles
to represent BIP direction (instead of zones)
Advantage: UZR zones are proxy for difficulty of BIP
Predicted outs for each direction calculated over all players
Actual outs for each direction calculated for individual
players and compared to predicted
Different PMR charts for grounders vs. liners vs. flys
Advantage: runs saved/cost is an easy to interpret scale
Disadvantage: zones are an ad hoc discretization of the
continuous fielding surface
Big Zone Metric (Peter Jensen):
Uses publicly available MLB Gameday data instead of BIS
Data is less resolute, so larger zones are used
Current Methods and Data
Bayesball Model
SAFE
Future
Continuous Fielding Curves
(x , y )ij location, velocity Vij and type of the BIP
Allows for principled sharing of information within and
between individual players
Bayesball Model
These probability functions will be smooth parametric
curves that can vary between different players
SAFE
Future
Representation for Different BIP Types
Flyballs and Liners
Y Coordinate
Y Coordinate
Grounder
Trajectory
CF Location at (0,324)
BIP
Location
100
150
SS
Location
100
Right
−200
−100
0
X Coordinate
100
200
"
log
−50
#
= βi0 + βi1 Dij + βi2 Dij Fij + βi3 Dij Vij
Logistic regression for grounders:
θ Angle
50
−100
pij
1 − pij
Dij = distance to BIP, Vij = vel, Fij = 1 if forward (vs. back)
Left
0
0
SAFE
Logistic regression for fly-balls/liners:
log
200
Distance
Bayesball Model
Logistic regression used to model smooth curves for
probability pij of successfully fielding BIP j by player i
Grounders
400
Forward
Current Methods and Data
Logistic regression for each smooth curve
Two-dimensional curves needed for flys/liners: success
depends on velocity, direction and distance to BIP
One-dimensional curves needed for grounders: success
depends on velocity, direction and angle to BIP
200
Future
Observed successes and failures are modeled as Binary
outcomes from an underlying probability pij
Each pij is a function of available data for that BIP:
Even more sophisticated approach embeds smooth
fielding curves within a Bayesian hierarchical model
300
SAFE
The outcome of each play is either a success or failure:
!
1 if the j th BIP hit to the i th player leads to out
Sij =
0 if the j th BIP hit to the i th player leads to hit
High-resolution data could also be used to fit smooth
fielding curves to the continuous playing surface
Backward
Bayesball Model
Count Data
Zone-based methods break up the field into discrete bins
for computational convenience
Current Methods and Data
Current Methods and Data
0
X Coordinate
50
100
"
pij
1 − pij
#
= βi0 + βi1 θij + βi2 θij Lij + βi3 θij Vij
θij = angle to BIP, Vij = velocity, Lij = 1 if left (vs. right)
Future
Current Methods and Data
Bayesball Model
SAFE
Future
Individual Grounder Curves
Current Methods and Data
Bayesball Model
SAFE
Future
Individual Fly/Liner Curves
Compare curves of individual fielders β$i of to aggegrate
model β$+ for all fielders at that position
Compare curves of individual fielders β$i of to aggegrate
model β$+ for all fielders at that position
P(Success) for Everett, Jeter vs. average SS
1.0
0.8
Average
Jeter
Everett
P(Success)
0.6
0.4
0.2
0.0
3rd Base
22.5
SS Location
7.5
2nd Base
7.5
22.5
1st Base
37.5
52.5
67.5
Degrees from SS
SAFE
Future
Numerical Summary of Overall Performance
Current Methods and Data
Bayesball Model
SAFE
Future
Differential Weighting in SAFE
Our full aggregation also weights grid points by BIP
frequency, run value, and shared consequence
Beyond comparing curves between players, we can derive
an overall numerical estimate of fielder performance
(a) P(Success) for Jeter vs. Average
(b) Density Estimate of Grounder Angle
1.0
0.8
P(Success)
SAFE: Spatial Aggregate Fielding Evaluation
For each player, aggregate differences between individual
curve (based on β i ) and overall curve (based on µ )
Average
Jeter
0.6
Density
Bayesball Model
0.4
0.2
0.0
3rd Base SS Location
22.5
Aggregation done by numerical integration over fine grid
of values (1D grid for grounders, 2D grid for flys/liners)
7.5
7.5
2nd Base
22.5
3rd Base SS Location
1st Base
37.5
52.5
67.5
22.5
7.5
7.5
2nd Base
22.5
1st Base
37.5
52.5
Degrees from SS
Degrees from SS
(c) Run Consequence for Grounders
(d) Shared Responsibility of SS
0.65
67.5
1.0
0.8
Responsibility Fraction
Current Methods and Data
Runs
0.60
Estimates and standard errors of β i gives us the mean and
95% confidence interval of SAFE for each player
0.55
0.50
0.6
0.4
0.2
0.0
3rd Base SS Location
22.5
7.5
7.5
2nd Base
22.5
1st Base
37.5
52.5
67.5
Degrees from SS
3rd Base SS Location
22.5
7.5
7.5
2nd Base
22.5
1st Base
37.5
52.5
67.5
Degrees from SS
SAFE value: runs saved/cost of fielder vs. average
Current Methods and Data
Bayesball Model
SAFE
Future
Results for Corner Infielders: Best/Worst Posterior SAFE values
Ten Best 1B Player-Years
Name and Year
Mean
95% Interval
Doug Mientkiewicz , 2007
7.2
( 2.8 , 11.3 )
Andy Phillips , 2007
7.1
( 2.6 , 11.4 )
Rich Aurilia , 2007
6.6
( 2.7 , 10.2 )
Albert Pujols , 2007
5.5
( 3.1 , 8.2 )
Doug Mientkiewicz , 2006
5.5
( 1.8 , 9.1 )
Albert Pujols , 2006
5.1
( 1.9 , 8.1 )
Kendry Morales , 2006
5.0
( -0.5 , 10.3 )
Ken Harvey , 2003
5.0
( 1.5 , 8.0 )
Howie Kendrick , 2006
4.5
( -0.8 , 9.6 )
Albert Pujols , 2008
4.1
( 1.0 , 6.8 )
Ten Best 3B Player-Years
Name and Year
Mean
95% Interval
Marco Scutaro , 2003
12.6
( 10.0 , 16.6 )
Mark Bellhorn , 2004
10.4
( 4.0 , 17.1 )
Hank Blalock , 2002
10.0
( 4.2 , 16.5 )
Sean Burroughs , 2004
8.9
( 3.4 , 14.2 )
David Bell , 2003
7.4
( 1.7 , 13.3 )
Scott Rolen , 2002
7.4
( 1.9 , 12.1 )
Hank Blalock , 2002
7.3
( 1.4 , 11.3 )
Damian Rolls , 2005
7.2
( 0.1 , 13.6 )
Pedro Feliz , 2002
7.1
( 0.5 , 13.3 )
Joe Crede , 2002
7.0
( 0.0 , 15.8 )
Ten Worst 1B Player-Years
Name and Year
Mean
95% Interval
Richie Sexson , 2002
-4.9
( -8.2 , -1.9 )
Robert Fick , 2002
-5.0
( -11.3 , 2.0 )
Mo Vaughn , 2002
-5.1
( -9.7 , -0.3 )
Dmitri Young , 2003
-5.5
( -9.9 , 0.1 )
Tony Clark , 2005
-6.3
( -11.7 , -1.6 )
Fred McGriff , 2002
-6.4
( -9.4 , -2.8 )
Mike Jacobs , 2002
-6.4
( -9.4 , -2.9 )
Ben Broussard , 2005
-6.7
( -10.4 , -2.2 )
Nomar Garciaparra , 2003
-7.2
( -11.1 , -3.5 )
Jason Giambi , 2003
-7.7
( -13.4 , -3.2 )
Ten Worst 3B Player-Years
Name and Year
Mean
95% Interval
Eric Munson , 2003
-7.1
( -12.4 , -2.8 )
Michael Cuddyer , 2005
-7.3
( -11.4 , -2.9 )
Michael Cuddyer , 2004
-7.4
( -14.1 , -2.3 )
Garrett Atkins , 2007
-7.8
( -12.4 , -2.4 )
Fernando Tatis , 2002
-8.1
( -14.2 , -2.0 )
Chone Figgins , 2006
-8.8
( -18.7 , -1.4 )
Travis Fryman , 2002
-9.4
( -15.2 , -4.4 )
Joe Randa , 2006
-9.8
( -17.3 , -2.8 )
Ryan Braun , 2007
-10.9
( -17.4 , -2.9 )
Jose Bautista , 2006
-11.6
( -17.4 , -5.9 )
Current Methods and Data
Bayesball Model
SAFE
Future
Results for Middle Infielders: Best/Worst Posterior SAFE values
Ten Best 2B Player-Years
Name and Year
Mean
95% Interval
Julius Matos , 2002
18.1
( 12.4 , 22.1 )
Erick Aybar , 2007
17.6
( 10.0 , 24.6 )
Junior Spivey , 2005
14.5
( 4.7 , 27.1 )
Tony Graffanino , 2006
14.1
( 4.6 , 27.6 )
Adam Kennedy , 2008
11.3
( 1.7 , 18.6 )
Willie Bloomquist , 2005
10.9
( 4.3 , 17.8 )
Jose Valentin , 2006
10.9
( 4.2 , 17.9 )
Chase Utley , 2008
10.8
( 5.7 , 17.5 )
Chase Utley , 2005
10.8
( 3.1 , 17.7 )
Craig Counsell , 2005
10.8
( 5.3 , 18.0 )
Ten Best SS Player-Years
Name and Year
Mean
95% Interval
Pokey Reese , 2004
22.6
( 12.0 , 31.2 )
Adam Everett , 2007
20.4
( 10.4 , 27.4 )
Adam Everett , 2006
17.1
( 9.0 , 21.8 )
Craig Counsell , 2006
14.7
( 6.9 , 21.1 )
Jorge Velandia , 2003
14.2
( 3.0 , 24.0 )
Alex Cora , 2005
14.1
( 3.0 , 24.6 )
Alex Rodriguez , 2003
13.5
( 3.5 , 24.4 )
Maicer Izturis , 2004
13.2
( 3.8 , 22.2 )
Marco Scutaro , 2008
13.0
( 4.0 , 20.1 )
Brent Lillibridge , 2008
11.8
( 5.0 , 19.1 )
Ten Worst 2B Player-Years
Name and Year
Mean
95% Interval
Ronnie Belliard , 2008
-9.8
( -19.5 , 2.6 )
Geoff Blum , 2005
-10.2
( -17.5 , -1.7 )
Miguel Cairo , 2004
-10.9
( -17.9 , -3.1 )
Terry Shumpert , 2002
-11.0
( -22.2 , 0.7 )
Roberto Alomar , 2003
-12.1
( -19.3 , -4.6 )
Enrique Wilson , 2004
-12.3
( -18.9 , -6.2 )
Alberto Callaspo , 2008
-12.4
( -20.4 , -4.5 )
Dave Berg , 2002
-13.5
( -25.1 , -2.4 )
Luis Rivas , 2002
-13.8
( -20.9 , -6.4 )
Bret Boone , 2005
-15.4
( -22.4 , -8.1 )
Ten Worst SS Player-Years
Name and Year
Mean
95% Interval
Erick Almonte , 2003
-13.8
( -26.9 , 2.3 )
Derek Jeter , 2007
-13.9
( -21.7 , -5.8 )
Michael Morse , 2005
-14.2
( -23.0 , -4.5 )
Damian Jackson , 2005
-14.5
( -30.6 , -3.5 )
Brandon Fahey , 2008
-15.1
( -22.4 , -8.2 )
Marco Scutaro , 2006
-15.1
( -22.0 , -10.0 )
Derek Jeter , 2003
-15.6
( -24.8 , -6.4 )
Michael Young , 2004
-15.6
( -23.6 , -7.2 )
Josh Wilson , 2007
-15.8
( -26.5 , -6.4 )
Derek Jeter , 2005
-18.5
( -29.1 , -9.2 )
Current Methods and Data
Bayesball Model
SAFE
Future
Results for Outfielders: Best/Worst Posterior SAFE values
Ten Best Center Fielders
Name and Year Mean 95% Interval
J Michaels , 05
17.9 ( 3.3 , 32.5 )
C Figgins , 03
15.5 ( 3.8 , 31.2 )
J Hairston Jr. , 05 13.7 ( 0.3 , 28.6 )
A Jones , 05
11.8 ( 2.2 , 20.7 )
D Glanville , 04
11.1 ( -3.1 , 30.5 )
J Payton , 05
10.2 ( 0.0 , 17.8 )
J Edmonds , 05
10.1 ( -0.5 , 20.5 )
J Gathright , 05
10.1 ( -6.6 , 25.0 )
D Erstad , 03
10.0 ( -1.2 , 20.7 )
C Patterson , 04
9.8 ( 1.9 , 17.9 )
Ten Best Right Fielders
Name and Year
Mean 95% Interval
G Matthews Jr. , 02 14.4 ( 5.7 , 22.3 )
D Mohr , 05
11.8 ( 2.3 , 28.0 )
T Nixon , 05
11.5 ( 3.3 , 18.1 )
G Matthews Jr. , 05 10.5 ( 2.6 , 19.0 )
R Langerhans , 05
10.5 ( 4.6 , 19.3 )
T Nixon , 04
9.4 ( 0.7 , 19.4 )
A Escobar , 03
8.7 ( -0.1 , 19.3 )
A Ochoa , 02
8.7 ( -2.3 , 20.9 )
E Marrero , 02
8.7 ( -0.9 , 20.7 )
J Drew , 03
8.4 ( -1.4 , 20.8 )
Ten Worst Left Fielders
Name and Year Mean 95% Interval
K Mench , 02
-10.7 ( -19.4 , -2.3 )
K Mench , 03
-11.1 ( -14.9 , -7.3 )
A Piatt , 02
-12.4 ( -16.8 , -6.9 )
L Berkman , 05 -13.0 ( -16.7 , -8.2 )
M Ramirez , 07 -13.5 ( -19.1 , -5.4 )
R Sierra , 03
-13.8 ( -16.2 , -10.3 )
B Kielty , 06
-15.2 ( -16.7 , -9.1 )
T Womack , 05 -17.1 ( -25.0 , -5.0 )
L Berkman , 02 -18.2 ( -18.9 , -17.0 )
M Ramirez , 06 -19.5 ( -24.8 , -13.5 )
Ten Worst Center Fielders
Name and Year Mean 95% Interval
C Hermansen , 02 -9.5 ( -23.6 , 4.3 )
D Roberts , 05
-9.8 ( -21.0 , 2.2 )
R Ledee , 02
-10.0 ( -19.6 , 0.5 )
K Griffey Jr. , 04 -12.5 ( -24.4 , -1.3 )
B Williams , 04 -13.2 ( -24.5 , -3.1 )
S Green , 05
-13.3 ( -28.3 , 2.8 )
L Terrero , 05
-13.6 ( -29.4 , 5.8 )
B Williams , 05 -14.2 ( -23.4 , -5.3 )
M Grissom , 05 -20.3 ( -34.2 , -9.4 )
J Cruz , 05
-22.4 ( -36.2 , -5.4 )
Ten Worst Right Fielders
Name and Year
Mean 95% Interval
G Kapler , 05
-8.1 ( -13.2 , -1.6 )
L Walker , 05
-8.2 ( -17.2 , 1.2 )
J Guillen , 05
-8.6 ( -17.0 , 0.7 )
K Mench , 02
-8.6 ( -17.7 , 0.7 )
E Kingsale , 04
-9.2 ( -13.1 , -2.9 )
W Pena , 02
-9.7 ( -17.6 , 0.2 )
C Wilson , 02
-11.1 ( -22.1 , -2.6 )
J Gonzalez , 05
-13.2 ( -16.4 , -10.4 )
M Tucker , 03
-14.1 ( -21.8 , -8.1 )
Sheffield , 04
-14.7 ( -21.6 , -9.5 )
Bayesball Model
SAFE
Bayesball Model
SAFE
Future
Publicity and Feedback
BIP data allows more detailed examination of differences
between players
Parametric approach: smooth probability function
reduces variance of results by sharing information between
all points near to a fielder
SAFE run value aggregates individual differences while
weighting for BIP frequency, run value, and shared
consequence between positions
Current Methods and Data
Bayesball Model
SAFE
Video-based Data
New system tracks players and BIPs with video cameras
Boston Globe (Gideon Gil, 02/16/08):
“Numbers tell a glove story”
Wired (Greta Lorge, 02/16/08):
“Statistics in the Outfield”
AP (Randolph E. Schmid, 02/16/08):
“Baseball’s top fielders ranked in
new statistical system”
New York Post had different take on study:
“You’ve Got To Be Kidding!”
Jeter himself responded in NY Post:
“Must have been a computer glitch”
Current Methods and Data
Bayesball Model
SAFE
Field F/X cont’d
How will Field F/X improve fielding estimation?
Real starting positions and speed for each player
Real hang time on flys/liners instead of current proxies
based on distance/velocity
Real trajectories on all BIPs: was that liner to the
shortstop 10 feet (catchable) or 20 feet (uncatchable) off
the ground?
Issue is availability of data. Current access limited to only
a few people (I’m not currently one of them).
Future
Summary of Our Approach
Ten Best Left Fielders
Name and Year Mean 95% Interval
E Brown , 07
14.4 ( 2.2 , 27.9 )
D Dellucci , 06
13.7 ( 5.7 , 20.4 )
R Johnson , 05
12.1 ( 2.3 , 21.0 )
C Crisp , 05
11.2 ( 4.1 , 17.8 )
S Hairston , 07
11.1 ( 1.1 , 23.5 )
S Podsednik , 07 11.1 ( 6.1 , 17.8 )
M Byrd , 05
10.7 ( 0.5 , 22.7 )
G Vaughn , 02
10.7 ( 1.9 , 16.9 )
O Palmeiro , 02 10.6 ( 0.8 , 22.2 )
T Long , 04
10.3 ( -0.8 , 21.7 )
Current Methods and Data
Current Methods and Data
This data will revolutionize the estimation of fielding ability
Future
Future