Thursday, August 30, 2012

Database Cleanup, the Wild Pitch, and the IBB

Each year I take some time to clean-up our database.  The past few years defense and basic stats have been checked against baseball-rerefence for accuracy and adjusted where necessary.

This year I am going through the same exercise, but for the first time including Information Only stats.  These are stats that are not strictly required to enter a player into the game, they include:
  • For Batters: R, RBI, SH
  • For Pitchers: W, L, Sv, G, GS, CG, ShO, R, GF
Here are the critical errors found this off season.  There were many more (hundreds) of corrections on the information only stats, and a handful of additional minor corrections to the required stats.  These were minor enough (AB here, one triple there, 1/3 of an inning, etc) that they will not have in impact on player valuation and I didn't list them out.

Major Batter Corrections
  • 1930 Goose Goslin (overall line)
  • 1947 Harry Walker (10 more doubles)
  • 1964 Ken Boyer (10 more homers)

Major Pitcher Corrections
  • 1959 Hoyt Wilhelm (10 more K's)
  • 1983 Kent Tekulve (100 fewer batters faced, this worries me, his Opp BA will skyrocket)
  • 1995 Jose Mesa (5 fewer HBP's)
  • 2008 Corey Wade (88 more batters faced)
  • 2009 Ted Lilly (10 more walks)
  • 2011 Antonio Bastardi (78 more batters faced)
Intentional Walks
Throughout league history I have completely ignored the IBB mainly because baseball-reference never included the stat on it's main player pages.  However, a few years ago they altered their page layout and made room for it, making it a lot easier for me during data entry.   Until now, I let DMB self-estimate the IBB based upon pitcher walk rate and era.  This will change going forward and I made retroactive updates to virtually every player.

This may impact certain relievers significantly.  Jim Brewer, for instance, walked 25 batters in 78 innings.  DMB estimated that 4 of those walks (16%) were intentional.  In truth, Brewer intentionally walked 11 batters (44%) and I expect this would have a measurable impact to how Brewer performs.  The impact should be fewer non intentional walks.

Major IBB Updates
  • +10 - 1986 Mark Eichhorn
  • +8 - 1972 Jim Brewer
  • +7 - 1963 Dick Radatz
  • +6 - 1983 Steve Howe, 1987 Dave Smith, 1966 Phil Regan, 2002 Chris Hammond, 1969 Fritz Peterson
  • +5 - 2008 Chad Bradford, 1984 Willie Hernandez
  • -5 - 1978 Ron Guidry, 1982 Mario Soto, 1999 Randy Johnson, 2009 Tim Lincecum, 1968 Denny McClain, 1971 Wilbur Wood, 1971 Tom Seaver, 1968 Dave McNally, 2005 Andy Pettitte
  • -6 - 1979 JR RIchard, 1997 Al Leiter, 1985 Sid Fernandez, 1991 Nolan Ryan, 2009 Felix Hernandez, 1972 Don Sutton, 2005 Chris Carpenter
  • -7 - 1961 Whitey Ford, 1971 Vida Blue, 1995 HideoNomo
  • -8 - 1965 Sam McDowell, 1987 Nolan Ryan, 1998 Kerry Wood, 2008 Tim Lincecum, 2007 Chris Young

Wild Pitch Changes
Wild Pitches have historical been one of the "Informational Only" stats in our league.  This upcoming year, however, I decided to implement the Wild Pitch rating system DMB employs:
This number indicates how often a pitcher throws a wild pitch when there are runners on base. The wild pitch rating tends to range from 0 to 60 with an average of 15. Use the formula:

  rating = (wild pitches * 1000) / (batters faced * .43)

For example, if a pitcher threw four wild pitches in a season in which he faced 1000 batters, his rating is 9. Why .43? Because about 43% percentage of batters faced occur with runners on base, though this number rises and falls over time and will vary for individual pitchers.
Using that formula, the worst offenders will be:

80 - 1996 Ruffin,Bruce
79 - 2006 Rodriguez,Francisco
77 - 2005 Turnbow,Derrick
76 - 1890 Neale,Joe
70 - 2011 Holland,Greg
68 - 1998 Hoffman,Trevor
66 - 1998 Gordon,Tom
64 - 2002 Eischen,Joey
63 - 2002 Romero,J.C.
62 - 1981 Ryan,Nolan
60 - 1986 Murphy,Rob
59 - 1993 Bedrosian,Steve
58 - 1885 Ramsey,Toad
58 - 1993 Ward,Duane
57 - 1995 Holmes,Darren
57 - 1995 Nomo,Hideo
56 - 1998 Brocail,Doug
54 - 1999 Rocker,John
53 - 1967 Niekro,Phil
52 - 1872 Spalding,Al
51 - 2011 Robertson,David
50 - 1989 Davis,Mark
50 - 1989 Russell,Jeff
48 - 2008 Rodriguez,Francisco
48 - 2006 Reyes,Dennys
47 - 1993 Wetteland,John
47 - 1995 Mesa,Jose
46 - 1881 Whitney,Jim
45 - 2000 Nen,Robb
45 - 2003 Guardado,Eddie
44 - 1992 Guzman,Juan
44 - 2006 Liriano,Francisco
44 - 2008 Buchholz,Taylor
43 - 2010 Feliz,Neftali
43 - 2009 Bailey,Andrew
43 - 1970 Richert,Pete
43 - 1965 Wilhelm,Hoyt
43 - 2008 Lincecum,Tim
42 - 1991 Fassero,Jeff
42 - 2010 Jimenez,Ubaldo
41 - 2011 Bastardo,Antonio
41 - 2004 Nathan,Joe
41 - 2007 Marmol,Carlos
41 - 2001 Fox,Chad
40 - 2009 Hernandez,Felix
40 - 2008 Marmol,Carlos
40 - 1972 Marshall,Mike
40 - 1977 Sutter,Bruce

While mostly relievers, there are a few great pitchers on that list nonetheless.  If I am interpreting the formula correctly, this means that Bruce Ruffin will throw 80 passed balls for every 1000 times he pitches WITH RUNNERS ON BASE.

I don't expect the need for draft strategy changes, we are talking wild pitches now occurring between .01% and .08% of the time with runners on base instead of virtually never.

No comments:

Post a Comment