From the File Drawer: Does Spring Training Predict Wins? March 18, 2014
Posted by tomflesher in Baseball.Tags: Baseball, file drawer, Spring training
1 comment so far
Nope.
It’s an idle curiosity, and more information is never a bad thing, but first, you need to establish whether there’s actually any information being generated. It would be useful, potentially, to have a sense of at least how the first few weeks of the season might go, so I decided to crunch some numbers to see whether I could torture the data far enough to get a good predictive measure. I grabbed the spring training and regular season stats from 2012 and 2013 and started at it.
First round.
Correlation! Correlations are useful. The correlation of spring winning percentage and regular-season winning percentage? A paltry .069. That’s not even worth looking at. This is going to be harder than I thought.
Second round.
Well, maybe if we try a Pythagorean expectation, we might get something useful. Let’s try the 2 exponent…. Hm. That correlation is even worse (.063). Well, maybe the 1.82 “true” exponent will help…. .065. This isn’t going to work very well.
Third round.
Okay. This is going to involve some functional-form assumptions if we really want to go all Mythbusters on the data’s ass and figure out something that works. First, let’s validate the Pythagorean expectation by running an optimization to minimize the sum of squared errors, with runratio = Runs Allowed/Runs Scored and perc = regular season winning percentage:
> min.RSS <- function(data,B) {with(data,sum((1/(1 + runratio^B) – perc)^2))}
> result<-optimize(min.RSS, c(0,10),data=data)
> result
$minimum
[1] 1.799245
$objective
[1] 0.04660422
That “$minimum” value means that the optimal value for B (the pythagorean exponent) is around 1.80 (to the nearest hundredth). The “$objective” value is the sum of squared errors. Let’s try the same thing with the Spring data:
> spring.RSS <- function(data,SprB) {with(data,sum((1/(1 + runratio.spr^SprB) – Sprperc)^2))}
> springresult<-optimize(spring.RSS, c(0,10),data=data)
> springresult
$minimum
[1] 2.243336
$objective
[1] 0.1253673
Alarmingly, even with the same amount of data, the sum of squared errors is almost triple the same measure for the regular-season data. The exponent is also pretty far off. Now for some cross-over: can we set up a model where the spring run ratio yields a useful measure of regular-season win percentage? Let’s try it out:
> cross.RSS <- function(data,crossB) {with(data,sum((1/(1 + runratio.spr^crossB) – perc)^2))}
> crossresult<-optimize(cross.RSS,c(0,10),data=data)
> crossresult
$minimum
[1] 0.08985465
$objective
[1] 0.3214856
> crossperc <- 1/(1 + runratio.spr^crossresult$minimum)
> cor(perc,crossperc)
[1] 0.05433157
.054, everybody! That’s the worst one yet!
Now, if anyone ever asks, go ahead and tell them that at least based on an afternoon of noodling around with R, spring training will not predict regular-season wins.
Just for the record, the correlation between the Pythagorean expectation and wins is enormous:
> pythperc<-1/(1 + runratio^result$minimum)
> cor(perc,pythperc)
[1] 0.9250366
Spitballing: Position Players Moving Around January 24, 2014
Posted by tomflesher in Baseball, Economics.Tags: Lucas Duda, Spitballing
add a comment
Earlier this week, I posted about Lucas Duda and how he’s being forced out of his natural position. This came up a few years ago for the Mets as well, when Angel Pagan was being forced out of the outfield – many fans suggested pushing him to second base (in the hole now filled by Daniel Murphy). The sense seems to be that players can move freely around the field, going wherever the team needs them. There are a couple of theories on this, and a couple of good examples, but it doesn’t always work out.
Typically, the best moves take someone from a more defensively-demanding position and move him to one that’s less so. Victor Martinez still catches occasionally, but he’s made a move almost entirely to the DH role and played more games at first base last year. Alex Rodriguez has also made some moves in that direction, moving from the very demanding shortstop position to the slightly less difficult third base (perversely, to allow the much lousier Derek Jeter to stay in his position), and mostly toward DH these days. Johnny Damon and Jorge Posada were among the revolving door of older Yankees to do time out of position at first base over the past few years, with varying degrees of success. On the other hand, even moves down the defensive spectrum don’t always work. Gary Sheffield was famously described as “painful to watch” at first by Michael Kaye. Kevin Youkilis was solid in his move from third to first, but the extra speed required for left field left him looking like he couldn’t hack it, and even though the corner positions are great places to stick a team’s best sluggers, it rarely makes sense to move a broken-down catcher there instead of to first or (rarely) third. Even a solid third baseman wouldn’t necessarily have the ability to cover ground needed by an outfielder, even if he had the requisite ability to predict the ball’s flight – a skill that probably needs time in the field to develop.
Pitching is kind of a weird exception. The best example in recent memory has to be Rick Ankiel, whose meltdown on the mound during the World Series led to his second career as an outfielder. On the opposite side, Juan Salas went from being a cannon-armed third baseman to pitching reasonably well, and the Dodgers’ Kenley Jansen has saved 53 games for the Dodgers since being converted from light-hitting catcher to closer. (He also has a lifetime .500/.667/.500 batting line, in the “Utterly Meaningless Statistics” category.) Similarly, Ike Davis went from being his college team’s Friday-night starter to the least defensively-demanding position (first base) in the majors.
Defensive position moves tend to be difficult to make. In Duda’s case, he’d technically be moving up the defensive spectrum, but it’s hard to even consider the speed required to be a competent outfielder on the same scale as the abilities of an infielder. It’s unremarkable to me that Johnny Damon was able to move to first, but putting Youkilis in left field a few years ago was a true head-scratcher. In order to move a player freely between the infield and the outfield, you’ll need a special kind of player unless you’re willing to give up a lot defensively. As an economist, I’m all about specialization given constraints; Duda’s constraints are just too tight to make this move work.
Is Bobby Abreu a good investment for the Phillies? January 23, 2014
Posted by tomflesher in Baseball, Economics.Tags: Bobby Abreu, Jim Thome, Kelly Dugan, Phillies, Tyson Gillies, Zach Collier
add a comment
Bobby Abreu signed with the Phillies on a minor league deal, offering him $800,000 if he makes the major league squad. He’s coming off a solid Winter League season in Venezuela, in which he hit .322/.416/.461. His deal is a bit smaller than the one the Phils offered Jim Thome for 2012, when Thome was 41 (Abreu is 39). Of course, Thome was coming off of a much heavier-slugging season- his OPS in 2011 was .838, almost as high as Abreu’s Venezuelan OPS (and swamping his 2012 Majors OPS of .693). He might play the field on occasion (as Thome did, playing first base in 2012 for the first time since the Bush administration), but the Phils’ corner outfield is pretty solidly set up with Marlon Byrd and Domonic Brown starting.
Thome and Bobby both represent an odd trend – it’s not surprising, really, that the Phillies would want to bring back some of their old sluggers for nostalgia purposes, and they did employ Matt Stairs for longer than they should have – but the trend for a while was toward specialization of pinch hitters into the DH role in the American League. Thome started four games at first base for the Phillies in 2012, but otherwise appeared almost exclusively as a pinch hitter or DH (and in fact was traded to Baltimore once the Phillies’ interleague play ended). Bobby still has more in the tank defensively than Thome did, it seems, but he probably won’t start man
y more games than Thome did.
Given that the Phillies are going to use Abreu the way they used Thome, this doesn’t look like a bad deal. In order to be a reserve outfielder and present some value, Abreu will only have to beat out a few arms in spring training. He’s not in direct competition with John Mayberry, since Mayberry’s a right-handed bat. The Phils have three left-handed minors outfield prospects on their 40-man roster – Zach Collier, Kelly Dugan, and Tyson Gillies. Based on his 2013 numbers, Collier probably isn’t ready – at AA Reading, he hit .222/.310/.348. Dugan, who like Collier was born in September of 1990, looks like he might be better off, but only slightly – his .264/.299/.472 line in 56 AA games (plus slightly better numbers in 56 games at high A) indicate some solid power, but not much plate discipline. Of the three, Gillies (who’s two years older) may be the most mature, but his .264/.312/.477 line doesn’t represent much of a marginal improvement over Dugan. Plus, when he was promoted to AAA Lehigh Valley, he struggled, with a sub-.600 OPS.
From a development perspective, Collier and Dugan might be a better investment, but neither of them is a franchise player, at least based on numbers alone. Abreu represents a nice left-handed insurance bat off the bench.
What can the Mets do with Lucas Duda? January 22, 2014
Posted by tomflesher in Baseball.Tags: Chris Young, Curtis Granderson, Eric Young Jr., Juan Lagares, Lucas Duda
add a comment
As MetsBlog reported the other day, Lucas Duda is likely to be an outfielder this year. (In fact, he’s listed there on the current Mets roster.) For a number of good reasons, this probably isn’t a great idea. MetsMerized pointed out that this is probably a flailing attempt to find a good home for Duda, so let’s consider a few options.
They appear to be considering leaving him in the outfield, but this isn’t a great idea. Curtis Granderson is probably going to play every day, with Eric Young, Juan Lagares, and Chris Young rounding out the outfield. Lagares and Chris Young each hit righty, but Eric Young is a switch hitter and Granderson is a lefty, so Duda’s left-handed bat doesn’t offer a huge upgrade. Duda’s OPS is much better versus right-handers than Lagares’ (.831 vs .620), so that might present a decent platoon split, but starting Duda against right-handers would put Duda’s significantly worse defense in the field. Duda isn’t comfortable in the outfield, and his numbers bear this out – Duda’s 2013 defensive WAR is -2.1, including his time at first base (his natural position), whereas Lagares’ is 3.5. Duda’s further time on-base is useful, but his defense is too rough to count on. Eric Young hovers around the 0 dWAR mark and has a .647 OPS mark, still representing a defensive upgrade.
The Mets will probably start Ike Davis, keeping Duda from his natural position. Though Davis has struggled offensively, he’s been defensively quite good.
Probably the best thing the Mets could do with Duda is to try to shop him around. Don’t get me wrong, I like Duda, but he’s best shipped off to a team that can use him as a platoon first baseman or DH. We don’t even need to get much for him in return – picking up a few young, cheap bullpen arms might be a viable option. I’m not an expert on the trade market this year, but someone can make more use of Duda than we can, even if arbitration finds him worth $1.9 million.
Ike Davis and his 12% raise January 21, 2014
Posted by tomflesher in Baseball, Economics.Tags: Ike Davis, Mets, Pythagorean expectation
add a comment
So, Ike Davis was pretty lousy last year. He batted .205/.326/.334 in an injury-shortened season with 106 total bases on 377 plate appearances, meaning he expected to make it to first a bit over a quarter of the time. Throw in his paltry home run figures and a handful of doubles, and you’re not looking at a major-league first baseman; his 0.2 wins above replacement put him in the company of Lyle Overbay and Garrett Jones.
Now that that’s out of the way, I’d like to point out that Overbay played 142 games and Jones played 144; Davis definitely presented more bang for your buck than those two, especially since he was earning $3.125 million. He’ll be getting a 12% raise this year, having re-signed for $3.5 million. Again, his numbers were pretty lousy.
But if you add up all of Davis’s appearances as a starter, you’ll see that the Mets scored 354 runs in those games, and allowed 376, meaning that the Pythagorean expectation for those games is 0.46989 – that corresponds to an expectation of about 76 wins over a 162-game season (or 41 wins over Davis’ tenure). The Mets’ overall winning percentage was .457 (74 wins), and their Pythagorean expectation was about .45, corresponding to around 73 wins; but without Davis, the team scored 265 runs and allowed 308, leading to an expectation of .425 and around 69 wins on the season. Additionally, the team actually won only 39 of the 87 games Davis started, for about a .45 winning percentage – right on with their season-long expectation, and two wins below expectation.
Now, there are some caveats. When Davis was active, the team was still doing its best to win, and players like John Buck and Marlon Byrd were still active. Toward the end of the season, the Mets moved more toward development and away from trying to win every game. It’s therefore entirely possible that the effect of having Davis start the game are wrapped up in the team’s changing fortunes. Still, the team would have been expected to perform better with Davis in the lineup, at least according to the Pythagorean expectation formula, and actually underperformed.
Comparing Contracts: Parnell and Gee January 20, 2014
Posted by tomflesher in Baseball, Economics.Tags: Bobby Parnell, comparing contracts, Dillon Gee, Mets
add a comment
A few days ago, Bobby Parnell and Dillon Gee both re-signed with the Mets; though there are some incentives in Parnell’s deal, he’ll be making $3.7 million to Gee’s $3.625 million. Those numbers were oddly close (and the contracts similar despite the difference in position), so I decided to check out the players’ recent statistics. Since the players are each negotiating one-year deals, and these players are neither very old or very young, it seems reasonable to treat the best predictor of future performance as the players’ most recent performance.
Gee started 32 games (almost exactly every fifth game) in 2013 to a 3.62 ERA and a .301 opposing BABIP. The median numbers for starters with 162 or more innings pitched were about 3.51 and .295, so Gee is performing almost exactly like a full-time starter (and thus presumably a bit better than your average pitcher). Gee’s performance corresponds to 2.2 wins above replacement, a shade below the median of 3.0 for full-time starters.
I’m not Parnell’s biggest fan, and his season was shortened by an injury (causing him to miss all of August), so I expected the numbers not to operate in his favor. However, his 2.16 ERA is well below the median of relievers with 40 appearances or more, and his 0.7 WAR is right on the median. Oddly, his BABIP at .268 is much lower than the median of .290, indicating that he’s benefiting, to some degree, from good fielding behind him. If we restrict the numbers to only pitchers with 15 saves or more (all 32 of them), those medians adjust to 2.645, 1.4, and .277, respectively, keeping him on the good side of ERA and BABIP but cutting his WAR performance considerably. Let’s see if we can extrapolate – in 104 team games, Parnell played 49, meaning that he played in about 47% of the team’s games. At that pace, he probably would have been put into about 27 more games, meaning his current stats are about 65% of what his season stats might have been. In that case, let’s hold his BABIP and ERA constant and extend his WAR to 1.08 (by dividing by .65). That would have ranked him with Huston Street and Addison Reed – much better company than his current competition. It also, interestingly, would have put him much closer to Gee’s WAR, at a higher-leverage position.
Again, I’m not Parnell’s biggest fan, and I was skeptical about this deal. Assuming that the injury hasn’t harmed him, though, Parnell’s contract really does make sense compared to Gee’s.
Quick thoughts on the Mets August 11, 2012
Posted by tomflesher in Baseball.Tags: A.J. Ellis, BABIP, Mike Baxter, R.A. Dickey
add a comment
- So, I’m a little late to the party on this one, but way back on August 4, Mike Baxter tied the National League record for most walks in a 9-inning game with 5. 5 was, incidentally, his total number of plate appearances. That was unusual in part because prior to August 4, Baxter had made 82 plate appearances (mostly as a pinch hitter) and walked in 8 of them, for a rate of .0972 walks per plate appearance. That makes the probability of having five consecutive plate appearances all end in walks about .09755, or a little under 9 in every million five-PA strings. In total this year he’s walked 52 times in 342 plate appearances, for a rate of about .15 walks every appearance. The Pride of Whitestone seems to be normalizing upward.
- R.A. Dickey pitched a complete game gem Thursday afternoon. Batters facing Dickey have a .277 batting average on balls in play, compared with a league average of .299. Dickey may be benefiting from a slightly lower-than-expected BABIP, but he’s helping himself avoid the unpredictability of balls in play with a league-leading 166 strikeouts (tied with Stephen Strasburg). He’s leading the league in WHIP with just 1.004 walks plus hits per inning pitched. It’s a shame he’s on this year’s squad, or he’d be receiving serious consideration for the Cy Young. As it stands, Strasburg has a much better case on player value grounds.
- Just as a side note, A.J. Ellis of the Dodgers has had two games where he walked in every plate appearance – both of them were 4-plate-appearance games. His stats are otherwise pretty similar to Baxter’s. He just likes to bunch them up a bit more.
Pujols is happy to be in the AL West. That’s where the Rangers are. August 2, 2012
Posted by tomflesher in Baseball.Tags: ALbert Pujols, Angels, BABIP, Mike Trout
add a comment
Albert Pujols hit two home runs in the Angels’ 11-10 extra-innings loss to the Rangers yesterday night. It was his first multi-homer game since, well, the Angels 6-2 win over the Rangers the previous night. Albert’s last multi-homer game was an October 22 win over… yep, the Rangers.
Although he’s dragging a bit in comparison to previous years (currently hitting .049 home runs per plate appearance, as opposed to last year’s .057 and 2010’s .06), there’s an argument to be made that he’s the victim of bad luck. For example, the league’s batting average on balls in play (BABIP) is .292, and Pujols’ is a full .016 below that at .276. In his 401 at-bats, that’s about 6 hits that average defense wouldn’t have fielded. Mike Trout, on the other hand, is up at a BABIP of .400. That’s about 36 hits on his 333 at-bats that are above his expectation if he had the league’s average BABIP. This is emphatically not to say that Trout’s season is a fluke, or that Pujols’ is, but sometimes the human element of the game has odd results.
Game-Ending Outs: Maybe A-Rod isn’t as bad at this as we thought July 19, 2012
Posted by tomflesher in Baseball.Tags: A-Rod, Craig Kimbrel, game-ending outs, game-ending strikouts
add a comment
So, a friend of mine made the following comment on Facebook the other day:
Someone has to look this up for me. Has any player, ever, made the last out for his team more often than A-Rod? He’s like the bizarro Mo.
At the time, Alex Rodriguez and Curtis Granderson were tied with 5 game-ending outs apiece for the Yankees. Since then, I thought it would be interesting to see what the average “game-ending out” looks like, at least according to Baseball Reference.
As of July 18, there were 1264 game-ending outs in the majors this year. Aaron Hill, Jesus Guzman, and Kyle Seager are ties for the lead with 9 apiece, with a spate of other batters tied for second at 8. Unsurprisingly, the 8th batting-order position makes the game-ending out most often; managers (of course) tend to arrange their strongest batters earlier in the lineup. By and large, game-ending outs tend to be short at-bats, with 850 coming with 4 or fewer pitches.
450 were strikeouts, with the league-leading total of 5 shared by Edwin Encarnacion, Giancarlo Stanton, and Ryan Ludwick. Craig Kimbrel of Atlanta leads the league in game-ending strikeouts, having thrown 15 of them. Kimbrel also led last year, with 31, which surprised me. Mariano Rivera had less than 2/3 as many, with only 20.
The Spectrum Club, 2011 Edition January 19, 2012
Posted by tomflesher in Baseball.Tags: Darnell McDonald, Don Kelly, Michael Cuddyer, Mike McCoy, Mitch Maier, Skip Schumaker, Spectrum Club, utility pitchers, Wilson Valdez
add a comment
2011 yielded 5 new members to the prestigious* Spectrum Club; the Spectrum Club is a collection of baseball players who have played at each end of the defensive spectrum, playing at least one game each as a pitcher and designated hitter. Those players were Michael Cuddyer, Don Kelly, Mitch Maier, Mike McCoy, and Darnell McDonald.
Of these five, Kelly was the most versatile, playing at every position except second base and shortstop this year. Maier and McDonald were the least: each played three outfield positions in addition to pitching and hitting, while Cuddyer played first base, second base, and right field. McCoy, a typical utilityman, played second, third, short, center, and right. Kelly’s tenure on the mound was the shortest (one batter, one out), with everyone else pitching a full inning. McDonald gave up two runs on a hit and two walks in six batters faced; Maier faced four and gave up one hit, but no runs; Cuddyer allowed one hit and walked one for six batters faced and no runs; and McCoy pitched a perfect inning.
There’s no telling who will join these fellows next year – Skip Schumaker and Wilson Valdez each pitched an inning this year, but while Valdez is a journeyman, he’s unlikely to DH, and Schumaker is locked in with the Cardinals for the next two years.
*not a guarantee