Is scoring different in the AL and the NL? May 31, 2011
Posted by tomflesher in Baseball, Economics.Tags: American League, Baseball, baseball-reference.com, bunts, Chow test, linear regression, National League, R, structural break
1 comment so far
The American League and the National League have one important difference. Specifically, the AL allows the use of a player known as the Designated Hitter, who does not play a position in the field, hits every time the pitcher would bat, and cannot be moved to a defensive position without forfeiting the right to use the DH. As a result, there are a couple of notable differences between the AL and the NL – in theory, there should be slightly more home runs and slightly fewer sacrifice bunts in the AL, since pitchers have to bat in the NL and they tend to be pretty poor hitters. How much can we quantify that difference? To answer that question, I decided to sample a ten-year period (2000 until 2009) from each league and run a linear regression of the form
Where runs are presumed to be a function of hits, doubles, triples, home runs, stolen bases, times caught stealing, walks, strikeouts, hit batsmen, bunts, and sacrifice flies. My expectations are:
- The sacrifice bunt coefficient should be smaller in the NL than in the AL – in the American League, bunting is used strategically, whereas NL teams are more likely to bunt whenever a pitcher appears, so in any randomly-chosen string of plate appearances, the chance that a bunt is the optimal strategy given an average hitter is much lower. (That is, pitchers bunt a lot, even when a normal hitter would swing away.) A smaller coefficient means each bunt produces fewer runs, on average.
- The strategy from league to league should be different, as measured by different coefficients for different factors from league to league. That is, the designated hitter rule causes different strategies to be used. I’ll use a technique called the Chow test to test that. That means I’ll run the linear model on all of MLB, then separately on the AL and the NL, and look at the size of the errors generated.
The results:
- In the AL, a sac bunt produces about .43 runs, on average, and that number is significant at the 95% level. In the NL, a bunt produces about .02 runs, and the number is not significantly different from saying that a bunt has no effect on run production.
- The Chow Test tells us at about a 90% confidence level that the process of producing runs in the AL is different than the process of producing runs in the NL. That is, in Major League Baseball, the designated hitter has a statistically significant effect on strategy. There’s structural break.
R code is behind the cut.
Complete Game Shutout… PSYCH! May 30, 2011
Posted by tomflesher in Baseball.Tags: Angels, Hank Conger, Jason Vargas, Jered Weaver, Mariners, Orioles, Phillies, pitcher, Reds, Roy Halladay, Travis Wood, Twins, weird lines, Zach Britton
add a comment
Jered Weaver pitched a brilliant game Saturday night for the Angels against the Twins. He’s had a strange opening to the season, starting with six straight wins and then beginning May with four straight losses followed by a no-decision. Saturday, on four days rest, he pitched nine scoreless innings with 2 hits, 0 runs, 2 walks, 7 strikeouts, no hit batsmen, a Game Score of 88, and a career-high 128 pitches. It’s a good thing he grabbed another win… wait, no he didn’t. The game went into extra innings, the Angels lost, and Weaver walked off the mound with a no decision.
Put another way, if anyone had managed to hit a home run, or if Hank Conger had singled instead of popping fly to third in the eighth, Weaver would have a two-hit complete game shutout, and we’d be talking about how he still had it. Instead, he gets a no decision, and the Angels lost the game.
That doesn’t happen a whole lot, but it does happen enough to take notice. For example, on May 12, a 2-1 win for the Orioles over the Mariners was 0-0 into the 12th. So, both the Mariners’ Jason Vargas (9 IP, 7 H, 0 R, o ER, 1 BB, 4 K, 76 GSc) and the Orioles’ Zach Britton (9 IP, 3 H, 0 R, 0 ER, 0 BB, 5 K, 86 GSc) left with complete game shutouts that weren’t.
Similarly, last year, on July 10, Roy Halladay was outpitched by the Reds’ Travis Wood in an 11-inning 1-run loss. Wood managed a game score of 93 on one hit, no walks, and 8 strikeouts, whereas Halladay had a paltry 85 on 5 hits, 1 walk and 9 strikeouts. Neither man got the win, which went to Phillies reliever Jose Contreras.
Zambrano Back on the Horse May 27, 2011
Posted by tomflesher in Baseball.Tags: Carlos Zambrano, Chris Young, Cubs, Dan Haren, Mel Stottlemyre, Mets, Micah Owings, Pitchers batting
add a comment
Last night, Carlos Zambrano pitched on one day’s rest after pinch-hitting against the hapless Mets for two RBIs on Tuesday. We’ve talked about Zambrano’s pinch-hitting prowess before, but last night he was an awesome 3 for 3 from the plate, including a double. In fact, in 26 plate appearances, Zambrano’s got 9 hits for a .375 batting average and, since he has no walks, a .375 on-base percentage. Not only is that impressive, but I hear he can pitch, too.
I figured that was pretty impressive. It can’t be often that a pitcher gets three at-bats and hits for all of them, can it? It’s happened 450 times since 1919, including, surprisingly, once already this year. The Mets’ Chris Young managed a 3-for-3 night while notching the win against the Phillies back on April 5.
In recent memory, the most at-bats by a pitcher who hit each time was Dan Haren, who grabbed a 4-for-4 as a Diamondback against the Cardinals last year (also as the winning pitcher). Haren also gave up a whopping 7 runs, so he’s lucky he was hitting.
Micah Owings has had two games where he pitched and hit in all of at least 3 plate appearances, including a 4-for-4 from 2007 in which three of his 4 hits were doubles.
Finally, Mel Stottlemyre (in 1964) and two pitchers from the 1920s had 5-for-5 games. Stottlemyre’s two-hit gem included him hitting a double and pitching to a game score of 83.
Complete Game in a Non-Quality Start May 26, 2011
Posted by tomflesher in Baseball.Tags: Cheap Wins, complete games, Cubs, Dillon Gee, Edwin Jackson, Freddy Garcia, Mets, Pete Harnisch, quality starts
add a comment
Dillon Gee of the Mets was credited with a complete game in last night’s win over the Cubs. His line: 6 IP, 4 H, 4 R, 4 ER, 2 BB, 4 K, 0 HR, and 1 HBP, for a game score of 50. He qualified for a quality start under the Game Score definition, but not under the six-inning, three-run criterion. That makes it a form of Cheap Win, where a pitcher is credited with a win even though he didn’t pitch as effectively as expected.
Since the game was shortened by rain, Gee got a complete game, even though that usually involves 8 innings for the visiting pitcher on a losing team or 9 inning for a winning pitcher regardless. That made me wonder how many pitchers from the modern era, when complete games are less common than in previous years, have pitched complete games in non-quality starts.
A quality start, under the Game Score definition, is a start with less than 50 points. That represents that a pitcher had negative value for his team. It can’t be especially common, can it?
According to this list I queried from Baseball Reference, a non-quality start complete game hasnt been pitched since 2006 when Freddy Garcia pitched a rain-shortened 5-inning complete game for the White Sox to defeat the Blue Jays 6-4, with a game score of 42. The last nine-inning complete game non-quality start was Pete Harnisch with the Reds, who won a 10-6 slugfest in August of 2000 on 124 pitches with only one walk and three strikeouts. Aside from the six earned runs (all scored in the first three innings) it wasn’t a bad performance, somewhat reminiscent of Edwin Jackson‘s ugly but effective no-hitter last year.
Wilson Valdez, Utility Pitcher Extraordinaire May 26, 2011
Posted by tomflesher in Baseball.Tags: Brent Mayne, Carlos Fisher, Danys Baez, free baseball, Phillies, position players pitching, Reds, ridiculously long games, Roy Halladay, Scott Rolen, Wilson Valdez
1 comment so far
Interested in position players who pitched? Check out The Best Game Ever and a previous post on what I like to call Utility Pitchers.
So, the Phillies and the Reds went into extra innings last night and Wilson Valdez was the winning pitcher. His line: 1.0 IP, 0 H, 0 R, 0 ER, 0 BB, 0 K, 0 HR, 4 BF on 10 pitches. He did have a hit batsman – Scott Rolen – but that’s not surprising, since Valdez has never pitched professionally at any level.
First of all, let me say that I’m thoroughly impressed with the way both managers managed the game. Ordinarily, a 19-inning game is full of spot relievers going a few innings each and at some point the managers seem to lose control of the situation and start panicking. The most common solution is to throw starters in on their throw day, which is how Mike Pelfrey got his save last year. Instead, Reds manager Dusty Baker seemed to know that Carlos Fisher, who has never started a game at the Major League level, had 5 2/3 innings of starter-quality stuff in him. Similarly, the Phillies’ Charlie Manuel relied on Danys Baez, who hadn’t pitched more than four innings since the Bush administration, for five innings that would have made any manager happy. To offer some perspective, if Baez had pitched his five innings at the beginning of the game and been lifted, his game score would have been 67; Fisher’s would have been 58 had he been removed from the game at the moment he gave up his run. That’s not only a quality start for each pitcher, but both of the relievers put together a higher game score than their team’s starter.
Oh, yeah, and the Phillies’ starter was Roy Halladay.
Also, Wilson Valdez had an incredible night. In addition to becoming the first position player to be the winning pitcher since 2000, Valdez started the game at second base and went 3 for 6 with a walk. To compare, when catcher Brent Mayne was the Rockies’ winning pitcher in 2000, he came in off the bench and didn’t bat at all.
Hats off to Charlie Manuel and Dusty Baker for managing a smart game, and bravo to Wilson Valdez for solid inning pitched and a great night at the plate.
Is ‘luck’ persistent? May 25, 2011
Posted by tomflesher in Baseball, Economics.Tags: American League, Baseball, Pythagorean expectation, wins above expectation
2 comments
I’ve been listening to Scott Patterson’s The Quants in my spare time recently. One of the recurring jokes is Wall Street traders’ use of the word ‘Alpha’ (which usually represents abnormal returns in finance) to refer to a general quality of being skillful or having talent. That led me to think about an old concept I haven’t played with in a while – wins above expectation.
As a quick review, wins above expectation relate a team’s actual wins to its Pythagorean expectation. If the team wins more than expected, it has a positive WAE number, and if it loses more than expected, it has wins below expectation, or, equivalently, a negative WAE. It’s tempting to think of WAE as representing a sort of ‘alpha’ in the traders’ sense – since the Pythagorean Expectation involves groups of runs scored and runs allowed, it generates a probability that a team with a history represented by its runs scored/runs allowed stats will win a given game. If a team has a lot more wins than expected, it seems like that represents efficiency – scoring runs at crucial times, not wasting them on blowing out opponents – or especially skillful management. Alternatively, it could just be luck. Is there any way to test which it is?
It’s difficult. However, let’s break down what the efficiency factor would imply. In general, it would represent some combination of individual player skill (such as the alleged clutch hitting ability) and team chemistry, whether that boils down to on- or off-field factors. Assuming rosters don’t change much over the course of the year, then, efficiency also shouldn’t change much over the course of the year. Similarly, if a manager’s skill was the primary determinant of wins above expectation, then for teams that don’t change managers midyear, we wouldn’t expect much of a change throughout the course of the season. Most managers work up through the minors, so there probably isn’t a major on-the-job training effect to consider.
On the other hand, if wins above expectation are just luck, then we wouldn’t need to place any restrictions on them. Maybe they’d change. Maybe they wouldn’t. Who knows?
In order to test that idea, I pulled some data for the American League off Baseball Reference from last season. I split the season into pre- and post-All-Star Break sets and calculated the Pythagorean expectation (using the 1.81 exponent referred to in Wikipedia) for each team. I found WAE for each team in each session, then found each team’s ‘Alpha’ for that session by dividing WAE by the number of games played. Basically, I assumed that WAE represented extra win probability in some fashion and assumed it existed in every game at about the same level. The results:
As is evident from the table, a whopping 10 out of the 14 teams see a change in the sign of Alpha from before the All-Star Game to after the All-Star Game. The correlation coefficient of Alpha from pre- to post-All-Star is -.549, which is a pretty noisy correlation. (Note also that this very closely describes regression to the mean.) It’s not 0, but it’s also negative, implying one of two things: Either teams become less efficient and/or more badly managed, on average, after the break, or Alpha represents very little more than a realization of a random process, which might just as well be described as luck.
Justin Verlander, Second No-Hitter of 2011 May 8, 2011
Posted by tomflesher in Baseball.Tags: Bleacher Report, Justin Verlander, no-hitters, poisson distribution, Ricky Romero
add a comment
Yesterday, the Tigers’ Justin Verlander threw 2011’s second no-hitter, blanking the Blue Jays on a phenomenal 9-0 win. Verlander’s line was 9.0 innings pitched, 0 runs, 0 hits, 0 errors, 1 walk, 4 strikeouts, 108 pitches (74 for strikes) and a game score of 90. The Jays’ Ricky Romero took the loss.
It was Verlander’s second no-hitter. Although Bleacher Report’s Josh Rosenblat suggests that this is the start of a second “Year of the Pitcher,” I would be careful about making that conclusion. Remember that no-hitters are probably poisson distributed, so we should expect clumps of them. If pitchers keep it up, I’ll start crunching more numbers, but for now it’s just an odd coincidence.
Francisco Liriano, First No-Hitter of 2011 May 4, 2011
Posted by tomflesher in Baseball.add a comment
Last night, Francisco Liriano of the Minnesota Twins pitched the first no-hitter of the year in a 1-0 win over the Chicago White Sox. Edwin Jackson, who threw a no-hitter of his own last year, took the loss.
Liriano’s line featured 9.0 innings pitched, 0 hits, 0 runs, 6 walks, 2 strikeouts, 30 batters faced, 123 pitches (66 of them strikes), and a game score of 83. Though Justin Morneau had no hits on the night, he walked once and was credited with making several excellent plays at first base to maintain Liriano’s win. The only run of the night was a home run by designated hitter Jason Kubel.