CSS Button No Image Css3Menu.com

Baseball Prospectus home
Click here to log in Click here to subscribe
<< Previous Article
Fantasy Article Resident Fantasy Geniu... (10/27)
<< Previous Column
Premium Article Manufactured Runs: Pun... (10/20)
Next Column >>
Manufactured Runs: The... (12/02)
Next Article >>
Transaction Analysis: ... (10/27)

October 27, 2011

Manufactured Runs

Matchup Madness

by Colin Wyers

the archives are now free.

All Baseball Prospectus Premium and Fantasy articles more than a year old are now free as a thank you to the entire Internet for making our work possible.

Not a subscriber? Get exclusive content like this delivered hot to your inbox every weekday. Click here for more information on Baseball Prospectus subscriptions or use the buttons to the right to subscribe and get instant access to the best baseball content on the web.

Subscribe for $4.95 per month
Recurring subscription - cancel anytime.

a 33% savings over the monthly price!

Purchase a $39.95 gift subscription
a 33% savings over the monthly price!

Already a subscriber? Click here and use the blue login bar to log in.

With the Cardinals facing elimination, Game Six will be an all-hands-on-deck endeavor. Both managers are scouring their rosters for any potential advantage, and as part of that effort, they’ll probably be referring to historic batter-pitcher matchups. Should La Russa lean heavily on a player like Octavio Dotel, who has historically done well against Rangers hitters like Adrian Beltre and Michael Young? Or should he opt for the players with the best overall performance, regardless of what the matchups say?

Let’s say we want to predict the outcome of a particular batter-pitcher matchup. I’m going to lean heavily on True Average, which is scaled to look like batting average but captures a player’s total batting value (so a player gets a little credit for a walk and a bit more for a single, all the way up to a home run).

How would we predict the outcome if we knew nothing about how a particular batter-pitcher pair had fared against each other? First we’d want to know about the talent level of the batter and pitcher involved. In this case, we’ll use the previous three seasons’ TAv and TAv against, respectively. To find an expected TAv for the matchup from those values, we can use something called the log5 method to combine the two values into one value.

Once we have that expected value, we can also look at the TAv from that batter-pitcher matchup from all previous seasons. We can run this data from 1951 through 2011, giving us sixty years of data and over 16,000 data points to look at.

Using a technique known as ordinary least squares regression, we can see how well our expected TAv and our prior batter-pitcher matchup TAv predict future batter-pitcher matchup TAv. After controlling for whether the batter has the platoon advantage, what we find is that our log5 estimate of the outcome of a batter-pitcher matchup is 67 times more predictive than the batter’s past performance against that pitcher. Now, that’s slightly better for the batter-pitcher matchup data than we might have expected; there were on average 78 times as many PA for the log5 expectation as there were for the batter-pitcher matchup. (Since there are both batter PA and pitcher PA against used to generate the log5 expectation, I used what’s known as a harmonic mean to come up with the PA totals for the log5 expectation.)

We can conclude that one plate appearance against a specific pitcher is slightly more predictive than a plate appearance against any pitcher at all. But that effect is dwarfed by the number of plate appearances a batter makes against all pitchers, and historic trends are conspiring to increase this effect. In the 1960s, the batters and pitchers with the most history against each other might have given us as many as 240 plate appearances from which to draw conclusions. On average, there would be 14 plate appearances between a batter and pitcher in previous seasons to draw from. Since 2000, however, the most plate appearances between a batter and pitcher in past seasons is only 148, and the average has fallen to just seven. In other words, the frequency of a batter seeing any particular pitcher has dropped by half. Expansion, interleague play, and free agency have conspired to reduce the number of times any particular batter and pitcher have faced off in the past.

Consider one area in which improperly valuing matchup data could potentially still cost the Cardinals in the World Series. If the Cardinals are able to hold on tonight, they would face a Game 7, and one of their starting pitcher options would be bringing Edwin Jackson back on short rest. Jackson is the Cardinals pitcher with the most history against the Rangers (due to his time in the American League). He has been effective against Michael Young, holding him to a .192 TAv in 21 plate appearances, but otherwise the Rangers have largely teed off on him—Ian Kinsler owns a .372 TAv in 16 plate appearances, Endy Chavez has a .359 TAv in all of three plate appearances, and David Murphy has a .488 TAv in 10 plate appearances. None of that is significant enough that La Russa should consider it strongly in deciding which pitcher to use for Game 7. And if La Russa does bring in Jackson, Murphy’s history against Jackson shouldn’t factor into Washington’s decision-making process; his 10 plate appearances against Jackson are nowhere near enough to outweigh Murphy’s underwhelming career numbers.

But what about cases where a batter has really owned a pitcher in the past—just utterly demolished him? Let’s restrict ourselves to cases with a prior TAv of .520 against a pitcher, or twice the average TAv. (By happy coincidence, that’s just about two standard deviations above the average, for those of you who care about such things.)

Historically, these have been more predictive of batter success than ordinary batter-pitcher matchups. But they are still dwarfed by the predictive power of our log5 expectation, by a factor of about 24 times. A manager is likely doing himself a favor if he puts a guy with that kind of extreme success in the lineup in place of a batter who’s otherwise reasonably close in ability. However, such cases are extremely rare, and even in these extreme cases, the whole of a batter’s historic performance (combined with knowledge of the platoon advantage) is still a much better gauge of how a batter will perform against a pitcher going forward.

It’s easy to read words like “harmonic mean” and “ordinary least squares” and dismiss these findings as something disconnected from what takes place on the field. But this study (and any other like it) is as much about those things as archaeology is about pickaxes and hammers. They’re just tools to expose the truth lurking within masses of data generated not inside some sterile sabermetric laboratory, but by the actions of the players themselves—in this case, an exhaustive record of batter-pitcher matchups over sixty years of baseball history. The data isn’t telling us that batters can’t pick up certain cues about a pitcher, or that a pitcher’s repertoire is equally suited to all batters. However, 10, 50, or even 100 plate appearances aren’t enough to tell us whether what we’re seeing is one player with a special edge against another, or simply a small-sample-size fluke, and there’s too much at stake for La Russa and Washington to let themselves be overly swayed by such statistics to the detriment of their teams.






Michael Cuddyer

John Danks



Mark Teahen

Mark Buehrle



Prince Fielder

Carlos Zambrano



Todd Helton

Livan Hernandez



Jim Thome

Justin Verlander



Carlos Lee

Carlos Zambrano



Derrek Lee

Javier Vazquez



Joe Mauer

Justin Verlander



Bobby Abreu

Javier Vazquez



Albert Pujols

Ryan Dempster



A version of this story originally appeared on ESPN Insider Insider.

Colin Wyers is an author of Baseball Prospectus. 
Click here to see Colin's other articles. You can contact Colin by clicking here

Related Content:  Pitcher,  TAv,  Batter-pitcher Matchups,  Matchups

9 comments have been left for this article.

<< Previous Article
Fantasy Article Resident Fantasy Geniu... (10/27)
<< Previous Column
Premium Article Manufactured Runs: Pun... (10/20)
Next Column >>
Manufactured Runs: The... (12/02)
Next Article >>
Transaction Analysis: ... (10/27)

Playoff Prospectus: Come Undone
BP En Espanol: Previa de la NLCS: Cubs vs. D...
Playoff Prospectus: How Did This Team Get Ma...
Playoff Prospectus: Too Slow, Too Late
Premium Article Playoff Prospectus: PECOTA Odds and ALCS Gam...
Premium Article Playoff Prospectus: PECOTA Odds and NLCS Gam...
Playoff Prospectus: NLCS Preview: Cubs vs. D...

Transaction Analysis: One Curse Down, One to...
Fantasy Article Resident Fantasy Genius: Fantasy Rumor Mill ...
Fantasy Article The Keeper Reaper: Starting Pitching for 10/...

2012-01-09 - BP Unfiltered: Suspicious Minds
2011-12-02 - Manufactured Runs: The Year of the Free Agen...
2011-10-27 - Premium Article Manufactured Runs: Matchup Madness
2011-10-25 - BP Unfiltered: Some Unsolicited Advice for t...
2011-10-20 - Premium Article Manufactured Runs: Punting on Punto
2011-10-12 - Premium Article Manufactured Runs: Curse to Curse

2012-05-16 - Premium Article Manufactured Runs: The Angels, Albert Pujols...
2012-04-04 - Manufactured Runs: Tragedy of the Commons
2011-12-02 - Manufactured Runs: The Year of the Free Agen...
2011-10-27 - Premium Article Manufactured Runs: Matchup Madness
2011-10-20 - Premium Article Manufactured Runs: Punting on Punto
2011-10-12 - Premium Article Manufactured Runs: Curse to Curse
2011-10-06 - Premium Article Manufactured Runs: When La Russa Should Pinc...