<< Previous Article

Prospectus Hit List: C... (05/22)

<< Previous Column
Prospectus Idol Entry:... (05/24)

Next Column >>
Prospectus Idol Entry:... (05/31)

Next Article >>

Prospectus Q&A: Andrew... (05/24)

May 24, 2009

Prospectus Idol Entry

Dare to Compare

by Ken Funck

Printer-friendly

Contact Author

Back in the mid-1970s I was an infielder for the New York Yankees. I was pretty good, too-leading the team in a number of offensive categories and appearing in a few All-Star games. My best season was probably 1977, when I managed to finish the season with a .633 batting average. So for at least a few years, you could have said I was a better hitter than Rod Carew.

Of course, if you had actually said that, you'd be crazy. Intuitively, we all know that hitting success in the Elmhurst Baseball League isn't exactly the same as hitting success in the American League. The differences between the two in terms of level of competition, playing environment, even the actual rules of the game are so vast that there's no need to even try to quantify them. But what about when the differences aren't quite so obvious, and the idea of comparing two sets of statistics isn't so laughable-say, if we wanted to compare a prospect in the Midwest League to one in the Carolina League, or Roy Halladay to Jake Peavy, or even Ty Cobb to Pete Rose? Maybe that last one is laughable, but for the other cases there's a whole category of metrics designed to help make possible such comparisons: translated statistics.

The idea behind a translated statistic (you'll also frequently see the term "equivalent"-don't get tripped up by the semantics, it's pretty much the same thing) is to take a player's "raw" stats, which accurately recorded what happened during a play, a game or a season, and translate them in such a way as to make them more directly comparable to other players who accumulated similar "raw" stats, but possibly in a very different environment.

"Raw" stats come in three delicious flavors:

Counting Stats, which basically aggregate certain events that occur during games. Hits, RBIs and Earned Runs fall into this category. Counting stats are building blocks for most other statistics, but as fun as they are to look at and memorize, and as important as they often are for a fantasy baseball team, they lose a lot of their utility when comparing the actual productivity of two players. Counting stats don't account for variance in playing time: the more plate appearances you have, the more chances you have to hit a home run. They also don't account for variance in opportunity: the more frequently you bat with runners on base, the easier it is to accumulate high RBI totals. These problems can be partially resolved by instead using...
Rate Stats, which are usually represented as a percentage. Batting Average, OBP and WHIP are all in this category, and they're all calculated by dividing the number of times something happened (e.g., a Hit) by the number of opportunities for it to happen (e.g., an At-Bat). This makes rate stats much better to use for comparisons between players-though by no means perfect.
Value Stats, which can be represented as either a number or a rate, but are still derived by performing straightforward math functions on counting stats. Two examples are Runs Created and Offensive Winning Percentage. These stats often try to combine various counting and rate stats to come up with a single number that represents the total offensive contribution a player is providing to his team.

While rate stats and value stats definitely make it easier to compare players than counting stats do, they still rely entirely on the counting stats themselves. The problem when it comes to comparing players is that the environment-often called the "context"-in which those hits, walks and strikeouts were recorded often vary wildly. Comparing Ken Funck's ability to turn around 45 mph fastballs at a .633 clip to Rod Carew's MVP season is an extreme example, of course, but even when looking at two major league players there are a plethora of external factors that might complicate the comparison: ballpark factors, league factors, quality of competition, weather, umpiring, etc., ad nauseum.

And that's where translated statistics come in-their job is to try and strip away the external factors that muddy up "raw" stats, so that players can be compared more accurately. Baseball Prospectus hosts a king's ransom of translated statistics-just take a stroll through the BP Glossary and you'll see even Neifi Perez couldn't swing a bat in there without hitting something that's been adjusted for ballpark, league difficulty, era and/or quality of opponent.

A good example is Clay Davenport's Equivalent Average (EqA), often used as the chassis for much of BP's statistical work. The BP Glossary defines EqA thusly:

A measure of total offensive value per out, with corrections for league offensive level, home park, and team pitching. EQA considers batting as well as baserunning, but not the value of a position player's defense. The EqA adjusted for all-time also has a correction for league difficulty. The scale is deliberately set to approximate that of batting average. League average EqA is always equal to .260. EqA is derived from Raw EqA, which is (H + TB + 1.5*(BB + HBP + SB) + SH + SF) divided by (AB + BB + HBP + SH + SF + CS + SB). REqA is then normalized to account for league difficulty and scale to create EqA.

Note that the main ingredient when cooking up EqA is REqA-and further note that you can prepare REqA yourself by merely sprinkling some simple math operators over a generous bed of counting stats. So REqA is a "raw" stat (it says so right on the box), subject to the same problems of context that apply to every other "raw" stat. It's the next process-applying "corrections for league offensive level, home park and team pitching"-that turns REqA into EqA, and defines EqA as a translated stat. The mechanism for this is described in more (but not complete) detail in the above link, but here's the 30,000 foot explanation: REqA is shape-shifted into runs produced, compared to the league's run scoring environment to determine how much better or worse than average those runs produced are, and then that difference is applied to a base EqA of .260. The same process can be applied to any major league player in any season-translating their "raw" stats into a fictional league where the average player has an EqA of .260. Once translated, players from different teams, leagues and eras can be compared with ease.

Let's take EqA for a spin by comparing the raw statistics of two Chicago Cub hitters from different eras:


Actual Stats     Year   AB  H  2B 3B HR  R RBI    BA    OBP    SLG    OPS    EqA*
Ron Santo        1968  577 142 17  3 26 86  98  0.246  0.354  0.421  0.775  0.301
Henry Rodriguez  1998  415 104 21  1 31 56  85  0.251  0.334  0.530  0.864  0.284
*calculated for season

Just eyeballing the counting and rate stats above would lead someone to the conclusion that Hammerin' Hank's 1998 season was far more impressive than Santo's rather pedestrian summer 30 years before. Rodriguez hit more home runs in many fewer at bats, leading to a 100+-point edge in slugging percentage and nearly 90 points in OPS. But the elephant in the room is the final stat: EqA, calculated here to compare Santo and Rodriguez directly to their peers that season. Remember, EqA is calibrated to set an average player at .260-thus Santo's .301 is very good. But to really see the difference, take a gander at the same statistics translated for all time (taken from each player's DT card):


Translated Stats  Year   AB  H  2B 3B HR  R  RBI    BA    OBP    SLG    OPS   EqA**
Ron Santo         1968  565 142 23  2 40 106 121  0.251  0.380  0.512  0.892  0.298
Henry Rodriguez   1998  404  98 18  1 31  52  76  0.243  0.326  0.522  0.848  0.279
**calculated for all time

After this translation we can directly compare counting and rate states. When accounting for the intimidating environment of the Swingin' (and Missin') Sixties, as well as the offensive fireworks of the late nineties, Santo has made up nearly the entire gap in slugging percentage, and opens up a 50-point lead in OPS. The value of translating statistics in this way should be pretty obvious to both the casual fan that wants to win a bar bet and the fantasy baseball owner who wants to see how much a player's stats are being boosted or suppressed by their home ballpark.

Best of all, Equivalent Average is just one of many translated stats that have been developed, some focusing on total value, some on just a specific aspect of play (e.g., fielding, baserunning, relief pitching), some focused on comparing minor league and major league performance. Virtually anything can be considered a factor that might affect a stat. Armchair analysts have spent countless hours perfecting new and different translation methods to normalize for new and different variables-spend a quiet evening with your favorite search engine and a refreshing beverage, and you'll find a wealth of equivalent stats that delight and amuse. If you've got the math chops and some database software, you can even roll your own. And if it's worthwhile, easy to use and (most importantly) defensibly accurate, you too might get to see your name in lights at the top of a sortable stat column.

Ken Funck is an author of Baseball Prospectus. Follow @KenFunck
Click here to see Ken's other articles. You can contact Ken by clicking here

Related Content: Stats, A-rod

36 comments have been left for this article.

BP Comment Quick Links

code of conduct

Kevin Goldstein

BP staff

This one is hard for me because it's a little more basic, with a lot more hand holding, and also because I have my own problems with translations where I don't think we have apples to apples comparisons. For example, I think if you dropped Jack Cust into the 1920s, he'd be better than Babe Ruth. But that's just me, and has nothing to do with this piece. Getting away from the baseball for a second, I think you are a good WRITER, and I'm interested in seeing what you do with a different subject.

May 24, 2009 00:27 AM

Will Carroll

BP staff

I love the construct here. He really draws me in from the first sentence and then takes me through the process in a clear fashion. He opens up a bit, quoting some from Clay that could be a bit intimidating to the intended audience, but pulls it right back. It works because it's like he's going "this is complex, but you're smart enough to understand. Let me show you." He nails the tone. There's some nitpicks here and there, but solid work.

May 24, 2009 08:07 AM

Christina Kahrl

BP staff

This was exactly what I wanted from a Basics piece: a basic explanation of a complicated concept, yet engaging, because Ken's a writer who isn't afraid to mix up a bit of self-mockery with a confidence in his use of examples and (supported and supportable) assertions. Were I to have points to give, I'd assign them for the tidy and appropriate reductionism as far as the tasty types of stats folks can sample from.

May 24, 2009 10:38 AM

Richard Bergstrom

(36532)

I don't think this article was "bad", but something about it just wasn't quite right. If the idea is to explain "the basics", then why move in, out, then back in to discussing EqA. It seems you were discussing the concepts of equivalencies instead of EqA itself. EqA itself isn't discussed until halfway through the article, then disappears for a few paragraphs to talk about a "DT Card" (Davenport Translation) thrown in there, then the article concludes with EqA. It seems to me it would have been best to stich with either EqA or with the DT Card. Also, if this article is intended for new people, it only makes sense to say what a "DT Card", who it is named after (Clay Davenport) and explain it to the reader...

I also have a problem with the Santo/Rodriguez comparison... I'm a Cubs fan and have only heard "Hammerin' Hank" used in reference to Henry Aaron... Henry Rodriguez was "Oh Henry!". Also, if you're explaining equivalencies, why not pick two players who play the same position, or swing from the same side of the plate? It just seems like this is a case where, all things being equal, there were examples of equivalency that would be less open to questions.

Then, to cap it off, you say that "armchair analysts have spent countless hours perfecting new and different translation methods", which seemed to be a bit dismissive. There are other elements of a condescending tone in this piece.

The article, besides elements of the tone and the organization, was pretty understandable and the writing style was easy to read. I just wanted to see it structured better and for the focus to be sharper without the writing style trying to be so "cute".

May 24, 2009 01:30 AM