Navigation: Jump to content areas:


Pro Quality. Fan Perspective.
Login-facebook
Around SBN: An Explanation For Some Of The Perplexing HOF Snubs

THERE IS CLUTCH! (or, The Case of the Lucky Lefties)

Okay…maybe I overstated my case, but I do have a somewhat scientific way to show that left-handed power hitters hit relatively better with runners on base than with bases empty compared with right-handed power hitters. 

 

Here’s my theory:

Due to the location of first base (the right side of the infield), teams employ large shifts for left-handed power hitters when it is possible, frequently placing three infielders on the right side of the diamond.  However, when there are runners on base, it is more difficult to position infielders in such a way that minimizes the hitter’s chance of hitting safely if they hit the ball in play.  The result is that for a given batting average, a left-handed power hitter is actually more likely to get those hits when runners are on base.  These are naturally higher leverage situations in general.  Hence, a given batting line for a left-handed power hitter is more valuable than the equivalent batting line for right-handed power hitters.

 

Star-divide

The process:

I don’t really know the details of the defensive positioning used against every hitter, so the numbers may not be the best reflection of the truth, but even with this noisy data, I was actually able to find significant results anyway. 

 

I found the top 20 non-switch hitters in career slugging percentage who started their careers after 1956 (the era for which Baseball-Reference.com has data on splits) and who had over 3000 career PA.  There were 12 righties and 8 lefties, and I checked their BABIP with runners on base and with bases empty.  The 12 righties hit an average of .009 points better with runners on base than with bases empty (the fact that they hit better is unsurprising, considering batting with runners on is correlated with batting against poorer pitchers and batting against fielders who are not positioning themselves only to limit the chance that the hitter reaches base, but also to avoid stolen bases and complete double plays).  However, the 8 lefties hit an average of .022 points better with runners on base than with bases empty.  Given the number of balls in play that each group had, this difference was significant at the 95% level.

 

The 12 righties with their respective BABIP with runners on base, BABIP with bases empty are as follows, and difference)

 

Albert Pujols (.317, .321, -.004)

Manny Ramirez (.345, .332, +.013)

Mark McGwire (.265, .247, +.018)

Alex Rodriguez (.325, .319, +.006)

Vladimir Guerrero (.327, .312, +.015)

Albert Belle (.288, .299, -.011)

Juan Gonzalez (.304, .305, -.001)

Frank Thomas (.311, .298, +.013)

Mike Piazza (.316, .312, +.004)

Miguel Cabrera (.354, .336, +.018)

Jeff Bagwell (.325, .308, +.017)

Frank Robinson (.300, .290, +.010)

 

The 8 lefties:

 

Barry Bonds (.308, .267, +.041)

Todd Helton (.346, .327, +.019)

Larry Walker (.331, .332, -.001)

Jim Thome (.327, .317, +.010)

David Ortiz (.324, .283, +.041)

Ken Griffey (.297, .285, +.012)

Carlos Delgado (.303, .303, .000)

Jason Giambi (.327, .277, +.051)

 

The totals were that the twelve righties hit .314 on 30,470 balls in play with runners on and .305 on 31,796 balls in play with bases empty.  The eight lefties hit .319 on 20,161 balls in play with runners on and .297 on 22,752 balls in play with bases empty. 

 

Obviously, the story could just be that David Ortiz, Jason Giambi, and Barry Bonds are uber-clutch and happen to left handed, but I would think that the shift is the reason.  To check whether they just happened to be better hitters with runners on base, I checked homerun percentage by the lefties and the righties—and the righties had 6.5% homeruns with runners on, and 6.6% with bases empty, and the lefties had 6.3% homeruns with runners on, and 6.8% homeruns with bases empty.

 

Maybe, I’m not using the right statistic, so for the sake of transparency, here’s how I developed the standard deviation I used to develop the t-statistic: the 20 hitters combined to hit .316 with runners on base and .302 with bases empty.  This established the variance with runners on base as .316*(1-.316) and with bases empty as .302*(1-.302).  I divided each of these by the number of balls in play that righties had with runners on base and bases empty, respectively.  That was the variance for the righties.  Then I divided .316*(1-.316) and .302*(1-.302) by the number of balls in play that lefties had with runners on base and bases empty, respectively.  That was the variance for the lefties.  I added these and took the square root and found that the standard error should be .0058.  Given that the difference between the .022 that the lefties hit better with runners on base and the .009 that the righties hit better with runners on base was .014, this established a t-stat of .013/.0058=2.30.  There is only a 2.2% chance this would happen by randomness—making this a statistical significant difference.

 

All in all, I think this is a sign that perhaps using purely sabermetric theories which assume that no hitter is significantly better than others at performing in the clutch, and thereby evaluating hitters’ cumulative numbers, is not accurate.  This is not quite a testament to the glory of the RBI, but it demonstrates some of the value in looking at situational context hitters as different.  I tend to doubt the impact of psychology on performance and mainstream sportswriters frequent use of antiquated ideas of clutchness, but there are probably characteristics that affect hitters’ abilities to succeed in important situations.  The characteristic presented here may be one of them.

Comment 21 comments  |  0 recs  | 

Do you like this story?

Comments

Display:

Excellent Work

As soon as you stated your theory, I thought, “Ah… that makes complete logical sense.” Unsurprising that the statistics you pulled yielded significant results.

However, I wonder what the data looks like with a larger sample size, perhaps lowering the PA standard to 2,500 (or even 2,000) instead of 3,000.

http://crashburnalley.com/

by Crashburn Alley on Jan 30, 2009 2:25 AM EST reply actions  

Nice study

Never thought of that before

by Brendan Scolari on Jan 30, 2009 3:09 AM EST reply actions  

Interesting and thought-provoking

Thanks for sharing this!

http://www.thegoodphight.com

by WholeCamels on Jan 30, 2009 9:40 AM EST reply actions  

Great

It’s one of those things that after you suggest it, it makes perfect sense. Teams employ a shift for a reason. When they can’t employ a shift, the hitter should be able to get hits more frequently. I’m checking some of the stat sites, but I can’t find any one the details BABIP with base runners in different positions. Teams will still employ shifts with a runner on second provided that there is not also a runner on third.

by uneasy rider on Jan 30, 2009 11:07 AM EST reply actions  

It might be interesting to track how the 2009 Phillies potentially being “too left-handed” may work to their benefit (i.e., balancing the detrimental effects of being susceptible to LOOGYs vs. the potential benefit of the “lucky lefty” phenomenon discussed here).

http://www.thegoodphight.com

by WholeCamels on Jan 30, 2009 11:17 AM EST reply actions  

Thanks, everyone.

Crashburn— I only used 3000 PA since that was the cutoff on Baseball-Reference to be eligible for a rate stat on the career leaderboard. I think adding more data is wise. I just didn’t want to go further down the list I was approaching guys like Utley who don’t regularly get the shift (or at least they didn’t until recently). I wanted to keep it to guys with .550+ slugging percentages, so fewer PA is the way to go. In reality, I think identifying who was actually shifted against would be the best methodology but I’m not sure who. But I know Ortiz, Bonds, and Giambi, who definitely had the largest gaps, and Howard didn’t have 3000 PA but he has a huge on base vs baes empty gap as well. I’m pretty sure at least Thome and Deglado were shifted against too, but clearly there is enough variance that they didn’t get the same realized difference as the other guys.

Uneasy Rider— Good way of looking at it. I think that might actually justify looking at pitchers who always pitch out of the stretch. After all, if most pitchers do use the windup with bases empty specifically because they prefer it, they must be not pitching their best with runners on. For given BB/K/HR rates, pitchers who use the same motion when there are runners on base or not might be less prone to surrender runs with runners on base. Just another thought of this kind of thinking. As to which sites detail the BABIP— Baseball-Reference is great with that. If you look at the yearly stats, you can actually look at sOPS+ which tells you how good they are relative to the rest of the league in those particular circumstances. For instance, everyone hits bad down 0-2 in the count, but Jimmy Rollins might not be anywhere near as bad as other hitters so his sOPS+ would come out over 100.

Whole Camels— that’s an interesting idea. The only hitters who get shifted against are Utley and Howard, so it’s relatively limited. This logic doesn’t hold for light hitting lefties. The effect is probably there though.

If the difference is actually about .022 points of batting average for lefties and .009 for righties, then that means that about 1.5 more hits per year would come with runners on base for lefties, and that might make about 1 run of difference a year per. It’s not a huge effect but keep in mind that if a one-win player is worth $5MM, then 1 run is worth $500K, which makes Ryan Howard and Chase Utley probably $500K more valuable than their slash stats would suggest. That’s not insigificant.

As to whether it would counteract the LOOGY usage, I’m not sure. I think that if Ryan Howard hits .230 vs LHP and .300 vs RHP then that means he gets 1 hit fewer every extra 14 at-bats he gets against lefties. I think I found in a previous article that he gets about 10% more at-bats against lefties than the average Phillie, so that would mean he gets about 4 fewer hits and 4 more outs than he would have if he got the same number of at bats against lefties, which probably amounts to about 4 runs or so. The question is how much of that is because of the lefties batting around him. Maybe it actually is about 1/4 of that, so these two effects might actually cancel. Very back of the envelope approximation in this paragraph, but I think it’s roughly true.

by Matt Swartz on Jan 30, 2009 11:52 AM EST reply actions  

Interesting study

Very interesting, and a very reasonable hypothesis. As others have already said, thanks for sharing this.

I haven’t been able to think it all the way through but I am not sure that your method is correct. Thank you for sharing what you did, otherwise criticism (constructive I hope!) would be impossible.

I think you are trying to test for the difference of a difference, – change in BABIP men on base vs bases empty between RH and LH hitters. This is hard and makes my head hurt to think about. I think in this case your estimates for variance are likely too low because uncertainty in the estimate of BABIP difference clouds the estimate of RH vs LH BABIP difference.

You could test the RH vs LH hitters separately using paired sample t-tests. A sample pair being each batter’s BABIP with men on vs bases empty. It would be interesting to see what the significance of the advantage is for rights compared to lefties. Is the effect for lefties markedly stronger?

This might also allow more hitters to be included as each is his own control.

by astoddard on Jan 30, 2009 12:00 PM EST reply actions  

I think that's what I did?

I think I used a paired sample t-test. Well maybe not paired, but I think the formula is adjusted when the samples aren’t the same size. I haven’t done statistics courses in a few years, but I had a lot and I think I used the right number. If someone knows, please let me know if this was right.

I had four samples:
—n(LM) & n(RM) of balls in play for lefties and righties with runners on, and n(LE) & n(RE) of balls in play for lefties and righties with bases empty.
—The hits in play in each of these subsamples were x(LM), x(RM), x(LE), and x(RE).

I computed the following statistics:
—p(LM)=x(LM)/n(LM), probability of a hit on a ball in play for lefties with men on base
—p(RM)=x(RM)/n(RM), probability of a hit on a ball in play for righties with men on base
—p(LE)=x(LE)/n(LE), probability of a hit on a ball in play for lefties with bases empty
—p(RE)=x(RE)/n(RE), probability of a hit on a ball in play for righties with bases empty
—p™=(x(LM)+x(RM)) / (n(LM)+n(RM)), probability of a hit on a ball in play for all of the guys in the sample with men on
—p(TE)=(x(LE)+x(RE)) / (n(LE)+n(RE)), probability of a hit on a ball in play for all of the guys in the sample with bases empty

The variance of the outcome of a ball in play with men on was:
—var™=p™)
and with bases empty
—var(TE)=p(TE)
(1-p(TE))

So the standard error for the righties should have been
SE®=square root of(var™/n(RM) + var(TE)/n(RE))
and similarly for the lefties to get SE(L)
SE(L)=square root of(var™/n(LM) + var(TE)/n(LE))

Then I got the standard deviation of the difference by taking the square root of (SE(L)2 + SE®2). That’s the .0058 value in the article. Since the difference was .013, seems like the t-stat should be .013/.0058=2.30 which is statistically significant.

Can someone check my work, because I’m not 100% sure but I felt pretty confident I was doing it right at the time?

by Matt Swartz on Jan 30, 2009 12:32 PM EST up reply actions  

fighting the shift

What I’ve always wanted to see is some kind of analysis of bunting into the shift. Color guys always say “Howard could lay one down right here and walk to first base,” but of course he never does, ostensibly because his job is to hit for power, not bunt and just get to first. But if you’re of the mentality that OBP = good, then taking a near-automatic pass to first should be a good thing, right? So I’d like to see a study on the likelihood of scoring in an inning where your power hitter swings away into a shift, vs. where he bunts and you assume say a 75% chance of getting to first.

by Carlos del Vaca on Jan 30, 2009 12:08 PM EST reply actions  

This is an interesting idea but it would be extraordinary difficult to test the mathematics of this. It would require determining the effect of bunting x% of the time on the fielders positioning and the resulting BABIP. Then you’d need to test the bunting success of these hitters. I think that’s more of a qualitative analysis you’d use to check that stuff.

by Matt Swartz on Jan 30, 2009 4:35 PM EST up reply actions  

# of outs is an important factor

If you look at league performance with RISP (I’m a Twins fan, so I was looking at AL numbers), you see this:

.261/.324/.412, 3128 AB/team — 2008 AL, none on
.297/.362/.452, 749 AB/team — 2008 AL, RISP, <2 outs
.245/.350/.386, 659 AB/team — 2008 AL, RISP, 2 outs

This, to me, indicates at the very least, that situational pitching/defense occurs and has a tangible effect on the league-wide numbers. The question, as always, exists as to whether or not this change is noticeably different from hitter to hitter, pitcher to pitcher, or defense to defense, but I think that any study that deals with this issue has to acknowledge that the league-wide average change substantially depending on the game situation.

I think you are definitely headed in the right direction on this by considering bases empty vs. men on, but given the drastic changes that the number of outs can have on the league-wide averages, I think that the generic “men on” split is a bit muddy.

A couple of other comments:

1) I think that it would be a good idea to look at batting average in addition to BABIP. If you insist on using the term clutch, it also matters how often a hitter makes contact in those situations.

2) FWIW, I think that these discussions are generally more productive if they are framed in the context of situational hitting rather than clutch hitting. Clutch ruffles a lot of feathers because it has to do with the perception of who has the largest testicles, and things like that, whereas I think people are more receptive to the idea that certain hitters have skills that allow them to succeed in different situations, regardless of their intestinal fortitude.

3) A sample that includes non-elite hitters might be nice, too. (And larger samples in general are going to be more convincing.)

4) I’m not sure about BABIP, but I do know that Joe Mauer sees his batting average improve substantially from none on to RISP/2 (.302/.352) and no one really plays a shift on him. Justin Morneau also sees his average improve from none on to RISP/2, but some teams do play the shift on him.

by ubelmann on Jan 30, 2009 2:33 PM EST reply actions  

Thanks for the comments. Here are my thoughts.

1) I’m not sure to what extent league-wide differences would matter in this situation. Both groups are pretty similar in terms of distribution across era and across leagues. Is there something systematically biased about those results? If this approach is not credible, it’s because there is something different about the two groups of hitters that is correlated with BABIP split differences like the ones I have described above, but different than the main difference I am discussing above. I think that league differences would be an example, but I’m not really sure the AL and NL differ in their men on/bases empty splits or that the groups above are composed differently.

2) The number of outs thing is interesting, but I counted sacrifice flies as outs in my BABIP formula to avoid some of the large differences you describe with <2 outs vs 2 outs. I’m also guessing that the hitters above had a pretty even distribution of at-bats with differing number of outs. Again, you do hit on another cause for variance in outcomes, but I’m not quite sure that the hitters differ with respect to their at-bats in those situations so that probably doesn’t affect the value of the t-statistic above.

3) Yes, I don’t mean “intestinal fortitude” clutch. I was being facetious. The idea was that the hitters had a naturally tendency to succeed in higher lever situations. For instance, Ryan Howard’s skill level might dictate that he could be expected to hit .265/.365/.565 next year but he may actually hit something like .255/.355/.555 with bases empty and .285/.385/.585 with runners on bases, thus provided more value than .265/.365/.565 suggests, since if a right handed slugger who also hits .265/.365/.565 might hit .260/.360/.560 with bases empty and .275/.375/.575 with runners on. Thus, the lefty is ‘clutch’ by design rather than the intestinal fortitude.

4) I don’t think non-elite hitters are not a useful sample. The whole idea is that context-neutral performance but lefty sluggers is misguided since they will systematically achieve more in higher leverage situations.

5) Larger samples are more helpful, but I do have 130,000 balls in play in my sample and the t-stat considers sample size. As Crashburn wrote above, it would be useful to check hitters with less than 3000 PA. I’d love a list of hitters who regularly were shifted against for the majority of their careers. That might help.

6) Using BABIP instead of AVG was precisely because I don’t believe that clutchness exists on much of a significant level in high leverage situations, and that the ability to make contact and the ability to hit homeruns should not be affected. That’s why I checked and found that the homerun rates did not also reflect a similar skill for lefties— in fact, there was a slightly negative “clutchness” in homerun hitting by lefties though not significant I don’t think. It was specifically hitting through the shift that made these guys so valuable.

by Matt Swartz on Jan 30, 2009 4:54 PM EST up reply actions  

timely

I have been getting the sense that Howard’s arb argument will rest in part on his better numbers with runners on. The absence of the shift once it’s a RISP situation would seem to explain it, though I wonder if at some point teams will decide it’s worth shifting against Howard even with a guy on second, letting him take third.

In any event, great work.

by dajafi on Jan 30, 2009 4:13 PM EST reply actions  

Haha, I hope Howard’s agent Casey Close doesn’t read TGP. I’d hate to give Ryan Howard the break he needs to make the $4MM extra, and take away the budget for a midseason pitching upgrade or something. Unless he’s hiring entry level Econ PhDs. In that case, read away, Casey! :)

by Matt Swartz on Jan 30, 2009 4:58 PM EST up reply actions  

Don’t lefties have a big advantage when there’s a runner on first base? The first baseman has to hold the runner leaving the big hole over there for the lefty hitter to take advantage. Maybe I missed it here. That could explain some things anyway.

by smitty99 on Feb 1, 2009 11:50 AM EST reply actions  

right, right. that’s some of the reason. my point is not that lefties are stronger willed, but that they succeed relatively more often with runners on base, precisely because of their actual dominant hand. the 1B has to hold the runner on first if there is a runner there, the SS needs to stay near 2B in case he steals, if there’s a runner on second, the 3B can’t be too far from third. all of these requirements make it harder to defend against david ortiz, ryan howard, and jason giambi in high leverage situations in the way that you’d like to with no runners on.

by Matt Swartz on Feb 1, 2009 12:21 PM EST reply actions  

Gotcha. The only reason I mention the first base thing is defenses do that regardless of the hitter. Whether it’s a guy who is shifted for like Howard or not. Lefty hitters have advantages because most pitchers are right handed as well. That’s one reason the worries regarding the Phils’ supposed lefty dominant lineup are a little over blown in my view.

by smitty99 on Feb 1, 2009 2:22 PM EST reply actions  

The lefty dominant lineup is probably overblown only in the sense that I think the effects on bullpen selection only affects the Phillies by a run or two per year. Left handed hitters numbers accurately reflect the fact that most pitchers are right-handed already. Ryan Howard’s context-neutral statistics are that he is a .950 OPS hitter because he faces mostly righties, so he is no more or less valuable than a .950 OPS righty who also naturally faces mostly righties. Obviously if 70% of pitchers would lefties, Howard would be worse off and Manny (or whoever you think is a .950 OPS righty) would be better off, but then Manny would have a 1.000 OPS and Howard would have a .900 OPS. Their numbers reflect the distribution of pitchers already.

The point of this exercise was to launch interest in the question as to when context-neutral statistics are not the best thing to evaluate performance. For instance, if I said that Burrell hit .222 with RISP a few years ago, most of us would have said that was a silly way to predict his performance with RISP the following year and that the best approximation for doing so would be his total average. That’s true. But there can still be reasons hitters systematically hit worse in certain situations. I guess my point is partly that David Ortiz’s reputation as super-clutch is actually true but not because of mental willpower. It’s because he pulls everything and that’s more likely to be a hit with runners. Ortiz actually homers less frequently with runners on base. He gets way more hits because he his hardly hit balls to rightfield systematically turn into hits when it’s more important and outs when it’s less important.

Sabermetric orthodoxy would say that the added run value a hitter will provide can be discerned from their AVG/OBP/SLG/SB/CS and adding in maybe some baserunning stats. I’m saying that’s not quite right. You need to consider the fact that some hitters will hit slightly better in higher leverage situations for systematic reasons rather than post hoc intestinal fortitude reasoning. Ryan Howard is probably about $500K more valuable (1 run) than his context-neutral stats would suggest. So is David Ortiz. So is Jason Giambi. Because these hitters will systematically overperform the implied run value of their AVG/OBP/SLG/SB/CS stats, they are more valuable than the right handed equivalent.

You make a good point about the lefty hitters all getting to hit more with runners on first base. Some thoughts on that:
1) I would guess that those situations are higher leverage on average— certainly higher leverage than no one on, but maybe lower leverage than bases loaded or ducks on the pond— first base occupied does somewhat depend on whether a that underweights situations where there’s a runner on 2nd or 3rd or both, or even a slow runner on 2nd allowing the 1B to play off the bag. I still think you’re right though.
2) Also, I wonder if righties also are more likely to ground into double plays because they hit more of their groundballs to infielders with stronger arms.
3) I’m not sure how often non-power hitters pull the ball. Looking through numbers, it seems like non-power hitters frequently hit the ball the other way nearly as often as they pull it. I’d love to get more data on this, but there seems to be no real way to collect this kind of stuff. I have an email out to Sean Forman at Baseball-Reference about getting more of this data. I would absolutely love if anybody knew how to get a hold of it!!

by Matt Swartz on Feb 1, 2009 3:14 PM EST up reply actions  

Good stuff Matt. Bill James online has some charts regarding where guys hit the ball. They are all individual charts though so it would take a lot of work to compile the data you’re looking for I think.

One thing regarding Ortiz. He was really clutch for a couple of seasons but he wasn’t for most of his career.

I think the stuff you are doing is really great and a lot better than just blindly accepting “there’s no such thing as clutch” and leaving it at that. It’s always a good thing to keep looking at stuff from a new and fresh perspective and see if we can figure out a little more about what’s what. It’s really what makes baseball the greatest game ever. We’ll never come close to knowing everything.

One thing for future study might be to see if there is a type of hitter who hits best in a bases loaded situation. I always wondered about that ever since I saw how incredible Pat Tabler was in those situations. Because Tabler really wasn’t very good. But you always read about his gaudy batting with the sacks full stats (He really was awesome).

http://www.baseball-reference.com/pi/bsplit.cgi?n1=tablepa01&year=00

by smitty99 on Feb 1, 2009 4:25 PM EST reply actions  

B-R has splits data too, but it’s a matter of tabulating it for hundreds of hitters. I did it for about 20 people to see how long it would take me to finish and realize that I’d need to be copying and pasting for a few days straight to even compile the data. I’ll check out Bill James online but it’s probably still the same issue.

Ortiz was bound to fluctuate around some mean. Cumulatively he hits .041 points better with runners on, which is about the same as Bonds and Giambi. It’s a little on the high side, but I think it’s indicative that there’s a trend there that amounts to about 1 run more valuable than a right-handed Oritz would be over the course of a season.

That’s absolutely amazing about Tabler. I have no idea what to even say about that. Thanks for that link. Hitting .489 in 88 at-bats with the bases-loaded for a .282 hitter is ridiculous. The average hitter hits 29 points better with bases loaded. hitting 207 points differently in 88 at-bats by chance would only happen about 1 out of every 5,000 times. I guess about 1 out of every 5,000 hitters is bound to be this lucky, but it does raise a big question mark as to whether that’s actually the best conclusion.

by Matt Swartz on Feb 1, 2009 5:03 PM EST up reply actions  

I wonder though if the Tabler type — the high averafge, low walk, low power hitter — a guy who isn’t really all that good — might be one of the better hitters to have in the bases loaded situation. Pitchers really hate to walk in a run. So the lack of patience doesn’t hurt as much in this situation. And the high contact rate may be of use here. It’s a theory anyway.

by smitty99 on Feb 1, 2009 5:37 PM EST reply actions  

Comments For This Post Are Closed


User Tools

Analysis and features focusing on Philadelphia Phillies baseball.

FanPosts


Blog Lords

Wholecamels_small WholeCamels

Boys_small jonk

198222_nlds_reds_phillies_baseball_small FuquaManuel

Dsc04697_small David S. Cohen

Meltingface_small dajafi

Phillyfriar__new2__small PhillyFriar

Associate Blog Lords

Bugs_small taco pal

Greg_luzinski_small Wet Luzinski

Cptjackalbatross_small RememberthePhitans

Phillies1980logo_small schmenkman

Madmen_icon_small lizroscher

Blogger Emeritus

Colevatar_small Matt Swartz