Throwing Strikes: How Important Is It?

The title is a little misleading, as throwing strikes is important. If you don’t throw strikes you can’t win, simple. But, how important are those few extra percentage of strikes you throw. 

I looked at multiple categories of numbers, and looks to see if there was a correlation with strike percentage. I only graphed the two extremes, and the one with the least correlation to keep this from getting out of hand. To the graph-mobile!
Click on graphs for larger view
strk_bb.png
That one should have been obvious, right? The more strikes you throw, the less you walk people. Next up:

Read more of this post

Strikeouts and Pitch Counts

Back in the days when men were men, nobody worried about pitch counts. If you tried to take Ed Walsh out of a game after 100 pitches, he’d probably tell you he had at least another 200 pitches in him. Despite the current efforts of Nolan Ryan to go back ye olden days of not counting pitches, the current way of the world is that every other team does it, so we might as well pay attention to it.

These days, a pitcher is usually taken out after around 100-110 pitches, give or take a few. Often times, this means taking out a guy throwing a shutout in the 6th inning because he had reached the 100 pitch limit. There are two ways around this: letting the pitcher throw more pitches (which could potentially increase the risk of injury), or becoming more efficient (i.e., throwing fewer pitches per at bat). I’m here to talk about the latter.
 
The conventional wisdom goes something like this: It takes at least three pitches to strike someone out, but only one is required to get a groundball out. Therefore, a pitcher could decrease his pitch count by not attempting to strike out as many hitters as he can. That seems good enough for most people, but the astute readers of StatSpeak know that this can’t end there. A strikeout results in an out 100% of the time (ignoring the rare dropped third strike), but a ball in play results in an out only 71% of the time. That “other 29%” results in more batters coming to the plate, which results in more pitches having to be thrown to those additional batters. On one hand, we have more strikeouts leading to more pitchers per at bat, but also leading to fewer batters coming to the plate. On the other, we have more balls in play (fewer strikeouts) leading to fewer pitches per at bat, but also leading to more batters coming to the plate. Which of the effects is stronger?

Thanks to the work of Tom Tango, it has been shown that the average strikeout requires 4.8 pitches, the average walk takes 5.5 pitches, and if the plate appearance results in batter contact then it takes an average of 3.3 pitches (the data he used are all publicly available, by the way). Before you say “but so and so is different,” these numbers have been tested against extreme pitchers here. So these averages apply very well to all pitchers, whether they follow the norm, or if they are unusual cases like Randy Johnson and Brad Radke. We can use these estimates to see how an increased strikeout rate affects a pitch count.

How about a real-life test of the estimator? Joba Chamberlain has received some criticism from mainstream media-types about needing to be more efficient with his pitches, so he’s as good an example as any (and since he’s a Yankee, I know this will get on Pizza Cutter’s nerves). Joba has faced 256 batters this year, striking out 55 and walking 28. Plug those numbers into the formula (remember to subtract K’s and BB’s from the total batters faced when using the formula), and you get just under 989 pitches thrown. How many has he actually thrown this year? 984. I hope that difference of only 5 pitches helps to ease your concerns about accuracy. 

Now back to the question at hand. Prorated to 9 innings, this is a fairly typical pitching line: 9 innings, 6 strikeouts, 4 walks, and one home run. If 30% of balls in pay fall in for hits, that also means that there are 10 hits allowed in those 9 innings. In that “typical” game, a pitcher is expected to throw 153.1 pitches in 9 innings. What about games that aren’t normal, like one where the pitcher racks up a ton of strikeouts?

Here’s an extreme example: Take the exact pitching line from above, but change strikeouts from 6 to 27. So the new pitching line is 9 IP, 27 strikeouts, 4 BB, 1 HR. Using the formula above, the pitcher would be expected to throw 154.9 pitches. The effect is actually smaller than that, and here’s why: If a pitcher strikes out 27 batters, would you really expect the ONLY guy to make contact to hit the ball over the fence? When a pitcher is that dominating, what are the chances that he’d give up a home run at a rate of one per 9 innings? I’d say very slim. Fewer balls in play means fewer fly balls, which in turn means fewer home runs, and fewer pitches. So the real pitch count would be lower than 154.9, but for simplicity’s sake I’m going to call it even.

Let’s look at the other extreme–a pitcher who doesn’t strike out a single batter all game. Such a pitcher would be expected to allow a little over 12 hits per game. His expected pitch count for a game that included 4 walks, no strikeouts, and 12 hits including one home run would be 151.2 pitches. The caveat above about home runs also applies here, but in the opposite direction–a pitcher who has ever batter put the ball in play on him would likely allow more than one home run per 9 innings, so he’d likely throw slightly more than 151.2 pitches.

So what did we learn from this exercise? Even in the most extreme cases, striking out lots of batters will not increase your pitch count by any noticeable effect. Even when comparing two pitchers with polar opposite strikeout tendencies, the difference comes out to fewer than four pitches per 9 innings, with the real-life effect likely being even smaller than that due to the home run issue mentioned above. Next time you hear someone saying that a pitcher needs to “pitch to contact” in order to decrease his pitch count, you’ll know that it makes no difference. 

Creating a dynamic FIP with BaseRuns

If you’re interested in starting a fistfight at the next SABR convention (not that I’m advising this) simply start bringing up DIPS in casual conversation loudly enough and I’m sure you can get something going. Voros McCracken set up the sabermetric version of the “less filling, tastes great” argument when he wrote:

There is little if any difference among major-league pitchers in their ability to prevent hits on balls hit in the field of play.

Suffice it to say that not everyone agrees with this.

But what everyone does agree on is that pitchers have far less control over the outcome of a ball in play than they do over the so-called Three True Outcomes: the walk, the strikeout and the home run.

From this, McCracken constructed dERA, essentially a run estimation model that attempts to isolate a pitcher’s performance from that of his defense.

For those looking for a quick-and-dirty shortcut for dERA, Tom Tango’s FIP is generally relied upon:

(13*HR+3*BB-2*K)/IP+3.2

3.2 is the league factor that puts FIP on the same scale as ERA.

FIP is also often used as a sort of component ERA, to estimate a player’s ERA from his projected component stats. There is, of course, Bill James’ Component ERA for those purposes as well. (Confoundingly enough, Component ERA is traditionally abbreviated ERC. Since "Earned Runs Created" describes what ERC is and does perfectly, that’s what I tell myself ERC stands for.)

So I decided to run a comparison of some of these run estimators.

Read more of this post

Recapping the BIP

Before even getting into the meat of this article, no, the title does not refer to Bip Roberts… so I’ll understand if hardcore fans of his are now turned off.  What the title does refer to, however, is balls in play and how they pertain to the statistics BABIP, FIP, and ERA.  I have written a lot here and on my other stomping grounds of late about how some of these statistics are affected and, seeing as it is a holiday weekend with not much interweb usage, it seemed like the logical time to recap everything into one neat package.  For starters, what are these three statistics?
BABIP: Batting Average on Balls In Play is a statistical spawn of the DIPS theory discovered by Voros McCracken at the turn of the century.  Essentially Voros found that pitchers have next to no control over balls put in play against them, which is why certain pitchers would surrender a ton of hits one year and much less the next.  From a control standpoint, the goal of the pitcher would be to get an out.  Once a ball is put in play, unless it is hit right back to the pitcher many defensive aspects have to coincide for an out to result.  Take a groundball for instance, one between shortstop and third base: both fielders have to understand whose territory the ball occupies and that fielder has to have the proper range in order to field it, all in a very short amount of time. 
There are plenty of other variables as well but what should be clear is that the pitcher has no control over them.  He may have control over sustaining a certain percentage of balls in play each year but the hits that result are almost entirely out of his hand.  In fact, the only aspects of pitching over which he has any type of control are walks, strikeouts, and home runs allowed.  Everything else is dependant on the fielding and luck.
BABIP is calculated by dividing the Hits minus Home Runs by the Plate Appearances excluding Home Runs, Walks, Strikeouts, and Sacrifice Flies.  If Player A has 30 hits out of 90 at-bats he will post a .333 batting average.  But if 8 of those 30 hits are home runs and 8 of the outs are strikeouts, in BABIP terms he would be 22 for 74, or .297.  This explains that, of all balls put in play–any hit or batted out other than a home run–29.7% fell in for hits.
FIP: a creation of Tom Tango’s, Fielding Independent Pitching takes the three controllable skills of walks, strikeouts, and home runs allowed, properly weights them, and then scales the result similar to the familiar ERA.  The end result explains what a pitcher’s skillset suggests his ERA should be around.  Someone with an ERA much lower than their FIP is usually considered to be lucky while the inverse is also true.  The statistic is kept at Fangraphs and ERA-FIP was recently added as well in order to allow readers a glimpse at those under- or overperforming their controllable skills.
ERA: arguably the most popular pitching barometer, ERA can be calculated by multiplying the earned runs of a pitcher by nine and dividing that product by the total number of innings pitched.  While not a terrible stat it suffers from some pretty drastic noise.  For starters, what are earned runs?  The surname ‘earned’ implies there are other runs that can be given up and that these must satisfy a specific criteria.  For instance, if a fielder botches a routine play with two outs, and the pitcher then gives up seven runs, none will be earned because the inning was extended by the poor play of the fielder.  This gets into all sorts of questions regarding exactly what an error is and how that factors into a pitcher’s performance.
Earned runs are also a direct result of hits, which have been proven to be largely accrued through chance via the DIPS theory.  So, if pitchers cannot control the percentage of hits they give up on balls in play, then fluctuations in hits can either inflate or deflate an ERA regardless of the pitcher’s skill level.  Therefore the FIP is more indicative of performance level because it only measures the three aspects of pitching he has control over which should not suffer from much fluctuation at all, as Pizza Cutter showed not too long ago that these skills were some of the quickest to stabilize.
Controlling BABIP
At Fangraphs we occasionally call upon a statistic we titled xBABIP, which refers to what the BABIP of a pitcher can be expected to be given his percentage of line drives.  Dave Studeman found a few years back that the general range of BABIP could be predicted with very good accuracy by adding .12 to the LD%; if a pitcher surrendered 22.1% line drives his xBABIP would be ~.341.  Using this for predictive purposes would not be correct due to the fact that the general baseline for pitchers is .300.  What we can do is evaluate performance at a given time and attribute line drives to a rather high or low BABIP.  For instance, saying that Player B’s BABIP of .275 as of today primarily due to his ultra-low 14-15% LD rate would be correct; saying that it will continue like this would not.  The line drive percentage may change as the season goes on.  In summation, we can use something like this when evaluating the past for pitchers but not the future.
David Appelman showed not too long ago that, in 2007, 15% of flyballs fell in for hits, 24% of grounders turned into hits, and a whopping 73% of line drives also followed suit.  Due to this, the ideal xBABIP calculation would be .15(FB) + .24(GB) + .73(LD).
I have done studies here recently, and Jonathan Hale at Baseball Digest Daily has done others in the past as well, that show how aspects like velocity, movement, and location can all affect the BABIP of a given pitcher.  It also been shown, again by Studeman, that elite relievers have the ability to consistently post lower BABIPs than others.  More studies have shown that pitchers, if any, have very weak control over their BABIP but instead of deeming it control I would be more inclined to say that these pitchers are merely taking advantage of “cold spots.” 
If just 15% of flyballs result in hits and such a large number of line drives do, then we could intuitively expect someone with consistently low LD rates and higher FB rates to post lower BABIPs.  From a movement perspective, I found that those with above average vertical movement in different horizontal movement subgroupings post lower BABIPs as well.  Higher vertical movement usually correlates to flyballs, and voila, flyballs have the lowest percentage of hits.
This was just a recap of the three statistics and explanations pertaining to their usage.  Based on this, if we see someone like Carlos Zambrano, whose ERA consistently beats his FIP, based on consistently posting lower BABIPs, we could somewhat safely assume that he might not be controlling anything persay but rather taking advantage of all the aspects proven to result in lower BABIPs.  His controllable skills may not be as good as his ERA would suggest but movement, velocity, and location may have combined to greatly aid his efforts.

Does Movement Influence BABIP?

A couple weeks back, Pizza Cutter found an interesting oddity in that Troy Percival had consistently posted very, very low BABIPs. In response, Dave Studeman brought up Mariano Rivera–another pitcher with consistently low BABIPs–and how it has been somewhat proven that elite relievers can register atypical results with this statistic. Mentioned on a few other sites was the idea that movement may be a central cause for these lower batting averages on balls in play; due to said movement, the sweet part of the bat would fail to meet the ball as it normally would on more “standard” pitches.
Last week, we explored the relationship between fastballs 92+ mph and BABIP, examining how it differed at each mile per hour interval. 92 mph to 96 mph clocked in between .290-.310–the established general range of BABIP for pitchers–before dipping to .273 at 97 mph and shooting back up to .293 for all thrown 98 mph or higher. The 97 and 98+ groups were too small in their sample sizes to definitively fail the 5% hypothesis; we would need around 1,650 balls in play and, combined, had 1,032. Still, the combo of 97 and 98+ offered a .279 BABIP, perhaps suggesting that the .293 at 98+ was the anomaly, not the .273.
Today we will look at the movement within the same 92+ mph range in order to attempt to answer the question posed in the title. First, though, a pre-requisite of sorts with regards to movement: the relationship between horizontal and vertical components is not extremely known yet other than some telltale signs aiding in the classification of pitches. For instance, a two-seam fastball will have much higher horizontal movement than vertical movement; however, four-seam fastballs generally have lower horizontal movement and higher vertical movement.
I queried my database for all fastballs 92+ mph and separated the results into groups by movement rather than velocity intervals. The signs (+-) were reversed so that righties and lefties could be grouped together as well. First, here is a sample size grid of sorts, showing all balls in play for each horizontal group and vertical subgroup; note that the subgroups differ for each horizontal movement grouping so they will be called simply below average or above average as they were essentially determined by the average or a similar type of cutoff point. The reasoning for this is the aforementioned relationship between movement components; for fastballs, lower horizontal movement will usually correlate with higher vertical movement with the inverse also being true.

Horizontal

Below Vert BIP

Above Vert BIP

0-4 in

3,735

2,456

4-8 in

6,823

4,718

8-12 in

4,355

3,227

12+ in

408

335

BABIP takes a while to stabilize, moreso than many other statistics, so I wanted to have at least 2,000 balls in play for each sub-grouping, preferably more. From 0-12 inches of horizontal movement we have large enough samples to notice discrepancies. Greater than 12 inches, however, offers just 743 balls in play. While I definitely plan to explore this and the velocity articles later in the year when more data is available, for now, I am going to exclude the group with more than 12 horizontal inches.
Looking at the other three groups and their two subgroupings each, here are the Ball%, Strike%, HR%, and BABIP:

Horiz.

Vert.

B%

K%

HR%

BABIP

0-4

Below

35.9

45.6

0.53

.289

0-4

Above

34.9

49.8

0.48

.286

4-8

Below

35.8

43.7

0.64

.302

4-8

Above

35.8

48.2

0.58

.292

8-12

Below

35.6

41.4

0.54

.315

8-12

Above

36.5

45.6

0.58

.298

The percentage of balls essentially stays in the same general range while the strikes fluctuate. The subgroupings with above average vertical movement have much higher strike percentages than others. So, judging by this it seems before we even get to BABIP, that higher vertical movement in these larger groups result in a higher percentage of strikes.

The BABIPs for horizontal movement groups with below average vertical movement register: .289, .302, and .315. The BABIPs for horizontal movement with above average vertical movement clock in at: .286, .292, .298. Judging from these results it would appear that, yes, movement does have some type of effect on BABIP. Each horizontal group posted higher counts when they had below average vertical movement, and at every interval as well; .289 to .286, .302 to .292, and .315 to .298. Additionally, all pitches 92+ mph with 0-4 inches of horizontal movement, regardless of whether or not they fell above or below the vertical cutoff point, produced a BABIP lower than .290, which is generally the lower edge of the .290-.310 range we expect it to fall into.

Tomorrow I’ll come right back with the total number of unique pitchers and those comprising at least 1% and at least 5% of the sample, in order to see if the results are skewed in any way. For now, though, it appears that, regardless of your horizontal movement, having above average vertical movement will produce a lower BABIP at each horizontal interval.

Heater Getting Hotter

Yesterday we looked at the averages of fastballs from different velocity groups as a means to compare certain pitchers to their like-throwing peers as opposed to an extremely broad group.  This way, we can compare Matt Cain’s movement to the average movement for all 94 mph fastballs to determine how effective it has been.
In doing so an anomaly surfaced: all velocity groups had a BABIP between .290-.310 except those thrown 97 mph.  Those heaters registered a .273 BABIP, nearly 20 points below the others.  Sure enough, fastballs registering 98 mph or higher jumped back to .293, leading many of us to believe something screwy, flukey, or any other adjective ending with the suffix “-y” slapped on its end, was taking place.  After exploring some logical possibilities, like a split-half reliability test, or a look at BABIP by count and location, the results either stuck or were inconclusive due to small sample sizes at work.
We had a really nice discussion in the comments section wherein more possibilities were tossed around.  The first of these suggestions involved testing the sample size via a Bernoulli Trial.  As was shown by commenter Adam Guetz, for an observed .273 when a .295 was expected, we would need approximately 1,650 balls in play.  For 97 mph pitches there were 707 balls in play, less than half of what is required, and just 325 balls in play for 98+ mph.  While the sample sizes of actual pitches thrown are large enough to conduct certain analyses, those of balls in play for anything 97 mph or higher were not.  Here are the BIP sample sizes:

  • 92 mph, 18.85 % BIP and 7,759 total
  • 93 mph, 18.05% BIP and 6,023 total
  • 94 mph, 18.05% BIP and 4,389 total
  • 95 mph, 17.04% BIP and 2,827 total
  • 96 mph, 17.26% BIP and 1,596 total
  • 97 mph, 16.69% BIP and 707 total
  • >98 mph, 16.11% BIP and 325 total

The samples from 92-96 appear large enough, but the combination of 97 and 98+ still comes a good 500 pitches below 96 mph on its own.  Another suggestion called for the total number of different pitchers as each interval as well as the number of those comprising certain percentages of the samples.  This way, we might be able to deduce that 97 mph pitches were skewed due to a small group representing the whole; for the lower velocities, which are more common, it is much more likely for the pitches to be more evenly divided amongst a larger group of pitchers.  Here are the number of pitchers for each group, those comprising 1% of the sample, and those comprising 5% of the sample:

  • 92 mph: 574 total pitchers, 8 at 1%, 0 at 5%
  • 93 mph: 485 total pitchers, 18 at 1%, 0 at 5%
  • 94 mph: 516 total pitchers, 21 at 1%, 0 at 5%
  • 95 mph: 337 total pitchers, 25 at 1%, 0 at 5%
  • 96 mph: 237 total pitchers, 28 at 1%, 1 at 5%
  • 97 mph: 160 total pitchers, 25 at 1%, 4 at 5%
  • >98 mph: 102 total pitchers, 18 at 1%, 8 at 5%

In the 97 mph group, the four pitchers with at least 5% of the sample combine to represent 23% of the total.  For 98+ mph, the eight pitchers with at least 5% of the sample combine to represent 56% of the total.
From these results it seems that 92-96 mph are safe from a drastic case of small sample size syndrome.  Anything abobe 97 mph, though, seems to be the opposite as they suffer from a small sample of balls in play as well as skewed results due to a small group of pitchers representing most of the total pitches. 
Another commenter, Dave Evans, pointed out that he received a significance of 0.55 when comparing 97 and 98+, meaning their BABIPs were not statistically significantly different; for significance, that value would need to be equal to or below 0.01.  This led me to group 97 and 98+ together, to enlarge the sample.  The result was 1,032 balls in play, 288 hits in play, and a .279 BABIP.  This suggested the possibility that perhaps it was not 97 mph that deserved the adjective+suffix “-y” treatment but rather 98+ mph pitches.  Granted, it is still a small sample, even moreso for BABIP, but perhaps we will find out, as more data becomes available, that 97 mph is the threshold, as Pizza Cutter noted, for “blowing it by the hitter.”
It will require several hundred more pitches in play to determine this with any certainty but I will be keeping very close tabs as the season progresses.  For now, though, we can effectively compare individual pitchers to the average movement components, B%, K%, and BABIP for their specific velocity, not an entire group, at least for heaters 92 mph to 96 mph.

Breaking Down the Heater

Back on December 20th, John Walsh wrote a very interesting article at The Hardball Times, taking everything recorded by the Pitch F/X system in 2007 and, amongst others, calculating the average velocity, horizontal movement, and vertical movement for the four major pitches: fastball, curveball, slider, and changeup.  The results showed that the average fastball clocked in at 91 mph with -6.2 inches of horizontal movement and 8.9 inches of vertical movement.  The author acknowledged that he did not differentiate between four-seamers, two-seamers, and cutters, but rather lumped them all together in determining the averages; two-seamers and cutters differ in velocity and movement components from four-seamers.

While I plan on calculating the averages for all different sub-groupings of pitches at some point, what recently piqued my interest was finding the averages for different velocity groupings.  As in, what is the average horizontal movement for all 94 mph fastballs?  Or, the BABIP for 98 mph fastballs? 
With that knowledge we could effectively compare certain pitchers to the means of their velocity grouping rather than overall averages of every grouping.  Instead of comparing, say, Edwin Jackson’s 94 mph fastball to a group including those who throw slower, we can compare him to his “peers.” 
I started at 92 mph and queried my database for groupings (92-92.99, 93-93.99, etc) all the way up until 98+ mph.  I figured 92 mph would be a solid starting point since the sample size would be extraordinarily large–large enough for four-seamers to overcome the two-seamers and cutters that may inevitably sneak in.  Anything 98 mph or higher was grouped together to ensure a large enough sample since, as you will see below, the higher the velocity, the smaller the sample:

Velocity

Sample

%

92 mph

41,157

31.4

93 mph

33,368

25.5

94 mph

24,315

18.6

95 mph

16,586

12.7

96 mph

9,245

7.1

97 mph

4,236

3.2

>98 mph

2,018

1.5

All of the sample sizes here were large enough for analysis.  Even though the 98+ group appears to be 1/20th the size of the 92 mph group, that speaks more for the latter than against the former.
Next, how do the movement components look for each group?

Velocity

Horiz.

Vert.

92 mph

-6.34

9.24

93 mph

-6.28

9.51

94 mph

-6.16

9.80

95 mph

-5.98

10.07

96 mph

-5.84

10.23

97 mph

-5.89

10.41

>98 mph

-6.03

10.38

It should be fairly apparent that the tendency is for horizontal movement to decrease and vertical movement to increase as the velocity increases, at least through 96 mph.  At 97 mph, both movement components increase.  At 98+ mph, the vertical movement stays stagnant while the horizontal movement jumps quite a bit.
The next area to discuss includes B%, K%, HR%, and BABIP:

Velocity

B%

K%

HR%

BABIP

92 mph

35.9

44.6

0.65

.302

93 mph

36.3

45.1

0.55

.303

94 mph

35.5

45.9

0.55

.292

95 mph

35.8

46.4

0.76

.303

96 mph

35.2

47.0

0.54

.291

97 mph

36.1

46.8

0.41

.273

>98 mph

33.9

49.3

0.69

.293

The percentage of balls doesn’t move too much until its dip of over two percentage points at 98+ mph.  The amount of strikes, however, seems to increase.  There is no real discernible pattern in the home run percentages; the most came on 95 mph heaters while the least came on those registering 97 mph.

Speaking of the 97 mph group, notice anything odd?  Perhaps that their BABIP is .273, a full eighteen points below any other group?  Prior to getting the results I expected each group to fall somewhere in the .290-.310 range; that all of them did except the .273 struck me as very peculiar.

I spoke to several other analysts, all of whom initially mentioned small sample size syndrome, only to redact the assessment after learning the sample sizes in question.  The dropoff in home run percentage was tossed around, as well, since less home runs means more balls in play to be counted in the BABIP formula.  This is a “could be,” though, rather than a “definitely why.”  As was mentioned in these discussions, too, it could be nothing; perhaps there were more warning track flyballs that just missed leaving the yard as opposed to weaker hit balls.

Now, while the 4,236 pitches at 97 mph constitutes a large enough sample to analyze, the balls in play were not large enough yet to break into individual counts or locations.  When they do get big enough this could serve as a means of explanation; perhaps something in either or both does not jive with the other velocity groups.  Of those with significance, however, there was a .263 BABIP on 0-0 counts, and a .286 BABIP on pitches in the middle of the strike zone.

Pizza Cutter, or “The Master of Statistical Reliability” as I like to call him (yeah, a nickname for a nickname), suggested that BABIP is one of those stats that is super-unreliable, even with my large sample of pitches.  I did a split-half reliability test, randomly splitting the sample in half, and calculating the BABIP of each half.  For those unfamiliar, this serves to test the reliability of the sample; if it truly is large enough then no matter how we cut the sample in half we will have fairly convergent results.  If the results were wildly divergent then we are dealing with an unreliable sample.  The BABIPs of the two groups were .271 and .275, which essentially threw that idea out of the window.

Something interesting to consider was how, in each of these tables, all patterns seemed to stop when they reached 97 mph or higher.  The horizontal movement increased instead of its decreasing trend; vertical movement decreased after its increase at 97; the percentage of strikes ceased increasing; and home runs reached their low.  Could be something, could be nothing, but interesting nonetheless.

For now I am going to chalk this BABIP drop as an extreme random statistical variation and hope that you loyal readers out there might chime in with some more ideas to investigate.  Otherwise, though, when gauging the movement components, percentage of balls/strikes/home runs, or even BABIP, we can compare individual pitchers to their “like-minded” averages by velocity grouping.  If I get enough feedback involving different aspects to measure regarding these fastballs we will look at that soon, in the next day or two.  Otherwise, next week I have something similar to this, looking at BABIP by movement.

Juuust A Bit Outside

On Thursday we took a look at the pitchers with the highest percentage of Pitch F/X-recorded pitches right down the middle of the plate.  I listed the top thirty out of the 165 pitchers with significant numbers and found that Ted Lilly of the Cubs has thrown the highest percentage; on top of that, the next pitcher on the list found himself relatively far off.  Today we are going to look at the opposite: The pitchers with the highest percentage of pitches outside the zone.
Now, outside the zone calls for four general parameters: very high, very low, outside to the left, and outside to the right.  I feel like I’m typing the Cha-Cha slide.
For now I am going to focus on the left/right parameters outside the strike zone, and we will explore high/low a bit later in the year as I have other ideas centering around those parameters.  As discussed previously, the strike zone on a general pitch location chart goes from -0.83 to 0.83 on the horizontal axis and 1.6 to 3.5 on the vertical axis.  To track pitches down the middle the axis numbers were set much smaller.  To track pitches outside the zone the horizontal axis numbers branch out in different directions.  For pitches outside to the left I set my database to give me all pitches with a PX (horizontal location in the data) less than -1.55 as well as greater than +1.55.
This provided me plenty of pitches to analyze but keep in mind that the data was not insanely consistent last year with regards to who gets recorded and where the recording takes place.  This year it has become more consistent and uniform but there may be data discrepancies due to some players having insufficient data.  For instance, Player A might be known to throw a ton of pitches out of the zone but, because the Pitch F/X system did not track many of his starts, he might not qualify. 
To help ensure the pitchers in the below leaderboard did not fall into this statistical fallacy, a minimum of 240 raw pitches was set.  That certainly whittled the list down.  The total tracked pitches were then recorded for all remaining pitchers, and they were then sorted by % instead of raw total.  Here are the top ten:
1) Livan Hernandez, 15.83%
2) Derek Lowe, 12.43%
3) Jake Peavy, 12.39%
4) Chad Gaudin, 11.56%
5) Braden Looper, 11.56%
6) John Smoltz, 11.40%
7) Jamie Moyer, 11.20%
8) Justin Germano, 11.01%
9) Jeff Francis, 10.73%
10) A.J. Burnett, 10.71%
I did not necessarily predict that Livan would be atop this leaderboard but, at the same time, it was not very surprising to find his name there, with a significant lead over the next pitcher nonetheless.  Moyer didn’t surprise me either as he’s a notorious “junkballer.”  Here are 11-20:
11) Jarrod Washburn, 10.59%
12) Carlos Zambrano, 10.05%
13) Shaun Marcum, 10.03%
14) Tim Hudson, 9.78%
15) Javier Vazquez, 9.66%
16) Kevin Millwood, 9.63%
17) Jose Contreras, 9.54%
18) Miguel Batista, 9.33%
19) Roy Halladay, 9.29%
20) Vicente Padilla, 9.14%
Something really interesting here is the emergence of Burnett, Marcum, and Halladay.  I noted in the comments on Thursday that, of pitchers with significant data, Burnett, Marcum, and Halladay were in the bottom ten of percentage of pitches thrown right down the middle; here they are in the top twenty of pitches thrown outside the zone.  I noted at Fangraphs a week or two ago that the Blue Jays rotation, arguably the best in the bigs both last year and this year, consisted of three guys (McGowan, Marcum, Litsch) who threw four or five different pitches at least 10% of the time, somewhat of an extreme rarity.  Additionally, Halladay has a potent three-pitch combo, and Burnett has a plus-fastball and plus-curveball.
Put together it seems like the Blue Jays pitchers are spreading their pitch selections quite liberally, rarely making mistakes in throwing the ball right down the middle, and not worrying about being outside the strike zone.  Perhaps this means nothing with regards to their performance, but it is interesting nonetheless that a rotation like this appears in the leaderboards in three different areas of selection/location.
As we get deeper into the season enough data will be compiled to look at both down the middle and outside pitches solely for 2008, when the data is tracked in each park.  For now, though, we’ll have to settle with Ted Lilly and Livan Hernandez.  If only those two faced each other this year.

Right Down the Middle

Last week I took a look at the relationship between pitches and home runs, checking to see if there were any noticeable discrepancies between those that sail out of the stadiums and those that do not.  The results showed that fastballs turned into souvenirs when they came in with lesser velocities and movements as well as with poor location; breaking balls were hit out when they hung in the zone.

While conducting these analyses I became very interested in pursuing the idea of mistake pitches and balls thrown not just in the zone but right down the middle.  Of all the balls that were hit for home runs from the top home run surrendering pitchers this year, at least 80% were smackdab in the middle of the plate.  Since this piqued my interest I decided to check out which pitchers threw down the middle most often.
The strike zone, in Pitch F/X terms, is generally -0.85 to 0.85 on the horizontal axis and 1.6 to 3.5 on the vertical axis.  I went smaller, looking at pitches in the middle of that zone, as evidenced by this picture:

strikezone.JPG


Probing my database for pitches in the smaller box–what I would consider to be down the middle–I found a ton of pitches.  Keep in mind, though, that the results below are from pitches tracked by the Pitch F/X system; there are some pitchers that might have a higher total or percentage but did not have the luxury of having their relevant data recorded.
I found 165 pitchers with a significant number of pitches down the middle.  Luckily, in terms of using neat/even numbers in a list, the top 30 percentages happened to consist of everyone with at least 14% of their pitches thrown down the middle.  Here are the top ten:
1) Ted Lilly, 18.6%
2) Paul Byrd, 16.7%
3) Josh Beckett, 16.3%
4) Micah Owings, 16.1%
5) Tim Lincecum, 15.9%
6) John Danks, 15.8%
7) Felix Hernandez, 15.7%
8) Greg Maddux, 15.5%
9) Joe Blanton, 15.5%
10) Justin Verlander, 15.4%
Lilly threw just about two percent more pitches down the middle than his closest competitor whereas #2-#10 were separated by a total 1.3 percent.  Numbers 11-20:
11) Andy Sonnanstine, 15.3%
12) Kevin Millwood, 15.2%
13) Cole Hamels, 15.1%
14) Aaron Harang, 15.0%
15) Brian Bannister, 14.8%
16) Daisuke Matsuzaka, 14.7%
17) Vicente Padilla, 14.7%
18) Matt Cain, 14.7%
19) Javier Vazquez, 14.7%
20) Randy Wolf, 14.6%
And the last group with at least 14% of their pitches down the middle:
21) Brad Penny, 14.5%
22) Roy Oswalt, 14.5%
23) Johan Santana, 14.4%
24) Nate Robertson, 14.3%
25) Ervin Santana, 14.2%
26) Miguel Batista, 14.2%
27) Jon Garland, 14.1%
28) John Lackey, 14.1%
29) CC Sabathia, 14.1%
30) Jarrod Washburn, 14.0%
Unfortunately, just as David Appelman found a couple of years ago, there is not much correlation between pitches thrown down the middle and, well, anything else at all.  I thought there might be something significant between down the middle pitches and line drives–it’s been theorized before that line drives might correlate quite well with mistake pitches–but, alas, there was not; at least not yet.
Additionally, I would like to explore this at the end of this season, or perhaps further into the year, when all pitchers would have the same (or close to it) amount of data recorded.  For now, though, at the very least, it’s somewhat interesting to see which pitchers throw the most down the middle.
On Saturday we will look at the opposite, pitchers who throw the most OUT of the zone and then compare the results (Balls, Called K, Swing K, etc) between pitches down the middle and those out of the zone.

Pitcher fatigue, batted balls, and DIPS

Are you tired, Mr. Starter?  I’ve been asking this question of my magic spreadsheet for a while now, and last week, during my look at fatigue factors like pitch count, days of rest, mileage on the arm for the season, and number of times through the lineup, I promised a follow up study on how fatigue affected what happens to a batted ball.  Here it is.
I isolated all batted balls from 2000-2006 (isolated is a strong word: that’s still 500K+ events!)  Like in my last two articles, I first calculated the batter’s and pitcher’s GB rate, LD rate, and FB rate (including pop ups), and then the expected probabilities of each for such a matchup, using the odds ratio method.  Then I took the natural log of that number.  I entered this into a binary logit regression as a control for batter and pitcher tendencies, and then on top of that entered my fatigue variables.  Pitch count was present in all the regressions, then one of the other fatigue variables (rest, mileage, times through order, number of pitches thrown last time out), and then the interaction of pitch count and whatever fatigue measure was under consideration.  (In my day job, we call this a modeator analysis.)
First, what effect does pitch count have on the batted ball profile in general.  As the game wears on, there’s an effect for pitch count such that grounders go down and fly balls go up.  How poetic.  Line drives don’t seem to respond to pitch count.
Let’s talk about mileage.  Like last week, I kept the sample to guys who went a maximum of 10 days between starts on the season to get rid of players who had “hidden mileage” in AAA or the bullpen, plus those who had been injured.  The results: as the season wears on, and the pitcher has more pitches on that arm, there’s a very real effect.  When the ball comes off the bat, it’s more likely to be a flyball and less likely to be a grounder.  Of course, the effect is not going to overwhelm a pitcher’s own tendencies to induce ground balls, but it’s certainly not going to help.  Tired pitchers get the ball up in the zone more often and that’s more likely to be hit in the air.  Now, last week, I didn’t find that there was a significant increase in home runs (or anything other than walks), in terms of outcomes, but batted ball percentages are usually much more statistically stable than are outcome measures.  It’s possible that what used to be ground ball outs are becoming fly ball outs, but I’m not convinced.
Next, let’s talk about rest.  Turns out that days of rest didn’t have any effect whatsoever on the batted ball profile, once you control for batter and pitcher matchup.  In my previous study from last week, I found that a short-rested pitcher was more likely to give up a home run.  So, while he might not throw any more fly ball pitches, his pitches that do go for fly balls are more likely to leave the yard.  A ground ball pitcher likely wouldn’t have the same problem, because… well a mistake on a ground ball pitch is going to just be a slightly harder hit ground ball… maybe a better chance for a single.  This also brings up the old chestnut about starting sinkerballer pitchers on short rest because their ball is “heavier” and sinks better, leading to more ground balls.  I decided that was worth a look.  I restricted my sample to pitchers who had GB percentages over 50% for the year.  That’s not an exact proxy for sinkerballers, but I’m guessing there’s a few sinkers and splitters being thrown there.  The result: no effect.  Looks like the sinker is sinking any more heavily because of the short rest, but it doesn’t seem to harm the pitcher either.
Finally, let’s talk about the number of times that the pitcher has seen this guy before in this game.  Because I’m controlling for the expectations of this pitcher/batter matchup and for pitch count, any effects for time through the lineup are likely due to the pitcher actually gathering intelligence about the batter.  Are there effects?  Yes, there are.  As the lineup cycles around more, the batter is more likely to hit a ground ball and less likely to hit a line drive.  (That’s a good trade for the pitcher!)  So, it looks as though the pitcher is actually gathering some sort of intelligence on the batter and is perhaps gaining some small advantage.  Oftentimes, Sabermetric analysis has a tendency to reduce at bats to simple agglomorations of probabilities.  Here’s some evidence that we need to take a look at the mental aspect of the game.  Of course the batter and pitcher are trying to learn about one another.  It looks like the pitcher is the one who has the advantage.  Perhaps a pitcher gets the batter out with his “stuff” early and his brain late.
One more area of interest.  Does fatigue affect DIPS?  For a long time, it’s been assumed that balls in play went for hits at a rate that had more to do with the defense than the pitcher.  That’s been based mostly on season-to-season intercorrelations.  But, what about within a game?  The answer is… yes, there is an effect.  At lower pitch counts, a ball in play is less likely to be a hit, again, controlling for batter/pitcher rates.  Additionally, there’s an effect for number of times through the lineup (already controlling for the fact that there will be a pitch count effect.)  So, we would expect that starters who are efficient with their pitch count to have a lower BABIP overall.  Fresher pitchers throw pitches that are better able to be turned into outs.  This might explain my question concerning Troy Percival.
There are a few more factors that could be studied.  I didn’t consider age (younger pitchers probably bounce back faster) nor body type (through BMI?), and I haven’t yet looked at relievers.  And then there’s the work that’s sure to come from the Pitch F/X folks who can break this stuff down on a molecular level.