## Throwing Strikes: How Important Is It?

The title is a little misleading, as throwing strikes is important. If you don’t throw strikes you can’t win, simple. But, how important are those few extra percentage of strikes you throw.

I looked at multiple categories of numbers, and looks to see if there was a correlation with strike percentage. I only graphed the two extremes, and the one with the least correlation to keep this from getting out of hand. To the graph-mobile!
Click on graphs for larger view
That one should have been obvious, right? The more strikes you throw, the less you walk people. Next up:

## Strikeouts and Pitch Counts

Back in the days when men were men, nobody worried about pitch counts. If you tried to take Ed Walsh out of a game after 100 pitches, he’d probably tell you he had at least another 200 pitches in him. Despite the current efforts of Nolan Ryan to go back ye olden days of not counting pitches, the current way of the world is that every other team does it, so we might as well pay attention to it.

These days, a pitcher is usually taken out after around 100-110 pitches, give or take a few. Often times, this means taking out a guy throwing a shutout in the 6th inning because he had reached the 100 pitch limit. There are two ways around this: letting the pitcher throw more pitches (which could potentially increase the risk of injury), or becoming more efficient (i.e., throwing fewer pitches per at bat). I’m here to talk about the latter.

The conventional wisdom goes something like this: It takes at least three pitches to strike someone out, but only one is required to get a groundball out. Therefore, a pitcher could decrease his pitch count by not attempting to strike out as many hitters as he can. That seems good enough for most people, but the astute readers of StatSpeak know that this can’t end there. A strikeout results in an out 100% of the time (ignoring the rare dropped third strike), but a ball in play results in an out only 71% of the time. That “other 29%” results in more batters coming to the plate, which results in more pitches having to be thrown to those additional batters. On one hand, we have more strikeouts leading to more pitchers per at bat, but also leading to fewer batters coming to the plate. On the other, we have more balls in play (fewer strikeouts) leading to fewer pitches per at bat, but also leading to more batters coming to the plate. Which of the effects is stronger?

Thanks to the work of Tom Tango, it has been shown that the average strikeout requires 4.8 pitches, the average walk takes 5.5 pitches, and if the plate appearance results in batter contact then it takes an average of 3.3 pitches (the data he used are all publicly available, by the way). Before you say “but so and so is different,” these numbers have been tested against extreme pitchers here. So these averages apply very well to all pitchers, whether they follow the norm, or if they are unusual cases like Randy Johnson and Brad Radke. We can use these estimates to see how an increased strikeout rate affects a pitch count.

How about a real-life test of the estimator? Joba Chamberlain has received some criticism from mainstream media-types about needing to be more efficient with his pitches, so he’s as good an example as any (and since he’s a Yankee, I know this will get on Pizza Cutter’s nerves). Joba has faced 256 batters this year, striking out 55 and walking 28. Plug those numbers into the formula (remember to subtract K’s and BB’s from the total batters faced when using the formula), and you get just under 989 pitches thrown. How many has he actually thrown this year? 984. I hope that difference of only 5 pitches helps to ease your concerns about accuracy.

Now back to the question at hand. Prorated to 9 innings, this is a fairly typical pitching line: 9 innings, 6 strikeouts, 4 walks, and one home run. If 30% of balls in pay fall in for hits, that also means that there are 10 hits allowed in those 9 innings. In that “typical” game, a pitcher is expected to throw 153.1 pitches in 9 innings. What about games that aren’t normal, like one where the pitcher racks up a ton of strikeouts?

Here’s an extreme example: Take the exact pitching line from above, but change strikeouts from 6 to 27. So the new pitching line is 9 IP, 27 strikeouts, 4 BB, 1 HR. Using the formula above, the pitcher would be expected to throw 154.9 pitches. The effect is actually smaller than that, and here’s why: If a pitcher strikes out 27 batters, would you really expect the ONLY guy to make contact to hit the ball over the fence? When a pitcher is that dominating, what are the chances that he’d give up a home run at a rate of one per 9 innings? I’d say very slim. Fewer balls in play means fewer fly balls, which in turn means fewer home runs, and fewer pitches. So the real pitch count would be lower than 154.9, but for simplicity’s sake I’m going to call it even.

Let’s look at the other extreme–a pitcher who doesn’t strike out a single batter all game. Such a pitcher would be expected to allow a little over 12 hits per game. His expected pitch count for a game that included 4 walks, no strikeouts, and 12 hits including one home run would be 151.2 pitches. The caveat above about home runs also applies here, but in the opposite direction–a pitcher who has ever batter put the ball in play on him would likely allow more than one home run per 9 innings, so he’d likely throw slightly more than 151.2 pitches.

So what did we learn from this exercise? Even in the most extreme cases, striking out lots of batters will not increase your pitch count by any noticeable effect. Even when comparing two pitchers with polar opposite strikeout tendencies, the difference comes out to fewer than four pitches per 9 innings, with the real-life effect likely being even smaller than that due to the home run issue mentioned above. Next time you hear someone saying that a pitcher needs to “pitch to contact” in order to decrease his pitch count, you’ll know that it makes no difference.

## Creating a dynamic FIP with BaseRuns

If you’re interested in starting a fistfight at the next SABR convention (not that I’m advising this) simply start bringing up DIPS in casual conversation loudly enough and I’m sure you can get something going. Voros McCracken set up the sabermetric version of the “less filling, tastes great” argument when he wrote:

There is little if any difference among major-league pitchers in their ability to prevent hits on balls hit in the field of play.

Suffice it to say that not everyone agrees with this.

But what everyone does agree on is that pitchers have far less control over the outcome of a ball in play than they do over the so-called Three True Outcomes: the walk, the strikeout and the home run.

From this, McCracken constructed dERA, essentially a run estimation model that attempts to isolate a pitcher’s performance from that of his defense.

For those looking for a quick-and-dirty shortcut for dERA, Tom Tango’s FIP is generally relied upon:

(13*HR+3*BB-2*K)/IP+3.2

3.2 is the league factor that puts FIP on the same scale as ERA.

FIP is also often used as a sort of component ERA, to estimate a player’s ERA from his projected component stats. There is, of course, Bill James’ Component ERA for those purposes as well. (Confoundingly enough, Component ERA is traditionally abbreviated ERC. Since "Earned Runs Created" describes what ERC is and does perfectly, that’s what I tell myself ERC stands for.)

So I decided to run a comparison of some of these run estimators.

## Recapping the BIP

Before even getting into the meat of this article, no, the title does not refer to Bip Roberts… so I’ll understand if hardcore fans of his are now turned off.  What the title does refer to, however, is balls in play and how they pertain to the statistics BABIP, FIP, and ERA.  I have written a lot here and on my other stomping grounds of late about how some of these statistics are affected and, seeing as it is a holiday weekend with not much interweb usage, it seemed like the logical time to recap everything into one neat package.  For starters, what are these three statistics?
BABIP: Batting Average on Balls In Play is a statistical spawn of the DIPS theory discovered by Voros McCracken at the turn of the century.  Essentially Voros found that pitchers have next to no control over balls put in play against them, which is why certain pitchers would surrender a ton of hits one year and much less the next.  From a control standpoint, the goal of the pitcher would be to get an out.  Once a ball is put in play, unless it is hit right back to the pitcher many defensive aspects have to coincide for an out to result.  Take a groundball for instance, one between shortstop and third base: both fielders have to understand whose territory the ball occupies and that fielder has to have the proper range in order to field it, all in a very short amount of time.
There are plenty of other variables as well but what should be clear is that the pitcher has no control over them.  He may have control over sustaining a certain percentage of balls in play each year but the hits that result are almost entirely out of his hand.  In fact, the only aspects of pitching over which he has any type of control are walks, strikeouts, and home runs allowed.  Everything else is dependant on the fielding and luck.
BABIP is calculated by dividing the Hits minus Home Runs by the Plate Appearances excluding Home Runs, Walks, Strikeouts, and Sacrifice Flies.  If Player A has 30 hits out of 90 at-bats he will post a .333 batting average.  But if 8 of those 30 hits are home runs and 8 of the outs are strikeouts, in BABIP terms he would be 22 for 74, or .297.  This explains that, of all balls put in play–any hit or batted out other than a home run–29.7% fell in for hits.
FIP: a creation of Tom Tango’s, Fielding Independent Pitching takes the three controllable skills of walks, strikeouts, and home runs allowed, properly weights them, and then scales the result similar to the familiar ERA.  The end result explains what a pitcher’s skillset suggests his ERA should be around.  Someone with an ERA much lower than their FIP is usually considered to be lucky while the inverse is also true.  The statistic is kept at Fangraphs and ERA-FIP was recently added as well in order to allow readers a glimpse at those under- or overperforming their controllable skills.
ERA: arguably the most popular pitching barometer, ERA can be calculated by multiplying the earned runs of a pitcher by nine and dividing that product by the total number of innings pitched.  While not a terrible stat it suffers from some pretty drastic noise.  For starters, what are earned runs?  The surname ‘earned’ implies there are other runs that can be given up and that these must satisfy a specific criteria.  For instance, if a fielder botches a routine play with two outs, and the pitcher then gives up seven runs, none will be earned because the inning was extended by the poor play of the fielder.  This gets into all sorts of questions regarding exactly what an error is and how that factors into a pitcher’s performance.
Earned runs are also a direct result of hits, which have been proven to be largely accrued through chance via the DIPS theory.  So, if pitchers cannot control the percentage of hits they give up on balls in play, then fluctuations in hits can either inflate or deflate an ERA regardless of the pitcher’s skill level.  Therefore the FIP is more indicative of performance level because it only measures the three aspects of pitching he has control over which should not suffer from much fluctuation at all, as Pizza Cutter showed not too long ago that these skills were some of the quickest to stabilize.
Controlling BABIP
At Fangraphs we occasionally call upon a statistic we titled xBABIP, which refers to what the BABIP of a pitcher can be expected to be given his percentage of line drives.  Dave Studeman found a few years back that the general range of BABIP could be predicted with very good accuracy by adding .12 to the LD%; if a pitcher surrendered 22.1% line drives his xBABIP would be ~.341.  Using this for predictive purposes would not be correct due to the fact that the general baseline for pitchers is .300.  What we can do is evaluate performance at a given time and attribute line drives to a rather high or low BABIP.  For instance, saying that Player B’s BABIP of .275 as of today primarily due to his ultra-low 14-15% LD rate would be correct; saying that it will continue like this would not.  The line drive percentage may change as the season goes on.  In summation, we can use something like this when evaluating the past for pitchers but not the future.
David Appelman showed not too long ago that, in 2007, 15% of flyballs fell in for hits, 24% of grounders turned into hits, and a whopping 73% of line drives also followed suit.  Due to this, the ideal xBABIP calculation would be .15(FB) + .24(GB) + .73(LD).
I have done studies here recently, and Jonathan Hale at Baseball Digest Daily has done others in the past as well, that show how aspects like velocity, movement, and location can all affect the BABIP of a given pitcher.  It also been shown, again by Studeman, that elite relievers have the ability to consistently post lower BABIPs than others.  More studies have shown that pitchers, if any, have very weak control over their BABIP but instead of deeming it control I would be more inclined to say that these pitchers are merely taking advantage of “cold spots.”
If just 15% of flyballs result in hits and such a large number of line drives do, then we could intuitively expect someone with consistently low LD rates and higher FB rates to post lower BABIPs.  From a movement perspective, I found that those with above average vertical movement in different horizontal movement subgroupings post lower BABIPs as well.  Higher vertical movement usually correlates to flyballs, and voila, flyballs have the lowest percentage of hits.
This was just a recap of the three statistics and explanations pertaining to their usage.  Based on this, if we see someone like Carlos Zambrano, whose ERA consistently beats his FIP, based on consistently posting lower BABIPs, we could somewhat safely assume that he might not be controlling anything persay but rather taking advantage of all the aspects proven to result in lower BABIPs.  His controllable skills may not be as good as his ERA would suggest but movement, velocity, and location may have combined to greatly aid his efforts.

## Does Movement Influence BABIP?

A couple weeks back, Pizza Cutter found an interesting oddity in that Troy Percival had consistently posted very, very low BABIPs. In response, Dave Studeman brought up Mariano Rivera–another pitcher with consistently low BABIPs–and how it has been somewhat proven that elite relievers can register atypical results with this statistic. Mentioned on a few other sites was the idea that movement may be a central cause for these lower batting averages on balls in play; due to said movement, the sweet part of the bat would fail to meet the ball as it normally would on more “standard” pitches.
Last week, we explored the relationship between fastballs 92+ mph and BABIP, examining how it differed at each mile per hour interval. 92 mph to 96 mph clocked in between .290-.310–the established general range of BABIP for pitchers–before dipping to .273 at 97 mph and shooting back up to .293 for all thrown 98 mph or higher. The 97 and 98+ groups were too small in their sample sizes to definitively fail the 5% hypothesis; we would need around 1,650 balls in play and, combined, had 1,032. Still, the combo of 97 and 98+ offered a .279 BABIP, perhaps suggesting that the .293 at 98+ was the anomaly, not the .273.
Today we will look at the movement within the same 92+ mph range in order to attempt to answer the question posed in the title. First, though, a pre-requisite of sorts with regards to movement: the relationship between horizontal and vertical components is not extremely known yet other than some telltale signs aiding in the classification of pitches. For instance, a two-seam fastball will have much higher horizontal movement than vertical movement; however, four-seam fastballs generally have lower horizontal movement and higher vertical movement.
I queried my database for all fastballs 92+ mph and separated the results into groups by movement rather than velocity intervals. The signs (+-) were reversed so that righties and lefties could be grouped together as well. First, here is a sample size grid of sorts, showing all balls in play for each horizontal group and vertical subgroup; note that the subgroups differ for each horizontal movement grouping so they will be called simply below average or above average as they were essentially determined by the average or a similar type of cutoff point. The reasoning for this is the aforementioned relationship between movement components; for fastballs, lower horizontal movement will usually correlate with higher vertical movement with the inverse also being true.

 Horizontal Below Vert BIP Above Vert BIP 0-4 in 3,735 2,456 4-8 in 6,823 4,718 8-12 in 4,355 3,227 12+ in 408 335

BABIP takes a while to stabilize, moreso than many other statistics, so I wanted to have at least 2,000 balls in play for each sub-grouping, preferably more. From 0-12 inches of horizontal movement we have large enough samples to notice discrepancies. Greater than 12 inches, however, offers just 743 balls in play. While I definitely plan to explore this and the velocity articles later in the year when more data is available, for now, I am going to exclude the group with more than 12 horizontal inches.
Looking at the other three groups and their two subgroupings each, here are the Ball%, Strike%, HR%, and BABIP:

 Horiz. Vert. B% K% HR% BABIP 0-4 Below 35.9 45.6 0.53 .289 0-4 Above 34.9 49.8 0.48 .286 4-8 Below 35.8 43.7 0.64 .302 4-8 Above 35.8 48.2 0.58 .292 8-12 Below 35.6 41.4 0.54 .315 8-12 Above 36.5 45.6 0.58 .298

The percentage of balls essentially stays in the same general range while the strikes fluctuate. The subgroupings with above average vertical movement have much higher strike percentages than others. So, judging by this it seems before we even get to BABIP, that higher vertical movement in these larger groups result in a higher percentage of strikes.

The BABIPs for horizontal movement groups with below average vertical movement register: .289, .302, and .315. The BABIPs for horizontal movement with above average vertical movement clock in at: .286, .292, .298. Judging from these results it would appear that, yes, movement does have some type of effect on BABIP. Each horizontal group posted higher counts when they had below average vertical movement, and at every interval as well; .289 to .286, .302 to .292, and .315 to .298. Additionally, all pitches 92+ mph with 0-4 inches of horizontal movement, regardless of whether or not they fell above or below the vertical cutoff point, produced a BABIP lower than .290, which is generally the lower edge of the .290-.310 range we expect it to fall into.

Tomorrow I’ll come right back with the total number of unique pitchers and those comprising at least 1% and at least 5% of the sample, in order to see if the results are skewed in any way. For now, though, it appears that, regardless of your horizontal movement, having above average vertical movement will produce a lower BABIP at each horizontal interval.

## Heater Getting Hotter

Yesterday we looked at the averages of fastballs from different velocity groups as a means to compare certain pitchers to their like-throwing peers as opposed to an extremely broad group.  This way, we can compare Matt Cain’s movement to the average movement for all 94 mph fastballs to determine how effective it has been.
In doing so an anomaly surfaced: all velocity groups had a BABIP between .290-.310 except those thrown 97 mph.  Those heaters registered a .273 BABIP, nearly 20 points below the others.  Sure enough, fastballs registering 98 mph or higher jumped back to .293, leading many of us to believe something screwy, flukey, or any other adjective ending with the suffix “-y” slapped on its end, was taking place.  After exploring some logical possibilities, like a split-half reliability test, or a look at BABIP by count and location, the results either stuck or were inconclusive due to small sample sizes at work.
We had a really nice discussion in the comments section wherein more possibilities were tossed around.  The first of these suggestions involved testing the sample size via a Bernoulli Trial.  As was shown by commenter Adam Guetz, for an observed .273 when a .295 was expected, we would need approximately 1,650 balls in play.  For 97 mph pitches there were 707 balls in play, less than half of what is required, and just 325 balls in play for 98+ mph.  While the sample sizes of actual pitches thrown are large enough to conduct certain analyses, those of balls in play for anything 97 mph or higher were not.  Here are the BIP sample sizes:

• 92 mph, 18.85 % BIP and 7,759 total
• 93 mph, 18.05% BIP and 6,023 total
• 94 mph, 18.05% BIP and 4,389 total
• 95 mph, 17.04% BIP and 2,827 total
• 96 mph, 17.26% BIP and 1,596 total
• 97 mph, 16.69% BIP and 707 total
• >98 mph, 16.11% BIP and 325 total

The samples from 92-96 appear large enough, but the combination of 97 and 98+ still comes a good 500 pitches below 96 mph on its own.  Another suggestion called for the total number of different pitchers as each interval as well as the number of those comprising certain percentages of the samples.  This way, we might be able to deduce that 97 mph pitches were skewed due to a small group representing the whole; for the lower velocities, which are more common, it is much more likely for the pitches to be more evenly divided amongst a larger group of pitchers.  Here are the number of pitchers for each group, those comprising 1% of the sample, and those comprising 5% of the sample:

• 92 mph: 574 total pitchers, 8 at 1%, 0 at 5%
• 93 mph: 485 total pitchers, 18 at 1%, 0 at 5%
• 94 mph: 516 total pitchers, 21 at 1%, 0 at 5%
• 95 mph: 337 total pitchers, 25 at 1%, 0 at 5%
• 96 mph: 237 total pitchers, 28 at 1%, 1 at 5%
• 97 mph: 160 total pitchers, 25 at 1%, 4 at 5%
• >98 mph: 102 total pitchers, 18 at 1%, 8 at 5%

In the 97 mph group, the four pitchers with at least 5% of the sample combine to represent 23% of the total.  For 98+ mph, the eight pitchers with at least 5% of the sample combine to represent 56% of the total.
From these results it seems that 92-96 mph are safe from a drastic case of small sample size syndrome.  Anything abobe 97 mph, though, seems to be the opposite as they suffer from a small sample of balls in play as well as skewed results due to a small group of pitchers representing most of the total pitches.
Another commenter, Dave Evans, pointed out that he received a significance of 0.55 when comparing 97 and 98+, meaning their BABIPs were not statistically significantly different; for significance, that value would need to be equal to or below 0.01.  This led me to group 97 and 98+ together, to enlarge the sample.  The result was 1,032 balls in play, 288 hits in play, and a .279 BABIP.  This suggested the possibility that perhaps it was not 97 mph that deserved the adjective+suffix “-y” treatment but rather 98+ mph pitches.  Granted, it is still a small sample, even moreso for BABIP, but perhaps we will find out, as more data becomes available, that 97 mph is the threshold, as Pizza Cutter noted, for “blowing it by the hitter.”
It will require several hundred more pitches in play to determine this with any certainty but I will be keeping very close tabs as the season progresses.  For now, though, we can effectively compare individual pitchers to the average movement components, B%, K%, and BABIP for their specific velocity, not an entire group, at least for heaters 92 mph to 96 mph.

## Breaking Down the Heater

Back on December 20th, John Walsh wrote a very interesting article at The Hardball Times, taking everything recorded by the Pitch F/X system in 2007 and, amongst others, calculating the average velocity, horizontal movement, and vertical movement for the four major pitches: fastball, curveball, slider, and changeup.  The results showed that the average fastball clocked in at 91 mph with -6.2 inches of horizontal movement and 8.9 inches of vertical movement.  The author acknowledged that he did not differentiate between four-seamers, two-seamers, and cutters, but rather lumped them all together in determining the averages; two-seamers and cutters differ in velocity and movement components from four-seamers.

While I plan on calculating the averages for all different sub-groupings of pitches at some point, what recently piqued my interest was finding the averages for different velocity groupings.  As in, what is the average horizontal movement for all 94 mph fastballs?  Or, the BABIP for 98 mph fastballs?
With that knowledge we could effectively compare certain pitchers to the means of their velocity grouping rather than overall averages of every grouping.  Instead of comparing, say, Edwin Jackson’s 94 mph fastball to a group including those who throw slower, we can compare him to his “peers.”
I started at 92 mph and queried my database for groupings (92-92.99, 93-93.99, etc) all the way up until 98+ mph.  I figured 92 mph would be a solid starting point since the sample size would be extraordinarily large–large enough for four-seamers to overcome the two-seamers and cutters that may inevitably sneak in.  Anything 98 mph or higher was grouped together to ensure a large enough sample since, as you will see below, the higher the velocity, the smaller the sample:

 Velocity Sample % 92 mph 41,157 31.4 93 mph 33,368 25.5 94 mph 24,315 18.6 95 mph 16,586 12.7 96 mph 9,245 7.1 97 mph 4,236 3.2 >98 mph 2,018 1.5

All of the sample sizes here were large enough for analysis.  Even though the 98+ group appears to be 1/20th the size of the 92 mph group, that speaks more for the latter than against the former.
Next, how do the movement components look for each group?

 Velocity Horiz. Vert. 92 mph -6.34 9.24 93 mph -6.28 9.51 94 mph -6.16 9.80 95 mph -5.98 10.07 96 mph -5.84 10.23 97 mph -5.89 10.41 >98 mph -6.03 10.38

It should be fairly apparent that the tendency is for horizontal movement to decrease and vertical movement to increase as the velocity increases, at least through 96 mph.  At 97 mph, both movement components increase.  At 98+ mph, the vertical movement stays stagnant while the horizontal movement jumps quite a bit.
The next area to discuss includes B%, K%, HR%, and BABIP:

 Velocity B% K% HR% BABIP 92 mph 35.9 44.6 0.65 .302 93 mph 36.3 45.1 0.55 .303 94 mph 35.5 45.9 0.55 .292 95 mph 35.8 46.4 0.76 .303 96 mph 35.2 47.0 0.54 .291 97 mph 36.1 46.8 0.41 .273 >98 mph 33.9 49.3 0.69 .293

The percentage of balls doesn’t move too much until its dip of over two percentage points at 98+ mph.  The amount of strikes, however, seems to increase.  There is no real discernible pattern in the home run percentages; the most came on 95 mph heaters while the least came on those registering 97 mph.

Speaking of the 97 mph group, notice anything odd?  Perhaps that their BABIP is .273, a full eighteen points below any other group?  Prior to getting the results I expected each group to fall somewhere in the .290-.310 range; that all of them did except the .273 struck me as very peculiar.

I spoke to several other analysts, all of whom initially mentioned small sample size syndrome, only to redact the assessment after learning the sample sizes in question.  The dropoff in home run percentage was tossed around, as well, since less home runs means more balls in play to be counted in the BABIP formula.  This is a “could be,” though, rather than a “definitely why.”  As was mentioned in these discussions, too, it could be nothing; perhaps there were more warning track flyballs that just missed leaving the yard as opposed to weaker hit balls.

Now, while the 4,236 pitches at 97 mph constitutes a large enough sample to analyze, the balls in play were not large enough yet to break into individual counts or locations.  When they do get big enough this could serve as a means of explanation; perhaps something in either or both does not jive with the other velocity groups.  Of those with significance, however, there was a .263 BABIP on 0-0 counts, and a .286 BABIP on pitches in the middle of the strike zone.

Pizza Cutter, or “The Master of Statistical Reliability” as I like to call him (yeah, a nickname for a nickname), suggested that BABIP is one of those stats that is super-unreliable, even with my large sample of pitches.  I did a split-half reliability test, randomly splitting the sample in half, and calculating the BABIP of each half.  For those unfamiliar, this serves to test the reliability of the sample; if it truly is large enough then no matter how we cut the sample in half we will have fairly convergent results.  If the results were wildly divergent then we are dealing with an unreliable sample.  The BABIPs of the two groups were .271 and .275, which essentially threw that idea out of the window.

Something interesting to consider was how, in each of these tables, all patterns seemed to stop when they reached 97 mph or higher.  The horizontal movement increased instead of its decreasing trend; vertical movement decreased after its increase at 97; the percentage of strikes ceased increasing; and home runs reached their low.  Could be something, could be nothing, but interesting nonetheless.

For now I am going to chalk this BABIP drop as an extreme random statistical variation and hope that you loyal readers out there might chime in with some more ideas to investigate.  Otherwise, though, when gauging the movement components, percentage of balls/strikes/home runs, or even BABIP, we can compare individual pitchers to their “like-minded” averages by velocity grouping.  If I get enough feedback involving different aspects to measure regarding these fastballs we will look at that soon, in the next day or two.  Otherwise, next week I have something similar to this, looking at BABIP by movement.