The High Boskage House Baseball-Analysis Web Site
baseball team and player performance examined realistically and accurately

  email me search site site directory  

The HBH Baseball-Analysis Formula
and Real-World Results


Summary of Results

Full results appear farther down this page, but first this summary:

The graph farther below shows the results of applying the full High Boskage House formula for runs scored (derived from team Plate Appearances, Hits, Total Bases, Walks, Steals, Caught Stealing, and Sacrifice Bunts) to almost a half-century, 48 full years, of actual major-league data: all teams in all years from 1954 through 2001 inclusive. As to why those year limits: at the early end, there were scoring-rule changes in prior years that make certain raw data suspect or unavailable; at the late end, that's when I did the job and I simply haven't had the time or energy to add on (making the graph is the worst part); and that's it.

Shown below are actual team runs scored, predicted team runs scored (rounded to the nearest whole number), and the difference between prediction and actuality as a percentage. As expected, the average size of runs projection errors is well under 3% (about 2½%). For those who know statistical theory, the Standard Deviation is 22.31 runs (without any regression "curve-fitting" cooking of the results).

The overall results are these:

For 1138 Team-Seasons Evaluated:
"Expected" Average Error Size: 19.92 runs/team/season 2.84%
Actual Average Error Size: 17.58 runs/team/season 2.51%
Cumulative Error: -0.02 run/team/season circa 0%

The "Error Sizes" disregard whether the error is high or low--they measure its size. The "Cumulative Error" allows plus and minus errors to cancel; as we should expect, it is virtually zero, far less than a run a team a season.

(The statistical formulae used to calculate those data are shown at the bottom of this page.)



Full Results

And now the year-by-year, team-by-team numbers. (Note: "Error" rates from years of less than 162 games--such as 1981--will, of course, be a bit larger than typical.) Since "a picture is worth a thousand words", here is a graph of the results: the red line is exact accuracy, and, as you can see, the results are a truly beautiful approximation to that red line.


Projected-vs.-Actual Runs Graph

Notice especially that accuracy remains excellent at the extremes: down at 329 annual runs (the lowest) and also up at 993 annual runs (the second highest), there are points bang on the line. That indicates a truly general predictive method of accuracy, not just one that is OK where the data bunch up.

For those who might want to see those data in tabular form, we have provided it on a separate page (separate owing to its length--it may take some time to fully download). If you want to check it, though here's the tabulated data.



The Calculations

The methodology of the TOP (projected-runs) calculation is explained elsewhere on this site.

As to the statistical measures:

The expected error is calculated using the standard statistical probability formulae. The expected average error is 79.79% of one Standard Deviation. The Standard Deviation for any one team-season is, in turn, the square root of npq, where n is the number of data samples, p is the probability of a success, and q is the probability of a failure (by definition, then, q=1-p). For this tabulation, a "success" is a run scored and a "data sample" is a batter at the plate. Thus, the probability of a success is the calculated expected runs divided by the team's total of batter plate appearances (we use the calculated, not the actual, runs because we want to know the S.D. for the projection, although in practice it would matter little which was used). So, the expected average error per team-season is just:

err = 0.7979 x SquareRoot(PA x (TOP/PA) x (1 - TOP/PA))

We then average the individual expected errors for an overall average expected error figure (the individual expected-error figures per team-season will be rather similar because neither PAs nor TOPs vary all that much from one to another).






You loaded this page on Friday, 3 July 2009, at 5:14 pm EDT;
it was last modified on Monday, 16 March 2009, at 1:14 am EDT.

Site Mechanics:

Search this site:


Custom Search
(the usual Google search rules apply)


Site Directory:

 This site's Front Page
 Late News about the site



(team and player performance evaluations, updated daily)
The Performance Stats:
    Team Measures:
    Player Measures:

(meanings and explanations of the things on this site)
Baseball-Analysis Background:
    For You Rookies:
 what this site is all about--what it is telling you about baseball, and how, and why
    Some Baseball Analysis Theory:
 a semi-technical backgrounding on modern baseball analysis
    Baseball Stat Definitions:
 the standard and the unique statistics we present
    The "Quality of Pitching" Measures:
 why they are the best way to evaluate pitching
    The SillyBall:
 why baseball before and after 1993 is really two different games
    Fielding and Defense in Baseball
 how important defense is or isn't in baseball, and how to correctly evaluate it
    Baseball Data Normalization:
 why raw stats need "correction", and how and why we can and cannot apply it
    "Steroids" and Other "Performance-Enhancing Drugs":
 why just about everything you think you know about them is wrong
(now a full-fledged site of its own)



(miscellaneous but not unimportant)
Some Miscellaneous Information:
    The Team-Performance Table
 there is a lot in that Table, and this explains what it all is
    The HBH Baseball-Analysis Formula Tested
 what we get when we apply it to half a century of team stats
    The Pitfalls of Park Factors
 an explicit, detailed demonstration of how and why they are so dubious
    About High Boskage House
 who we are and why we might know what we're talking about
    Links To A Select Few Other Useful Baseball Sites
 including those that link to this one



(new, used--find any book, anywhere in the world)
The High Boskage House Baseball Shop:
    What Makes This "Baseball Shop" Special:
    Baseball Books Available Today:


Site Info:

owl logo This site is one of The Owlcroft Company family of web sites. Please click on the link (or the owl) to see a menu of our other diverse user-friendly, helpful sites.       Pair Networks logo Like all our sites, this one is hosted at the highly regarded Pair Networks, whom we strongly recommend. We invite you to click on the Pair link (or their logo) for more information on getting your site or sites hosted on a first-class service.

Comments? Criticisms? Questions?

Please, e-mail me by clicking here.

(Or, if you cannot email from your browser, send mail to webmaster@highboskage.com)

All content copyright © 1999 - 2009 The Owlcroft Company.

This web page is strictly compliant with the W3C (World Wide Web Consortium)
Extensible HyperText Markup Language (XHTML) Protocol v1.0 (Transitional).
Click on the logo below to test us!

So if your browser experiences any difficulties with this page(or, really, even if it doesn't seem to),
just click on the logo below to find out all about (and even get)--


Get the Firefox browser!
(It's free!)



---=== end of page ===---