Clay Davenport

Last updated

Clay Davenport is a baseball sabermetrician who co-founded Baseball Prospectus (BP) in 1996. He co-edited several of the Baseball Prospectus annual volumes and is a writer for BaseballProspectus.com. Much of his work for BP was behind the scenes, where he maintained and implemented advanced statistics for the website.

Contents

For most of the time during which he contributed to Baseball Prospectus, Davenport's main employment was as a meteorologist. In March 2010, it was announced that he had moved to full-time status at Baseball Propectus. [1] In 2011, he departed Baseball Prospectus, but maintains his own website ClayDavenport.com on which he continues to publish baseball analysis and projections. [2] As he commented there in May 2011:

As my title indicates, this is a place for me to keep some statistics I happen to care about. These are statistics that I've run at Baseball Prospectus for many years, but BP has decided to discontinue them – or at least transform into something I no longer recognize. Baseball Prospectus was founded on the premise that, since no one was publishing the baseball book we wanted to read, we would print one ourselves. In that same spirit, since BP is not publishing the stats I want to see, the way I want to see them, I'll put them up myself. [3]

In a later post, he characterized the reason for his departure from BP:

I'm Clay Davenport, one of the founders of Baseball Prospectus. I still have a (looser than before) affiliation with BP, so don't expect to see me using this site to dish dirt or run anybody into the ground. I'm old enough and stubborn enough to have my own way of doing things, and some of those things are contrary to the way BP wants to do things, which is why I wound up out here. [4]

Davenport is a native of Hampton, Virginia and now lives in Maryland.

Baseball analysis and sabermetrics

Davenport is known for creating the Pythagenport Formula, [5] (designed to find the best exponent for the Pythagoras winning percentage equation), for inventing the statistic Equivalent Average (EqA) (now called "True Average" or "TAv"), and for the "Davenport Translations" or DT's. The DT's are estimated Major League equivalent performance statistics based on player statistics from minor league and international baseball. DT's were first published by Davenport on the rec.sports.baseball Usenet site in 1995, before Baseball Prospectus was founded. [6]

The DT's are also used to standardize the records of players who played in different eras and playing conditions, not only in different leagues and levels of baseball. This allows comparison, for example, of the number of home runs hit by Babe Ruth and modern players, to estimate how many each would have hit in a season or a lifetime if they had all played under the same playing conditions (parks, leagues, levels of competition, and eras). [7]

Davenport introduced the DT's to the on-line baseball research community in 1995 as follows:

Hello. My name is Clay Davenport, and I attend the University of Chicago as a physics genius.

While these Translations look like player stats, they are NOT the players' actual statistics. The Translations are an attempt to show how well the player would have performed in a standard league (the American League of 1992), knowing how well he played in his actual league. We know that some leagues are tougher than others; that's why we have the majors, AAA, AA, and so on. We know that some leagues are easier to hit in; we know that some parks favor the pitchers; and we know that these effects are not constant from one year to the next. We can estimate how big a difference each of those makes and correct for them, and that is what the Translations try to do. How well they work I shall leave for you to judge. [8]

Meteorology

A graduate of the University of Chicago, Davenport was employed for many years as a software contractor with the National Oceanographic and Atmospheric Administration (NOAA) in the Satellite and Information Service, where he developed models for predicting rainfall from satellite imagery. He has likened some of that work to his baseball analysis: "The biggest similarity between handling the two types of statistics is that they each involve making forecasts that are there for everyone to see, and you end up being wrong a lot," Davenport said. "You learn to develop a thick skin." [9]

In 2000, Davenport developed the Hydro-Estimator, a set of computer programs to estimate precipitation in real time.

"The Hydro-Estimator (H-E) version of the Auto-Estimator (AE) was developed by Clay Davenport, a contractor working for the ORA Hydrology Team under the direction of Dr. Rod Scofield. The Hydro-Estimator algorithm differs from the original AE by using a brightness temperature screening technique. It adjusts the rain rate assigned to each picture element (pixel) according to the surrounding pixel temperatures. This helps separate raining and non-raining pixels and decreases the need for radar screening. It also helps focus rainfall estimate totals into more clearly defined maxima. There is less of a tendency for overestimating for very cold cloud tops using the H-E, and it does a much better job of estimating for large mesoscale convective complexes (MCC's). The H-E also has a different way of handling the moisture corrections, and also produces more frequent products every 15 minutes for all except the 24-hour totals. The 1 hour H-E totals are available on the NWS AWIPS system as a graphic for the whole CONUS every hour." [10]

Because of these programs, according to Davenport, "we are now capable of producing rainfall estimates for every system visible from satellite, which allows it to be used for other purposes in the United States and around the world, for example, drought monitoring in Africa, forest fire protection in Brazil and landslide studies in Venezuela." [11]

Related Research Articles

Sabermetrics or SABRmetrics is the empirical analysis of baseball, especially baseball statistics that measure in-game activity.

Bill James American baseball writer and statistician

George William James is an American baseball writer, historian, and statistician whose work has been widely influential. Since 1977, James has written more than two dozen books devoted to baseball history and statistics. His approach, which he termed sabermetrics in reference to the Society for American Baseball Research (SABR), scientifically analyzes and studies baseball, often through the use of statistical data, in an attempt to determine why teams win and lose.

In statistics, maximum likelihood estimation (MLE) is a method of estimating the parameters of a probability distribution by maximizing a likelihood function, so that under the assumed statistical model the observed data is most probable. The point in the parameter space that maximizes the likelihood function is called the maximum likelihood estimate. The logic of maximum likelihood is both intuitive and flexible, and as such the method has become a dominant means of statistical inference.

Pythagorean expectation is a sports analytics formula devised by Bill James to estimate the percentage of games a baseball team "should" have won based on the number of runs they scored and allowed. Comparing a team's actual and Pythagorean winning percentage can be used to make predictions and evaluate which teams are over-performing and under-performing. The name comes from the formula's resemblance to the Pythagorean theorem.

Equivalent Average (EqA) is a baseball metric invented by Clay Davenport and intended to express the production of hitters in a context independent of park and league effects. It represents a hitter's productivity using the same scale as batting average. Thus, a hitter with an EqA over .300 is a very good hitter, while a hitter with an EqA of .220 or below is poor. An EqA of .260 is defined as league average.

In baseball, value over replacement player is a statistic popularized by Keith Woolner that demonstrates how much a hitter or pitcher contributes to their team in comparison to a replacement-level player who is an average fielder at that position and a below average hitter. A replacement player performs at "replacement level," which is the level of performance an average team can expect when trying to replace a player at minimal cost, also known as "freely available talent."

In baseball statistics, pitch count is the number of pitches thrown by a pitcher in a game.

In baseball, defense-independent pitching statistics (DIPS) measure a pitcher's effectiveness based only on statistics that do not involve fielders. These include home runs allowed, strikeouts, hit batters, walks, and, more recently, fly ball percentage, ground ball percentage, and line drive percentage. By focusing on these statistics, which the pitcher has almost total control over, and ignoring what happens once a ball is put in play, which the pitcher has little control over, DIPS can offer a clearer picture of the pitcher's true ability.

Baseball Prospectus organization

Baseball Prospectus (BP) is an organization that publishes a website, BaseballProspectus.com, devoted to the sabermetric analysis of baseball. BP has a staff of regular columnists and provides advanced statistics as well as player and team performance projections on the site. Since 1996 the BP staff has also published a Baseball Prospectus annual as well as several other books devoted to baseball analysis and history.

Robust statistics are statistics with good performance for data drawn from a wide range of probability distributions, especially for distributions that are not normal. Robust statistical methods have been developed for many common problems, such as estimating location, scale, and regression parameters. One motivation is to produce statistical methods that are not unduly affected by outliers. Another motivation is to provide methods with good performance when there are small departures from parametric distribution. For example, robust methods work well for mixtures of two normal distributions with different standard-deviations; under this model, non-robust methods like a t-test work poorly.

Football Outsiders (FO) is a website started in July 2003 which focuses on advanced statistical analysis of the NFL. The site is run by a staff of regular writers, who produce a series of weekly columns using both the site's in-house statistics and their personal analyses of NFL games.

In statistics, M-estimators are a broad class of extremum estimators for which the objective function is a sample average. Both non-linear least squares and maximum likelihood estimation are special cases of M-estimators. The definition of M-estimators was motivated by robust statistics, which contributed new types of M-estimators. The statistical procedure of evaluating an M-estimator on a data set is called M-estimation.

Nate Silver American pundit and writer

Nathaniel Read Silver is an American statistician and writer who analyzes baseball and elections. He is the founder and editor-in-chief of FiveThirtyEight and a Special Correspondent for ABC News.

In estimation theory and decision theory, a Bayes estimator or a Bayes action is an estimator or decision rule that minimizes the posterior expected value of a loss function. Equivalently, it maximizes the posterior expectation of a utility function. An alternative way of formulating an estimator within Bayesian statistics is maximum a posteriori estimation.

PECOTA, an acronym for Player Empirical Comparison and Optimization Test Algorithm, is a sabermetric system for forecasting Major League Baseball player performance. The word is a backronym based on the name of journeyman major league player Bill Pecota, who, with a lifetime batting average of .249, is perhaps representative of the typical PECOTA entry. PECOTA was developed by Nate Silver in 2002–2003 and introduced to the public in the book Baseball Prospectus 2003. Baseball Prospectus (BP) has owned PECOTA since 2003; Silver managed PECOTA from 2003 to 2009. He was responsible for the PECOTA projections for the 2003–2009 baseball seasons. Beginning in Spring 2009, BP assumed responsibility for producing the annual forecasts. The first baseball season for which Silver played no role in producing the PECOTA projections was 2010.

Joseph S. (Joe) Sheehan was born in New York City on February 26, 1971, and attended Regis High School. He graduated from the University of Southern California in 1994, with a degree in journalism. Sheehan lives in the New York City area. He is one of the founders and was a co-editor of the first annual book of sabermetric baseball forecasts and analyses by Baseball Prospectus in 1996 as well as several later volumes.

Rany Jazayerli, a Chicago-area dermatologist, is a co-founder of and writer for Baseball Prospectus. He developed the statistical concept of Pitcher Abuse Points (PAP), which relates to high pitch counts in baseball.

Extrapolated Runs (XR) is a baseball statistic invented by sabermetrician Jim Furtado to estimate the number of runs a hitter contributes to his team. XR measures essentially the same thing as Bill James' Runs Created, but it is a linear weights formula that assigns a run value to each event, rather than a multiplicative formula like James' creation.

In baseball, wOBA is a statistic, based on linear weights, designed to measure a player's overall offensive contributions per plate appearance. It is formed from taking the observed run values of various offensive events, dividing by a player's plate appearances, and scaling the result to be on the same scale as on-base percentage. Unlike statistics like OPS, wOBA attempts to assign the proper value for each type of hitting event. It was created by Tom Tango and his coauthors for The Book: Playing the Percentages in Baseball.

Wins Above Replacement or Wins Above Replacement Player, commonly abbreviated to WAR or WARP, is a non-standardized sabermetric baseball statistic developed to sum up "a player's total contributions to his team". A player's WAR value is claimed to be the number of additional wins his team has achieved above the number of expected team wins if that player were substituted with a replacement-level player: a player who may be added to the team for minimal cost and effort.

References

  1. Dave Pease, "Clay Davenport Now at BP Full Time," BaseballProspectus.com, March 1, 2010 Archived March 3, 2010, at the Wayback Machine .
  2. ClayDavenport.com
  3. Clay Davenport, "Hello Everybody," ClayDavenport.com, May 15, 2011. [retrieved February 20, 2012]
  4. Clay Davenport, "If You Don't Know Me ... ", ClayDavenport.com, May 15, 2011. [retrieved February 20, 2012]
  5. Clay Davenport and Keith Woolner, "Revisiting the Pythagorean Theorem: Putting Bill James' Pythagorean Theorem to the Test", BaseballProspectus.com, June 30, 1999.
  6. See, for example the 1994 figures at https://groups.google.com/group/rec.sport.baseball.analysis/tree/browse_frm/month/1995-02/.
  7. Will Carroll and Clay Davenport, "The Answer: No Asterisks Necessary,"BaseballProspectus.com, July 15, 2007.
  8. https://groups.google.com/group/rec.sport.baseball.analysis/tree/browse_frm/month/1995-01/. In this initial release as well as later, Davenport explained how the DT's were different from and more useful than the Major League Equivalencies (MLE's) that Bill James had first developed. See also Clay Davenport, "DTs vs. MLEs - A Validation Study," BaseballProspectus.com, January 30, 1998.
  9. John Leslie, "Clay Davenport is Team Member of the Month," NOAA Report 13, No. 5, May 2004 Archived 2006-10-01 at the Wayback Machine .
  10. See http://www.ssd.noaa.gov/PS/PCPN/program.html
  11. Ibid.