View the history of baseball through these graphs. Use the dropdown menus below to select any year from 1901 onwards and any league to see how teams compared within that season. The first graph shows how runs scored and allowed lead to wins (and losses). The second graph shows how runs scored are related to getting on base and slugging. The third shows how runs allowed are related to pitching and fielding. As you travel through time, you'll see what made the great teams great, the bad teams bad and just how lucky some teams can be.
Further down the page, you will find other graphs. One set of graphs shows have stats have changed through the years and how they are correlated with each other. Another set of graphs takes one team at a time and illustrates its performance through the years. By going between all the graphs, you'll find lots of interesting baseball facts and insights.
There are no graphs about players on these pages. If you'd like to view player graphs, I highly recommend Fangraphs. For instance, here is the graph page for Hall of Famer Chipper Jones.
There are only two years of Federal League graphs, of course--1914 and 1915. I will be slowly but surely adding written summaries of each season. If you have friendly comments or helpful suggestions, I'm at daveATbaseballgraphsDOTcom. All team links lead to Baseball Reference. The underlying data is courtesy of the Lahman database and the code was written by Claude.
Better teams (upper right) score more and allow fewer runs
Win% lines of .400, .500 and .600 are based on the Pythagorean Formula
Hover over a data point to see the number of games won by each team
Run Differential
This graph shows how well each team performed scoring and allowing runs, which are the drivers of baseball success. The Runs Allowed axis (the Y axis) is in reverse, so that the best defensive teams are at the top of the graph and the best scoring teams are on the right.
The dotted lines are based on the Pythagorean Theorem (RS^2)/(RS^2+RA^2), which is a pretty good predictor of winning percentage. The lines are drawn at different winning percentages so you can compare teams in different areas of the graph. The best teams will be above the .600 line.
If you hover over a datapoint, you'll see how many actual wins that team achived. If you go back up to the Standings table, the "PythVar" is the difference between the actual wins and Pythagorean wins. Differences between Pythagorean and actual wins is typically based on random distribution patterns more than anything else.
This graph allows you to overlay up to three statistics with completely different scales on the same graph using normalized values. There are two different modes for comparing stats.
0-1 Scale (Min-Max): Normalizes where 0 = historical minimum and 1 = historical maximum across ALL years (1901-2024). This lets you see where any value ranks in the all-time range. The middle of the axis (0.5) isn't the average or the median. It's the point betweeen the minimum and the maximum.
Z-Score: Shows how many standard deviations each value is from the historical average. 0 = average, +1 = top 16%, +2 = top 2.5%, etc. Great for spotting unusual or exceptional periods in baseball history.
Hover over any point to see the actual raw values alongside the normalized values.
Defense Efficiency Ratio (or DER, for those in the acronym biz) is a pretty simple statistic. It takes all balls that were batted into play (so, no Home Runs) and calculates how many were turned into outs. It's a simple measure but it reflects a lot of complex stuff, such the quality of the fielders, the gloves, the ballpark configurations, how hard the ball was hit, where it was hit, and probably a few more things I haven't thought of.
Which is why I find the first graph on the right, the League Statistics Over Time graph, so fascinating. It's the story of what has happened to batted balls in play since 1901. (Click to embiggen the graph if you'd like) The story it tells is this: there were wild swings in fielding results during the Deadball Era, followed by a steady period of relatively low DER in the Live Ball Era. In the beginning years of World War II, DER took a jump up and didn't look down again for several decades (that gap between the AL and NL in the '60s is interesting, isn't it?). This jump may have been due to improvements in technology, such as better fielding gloves or park conditions, among other things.
But the thing that really catches my eye (even without the arrow) is the downturn in the early '90s. If you pull up the graph yourself and mouse over the data points, you'll see that the percent of batted balls fielded for outs dropped from 70% in 1992 to 68% 1996. That doesn't sound like a lot, but keep in mind that we're talking over 100,000 balls in play each year. It adds up. What's more, this became a long-term change. DER has crept back up but is only now (2024) back up to 70%.
What happened?
Well, we should first acknowledge that the Colorado Rockies, with their cavernous ballpark, joined the majors in 1993. If you look them up in the Team Timeline Explorer you'll see that their DER is typically two percentage points lower than the major league average. But that's how much DER fell across both leagues during that time, and the Rockies were only one team.
This is where the Multi-Stat Timeline comes in handy. In the second graph, you'll spot two statistics: DER and Slugging Percentage. It's a fascinating graph, isn't it? The two stats seem to dance around each other. When Slugging goes up, the percent of fielded balls goes down. And vice versa. Perhaps balls were hit harder, which made them harder to field. Perhaps the old defensive whizzes who were in the lineup just for their gloves went by the wayside as hitting was emphasized more and more. Perhaps the ball was juiced. Whatever the cause, decline in fielding outcomes is another way the game has changed right under our noses.
Try playing with the graphs yourself to see if you can spot any particular reasons for this intriguing phenomenon.
Everyone knows that 1968 was the Year of the Pitcher, right? That was the year teams scored only 3.4 runs a game, the lowest total outside of the Deadball Era. Bob Gibson set the modern record with a phenomenal 1.12 ERA. Denny McLain won 30 games, etc. etc. Well, what if I told you that there was a recent year in which pitchers dominated just as much as they did in 1968; that there was a year in which their strikeouts, walks and home runs allowed were at virtually the same level as the Year of the Pitcher? See if you can spot it in the League Statistics Over Time graph on the right. (You can click on the graph to embiggen it and get a closer view.)
If you picked 2014, give yourself a prize. In 2014, FIP was actually a bit lower (0.49) than it was in 1968 (0.52). When comparing 1968 to 2014...
You may ask, why didn't anyone notice this at the time? Well, you see, the FIP statistic that we use here doesn't include a constant to make it equal to the league's ERA. We use "pure" FIP, which only considers the three main components of FIP. So the more recent Year of the Pitcher was overlooked because they just looked at regular old ERA. And ERA masked what was going on. In fact, the major leagues scored 4.1 runs per game in 2014, far above the 3.4 registered in 1968. And why was that?
Have a gander at the second graph, the Multi-Stat Timeline. The blue line is FIP, the orange line is Runs Per Game and the black dotted line with the triangles is DER. As we've already discussed, fewer balls have been turned into outs in recent years and in retrospect you could have called 1968 "the year of the fielder." 71.3% of batted balls in play were turned into outs in 1968, the highest total in baseball history. In 2014, that figure dropped to 68.8%. As we mentioned in the previous article, a lot of factors go into DER, but FIP is a purer stat. It measures the things that pitchers control most. And pitchers in 2014 were as effective as any year since the Deadball Era.
Here is a graph of the number of triples hit per game on the X axis and each team's Slugging Percentage on the Y axis. Each dot represents one team in one year-there are 2,692 dots in all. And, as you can see, this graph is a mess. It's one big blob of dots with a slightly downward trend, meaning that the more triples that are hit, the lower the Slugging Percentage tends to be.
Which makes no sense, right? If you hit more triples, your Slugging Percentage should go up. After all, each triple is worth three bases. But what we have here is a countervailing trend over the years. As baseball time has passed, batters have learned to hit more home runs and fewer triples. As a result, Slugging Percentage has gone up while triples have gone down. It's not that triples don't increase Slugging Percentage, it's that the wider trend has wiped out the natural logic of it all.
If you look at this graph for each era instead of over all baseball history (go ahead and try it), you'll find correlations that make more sense. Here is a list of the correlation between triples and Slugging Percentage in each era:
See that? Through most of baseball history, triples have been positively correlated with Slugging Percentage. It's only in the last few eras, when so few triples have been hit at all, that there is almost no correlation between triples and SLG. And there was never an era in which triples had a negative impact on Slugging (that negative correlation in the Steroid Era is so weak that it can be ignored). So when you investigate the correlation between statistics, be sure to weed out the long-term trends from the short-term effects.
For about ten years, from 2005 to 2015, I wrote a lot of words about baseball. I was also managing the Hardball Times website and coordinating the publication of our baseball annuals. If you're interested in the state of baseball analysis and commentary at the time, you might find some of my old work interesting.
I created the following PDFs from the Hardball Times website which is now hosted by Fangraphs. Not all of the formatting from the old website made it to Fangraphs, so please forgive some of the odd displays, such as bold titles and tables. I think they're still readable. Enjoy.
I wrote a regular column at the Hardball Times called Ten Things I Didn't Know Last Week. Each week, I tried to find the right mix of storytelling and analysis. It was a good run for a few years. Here's a PDF of some of my favorite columns.
I also wrote a Ten Things article in each Hardball Times Annual as a way of summarizing the year just ended. These are my articles from 2006 through 2010.
These are a few basic columns that I wrote, covering subjects such as regression to the mean (which was used in at least one statistics class that I know of), APBA, and the mind of Brian Sabean. Most of these articles were pretty simple, but sometimes you have to make the point. Included in here is one of my personal favorites "Why wOBA Works". I think the baseball world could still get something from that one.
Back in the day, we didn't have all the crazy tracking technology that baseball utilizes today. We didn't know the speed of a batted ball, let alone the bat. We were just learning about pitch speeds and we were only categorizing batted balls in line drives, ground balls, and such. But at the Hardball Times, I like to think we were cutting edge with the data as it unfolded. These insights aren't as relevant today, but they helped lead us to the current day.
Inspired by Bill James' Win Shares, I developed an unhealthy obsession with win-based statistics. I took Bill's Win Shares and added my own take called Win Shares Above Bench (WSAB). This was an early precursor to Wins Above Replacement (WAR), which you might have heard of.
As part of my WSAB meanderings, I developed a unique approach to evaluating contracts that people still use today, substituting WAR for WSAB.
Yeah, I really got into Win Probability Added (WPA), which is a different kind of win-based statistic. First, I spilled some explanatory and analytic words about the development of WPA.
Then I spent some time using WPA as a basis for baseball commentary and analysis. This might be the work I'm proudest of. It includes multiple entries from the Hardball Times Annual and an article for which I won the first ever SABR Analytic Award for Historical Analysis/Commentary.
I've also developed an online tool to look up win probabilities called The WPA Toy.
In 2009, I decided to write a weekly column that used everything we had learned about batted ball data up to that point. It didn't really take, but I did develop a way of reporting the impact of batted balls that is useful. These are all the reports from that year, which is also a nice way of reading about how the season unfolded.