Only 30 days and fewer than 30 games per team remain in major-league baseball’s regular season. There are all-but-certain winners in three of six divisions: the New York Yankees, American League (AL) East; Houston Astros, AL West; and Los Angeles Dodgesr, National League (NL) West.

The Boston Red Sox, last year’s AL East and World Series champions, probably won’t make it to the AL wild-card playoff game. The Milwaukee Brewers, last year’s NL Central champs, are in the same boat. The doormat of AL Central, the Detroit Tigers, are handily winning the race to the bottom, with this year’s worst record in the major leagues.

Anecdotes, however, won’t settle the question whether major-league baseball is becoming more or less competitive. Numbers won’t settle the question, either, but they might shed some light on the matter. Consider this graph, which I will explain and discuss below:

*Based on statistics for the National League and American League compiled at *Baseball-Reference.com*.*

Though the NL began play in 1876, I have analyzed its record from 1901 through 2018, for parallelism with the AL, which began play in 1901. The rough similarity of the two time series lends weight to the analysis that I will offer shortly.

First, what do the numbers mean? The deviation between a team’s won-lost (W-L) record and the average for the league is simply

D_{t} = R_{t} – R_{l }, where, R_{t }is the team’s record and R_{l }is the league’s record in a given season.

If the team’s record is .600 and the league’s record is .500 (as it always was until the onset of interleague play in 1997), then D_{t }= .100. And if a team’s record is .400 and the league’s record is .500, then D_{t} = -.100. Given that wins and losses cancel each other, the mean deviation for all teams in a league would be zero, or very near zero, which wouldn’t tell us much about the spread around the league average. So I use the absolute values of D_{t }and average them. In the case of teams with deviations of .100 and -.100, the absolute values of the deviations would be .100 and .100, yielding a mean of .100. In a more closely contested season, the deviations for the two teams might be .050 and -.050, yielding a mean absolute deviation of .050.

The smaller the mean absolute deviation, the more competitive the league in that season. Season-by-season plots of the means are rather jagged, obscuring long-term trends. I therefore used centered five-year averages of mean absolute deviations.

Both leagues generally became more competitive from the early 1900s until around 1970. Since then, the AL has experienced two less-competitive periods: the late 1970s (when the New York Yankees re-emerged as a dominant team), and the early 2000s (when the Yankees were enjoying another era of dominance that began in the late 1990s). (The Yankees’ earlier periods of dominance show up as local peaks in the black line centered at 1930 and the early 1950s.)

The NL line highlights the dominance of the Chicago Cubs in the early 1900s and the recent dominance of the Chicago Cubs, Los Angeles Dodgers, and St. Louis Cardinals.

What explains the long-term movement toward greater competitiveness in both leagues? Here’s my hypothesis:

Integration, which began in the late 1940s, eventually expanded the pool of baseball talent by opening the door not only to American blacks but also to black and white Hispanics from Latin America. (There were a few non-black Hispanic players in major-league ball before integration, but they were notable freckles on the game’s pale complexion.) Integration of blacks and Latins continue for decades after the last major-league team was nominally integrated in the late 1950s.

Meanwhile, the minor leagues were dwindling — from highs of 59 leagues and 448 teams in 1949 to 15 leagues and 176 teams in 2017. Players who might otherwise have stayed in the minor leagues have been promoted to the major leagues more often than in the past.

That tendency was magnified by expansion of the major leagues. The AL started in 1961 (2 teams), and the NL followed suit in 1962 (2 teams). Further expansion in 1969 (2 teams in each league), 1977 (2 teams in the AL), 1993 (2 teams in the NL), and 1998 (1 team in each league) brought the number of major-league teams to 30.

While there are now 88 percent more major-league teams than there were in 1949, there are far fewer teams in major-league and minor-league ball, combined. Meanwhile the population of the United States has more than doubled, and that source of talent has been augmented significantly by the recruitment of players of Latin America.

Further, free agency, which began in the mid-1970s, allowed weaker teams to attract high-quality players by offering them more money than stronger teams found it wise to offer, given roster limitations. Each team may carry only 25 players on its active roster until the final month of the season. Therefore, no matter how much money a team’s owner has, the limit on the size of his team’s roster constrains his ability to sign the best players available for every position. So the richer pool of talent is spread more evenly across teams.