What Do Economic Models Really Tell Us About Elections?

Yesterday’s column on the weak historical relationship between the unemployment rate and presidential performance generated a mostly favorable reaction. But I also got a few replies wondering how to reconcile these findings against the claims, made with some frequency by political scientists, that presidential elections can be forecast with pinpoint accuracy provided that you know the economic fundamentals.

Might it be that the unemployment rate is the “wrong” variable? It has a couple of flaws. It tends to be more a lagging than a leading indicator. It affects a relatively small number of voters (although underemployment is another matter). There is ample room to critique the way that it is calculated, such as by excluding “discouraged workers” from the calculation, and it’s subject to reasonably large measurement errors.

Apart from the unemployment rate, the economic measures that we hear about most frequently are probably gross domestic product (G.D.P.) growth and inflation. Do either of these do a better job of forecasting elections?

Yes, but they still don’t do terribly well.

Here’s the comparison over the past 100 years of real (inflation-adjusted), per-capita G.D.P. growth over the course of the president’s term to performance of the incumbent party:

G.D.P. growth explains about 12 percent of election results. That’s something — but it still leaves 88 percent unexplained. If this was as well as we could do, the slogan “it’s the economy, stupid” might have to be revised to “sure, you’d rather have a better economy than a weaker one; it probably makes some difference at the margin” — which doesn’t make for a very good bumper sticker.

Roughly the same is true of the inflation rate:

However, this one is a little tricky because both high inflation and deflation have deleterious effects on the consumer. A quadratic curve does somewhat better than a straight-line estimate: low (but positive) inflation helps the incumbent president, while either deflation or high inflation hurt.

You can do a bit better still if you combine these measures. One relatively elegant method is to subtract the inflation rate from the per-capita G.D.P. growth rate. This explains about 43 percent of the variance in election results:

So we can breathe a sigh of relief. The economic fundamentals clearly do make some difference — quite a bit really.

But we shouldn’t expect any miracles. This cute little model would have called the wrong winner in six of the past 25 elections: 1912, 1948, 1952, 1960, 1968, and 2000 (although it would have gotten the winner of the popular vote right in that year). And it would have missed the margin in the popular vote by about 8 points on average, or roughly 4 points for each of the two major candidates (since a vote for one of them means a vote against the other).

This seems like a healthy state of affairs. Simple economic variables can account for a little less than half of the variability in election results. The other half falls into the “everything else” category, including factors such as foreign policy successes and failures, major scandals, incumbency, candidate quality, controversial social legislation and structural factors like changes in partisanship. Technically speaking, some of the variability may also be explained by economic factors that weigh upon voters’ minds, but which are not easily quantified by measures like G.D.P. and inflation.

Nevertheless, the heuristic “it’s half the economy and half everything else, stupid” is a pretty good way to think about presidential elections.

Some models, however, claim to have much more predictive power than this, using economic variables (sometimes along with noneconomic variables) to explain as much as 90 percent of presidential election results. You should be very skeptical of these claims.

Perhaps the best-known of these models is the so-called “Bread and Peace” model designed by Douglas Hibbs of the University of Gothenberg. There are a lot of things to admire about this model. Most notably, it’s not larded down with superfluous variables. Instead, it is based on just two: growth in real, per-capita disposable income (weighted to place more emphasis on the later years of a president’s term than the earlier ones), and the number of military fatalities resulting from U.S.-initiated foreign conflicts. (The latter definition would apply to wars like Iraq or Vietnam — but not to something like the Gulf War or World War II where the U.S. was responding to another country’s attack.)

Mr. Hibbs, using data from 1952 through 2008, claims to be able to explain almost 90 percent of the variance in presidential election results based on these variables alone, missing the results by just a point or two on average:

There are a couple of common critiques of this model. One is that the way it defines wars is a bit problematic. For instance, because the Korean War resulted in 14 times more U.S. fatalities than the Iraq War, it had about 14 times more effect on Mr. Hibbs’s model. Perhaps that’s a defensible position — U.S. soldiers being killed in action is a very noticeable impact for the public — but by other measures we might have expected Iraq to have a larger impact: it was more unpopular than the Korean War, and it was quite a bit more expensive.

Nevertheless, the proof is the model’s predictive power. Mr. Hibbs first released forecasts using this model in 1992, and since then it has performed acceptably well but not superbly, with fairly big misses in 1996 and 2000 but good results in the other years, including 2008.

Still, there are some signs that the model is not quite as accurate as claimed. Using these two variables to forecast the results of the elections of 1952 through 1988 — the dataset that Mr. Hibbs originally had to work with — would have missed the incumbent party’s vote share by just 0.8 percentage points, on average. Since then, on the out-of-sample results, the average miss has been 2.6 percentage points.

We can, of course, wait to see how this model does in 2012, 2016, 2020 and so forth to get a better sense for how accurate it really is. But I don’t have that much patience! So here’s what we can do instead — it’s a technique that I’ve applied to other models of this type. We can plug in the data from elections prior to 1952, which were outside the period that the model to build its estimates. How would the model have performed in 1948, for example?

I dug deep into the bowels of the Census Bureau’s Web site and discovered data on disposable income growth dating back to the 1920s. (The data is annual rather than quarterly, but this is easy to adjust for and should make only an extremely minor difference.) The other variable that Mr. Hibbs uses — military casualties — isn’t pertinent to these years because the only major conflict that the U.S. fought in during this period was World War II, a “good” war in which the United States was compelled into action because of Axis hostilities and that therefore would not meet Mr. Hibbs’s definition.

Here, again, is how the model performed from 1952 through 2008:

Now, let’s plug in 1948:

Uh-oh. The model did really, really badly in 1948. Disposable income growth was actually negative in the four years preceding 1948, according to Mr. Hibbs’s formula. It would have predicted, therefore, just a 43 percent share of the two-party vote (that is, excluding votes for third parties) for Harry Truman. Instead, Truman got 52 percent of the two-party vote and won the election over Thomas Dewey, much to the Chicago Tribune’s surprise.

But 1948 was a weird year — lots of erratic economic data in the postwar period. How would the model have performed in 1944?

Another bad year for the model. Income growth was prodigious in these years, and the model would have expected Franklin Roosevelt to win a landslide victory, getting 66 percent of the two-party vote. Roosevelt booked a solid win, but his actual total of 54 percent of the two-party vote is not particularly close to the model’s estimate. This election occurred during World War II; perhaps Mr. Hibbs’ distinction between good and bad wars is less salient to the American public than we might think. But this is a poor result.

The 1940 election occurred before the bombing of Pearl Harbor, and the model performed better in that year, missing Roosevelt’s vote share by 4 points.

But 1936 constituted another big problem. Disposable income growth between 1932 and 1936 was an astounding 8.5 percent. That should have translated, according to the formula, to Roosevelt receiving 77 percent of the two-party vote. Roosevelt performed really, really well in 1936, winning all but two states, but he won only 62 percent of the vote rather than 77 percent.

A similar problem is apparent in 1932. Herbert Hoover presided over the biggest economic disaster in United States history, with per capita income growth declining by 8.4 percent. Mr. Hibbs’s formula would have called for him to win just 16 percent of the major-party vote. Hoover did, in fact, lose in a landslide, but things weren’t quite that bad.

1928, a seemingly normal year (no depressions, no wars), was another bad one for the model. Although some other economic measures were decent or good, disposable income growth was slow enough that the model would have predicted that Hoover would receive 47 percent of the vote and lose the election. Instead, he won in a landslide, defeating the Democratic candidate, Al Smith.

If it’s any consolation, the model would have performed quite well in 1924.

Overall, however, the model performs quite poorly on out-of-sample results. In the years from 1924 through 1948, and from 1992 through 2008, it would have missed the incumbent party’s vote share by an average of 7.8 points, and a median of 4.7 points. By contrast, a naive strategy of simply guessing that the incumbent party would win exactly half the vote would have done better, missing by an average of 5.8 points and a median of 4.3 points.

If we redo the model using all the data from 1924 through 2008, it explains about 60 percent — not 90 percent — of the variance in the presidential vote. Importantly, the coefficient on the growth variable is also quite a bit lower, meaning that the electorate is somewhat less sensitive to economic performance. Each percentage point rise in income growth translates to a 1.5 percentage point rise in the incumbent party’s vote share, rather than 3.6 percent as the original model implied.

And if we remove the war causalities variable and just focus on what the model tells us about the economy, it explains almost exactly half the vote. That’s good, but not much better than our simple G.D.P.-and-inflation model.

To be clear, I think Mr. Hibbs’s model is the best of its kind. But there’s nothing magical about it, and the fact that it performed so uncannily well from 1952 through 1988 is not a good reflection of its predictive power. Core economic variables explain about half of the presidential vote — the rest is up to the candidates and the voters.

At the risk of beating a dead horse, let me reiterate that this is a result that I find intuitively appealing.

I’d be worried if, as our study of the unemployment rate seemed to imply, the economy had no effect on election results at all; that clearly seems wrong given the effect it has upon people’s lives and the media’s (appropriate) attention to it.

But I’d be just as worried if one or two economic variables explained 90 percent of the results. Wars matter, above and beyond what can be measured with a single variable based on military causalities. Watergate mattered. September 11 mattered. Monica Lewinsky mattered. The fact that parties have nominated candidates as strong as Dwight Eisenhower and as weak as George McGovern — that matters. It matters that the electorate goes through phases of being relatively more and relatively less partisan.

In 2012, things like President Obama’s unpopular health care bill, the Republicans’ unpopular Medicare bill, and the death of Osama bin Laden are likely to matter. So will the economy (those numbers are getting worse for Mr. Obama).

But the results are not quite baked in, as some would have you believe. Until we reach the point where the polls become more reliable — the nice thing about polls is that they permit voters to determine for themselves what matters to them, rather than having preferences inferred by a statistical formula — my advice is to look at more rather than fewer pieces of evidence.

Look at a broad range of economic variables — maybe you should pay a little bit more attention to disposable income growth, and a little bit less to unemployment — but look at all of the major ones. Look at Mr. Obama’s approval ratings — we’re just now getting to the point where they can start to have a tiny bit of predictive power. Look at the number of people who say the country is on the right track or the wrong track. Don’t dismiss the importance of factors like the health care bill and the death of Bin Laden just because their impact is harder to quantify. Furthermore, consider the potential impact of “black swan” types of events: everything from a debt default to a Tea Party candidate running as an independent.

It’s much too soon to reduce the election down to a four-word catch phrase, or to a two-variable formula. It’s the economy, stupid. And everything else too.

FiveThirtyEight

What Do Economic Models Really Tell Us About Elections?

Comments