2020 Presidential Elections Analysis

Tahmid Ahmed

View the Project on GitHub TahmidAhmed2000/Gov1347

11/1 - Predicting the 2020 Election

Overview

The time has come! It is time to make our final prediction for the 2020 Presidential election. The upcoming election will be a very unique one. America has not witnessed events like the Coronavirus pandemic and civil unrest from the death of George Floyd in a long time, which may impact voting behaviors. Furthermore, the pandemic has played detrimental effects on the economy, also possibly impacting voting behaviors. In this blog, I will talk about the model we will use to forecast the 2020 election.

Models Logistics

Poll Model

R_pv2p ~ avg_pollyr

Figure 1.

The results of this model is seen in Figure 1. Based on this model, Biden is expected to win with an electoral count of 310 votes to Trump’s 228 votes.

I decided to use a poll model because I learned from Nate Silver that polls are very good predictors of the election. While in 2016, polls were not as representative of the actual results, it is important that many pollsters have now improved in their methods. Now many pollsters are asking less biased questions and polling at more areas.

Fundamental Model

R_pv2p ~ GDP_growth_qt + turnoutpct_change + net_app

Figure 2.

The results from this model is seen in Figure 2. Based on this model, Biden is expected to win with an electoral count of 402 votes to Trump’s 136 votes.

I noticed that using the fundamental model has played a significant impact on battleground states in comparison to the polls model. I noticed many battleground states like Texas and Georgia have become blue from using the fundamental model.

Ensemble Model

Trump vote share = 0.96*Poll + 0.04*Fundamental

Now that we have our poll model and fundamental model, I decided to then use an ensemble model where I weighted the poll model by 0.96 and the fundamental model by 0.04.

Figure 3.

Just like the FiveThirtyEight model, I relied heavily on polling data. The ensemble model predicts Biden to win with 310 electoral votes and Trump to lose with 228 votes. While there are not necessarily any glaring predictions for a state, it is important to understand that there is uncertainty with this model, which I will talk about later on.

Validation of Models

Figure 4.

Figure 5.

Figure 6.

In this section, I will discuss the validation I used for my models.

Figure 7.

Predictability of Ensemble Model

Figure 8.

Sensitivity Analysis

I also decided to do a sensitivity analysis on our model. Figure 9.

Conclusion

My final prediction is using the results from our ensemble model, which means that Biden is projected to have 310 electoral votes while Trump is projected to have 228 electoral votes. The 95% prediction interval for Biden is 144 electoral votes to 432 electoral votes based on Figure 8. Given the nature of the ensemble model, there is quite a lot of uncertainty with this model, particularly with battleground states. While the battleground states did have smaller prediction intervals than other states according to Figure 8, they have been very close to 50%. This means they are quite easy to flip, or swing, come election time.

I prefer using the ensemble model because it weighs the poll model more heavily, especially since fundamentals tend to be noisy predictors as the election nears. However, given the impact of the Coronavirus pandemic and the expected record-breaking voter turnout, I do expect fundamentals to have some sort of impact on the election.

Additionally, I am interested to see how bias my models are in general after we find out the actual results of the election. There is a chance that polling data might be biased and could repeat what happened in 2016. Given that my polling model and ensemble model mainly represent what polls are saying, it would be interesting to see if the poll and ensemble model are very different from the actual results. This might tell America that we again need to do a better job in polling our citizens, especially since polling was heavily flawed in 2016.