Monkey CageAnalysis Is the media biased toward Clinton or Trump? Here is some actual hard data.
There are endless disputes this year about how the media is covering the presidential campaign. See, for example, this exchange between my Post colleague Chris Cillizza and the political scientist Norm Ornstein. Mostly lacking in these disputes, however, is systematic data about how, and how much, the media is covering Hillary Clinton and Donald Trump. Kalev Leetaru and I wrote a long post about media coverage during the primary. But what about now?
An interesting new analysis comes from the team at the Data Face, Jack Beckwith and Nick Sorscher. I asked them a few questions about what they found. Below is a lightly edited transcript.
How did you go about measuring media coverage of Hillary Clinton and Donald Trump?
We compiled a total of 21,981 articles written about the election dating back to July 1, 2015. To be included in our data set, each article had to reference either Donald Trump or Hillary Clinton in its headline (but not both). The articles came from the websites of eight major media outlets: the New York Times, The Washington Post, Chicago Tribune, Wall Street Journal, Slate, Politico, Fox News and the Weekly Standard. We wanted a mixture liberal and conservative outlets, at least according to conventional wisdom.
We looked at the number of articles that were published about each candidate over time, which captures their ability to dictate the news cycle. And using the actual text of the articles, we evaluated the tone of the coverage how positive or negative it was toward each candidate and how it has shifted throughout the campaign.
How do you know whether a story is positive or negative about either of the candidates?
We did this via a computer algorithm, which is becoming increasingly common as social scientists work with huge data sets of text. There are a variety of approaches to whats often called sentiment analysis, but our methodology was this: for each article, the algorithm identified every adjective. Then, using a very large word bank, it scored the adjectives on a scale of -1.0 (most negative) to +1.0 (most positive). The computer then averaged those values to generate an overall sentiment score for each article.
This obviously isnt perfect. A computers sense of sentiment can be tripped up by things like satire, slang or misspellings. But given that we were working with news articles (the Onion wasnt among our outlets), we believe these concerns are less relevant. Moreover, sentiment analysis has been shown to be surprisingly effective in predicting the stock market, summarizing customer feedback and delineating a populations political views.
Lets talk about what youve found. First, how much coverage have the two candidates received in these outlets, and how has that changed over time?
Findings: https://www.washingtonpost.com/news/monkey-cage/wp/2016/09/20/is-the-media-biased-toward-clinton-or-trump-heres-some-actual-hard-data/