2016 Postmortem

scheming daemons

(25,487 posts) Wed Oct 12, 2016, 01:55 PM Oct 2016

Want to know why the LA Times/Dornsife poll has been an outlier all election season?

There is a 19-year-old black man in Illinois who has no idea of the role he is playing in this election.

He is sure he is going to vote for Donald J. Trump.

And he has been held up as proof by conservatives — including outlets like Breitbart News and The New York Post — that Mr. Trump is excelling among black voters. He has even played a modest role in shifting entire polling aggregates, like the Real Clear Politics average, toward Mr. Trump.

How? He’s a panelist on the U.S.C. Dornsife/Los Angeles Times Daybreak poll, which has emerged as the biggest polling outlier of the presidential campaign. Despite falling behind by double digits in some national surveys, Mr. Trump has generally led in the U.S.C./LAT poll. He held the lead for a full month until Wednesday, when Hillary Clinton took a nominal lead.

Our Trump-supporting friend in Illinois is a surprisingly big part of the reason. In some polls, he’s weighted as much as 30 times more than the average respondent, and as much as 300 times more than the least-weighted respondent.

http://www.nytimes.com/2016/10/13/upshot/how-one-19-year-old-illinois-man-is-distorting-national-polling-averages.html

31 replies

= new reply since forum marked as read

Highlight:

Want to know why the LA Times/Dornsife poll has been an outlier all election season? (Original Post) scheming daemons Oct 2016 OP

What the heck Johnny2X2X Oct 2016 #1

Many polls weight responses if they are unable to hit targets of certain demographics. Ace Rothstein Oct 2016 #3

That's what Trump means when he says Happyhippychick Oct 2016 #2

I read the article but cannot understand why they have a poll that they know is inaccurate? UCmeNdc Oct 2016 #4

They didn't do it on purpose, and the design is defensible. DanTex Oct 2016 #10

Being weighted 30 times more than the average respondent doesn't seem defensible muriel_volestrangler Oct 2016 #13

The fact that there is that much overweighting is taken into account in DanTex Oct 2016 #15

Or you recruit a few more people in the required categories muriel_volestrangler Oct 2016 #19

If that can be done without compromising the randomness of the sample, then yes. DanTex Oct 2016 #20

Here's how they present it: muriel_volestrangler Oct 2016 #21

Thanks. DanTex Oct 2016 #22

Not completely... Adrahil Oct 2016 #23

Well, all designs are vulnerable to a bad sample. DanTex Oct 2016 #24

That's true to an extent, of course... Adrahil Oct 2016 #26

I don't agree that it's doubly vulnerable. DanTex Oct 2016 #28

Not they are not stuck with it due to weighting... Adrahil Oct 2016 #30

Agreed about the "previous voting" weighting. DanTex Oct 2016 #31

lol, well the overall flaw with that poll is they just poll the same group again and again... qdouble Oct 2016 #5

Not a flaw. Feature that makes it a DIFFERENT kind of poll: a Tracking Poll. Bernardo de La Paz Oct 2016 #6

It's clearly a flaw in it being presented as being representative of popular opinion when it is not. qdouble Oct 2016 #8

Yes & NO. The poll is NOT flawed. The media are flawed (or worse). . . . . nt Bernardo de La Paz Oct 2016 #9

That's reasonably true.... Adrahil Oct 2016 #12

Good points. . . . nt Bernardo de La Paz Oct 2016 #14

Lie, Damned Lies, and Dark n Stormy Knight Oct 2016 #7

Nate Cohn is pretty awesome. Adrahil Oct 2016 #11

Nate Silver should stop counting this poll at all bluestateguy Oct 2016 #16

Trumpkin's favorite poll. nt oasis Oct 2016 #17

Its a bogus poll workinclasszero Oct 2016 #18

I went to USC, and I feel ashamed Yavin4 Oct 2016 #25

I'll defend USC conducting this poll but LA Times should not have put it's name behind it RAFisher Oct 2016 #27

A experimental should be a subject, not a tool Foggyhill Oct 2016 #29

Johnny2X2X

(24,207 posts)

1. What the heck

Reply to scheming daemons (Original post)

Wed Oct 12, 2016, 01:59 PM

Oct 2016

They are weighting their respondents? What the heck kind of poll is this?

Ace Rothstein

(3,373 posts)

3. Many polls weight responses if they are unable to hit targets of certain demographics.

Reply to Johnny2X2X (Reply #1)

Wed Oct 12, 2016, 02:04 PM

Oct 2016

Demographics such as gender, race and age.

Happyhippychick

(8,422 posts)

2. That's what Trump means when he says

Reply to scheming daemons (Original post)

Wed Oct 12, 2016, 02:02 PM

Oct 2016

"There's my African American"!

UCmeNdc

(9,655 posts)

4. I read the article but cannot understand why they have a poll that they know is inaccurate?

Reply to scheming daemons (Original post)

Wed Oct 12, 2016, 02:12 PM

Oct 2016

Why design a poll that cannot accurately represent the voting groups. It seems silly and stupid. Once they knew they had a panelist that falsely represented the black vote why keep polling? They had to know their numbers were not accurate.

I do not understand why they kept the whole poll running?

DanTex

(20,709 posts)

10. They didn't do it on purpose, and the design is defensible.

Reply to UCmeNdc (Reply #4)

Wed Oct 12, 2016, 02:35 PM

Oct 2016

All polls reweight the sample to fit demographics. The problem with the LAT poll is that, basically, they got unlucky, and one part of the sample that was heavily overweighted ended up being highly unrepresentative.

That doesn't mean the methodology was bad, it's part of sampling error and could happen with any poll.

The problem is made worse by the fact that this is a panel, so the same people are polled every time. Which means that the unrepresentative sample stays in there the whole time. If they did brand new polls every time with the same methodology, this sample would have been an outlier, but other results they got would be more in line with the poll averages.

The problem is, it is very hard to justifiably changing a methodology once you decide on it just because the numbers don't come out the way you think they should. This is a problem that comes up in some scientific and medical research: people gather data, do some analysis, it doesn't work out well, so then they do some different analysis on the same data and publish it. That's not kosher, you have to decide on what you are going to do with the data before you gather the data in order for the statistical tests to be valid.

So, all in all, I think that the LAT pollsters are doing the right thing. The designed a methodology honestly, and they aren't changing the methodology after the fact, which would result in bias. They decided to run a panel, the designed the panel study soundly, and they are running it like they said. Even if they stopped running the panel because they don't think it's accurate, even that would result in a form of "survivor bias" in poll averages, because it would mean that some soundly designed polls were excluded because of the results they gave.

I agree with Nate Silver's comments on the LAT poll a while ago. What poll readers should do is not just dismiss the poll, but instead adjust it for it's house effect.

muriel_volestrangler

(106,212 posts)

13. Being weighted 30 times more than the average respondent doesn't seem defensible

Reply to DanTex (Reply #10)

Wed Oct 12, 2016, 02:48 PM

Oct 2016

The article goes on to say that on his own, he's made Trump's black support look like it's in double digits, and:

He is also the reason Mrs. Clinton took the lead in the U.S.C./LAT poll for the first time in a month on Wednesday. The poll includes only the last seven days of respondents, and he hasn’t taken the poll since Oct. 4. Mrs. Clinton surged once he was out of the sample for the first time in several weeks

http://www.motherjones.com/kevin-drum/2016/10/lat-poll-finally-makes-it-unanimous-donald-trump-loser

I can see defending weighting in a poll that increases a group from, say, 15% of actual respondents to 25% of the poll results, if you can't get hold of enough people in that demographic. But a weighting of 30 times for an individual is taking the piss.

DanTex

(20,709 posts)

15. The fact that there is that much overweighting is taken into account in

Reply to muriel_volestrangler (Reply #13)

Wed Oct 12, 2016, 03:00 PM

Oct 2016

the margin of error calculations, which means that, yes, it is defensible (even though it definitely appears strange).

That one person can apparently result in a 1% change in the outcome. Well, if you did a 100-person poll then every single person would account for 1% of the outcome. 100 people is a very small sample, so you'd have a big margin of error. In the LAT case, not many people (only one, it looks like), have that much influence, so the margin of error is smaller than it would be in a 100-person poll. But the margin of error calculations take into account the amount of reweighting.

As Nate Cohn points out in the article, if you cap the amount of overweighting, then you reduce the sampling error, but you end up introducing a bias, because your re-weighted sample will not approximate true demographics correctly.

muriel_volestrangler

(106,212 posts)

19. Or you recruit a few more people in the required categories

Reply to DanTex (Reply #15)

Wed Oct 12, 2016, 03:18 PM

Oct 2016

It looks like an attempt to claim a level of detail for the results without paying for the number of respondents to justify that, to me. They don't give a margin of error exactly like most polls do; they draw a chart with an "area of uncertainty", currently 4% wide, but which has been 6% at times (when this one guy is in, that's 1% of it, I suppose). But that doesn't get reported in words, just the basic figure which is given to a ridiculous 3 significant figures.

The only times the figures have gone outside the area of uncertainty have been for leads for Trump, which, given the results of so many other polls with the more common methods, seems to show this poll has a fundamental flaw.

DanTex

(20,709 posts)

20. If that can be done without compromising the randomness of the sample, then yes.

Reply to muriel_volestrangler (Reply #19)

Wed Oct 12, 2016, 03:49 PM

Oct 2016

The way it's typically done is to gather the sample before asking about demographics, though I don't know about the LAT poll. If that's the case, extending the sample by first asking people about their demo information would result in two different sampling techniques.

The other thing is, if you went that route, it would make the poll more expensive, because you'd end up contacting a lot more people, only to discard a lot of them because they weren't in the demographics you needed. If you did that, IMO you'd be better off just including everyone, and still having the reweighting, because that would result in a smaller margin of error.

In fact, discarding people because they don't fall into a demo that you need is itself a form of overweighting: in this case it's not that people's responses in other demographics would count for less, it's that if you fell into another demographic and were contacted, you'd have a very high chance of not even being able to respond at all. Which means that, relative to everyone who was contacted, there would still be huge disparities in effective influence. The problem to me isn't really that the overweighting is so severe, it's that one individual ended up with so much sway. If their sample were 10X bigger, then the most influential person would only have 0.1% sway, which would be fine.

I haven't actually seen the way they present the results. Whether they call it "margin of error" or "area of uncertainty" or whatever else their marketing department decided on, it's still basically the same thing: two standards of deviation.

It's interesting that their margin of error (or whatever they call it) is that large. The article says the sample size is about 3,000, and without reweighting that would mean a margin of error of about 1.8%. 5% is what you'd get with a sample of size 400, so that is an indication of the serious reweighting they are doing.

I agree that 3 significant digits is absurd. On the other hand, most polls only release 1 significant figure, and I think they should release 2. A typical poll with a margin of error of 4% means standard deviation of 2%. That means that, with one significant figure, rounding error could be a quarter of a standard deviation.

muriel_volestrangler

(106,212 posts)

21. Here's how they present it:

Reply to DanTex (Reply #20)

Wed Oct 12, 2016, 04:05 PM

Oct 2016

http://www.latimes.com/nation/politics/trailguide/la-na-trailguide-updates-trump-falls-behind-in-the-usc-la-times-1476283455-htmlstory.html

DanTex

(20,709 posts)

22. Thanks.

Reply to muriel_volestrangler (Reply #21)

Wed Oct 12, 2016, 04:20 PM

Oct 2016

I guess it means that if the two lines are in that region, then they are separated by less than the traditional margin of error.

Although if they are doing it right, that would be the MoE for the difference between the numbers, which is generally close to twice as large as the MoE for one individual number. So it's possible that they aren't, as a whole, as reweighted as I thought.

Also, in my last post, I miscounted the number of significant figures (forgot about the first digit). I actually agree with them presenting 3 digits, one after the decimal, for the I reasons said.

Adrahil

(13,340 posts)

23. Not completely...

Reply to DanTex (Reply #10)

Wed Oct 12, 2016, 04:43 PM

Oct 2016

The basic design might be sound, but this particular sample shows how vulnerable the design is to a bad sample. That's a serious weakness.

DanTex

(20,709 posts)

24. Well, all designs are vulnerable to a bad sample.

Reply to Adrahil (Reply #23)

Wed Oct 12, 2016, 04:52 PM

Oct 2016

Other pollsters have outliers too. The problem is LAT is conducting a panel, so now they have the same sample for the entire election season, and they got a bad one. Bad luck, but it could happen to any panel.

Adrahil

(13,340 posts)

26. That's true to an extent, of course...

Reply to DanTex (Reply #24)

Wed Oct 12, 2016, 05:05 PM

Oct 2016

But as this article makes clear, this poll design is DOUBLY vulnerable.

First, it weights demo groups to a fine level of detail, leading to weightings far in excess of what's normally seen, meaning that if you have a bad sample, its effect is magnified. A lot.

Second, if you DO get a bad sample, the design of this poll means you're stuck with it, meaning that sample bias is perpetuated, rather than averaged out.

That's a pretty big flaw, IMO.

DanTex

(20,709 posts)

28. I don't agree that it's doubly vulnerable.

Reply to Adrahil (Reply #26)

Wed Oct 12, 2016, 05:35 PM

Oct 2016

If they did the calculations correctly, which I assume they did, their vulnerability to a bad sample is accurately characterized by their margin of error. Yes, by less finely dicing up the demo groups, they could have reduced their vulnerability, which would have reduced their MoE. On the other hand, they had a relatively large sample, at 3000, which made up for that.

They would have been equally vulnerable if they went with a more typical sample size of 1000, with less fine dicing and reweighting. But in the end, they got unlucky with the sample, which could have happened to any poll.

And the fact that they are stuck with it is unfortunate, but that's not due to the weighting, it's because they are performing a panel and not a bunch of separate independently sampled polls.

Adrahil

(13,340 posts)

30. Not they are not stuck with it due to weighting...

Reply to DanTex (Reply #28)

Wed Oct 12, 2016, 08:15 PM

Oct 2016

That's the panel, as you said. And it's also true that the MoE can account for sampling errors, though in this case, it seems they are probably hanging onto the edge of that MoE. It is NOT clear if the poll accounts for weird "previous voting" weighting. I mean, if you look in the article, they show what the poll looks like with usual weighting and without the weird vote reporting weighting.

I don't advocate throwing out data just because one disagrees, but I think it's clear their design choices have led to their extreme outlier status.

DanTex

(20,709 posts)

31. Agreed about the "previous voting" weighting.

Reply to Adrahil (Reply #30)

Wed Oct 12, 2016, 08:28 PM

Oct 2016

I think that was an error, and a preventable one: after an election people are more likely to retrospectively claim they voted for the winner, so weighting according to previous vote will predictably result in overweighting the GOP. So, yeah, that was a bad choice by them.

But the way they split up the demo in very fine ways, with their large sample size, I think is defensible even if it resulted in extreme overweighting for some respondents.

According to the article, it seems that about half of their systematic error is due to the "previous vote" weighting, and the other half is due to demo splitting. I think the first half can be attributed to poor methodology, but the second half I think is mainly bad luck. Even with standard demo weighting, you can end up with a skewed sample, and you can also end up with a sample that is more skewed with standard demo weighting than with their finer grained demo subgroups.

qdouble

(891 posts)

5. lol, well the overall flaw with that poll is they just poll the same group again and again...

Reply to scheming daemons (Original post)

Wed Oct 12, 2016, 02:13 PM

Oct 2016

so if it initially had a pro-Trump bias, it will keep it, even if he's looking much worse in every other poll.

Bernardo de La Paz

(60,320 posts)

6. Not a flaw. Feature that makes it a DIFFERENT kind of poll: a Tracking Poll.

Reply to qdouble (Reply #5)

Wed Oct 12, 2016, 02:21 PM

Oct 2016

That's what you have to do to make a Tracking Poll: Poll the same people.

So ignore the absolute numbers on it and watch the swings. It will tend to give a more accurate view of swings and a less accurate view of actual numbers.

qdouble

(891 posts)

8. It's clearly a flaw in it being presented as being representative of popular opinion when it is not.

Reply to Bernardo de La Paz (Reply #6)

Wed Oct 12, 2016, 02:26 PM

Oct 2016

It should be obvious from my first post that I understand the methodology is different than other polls.

Bernardo de La Paz

(60,320 posts)

9. Yes & NO. The poll is NOT flawed. The media are flawed (or worse). . . . . nt

Reply to qdouble (Reply #8)

Wed Oct 12, 2016, 02:33 PM

Oct 2016

Adrahil

(13,340 posts)

12. That's reasonably true....

Reply to Bernardo de La Paz (Reply #6)

Wed Oct 12, 2016, 02:40 PM

Oct 2016

Nate Silver has said that.... that the poll is still useful for trends, BUT.... as this article shows, the very unusual weightings and the affect of those weightings means the poll can miss trends. Virtually every other poll showed a significant move to Clinton after the first debate, but this one missed it. It's odd weighting and bad sample explain that. It doesn't mean the poll is useless, but it does significantly lower it's value.