August 12, 2008
How We Choose Polls to Plot: Part II
[Update: In Part III of this thread, Mark Blumenthal answers to some of the comments below and poses a new question].
By Charles Franklin on August 12, 2008 10:16 AM | Permalink
Comments
Your comment that LV skews in one direction scares me somewhat. Is it real, or is it an effect of the LV screening?
In general, do you worry at all that there can be correlations between the LV screens and the poll results? How are these LV filters constructed? Some algorithm applied to personal information and previous voting record questions, I would guess. Since things like age, gender, education, ... all correlate with political preference, if these are also then part of the LV screen, how do you disentangle the effect?
(Of course, if the LV screen consists of a single "are you likely to vote?" question, the point is moot.)
In physics we worry about "trigger bias" or "filter bias", and we estimate this by having a family of filters with varying levels of selection, so one can judge how warped the answer is getting. In an ideal world the pollster would give you a family of LV screens (say, LV_soft, LV_medium, LV_hard) and you could judge.
Amit
Charles, you ask, "Do LV samples more accurately predict election outcomes than do RV samples?"
Isn't that the wrong question, though, except for polls taken just before Election Day? After all, pollsters continually tell us that their results are *not* a prediction, but a "snapshot" of *current* preferences.
The problem is that to get that "snapshot" they ask a question about a fictitious event--an election held today. But of course if the election really were held today, some people who tell the pollsters that they haven't been taking much interest in the campaign--one common screen question for likely voters-- would be very interested indeed. (For that matter, some people who aren't even registered now might long since have registered.) So to defend likely voter models by saying "They aren't intended to predict whether the respondent will vote in November but whether he would vote today" is in effect to pile one fiction on top of another.
Looking at the plot, it appears that Likely Voters show the highest variability as a function of time, while Registered Voters show the least. Is there some reason why LVs should be more volatile than RVs? If not, shouldn't one suspect that the higher variability of the LV votes is an artifact of the LV screening process?
It would be great to see trend estimates for the different types of polls, since RV and LV polls are measuring different potential electorates. Actually, isn't it the case that all RV polls are measuring the same potential electorate, while the various LV polls are measuring multiple and different potential electorates? What's nice about the RV polls is that at least the instrument tapping the potential electorate ("are you registered to vote?") is stable and consistent across polls, and we can be reasonably confident that different pollsters are actually mapping the same potential. On the other hand, what's nice about the LV polls is that the pollster's historical knowledge about who actually goes to vote (why the pollster's get paid the big bucks) may increase the predictive capacity of the poll. As a reader of the LV polls, it would be nice to know the criteria used to define the LV population. Perhaps pollster.com should only include LV polls that publish these criteria. Maybe all of them do.
@ Amit:
There's good reason to think that the LV screens correlate highly to at least one demographic factor: age. For example, CNN's current screen asks about prior voting habits. People who have recently turned 18 obviously haven't voted before. Of course, there is also a strong correlation between age and candidate preference. I've also seen it argued that the "cutoff" method CNN uses (only counting the most likely voters rather than weighting choices by probability of voting) exacerbates this bias. Nate on fivethirtyeight.com has a post about this potential "long tail" effect.
@ Independent:
I think there has been some research showing that LV screens introduce higher volatility. This may be because part of some screens (including CNN's current screen) is current interest in the race, which is of course highly volatile.
What the trend estimate likely vs registered voting intention is showing is what happened for the most part in the primaries Obama polled less in the actual poll than in the opinion polls and certainly what happened in the last General Kerry polled less in the actual. The question is how much pollsters have refined. Rasmusson looks to have skewed slightly further towards the Repubs as well.
The bottom line is how you are presenting does allow us to extrapolate in a one stop fashion snapshots of our own without necessarily "cherry picking"!
Prof Franklin,
Reasonable defense for your decisions, especially in view of the fact that most pollsters don't report all three sets of results.
However, I was struck by your comment:
"...If a pollster doesn't "believe" their LV results, then it is a strange professional judgement to report them anyway. If they think that RV results "better" represent the electorate than their LV results, they need to reconsider why they are defining LV as they do..."
I couldn't agree more. And with that in mind, I was struck by Frank Newport's "defense" of the recent USA Today/Gallup poll that had McCain ahead among LV's and trailing among RV's. In effect, despite trumpeting the LV results in USA Today, Newport claimed that the RV results were more reliable.
I realize that pollsters may not control (or even influence) the editorial decisions of their clients,(e.g. USA Today), but it is rather disconcerting to see poll results that are effectively disowned by the firm (e.g. Gallup) who conducted the poll.
One further comment. I'd be much less skeptical of LV screens (especially this far ahead of the election) if (a) pollsters routinely published the details of their LV screens (which they don't); the LV screens were rougly comparable (which they apparently are not); and (c) if LV weighting weren't so absurdly "kludgy" (as Gallup's, at least, appears to be.
thoughtful you say that "in the primaries Obama polled less in the actual poll than in the opinion polls"
According to a pro-Obama commentator at http://www.fivethirtyeight.com/2008/08/persistent-myth-of-bradley-effect.html this is false:
"On average, Barack Obama overperformed the Pollster.com trendline by 3.3 points on election day." Unfortunately, that commentator included caucuses as well as polls in his chart, but I think the basic point remains valid: people have it so much etched in their memory how Obama did worse than the polls indicated he would in New Hampshire that they forget that there were many other states where he actually outperformed the polls.
As for 2004, Bush outperformed the final polls (I mean the final polls in the week before the election, not that ridiculous exit poll) but only very slightly. They showed on average a Bush victory by 1.5%; he won by 2.45%. http://www.mysterypollster.com/main/2005/01/final_results.html
@David_T You are quite right. Caucuses are different to the polling booths.I'lltry to pull out other examples other than NH!
Just a note that earlier this afternoon I posted Part III of this thread, which addresses the comments from Amit and Independent
I smiled at "it is a good problem in general." Yeah, like an especially sweet Sudoku.
Thinking about the "accuracy" of various poll estimates at this point in the campaign gets pretty metaphysical, pretty fast. My gut tells me that some LV screen ought to be better than none at all -- people who say they are unlikely to vote, probably are. Of course that doesn't mean that every LV screen is better than none at all. But I agree that there's a lot to be said for a simple, inflexible rule at least as a baseline.
I'd say the difference between LV and RV trendlines here is pretty large, although I take your point. Can this become a semi-regular feature?
We all look forward to the era where pollster.com users can eliminate or "drag" individual points and watch the trendline update dynamically... no, sorry, that's just sick.
Posted on August 12, 2008 12:32 PM