Pollster.com

April 22, 2007 - April 28, 2007

 

California Polls, Turnout and the Democratic Vote

Today, in our ongoing conversation about whether national surveys may be overstating Hillary Clinton's lead, let's look at some recent statewide surveys from California. In that state at least, the data suggest that if anything, the opposite may be true.

First, consider past turnout in California. On March 2, 2004, 3.1 million Californians voted in the Democratic presidential primary. Turnout amounted to roughly 14.8% of eligible adults [and 11.7% of all voting age adults -- see update below].

Now consider four recent surveys, each with a unique methodology. The Public Policy Institute of California (PPIC) conducted a survey of 2,000 California adults, March 13-20, using a traditional random-digit-dial (RDD) methodology. They used screen questions to select, first registered voters, then 498 likely Democratic primary voters (those who consider themselves "strong Democrats" or say they plan to vote in the primary on February 5, 2008). The Democratic sample amounts to 25% of the adults interviewed.

The firm SurveyUSA polled Californians, March 3-5, using an automated "Interactive Voice Response" methodology that requires respondents to answer using the keys on their touch-tone phone, but also uses the traditional RDD methodology to reach a random sample of all households with a working landline telephone in the state. They interviewered 2,400 adults and selected 562 likely Democratic primary voters; the Democrats were 23% of the adult sample.

The Field Poll, the granddaddy of all California polls, has recently shifted to so-called "registration-based-sampling" (RBS). Their March 20-31 survey was based on a sample drawn from a list of registered voters in California. They interviewed 1,093 registered voters then used screen questions to select 417 562 likely Democratic primary voters. With a back-of-the-envelope calculation (that California's 15.8 million registered voters last fall were 74% of the voter eligible population), the Democrats in the Field poll amount to roughly 28% of California eligible adults [and 23% of voting age California adults - see Update II below].

Finally, Working Californians (a labor friendly non-profit) yesterday released a survey of 400 likely Democratic primary voters in California conducted April 9-12 by Democratic pollster Mark Mellman (who polled for John Kerry in 2004 but is not currently affiliated with any of the 2008 presidential candidates). Via email, Mellman confirms that he, like most campaign pollsters, not only sampled from a voter list but takes the method one step further than Field. Rather than simply screening for likely primary voters, Mellman also selected from the registered voter lists only individuals with some past history of primary voting.

Here are the results:

04-27%20california.png

A few things stand out. First, despite their wide variation in methodology, all of these surveys use "tighter" screens than most of the national polls, and yet Hillary Clinton leads by a healthy margin in all four. She runs as strongly here as in the national surveys, despite the arguably tighter screens used by PPIC, Field and SurveyUSA.

Second, the one automated poll from SurveyUSA has Clinton leading by roughly the same margin (13 points) as the two interviewer surveys conducted at about the same time by PPIC (+11) and Field (+13). SurveyUSA produced a slightly smaller undecided (6%) than the other polls (9% to 14%), as they typically do. No evidence of a "don't tell mama" effect here.

Third, and perhaps more important, the survey with the arguably tightest screen - the one with the least potential to include non-primary voters - gives Clinton her largest lead (+19) and Obama his smallest share of the vote (19%).

UPDATE:Over at MyDD, reflecting in part on my last post on this subject, Chris Bowers has concluded that "the existing evidence no longer provides any clear support to the Inflated Clinton Poll Theory." He adds:

Simply put, there are conflicting indications as to whether Clinton or Obama would benefit more from a tighter poll sample that focused only on Democrats who are likely to make up the caucus and primary electorate, rather than all Democratic self-identifiers and leaners who are registered to vote. Given all of this, I think it is time that I move on and start blogging about other subjects again.

Agreed -- and we will do the same. See you Monday.

UPDATE II: Mark DiCamillo, director of the Field Poll, left the following comment below:

[W]hen making your "back of the envelope" calculations to compare The Field Poll's RBS sample to the other polls' RDD samples, I would take issue with your decision to calculate registered voters as a percentage of California's citizen-eligible population (74%) rather than its adult population (66%). When RDD samples are implemented, the survey universe includes all adults, not just citizen-eligibles. So, if you had calculated registered voters using its 66% share of all adults, this would have reduced the proportion of eligible Democrats sampled by The Field Poll's to 25% of California adults, rather than 28% of citizen-eligible cited in your post.

He is right, and I apologize for the oversight. The voting age population (VAP) in California is a more appropriate statistic because the voter eligible population (VEP) statistic that I used excludes non-citizens that will be contacted by an RDD survey. As DiCamillo points out, the adults contacted by an RDD survey will include non-citizens, especially in California, a state with the largest percentage of voting age non-citizens in the nation (18.9% in August 2006).

As for his estimate that registered voters are 66% of the adult population, DiCamillo may have been using older data. GMU Prof. Michael McDonald's invaluable archive of turnout statistics reports tells us that the voting age population for California was 26.6 million in the August 2006, which would make registered voters 59.5% of that population. As such, the Democrats on the Field poll represent roughly 23% of California adults.

Of course, the same point applies to my turnout calculation. The California Democratic primary turnout of 2004 amounted to 11.7% of that state's adult population at the time.

This difference does not affect my larger point: Turnout was greater for California's Democratic primary in 2004, then for Democrats nationally, yet California's pollsters are using tighter turnout screens than the national pollsters. And Clinton's lead still looks roughly the same as in the national polls.

By Mark Blumenthal on April 27, 2007 5:27 PM | | Comments (3) | TrackBacks (0)

POLL: Harris Approvals

A new Harris Interactive online survey of 1,001 adults (conducted 4/20 through 4/23) finds:

  • 28% of Americans approve of the job Bush is doing as president, the "lowest ever" since he took office; 70% disapprove.

  • 25% approve of the job Cheney is doing; 68% disapprove.

  • 22% approve of the job Majority Leader Harry Reid is doing; 52% disapprove.

See also Charles Franklin's analysis of how this poll compares to other recent job approval ratings.

By Eric Dienstfrey on April 27, 2007 4:29 PM | | Comments (0) | TrackBacks (0)

Bush Approval: Harris at 28%, Trend at 34.1%

1BushApproval2ndTerm20070424.png

After a flurry of new polls in the last 24 hours, a new Harris poll taken 4/20-23/07 finds approval at 28%, disapproval at 70%. With this addition, the estimated approval trend stands at 34.1%.

The Harris result is a good deal below the estimated trend, and below other recent polls. In part this reflects a typical Harris "house effect". With it's atypical four point question format ("Excellent, good, fair or poor" rather than "approve or disapprove"), Harris' approval results are often a bit below that of other pollsters. The plot below, however, shows that this low result is not only due to house effects. The current poll is noticeably further below the trend estimate than are other recent Harris results.

2LastSixPolls20070424.png

How far the new poll is from the trend is apparent in the next figure below, in which Harris clearly falls outside the 95% confidence interval for "normal" variability. While it is possible the new poll at 28% represents a sudden negative shift, this is quite a distance away from other contemporaneous results from CBS News, NBC/WSJ and Pew.

3BushResiduals20070424.png

In light of the outlier analysis, it seems likely that the new Harris poll does not represent a sudden shift in support for President Bush. It is more reasonable that our prior estimate of 34.7% is closer to the national mood, rather than the 34.1% resulting from inclusion of this poll. In either case, the overall picture of relatively little trend in the last 5 months holds. We have been in a period of approval moving up or down a point, but never establishing strong evidence for a trend. While it could change at any point, the post-2006 election period has so far held in rough equilibrium.

4BootApproval20070424.png

5Sentitivity2ndTerm20070424.png

Cross-posted at Political Arithmetik.

By Charles Franklin on April 27, 2007 3:37 PM | | Comments (2) | TrackBacks (0)

Tag Clouds for the Democratic Debate

Janet Harris, a friend and Pollster reader, sends an interesting bit of analysis she did on last night's MSNBC Democratic debate. Using the free site TagCrowd, she created a set of "tag clouds" that provides a visual depiction of the words used most often last night by each of the candidates.

For those not familiar with the term, you have probably seen tag clouds appearing on many web sites (and hopefully, very soon here on Pollster). They were apparently first implemented on the photo sharing site Flickr, and typically provide a visual representation of the most popular "tags" assigned to web pages. The type size of each word varies according to its frequency of usage. The larger the type size, the more often each candidate used that word.

Here are the clouds Janet created:

BidenTag400.png

ClintonTag400.png

DoddTag400.png

EdwardsTag400.png

GravelTag400.png

KucinichTag400.png

ObamaTag400.png

RichardsonTag400.png

She also created a PDF version suitable for printing.

Now of course, this is a quick blog post, which probably raises as many questions for me as it answers. Each of the clouds consists of the 50 words used most often, omitting common words like "and," "of," "the," etc. I am not sure if the scale of the words is comparable across clouds -I suspect that Professor Franklin will feel strongly that they should be. Finally, for what it's worth, Janet also sends along this total word count for each candidate:

  • 1,872 - Senator Obama
  • 1,766 - Senator Clinton
  • 1,518 - Senator Edwards
  • 1,281 - Governor Richardson
  • 1,180 - Representative Kucinich
  • 961 - Senator Biden
  • 912 - Senator Dodd
  • 753 - Senator Gravel

A few quick observations, with an assist from Janet (who is the president of the media analysis firm, Upstream Analysis):

  • Notice the more frequent use of wonkier language by Chris Dodd, particularly the use of "administration," "multinational," "stateless," etc.
  • Now contrast that to John Edwards, whose answers tend to use everyday language and deliver a message loud and clear message: "America," "believe," "united."
  • The one-issue emphasis of Kucinich and Gravel - "war" -- is obvious.

Obviously, this feature is a bit off-topic for a site devoted to polling methodology, but it does deal with the graphic analysis of political data. I can certainly see potential applications of this sort of graphic for those that conduct and transcribe focus groups and other "qualitative" analysis.

But enough wonkiness. Readers, what do you see in these clouds? Our comment section is wide open...

By Mark Blumenthal on April 27, 2007 11:07 AM | | Comments (42) | TrackBacks (0)