In Response To Nate Silver

Someone in the comments was kind enough to point out that Nate Silver has penned a little ditty over at FiveThirtyEight.com entitled “Bad Math and the Bradley Effect”, which purports to be a response to my earlier article speculating on what the scenario is for McCain to win. Ideally I’d spend my Sunday early afternoon doing something more productive, but since he’s basically challenging my intellectual honesty by accusing me of cherry-picking polls, I suppose that it deserves a response.

Silver’s biggest complaint with my methodology seems to be that I define my dataset as something less than every election that was held during the primary season. Of course, one of the biggest challenges of any type of statistical analysis is defining your dataset. I did not go into my reasoning for my dataset selections, since (a) I was trying to explain the scenario (which I considered unlikely) for McCain winning, not explore under strict adherence to political science principles whether a given effect occurred or not and (b) it is a blog post, not an academic paper. But since it has been brought up, the reasoning was as follows.

I excluded caucus states, such as Iowa and Nevada. I think anyone with a basic understanding of the dynamic of caucuses would understand my decision to do this, and would know that it had nothing to do with whether or not the results in those states fit any particular hypothesis. They were excluded because they are relatively low turnout affairs (Iowa being something of an exception to this) with Byzantine voting rules where it is almost impossible to know the true first-choice preferences of everyone who turns out to vote. These are rules that zero states will follow in November. And the caucus states require attendees to have all-day availability, which means only a certain type of person can attend. Because of this, they are notoriously difficult to poll, and it is more likely that the pollster error is just due to a bad turnout model than it is anything else. This is a reasonable choice (as Silver actually seems to concede). 

And while it is always good to have as many datapoints as possible, in my estimation, including caucus states meant that I would be comparing apples with oranges, which is actually the last thing you want to do when hypothesis-testing. 

Silver knows this, given that in his recent piece on cellphones, he excludes a number of pollsters from his dataset, for a variety of (perfectly justifiable) reasons. While it would have given him more datapoints if he had included pollsters who only recently began polling or who conducted internet polling, Silver correctly decided that including them would damage the integrity of his data. It’s the same process with excluding caucus states.

 Next, I excluded Florida because it was a part of the Old Confederacy (see below) and also because, as we were reminded again and again in the primaries by Obama supporters, neither candidate campaigned there. It is therefore difficult to use its results as indicative of how the candidates would have fared in a full-on election.

Finally, I excluded the results from the Old Confederacy, but left in Texas. My reasoning for excluding these states is simple, and was (perhaps too succinctly) summarized in the following phrase in my post, where I described the South as a region “where [Obama] was buoyed by unusually high African American turnout...” In other words, the makeup of the electorate in those states is fundamentally dissimilar from the other states in the dataset. African Americans made up 48% of the electorate in Alabama, 49% in Georgia, 47% in Louisiana, 48% in Mississippi, 34% in North Carolina, 55% in South Carolina, 29% in Tennessee, 19% in Texas, and 29% in Virginia. This is unique to the Confederacy: The only other states where African American turnout exceeded 20% of the electorate were Delaware (29%), Illinois (21%), New Jersey (23%) and Maryland (37%).

And it is pretty clear that the African American population has a significant effect on whether Obama over- or under-performed in the polls- - the r-square when you compare AA% in a state to the pro- or anti-Obama effect in the state is .45, with a t-stat of 4.6 for the variable (which seems to argue for the existence of such an effect). Since the goal is to figure out how white voting behavior will change on election day relative to the polls, if at all, it seemed to argue for excluding states where whites make up an unusually small portion of the electorate. I guess another approach in states like NC and SC would be to compare the proportion of white voters Obama was predicted to get by particular pollsters versus what exit polls showed, but this has its own problems. And I’m lazy.

Silver argues that “the particular geographics [sic] of the Confederacy are not especially relevant electorally.” In many contexts that may be true, but in this context it is not. Having African Americans comprise around 50% of the electorate – something, incidentally, that many pollsters weren’t predicting, especially at first – would drown out any Bradley effect in a way that wouldn’t occur in other states where African Americans comprise a much smaller portion of the electorate. Moreover, the South behaved differently than the rest of the country in the primary season. You could explain 80% of the variance – at the county level! – of the voting in the South between Hillary and Obama solely by looking at the percentage of African American and college-educated voters in the county. That is unique to the South, and did not hold up elsewhere. Finally, there aren’t any states in the country that will have African Americans making up 40-50% of the electorate, except maybe Mississippi.

 I’ll also add that Silver’s assertion that Kentucky and Tennessee are two peas in a pod is just silly. Tennessee has no analogue to the Old Seventh congressional district in Kentucky, which is mining country organized by the UMW in the 30s, and which is basically an extension of West Virginia. Kentucky has no good analogue to Memphis, and its Fifth District is only a much smaller and weaker analogue to Tennessee’s First, Second, and Third Districts. And more importantly, the African American percentage in the Kentucky Democratic primary electorate was 9%, versus 29% in Tennessee. For whatever similarities they might have, their Democratic primary electorates are very dissimilar in the way that is most germane to this model.

 The decision to re-include Texas is probably the best criticism of my methodology, but it is also the one with the least overall effect, given that the Obama barely underperformed there. The reason for including it is pretty obvious if you look at the statistics above, and consider my overall reasoning for excluding the Confederacy. The AA population is comparatively small relative to the remaining Southern states, and is more akin to the general Democratic electorate. Perhaps it would have been simpler to say “exclude all states where African Americans comprised over 25% of the voting electorate,” which would have had the same effect, although Maryland would have been excluded, and would have been more consistently applied. The only problem is that any percentage applied would have been probably even more arbitrary than the methodology I chose – why not 20%, which would have excluded New Jersey? Why not 30%, which would have included TN and VA? Regardless, if it makes people feel better, we can still exclude Texas, which changes my results a couple hundredths of a point.

I’m not 100% certain, and am genuinely curious, how Silver is calculating the confidence intervals for my results, so I can’t really respond to the statistical significance charge. I must admit, however, that I find it odd for Silver to chide me for not reaching the 90th percentile in statistical significance, given that his data dredging...er...stepwise regression process demands significance only at the 85th percentile (just do cntrl-F and enter 85 to find it). The use of RCP averages versus pollster.com is much easier to defend. As you know from watching the primary results, Obama surged in late January after his South Carolina win. I’m no expert in Pollster.com’s methodology, but my impression from looking at some of their results is that they are much slower to phase out old polls than was RCP. Indeed looking at the FAQ, I’m not certain they phase out old polls at all. In some ways and in some applications, their estimates are superior to RCP’s, but in a race where you have a last minute surge by a candidate, an average that only includes the last few days’ polling is going to be the best estimate to use.

 There are several examples to point to of how this affects the results, but perhaps California is the best one. RCP’s final average for California included polls concluded only four days before the primary, which fully captured the poll bounce Obama was seeing. This is demonstrated in their chart of the race. The Pollster.com polling shows a much more gradual improvement in Obama’s polling, in part (I think) because the relatively recent, but nonetheless outdated polling from a few weeks prior was dragging their analysis down. In other words, my sense was that, especially in the Super Tuesday polls, the Pollster.com method was understating Obama’s strength in the polls, and hence overstating the degree to which he overperformed. 

Indeed, if you look at Silver’s chart, which is organized sequentially, the numbers become an awful lot redder (indicating Obama underperforming) when you get past Super Tuesday, and there is a lot less difference between his findings regarding Obama’s performance and mine. Take the following chart, which shows my result (a negative value means Obama underperformed), 538’s results, and then the difference between the two. I’ve highlighted any state where I found a pro-Obama effect at least two points higher than Silver’s in blue, and any state where I found an anti-Obama effect at least two points higher than Silver’s in red. The Super Tuesday states are between AL and TN. Super Tuesday is really where the Silver and I find different results, after that we are rarely more than a couple points off in our results. I think this is almost entirely due to the last-minute poll bounce that RCP captured, and which Pollster didn’t. 

State Oxendine Result 538 Result Difference
NH -10.9 -9.8 -1.1
SC +17.3 +14.3 +3
AL +9.2 +15.6 -6.4
AZ -2.8 -.3 -2.5
CA -10.8 -2.3 -8.5
CT +1 +5.8 -4.8
GA +17.3 +21.4 -4.1
IL -1.5 -4.1 -2.6
MA -8.4 -4.2 -4.2
MO +7 +2.4 +4.6
NJ -2.1 +.1 -2.2
NY -.3 +2.5 -2.8
TN -.3 +8.7 -9
MD +1.2 +4.7 -3.5
VA +10.5 +6.2 +4.3
WI +13.1 +10.3 +2.8
OH -3 -2.7 -.3
TX -1.8 -1.6 -.2
RI -13 -7.7 -5.3
VT -2 -.6 -1.4
MS +8.6 +9.1 -.5
PA -3.1 -1.7 -1.4
IN +3.6 +3.1 +.5
NC +6.7 +7.2 -.5
WV -6.3 -4.3 -2
KY -6.6 -.4 -6.2
OR +5.6 +5.5 +.1

 

 The criticism that the results aren’t robust if they change when the averaging mechanism is changed is also silly. If one averaging mechanism somehow systemically biases the results relative to other averaging mechanisms, which I think is the case here, then of course which one you choose makes a difference, and it should make a difference. This is especially true if the averaging mechanisms are interpreting different data, which I also suspect is the case here. 

In the end, I suppose reasonable minds can differ over whether to use Pollster.com or realclearpolitics.com. Without really knowing how Pollster’s regression works, it is probably impossible to argue it conclusively. But I really thought Silver's response to using RCP rather than Pollster was silly. It tipped me off that his principal interest is in NOT finding the Bradley Effect, rather than letting the chips fall where they may. If he finds no Bradley effect and I find a Bradley effect, there is evidence for the Bradley effect. That doesn't mean that it is there or not – and remember, the real point of the article was to speculate on what would have to happen for McCain to win, not to prove or disprove anything -- all it means is that more research must be done. 

Finally, Silver writes:

The other, more important question is why we should simply dismiss the results in the South, where Obama significantly overperformed his numbers, by 7.2 points on average, according to my definition of the region and by 9.9 points according to his…

The easy answer is that I don’t dismiss it. Had Silver bothered to read the entire article, he would see that I wrote, under the heading “Youth/African American Vote”:

Regardless, I’ve covered this to some extent here, with the salient point being this: yes, Obama will likely increase African American turnout, but the states where this could make a real difference – with the exception of Virginia – are either so deeply red or so deeply blue that it is unlikely that AAs will be game-changers. Improving Obama’s vote share in Mississippi from 40% to 43% doesn’t do him a lick of good.

Emphasis added. Silver and I seemingly agree that any pro-Obama effect from high African American turnout is likely to be muted in the general election, with AA’s share of the electorate likely to be at least halved relative to the primary election. Higher African American turnout might absolutely flip Virginia if McCain is only leading by a couple of points on election day, as he is today, and as I concede in my article. It might make a difference in North Carolina, but given that McCain is, as of this writing, leading by nine in the RCP average and five in Pollster.com, I’m not sure it will happen.  Even Silver’s own averages have McCain up five in NC, though they only have him up one in Florida (notwithstanding that only one poll this month has Obama leading, and that only three polls this month have McCain’s margin at less than five, but the shortcomings of his model are a story for another time).  I don’t think a reverse Bradley effect will have much of an effect in Florida either, where African Americans make up an even smaller portion of the population than they made up in Texas: only 12% in 2004

But as of this writing, McCain (using Pollster.com) is down three in PA, and down three in MI, and up two in OH. This all may change by election day – heck, it might change this week – but as of right now, I would take a couple extra points for McCain in PA, MI, and OH in exchange for giving up a couple of points in VA, NC, and FL. In a heartbeat, without thinking twice. 

And now I’m going to go play with my kid.

0
Your rating: None

Comments

Was it really kind...

... if my pointer ended up messing with an otherwise perfectly fine Sunday?

This is a good, thorough response.  Hope you can enjoy the remainder of your weekend.

Of course it was kind

I secretly like this kind of stuff.  Okay, not that secretly.

Great Post

Nate Silver is a great number cruncher, and I religiously follow 538.com. But He his far from perfect, and his post on your article was biased, shallow, and really downright rude. You response here was great Sean, keep up the good work.

Thanks Sean for straightening Nate Silver out.

Afetr reading its idiotic announcement that Kentucky is the same as Tennessee, I just wanted to go bitch-slap his dumbass.  As proud, native Kentuckian whoose mother is from Tennessee ( I have family in both states), I know first hand that those two states are more dissimiliar then they they similar to each other.

While 538.com has some interesting analysis that is just plain intellectually lazy and an inexcusably dumb thing to say.  Of course when you have half of the commenters on that site that typically think Democrat-controlled Big Government can do no wrong and genuine free-market capitalism can do no good, I guess you can't expect a whole lot.

If he finds no Bradley effect and I find a Bradley effect...

"If he finds no Bradley effect and I find a Bradley effect, there is evidence for the Bradley effect."

If he found no evidence that the Earth is flat and I find such evidence (ugh, like, I walked out today for miles, man, and it it s FLAT), there is evidence that the Earth is flat, even though my analysis might be completely full of shiitee.

Logic 101 eludes you, "my friend," and statistical analysis does too.  Try the same fo living and see how long you'll last.

A specialist.

 

 

 

I'm guessing you won't be returning

So this probably isn't worth it, but here goes anyways.  Yes, someone who argues that the Earth is flat because he's walked for miles and its f-l-a-t is presenting evidence that it is flat.  In fact, this evidence supported a belief that the Earth was flat for millenia. 

The problem is, Silver's response (and remember, my comment is simply in response to his comments re RCP vs. Pollster) is "I don't want to get in the details of RCP vs. Pollster.com, but changing the method changes the results, therefore his findings are suspect/incorrect."  That does not follow.  To take your horrendous analogy (since it isn't immediately obvious why Pollster.com is better than RCP, much as it is immediately obvious why mathematical calculations are better than walking a few miles) his argument is the equivalent of saying "Well, he says the Earth is flat because he's walked, but I get a different result by looking at my sundial, therefore he's wrong" without getting into why the measurements are a superior method of calculation.  The bottom line is that simply getting a different result using a different method does not mean that the initial result is questionable.  You have to go to the work to show why one method was flawed.  Which Silver failed to do.

But maybe they taught that in Logic 102.

guess again

In your statement that I quoted, you claimed that if one study produces a negative results (no evidence rejecting the null hypothesis), and another one produces a positive result (able to reject the null), the second study unequivocally produces "evidence".  This is loughable.  Long story short: Just because one method produces a positive result, that is no "evidence" as you claim, because the method can be completely flawed, or worse, perhaps biased; so can the method that fails to produce a positive result.  There is as much support for the null as for the alternative hypothesis, contrary to your claim that you provide "evidence."

You seem to agree with my criticism "The bottom line is that simply getting a different result using a different method does not mean that the initial result is questionable. "  Exactly.  But you said the opposite, that a new result contradicting a prior results already establishes evidence.  By your own logic, you are wrong.  It does not establish evidence.

By the way, no "evidence" supported the belief that the earth was flat, especially not for millennia.  Theology and tyrrany did.  The Greeks knew the Earth was not flat, and more anscient civilizations did prior to them too. 

 

 

 

Um

 

As much as I would like to engage in a discussion about the definition of "evidence" (very low standard) versus "proof" (very high standard), I can't say it would be remotely productive.  The bottom line is that the actual argument made by Silver is that because he reaches a different conclusion using a different approach, that this somehow invalidates the first approach, or means that the first approach does not yield robust findings. 

But all that this means is that, at that point, two people have presented evidence, one for a phenomenon, one against it.  Up to and until he takes the time to show why the Realclearpolitics.com averages are so horrendously biased or inadequate as to sever any logical connection between the data and the conclusions that they purport to tend to make more likely, he can't say there's no evidence to support the existence of a Bradley effect in the primaries.  To put it another way, his regression model is a giant exercise in stepwise regression run amok, and is flawed in many ways, but it is still "evidence" that Obama is presently up by 312-226 EVs (yes, I know that's not quite what his model says, but that's the simplest way of putting it).  The fact that I think he makes mistakes and the fact that my model shows a much closer race doesn't mean that there's no evidence for a 312-226 Obama lead, nor, for that matter, does the existence of my model mean that his model's findings aren't robust.

As for the flat Earth, if you really want to be techinical the belief in a round Earth was never really stamped as most scientists and many theologans believed in it throughout the Middle Ages.  It's a rhetorical point, and I think you got the general drift.

Nate Silver's Model Is Off The Rails

 He has built in tremendous bias.  Only 1 state has Obama performing below his real world poll average.  That is Hawaii, which is proof he does not even control for the fact that Obama has a "Favorite Son" status there.

Most states he penalizes McCain 2-5 points vs his unweighted polling average.

It used to be worth reading.  It no longer is.

I think it started off as an honest attempt to gain true insight.  Now it has devolved into gerrymandering to find hidden, non-existent, Obama support by "tweaking" his model.

Objectivity says you lock down your model before the conventions begin...he decided to change his to mute the McCain bounce and is now exaggerating it to account for the fact that numbers for McCain remain favorable in the most critical states.

Look at his results for VA, NV, OH.  A complete farce.

At this point I am not sure I would bother arguing with him.  He is busy arguing that a DailyKos poll should be included in the RCP average.

That is where his mind is right now....