Cory Doctorow: Our dangerous statistical ignorance

tonystubblebine · on May 21, 2008

There's a quote I like that I think came from Marvin Minsky, "the US needs a Department of Homeland Arithmetic." We're protecting ourselves from all the wrong risks.

byrneseyeview · on May 21, 2008

I don't understand why Marvin Minsky hates America so much that he wants to put the government in charge of assessing such risks. The government is an incredibly complex device for distilling a tiny amount of individual ignorance from each voter into a moonshine of batshittery.

SwellJoe · on May 21, 2008

I'm not exactly sure what a "moonshine of batshittery" is, but you've got my vote.

fnord23 · on May 22, 2008

I'm guess it's from the phrase 'batshit crazy'.

But yeah, he's got my vote too...

rw · on May 21, 2008

It's presumptuous to say that voters effect any real change in government policy.

eru · on May 21, 2008

Perhaps you have more effect by voting (i.e. switching) TV channels. The public opinion seems to be very important for government policy - and popular shows can be and are seen as a barometer for it, I guess.

michaelneale · on May 22, 2008

A book I want to read on that exact subject: http://miami.indymedia.org/news/2008/05/11008.php

gojomo · on May 21, 2008

We could fund the Department of Homeland Arithmetic with lottery revenues!

mynameishere · on May 22, 2008

People who are quite aware of the mathematics behind gambling still do it. Gambling exploits instincts rather than ignorance.

aggieben · on May 21, 2008

Uh, wouldn't this actually be called something like "the US Department of Education"?

It's been a rousing success, too - it's allowed our Ministry of Truth to claim things like "comrades! Our production of students that pass the standardized test is up 85% this quarter!".

It's all about families, man. Good families teach people what's really important - not federal departments of anonymoustaxeatingbureaucrat.

sabat · on May 21, 2008

And when a large proportion of families (broken, dysfunctional, otherwise unable) fail to teach kids what's important: what then?

aggieben · on May 21, 2008

I don't know many good answers to that question - but in my bones I know that the federal government isn't one of them.

graywh · on May 21, 2008

People didn't worry about that (i.e. formal education systems) until only recently, but I guess what's "important" has gotten more complex and the rate of "dysfunction" has increased.

helveticaman · on June 7, 2008

What now?

jpeterson · on May 21, 2008

"Here's how that works: imagine that you've got a disease that strikes one in a million people, and a test for the disease that's 99% accurate. You administer the test to a million people, and it will be positive for around 10,000 of them – because for every hundred people, it will be wrong once (that's what 99% accurate means). Yet, statistically, we know that there's only one infected person in the entire sample. That means that your "99% accurate" test is wrong 9,999 times out of 10,000!"

No, it means that the "99% accurate" test is wrong 9,999 times out of 1,000,000. It would be clear to anyone when stated that way. What's counterintuitive is the author's statement of the result, not the result itself.

graywh · on May 21, 2008

http://en.wikipedia.org/wiki/Bayes'_theorem#Example_.232:_Dr...

Basically, an extremely low rate of incidence (in Cory's example, 1 in a million) will render a "highly accurate" test meaningless.

tel · on May 21, 2008

A better wording is that your chance of having the disease if given a positive result from the test is 1/10000 (0.1%).

That's a huge increase from a 0.0001% chance of having the disease, but it's still not flat out terrifying. Repeat testing can weed through the false positives at a speed proportional to its accuracy.

graywh · on May 21, 2008

Not necessarily. If you get the same results every time, repeated testing provides nothing.

DougBTX · on May 21, 2008

It depends on what causes the false positives.

If the test is picking up something in the person being tested, then yes, you'll get the same result every time and repeated testing proves nothing. But you can still repeat using other tests.

If the test gives false positives purely at random, then repeated testing will help. Say the test is wrong 50% of the time, and you do the test five times. If you get the same results every time, then you can be 100-(50/100)^5*100 = 97% sure of the results.

breck · on May 21, 2008

"But the fact is that attacks by strangers are so rare as to be practically nonexistent. If your child is assaulted, the perpetrator is almost certainly a relative."

Maybe that statement is true in today's world. But for tens of thousands of years, while our brains evolved, I would guess attacks from strangers were a lot more common.

tel · on May 21, 2008

The causation implied here is unlikely at best. More likely might be that until the last century no one had to deal with numbers large enough to need these kinds of statistics.

niels_olson · on May 21, 2008

As someone who deals in diagnostic tests all day, nobody's diagnosed on one test. There's a similar problem I worked out on the odds of getting into medical school, I'm sure this is elementary stats and has someone's name on it, but I'll call it the acceptance problem: how many medical schools do you have to apply to in order to have 90% chance of getting into medical school? For any school there is a acceptance quotient: Q = (acceptances sent out)/(number of applications received)

For any given student applying to some schools 1 thru n, the goal is getting at least once acceptance, and applying to more schools, mathematically, can't possibly hurt in the closed case (neglecting social engineering, time spent on applications, etc), so the chance of acceptance, Ca, approaches 1 with every new application in the following fashion:

Ca = (1 - (1 - Q1)(1 - Q2)(1 - Q3)...(1 - Qn))100

There's a visual and some worked out examples here: http://nielsolson.us/MedSchool/

Similarly, if the sensitivity of a test is 90%, that means the test identifies 9 of every 10 people with the diagnosis. If I administer n different tests each with a sensitivity of S, then the chances of accurately diagnosing the disease, Cd, goes up* with each additional positive but never gets to 1.

Cd = (1 - (1 - S1)(1 - S2)(1 - S3) . . . (1 - Sn))100%

So lets say you are doing, say, genetic testing, and any one gene is 1% sensitive for the disease. If you tested 300 genes you could be no more than 95% certain of the diagnosis.

(1 - (1 - .99)^300)100 = 95.09591...%

Now, if your genetic tests were 5% accurate, you're panel could be no more than 95% accurate with 59 tests.

If your test was 50% accurate, you're panel could consist of 6 tests and be no more than 95% accurate.

Of course, if some of the tests are negative, things get more complicated. One of the problems with these data sets is that we have no idea how predictive they are. You can't even calculate the predictive power of the database. There simply haven't been enough events. Then we get into surrogate measures (how many were positive on tests 1 - n and were found to have razor blades in their homes, etc).

The claim that these databases can't be effective isn't true. They could be. P might also equal NP. Whether the hypothesis is strictly true or not, the vague but real set of 'practical concerns' suggest that the truth of the hypothesis is sufficiently difficult to test as to render the null hypothesis the de facto assumption until proven otherwise.

aston · on May 21, 2008

The assumption of independence underlying the math you're doing is probably outright false (albeit mathematically simple). As one example, imagine the case where all n schools use the exact same admissions criteria. Unless you're applying with a random application to each school, your math is shot; you will either get into every school or none of them. I won't even go into the genetic independence issue.

niels_olson · on May 23, 2008

Every mathematical model is a false representation of reality. The predictive accuracy must be validated by experiment. And using the link I reference, you'll see there's actually some pretty strong data to start from in this case.

JacobAldridge · on May 22, 2008

I believe the worst damage to statistics is specious reasoning. My favourite chocolate promotion is Mars' '1 in 6 Wins a Free Bar' (on currently here in Oz). If I buy 6 bars, most people would assume I would win once. In fact, I have only a 2/3 chance of winning a free one

1 - (5^6/6^6)

Buy 12 bars, and there's still a more than 10% chance I won't have won yet...most of the chocolate-buying government-voting lottery-praying public would be stunned.

aston · on May 22, 2008

Although the average number of bars you'd need to buy before you win is indeed six. I think almost no one would put the odds of actually winning one by buying six at 100%.

JacobAldridge · on May 23, 2008

"Any 6 bars may not contain a free bar"

You're probably right, and I hope that you are. Still, funny to note that Mars has even put this disclaimer on the bottom of their promo site!

http://www.marsfreebars.com.au/

graywh · on May 22, 2008

And as N approaches infinity, the probability of winning after N tries with a p=1/N approaches 1 - 1/e.

(Proof left to the reader as an exercise.)

dhimes · on May 21, 2008

If I had an intuitive grasp of statistics, I may not have a start-up!

SwellJoe · on May 21, 2008

Why not? It's a one-in-four (or one-in-three, depending on who you believe) chance of being rich in a few years--that means that in the worst case scenario, if you keep at it, starting over when you fail, in about 10-15 years you're very nearly guaranteed to be rich. And you almost certainly won't starve in the process.

Of course, that assumes a reasonable level of intelligence, education, and drive.

Startups are about the best game going, as far as I can tell--I wouldn't be playing if the game was rigged against me (more than a little, anyway...sure, small companies have higher relative regulatory burden, but on the whole the technology game is actually rigged in favor of new companies, from a growth perspective).

iamwil · on May 21, 2008

I don't get how you're guaranteed to be rich in 10-15 years. If it's a 1/4 chance of being rich every x years, it's still just 1/4 chance over the span of 10-15 years, right?

However, I'm going to guess that you mean every time you try is going to influence the next time you try for the better (as evidenced by some paper about higher success rates for 2nd+ time entrepreneurs I remember on here), which makes sense. You learn from your mistakes, you make contacts, you have a better view of the market--so it shouldn't stay one in four every time you try.

SwellJoe · on May 21, 2008

"If it's a 1/4 chance of being rich every x years, it's still just 1/4 chance over the span of 10-15 years, right?"

So, no matter how many times you roll a die, you've only got one in six chance of rolling a 1 in all of the rolls?

Somehow, I think your math is slightly off.

iamwil · on May 21, 2008

No, just a mismatch with understanding your wording. With your original wording, it didn't make sense, since each roll would have the same prob, no matter how many times you rolled. That's why I figured you meant that rolls weren't independent from each other.

But it seems that you mean, what's the chance that given x number of rolls, the very last one is a "1" (assuming you only need/want to get rich once). As the number of rolls increase, the chance of that scenerio (a string of non-1s with the last one being 1) becomes smaller and smaller when taken as a whole.

SwellJoe · on May 21, 2008

How else could I possibly mean "keep trying for 10-15 years"? One can't take 2-5 year increments of your life in isolation, since, as you've noted, you only need to get rich once to be rich.

But, I'm glad we're all clear now.

epi0Bauqu · on May 21, 2008

Where do you get this 1/4 or 1/3 from?

SwellJoe · on May 21, 2008

The Internet! Where do you get your dubious information from?

Seriously, though, there have been a few studies of various degrees of reliability that indicate that new technology business failure rate over five years is quite a bit smaller than the old "9 in 10 startups fail" wives tale would have us believe. I wouldn't put significant weight on any particular piece of data, but it seems to be pretty consistently in that range whenever people who I would trust to know the numbers (VCs, angels, journalists covering the field, successful and famously unsuccessful entrepreneurs) talk about it.

And, among my own peers here in the valley who started during WFP07, about 1/7 of them are already rich (by some definition of rich). There were 21 groups, and I believe 3 have had exits, and I'm certain that Octopart, Weebly, Buxfer, Heysan, and Virtualmin have not come fully to fruition yet. Tsumobi might even surprise folks, as they're still slaving away in their secret underground lab in the Balkans (Josh may have actually said, "Boston", it was hard to hear at the Startup School reception due to the size of the crowd). YC certainly makes a notable improvement in the outcomes of their startups, but it's not magical, so I don't think it's a crazy idea to look at YC startups as at least somewhat representative of tech startups in general--where "tech startup" means, to me, folks who actually file the paperwork, build something, and get it into the hands of users...until you've done that, you're just another dork with a big idea (and those probably fail at a much higher rate than 9 in 10).

Anyway, we're only a year and a half into the experiment with the WFP07 group, and I expect the numbers will probably end up in the 1/3 to 1/2 range.

davidw · on May 22, 2008

I'm still quite curious about Tsumobi, as it seems very similar to Hecl, but I've never seen or heard much.

SwellJoe · on May 22, 2008

Adam has explained to me why their approach was different from Hecl, but I won't attempt to reproduce that explanation, since I don't actually know what Hecl or Tsumobi are all about. I have vague notions, and when I'm actually talking to Adam or Josh about it it all makes beautiful sense...but then they stop talking, the fog returns, and I have no idea what it is they're building.

What I'm saying is, they're really smart guys working in a field that I know almost nothing about, and doing work that walks a razor fine line between "research" and "product". Thus, one of their biggest problems in reaching a market, reaching investors, or reaching developers, is making what they're working on into a concrete solution to a real-world problem that everyone (or at least their customers) can understand quickly. I think they'd be a bargain for anyone that hired them (either by investing in them or acquiring Tsumobi) because they are extremely smart kids with huge ideas, but I'm not sure how many people will see that based on what they're building.

And, while I'm pontificating, I don't think I'd be crazy to suggest that the best thing they could do would be to get their current code into the hands of some customers--even just a few. Because nothing guides you to providing value like having customers. And the more they pay (or the more ownership they have, if it's an Open Source project), the more value they demand...and that's a good thing when it comes to finding a need and filling it.

davidw · on May 22, 2008

Thanks again for the answer.

Put simply: Hecl is a scripting language for mobile phones, which are currently a real pain to program for.

dhimes · on May 21, 2008

I guess it depends on what your goal is. My goal is making this startup work, but I'm in a dysfunctional market. I think I'd rather not know the real odds.

eru · on May 21, 2008

Dysfunctional markets present oppurtunity.

dhimes · on May 22, 2008

Bingo.

SwellJoe · on May 22, 2008

Old women sure do love bingo. I bet you'll make a killing.

helveticaman · on June 7, 2008

So:

25% chance, 4 cycles: 68.4% chance of one or more successes. 33.3% chance, 4 cycles: 80.2% chance at least one is a winner.

33.3% chance, 10 cycles (your entire career): 98.3%. Though it would be pretty hard to keep at it at 58, after 9 failures.

andrewparker · on May 21, 2008

Riding on the subway earlier today, a guy got on an starting preaching how "the lord saves" and "you must embrace Jesus." I wish math could create equivalent evangelism. I'd love to live in a world where a guy gets on the subway and starts preaching the Pythagorean theorem?

graywh · on May 21, 2008

That one's not the problem. Perhaps Bayes' theorem?

michael_dorfman · on May 21, 2008

I agree with all of his points, but it was a very slim article-- I was kind of hoping for a more profound analysis from someone like Doctorow.

spydez · on May 21, 2008

If you want a profound analysis, he wrote a (fiction) book recently on this subject, and put it out on the internet under a CC license. It's an enjoyable read; he pretty much distilled a few bits from the book to make that article.

Here tis: http://craphound.com/littlebrother/Cory_Doctorow_-_Little_Br...

billroberts · on May 21, 2008

I think it was about as in-depth as you are likely to get published in a (UK) national newspaper. One small but well-written step in the fight against the innumerate sensationalists that dominate the press and no doubt contribute to the populist over-reactions from the UK government.

__ · on May 21, 2008

Something doesn't seem right about Doctorow's example of attacks on children. He writes, "But the fact is that attacks by strangers are so rare as to be practically nonexistent. If your child is assaulted, the perpetrator is almost certainly a relative (most likely a parent)."

I'm sure that's true, but it doesn't answer the real question. For most parents, the question is: given my child's particular environment, what is the greatest threat to him?

Doctorow is saying: given that your child was attacked (and no additional information), the attacker is more likely than not to be a relative. That seems backwards.

tel · on May 21, 2008

He's stating that the evidence is P(Was Relative | Child Attacked) is relatively high, especially compared to P(Was Total Stranger | Child Attacked). This means that it should, if you use Bayes' Theorem correctly in everyday life, shift your suspicion away from the random photographer.

Of course, since P(Child Attacked) is so very low it's still not a huge deal.

It's not really backwards. That sort of inversion is exactly part of Bayes' Theorem.

Retric · on May 22, 2008

If you are related to X people then the odds that one of them attacks your child is greater than that anyone else on the planet.

Picking numbers from thin air:

Let's say are related to 30 people know 200 and you encounter 10,000 random strangers and you have a 1 in 100 chance of an attack. Well the odds a specific random stranger attacks your child given the above assumptions is less than 1/2,000,000. The odds a specific person they know attacks your child would be less than 1/20,000 and the odds that a given relative attacks your child would be more than 1/6,000. Now who would you focus on? (What about a predator that has a 50% chance of attacking someone well he is still under 1/6000 because there are so many other people for them to attack.)

PS: Given my child's particular environment you should focus on relatives and people you know.

ken · on May 21, 2008

Statistician Peter Donnelly on the same subject:

http://youtube.com/watch?v=kLmzxmRcUTo

seshadripv · on May 21, 2008

It's not just the probability of an occurence that leads us to behave in an unreasonable manner. The other important thing is the impact of that 'rare occurence'. Most critics don't seem to take that in to account at all...

redorb · on May 21, 2008

good case against data mining... and the "if it saved one life, would it be worth it?" argument doesn't change my mind.

henning · on May 21, 2008

a more basic problem is just plain ignorance.

many if not most citizens of the USA do not understand the basic organization and functions of their government. most of them cannot name a single sitting supreme court justice, have never read the constitution, do not know how a bill becomes a law.

and these people vote.