{"id":1779,"date":"2012-05-21T10:57:54","date_gmt":"2012-05-21T14:57:54","guid":{"rendered":"http:\/\/scientopia.org\/blogs\/goodmath\/?p=1779"},"modified":"2012-05-21T10:57:54","modified_gmt":"2012-05-21T14:57:54","slug":"willfull-ignorance-about-statistics-in-government","status":"publish","type":"post","link":"http:\/\/www.goodmath.org\/blog\/2012\/05\/21\/willfull-ignorance-about-statistics-in-government\/","title":{"rendered":"Willfull Ignorance about Statistics in Government"},"content":{"rendered":"<p> Quick but important one here.<\/p>\n<p> I&#8217;ve repeatedly ranted here about ignorant twits. Ignorance is a plague on society, and it&#8217;s at its worst when it&#8217;s <em>willful<\/em> ignorance &#8211; that is, when you have a person who knows nothing about a subject, and who refuses to be bothered with something as trivial and useless about <em>learning<\/em> about it before they open their stupid mouths.<\/p>\n<p> We&#8217;ve got an amazing, truly amazing, example of this in the US congress right now.<br \/>\nThere&#8217;s a &#8220;debate&#8221; going on about something called the American Community Survey, or the<br \/>\nACS for short. The ACS is a regular survey performed by the Census administration, which<br \/>\nmeasures a wide range of statistics related to economics.<\/p>\n<p> A group of Republicans are trying to eliminate the ACS. Why? well, let&#8217;s put that question aside. And let&#8217;s also leave aside, for the moment, whether the survey is important or not. You can, honestly, put together an argument that the ACS isn&#8217;t worth doing, that it doesn&#8217;t measure the right things, that the value of the information gathered doesn&#8217;t measure up to the cost, that it&#8217;s intrusive, that it violates the privacy of the survey targets. But let&#8217;s not even bother with any of that.<\/p>\n<p> Members of congress are arguing that the survey should be eliminated, and they&#8217;re claiming that the reason why is because the survey is unscientific. According to Daniel Webster, a representative from the state of Florida:<\/p>\n<blockquote><p>\nWe\u2019re spending $70 per person to fill this out. That\u2019s just not cost effective, <em>especially since in the end this is not a scientific survey. It\u2019s a random survey.<\/em>\n<\/p><\/blockquote>\n<p> Note well the emphasized point there. That&#8217;s the important bit.<\/p>\n<p> The survey isn&#8217;t cost effective, the data gathered isn&#8217;t genuinely useful according to Representative Webster, because <em>it&#8217;s not a scientific survey<\/em>. Why isn&#8217;t it a scientific survey? Because it&#8217;s <em>random<\/em>.<\/p>\n<p> This is what I mean by willful ignorance. Mr. Webster doesn&#8217;t understand what a survey <em>is<\/em>, or how a survey <em>works<\/em>, or what it takes to make a valid survey. He&#8217;s talking out his ass, trying to kill a statistical analysis for his own political reasons without making any attempt to actually understand what it is or how it works.<\/p>\n<p> Surveys are, fundamentally, about statistical sampling. Given a large population, you can create estimates about the properties of the population by looking at a <em>representative sample<\/em> of the population. For example, if you&#8217;re looking at the entire population of America, you&#8217;re talking about hundreds of millions of people. You can&#8217;t measure, say, the employment rate of the entire population every year &#8211; there are just too many people. It&#8217;s too much information &#8211; it&#8217;s pretty much impossible to gather it. <\/p>\n<p> But: if you can select a group of, say, 10,000 people, whose distribution matches the distribution of the wider population, then the data you gather about them will closely resemble the data about the wider population. <\/p>\n<p> That&#8217;s the point of a survey: find a <em>representative<\/em> sample, and take measurements of that sample. Then, with a certain probability of correctness, you can infer the properties of the entire population from the properties of the sample. <\/p>\n<p> Of course, there&#8217;s a catch. The key to a survey is the sample. The sample must be <em>representative<\/em> &#8211; meaning that the sample must have the same properties as the wider population of which it&#8217;s a part. But the point of survey is to <em>discover<\/em> those properties! If you choose your population to match what you believe the distribution to be, then you&#8217;ll bias your data towards matching that distribution. Your sample will only be representative if your beliefs about the data are correct. But that defeats the whole purpose of doing the survey.<\/p>\n<p> So the scientific method of doing a survey is to be random. You don&#8217;t start with any preconceived idea of what the population is like. You just randomly select people in a way that makes sure that every member of the population is equally likely to be selected. If your selection is truly random, then there&#8217;s a high probability (a measurably high probability, based on the size of the sample and the size of the sampled population) that the sample will be representative. <\/p>\n<p> Scientific sampling is <em>always<\/em> random. <\/p>\n<p> So Mr. Webster&#8217;s statement could be rephrased more correctly as the following contradiction: &#8220;This is not a scientific survey, because this is a scientific survey&#8221;. But Mr. Webster doesn&#8217;t know that what he said is a stupid contradiction. Because he doesn&#8217;t <em>care<\/em>. <\/p>\n","protected":false},"excerpt":{"rendered":"<p>Quick but important one here. I&#8217;ve repeatedly ranted here about ignorant twits. Ignorance is a plague on society, and it&#8217;s at its worst when it&#8217;s willful ignorance &#8211; that is, when you have a person who knows nothing about a subject, and who refuses to be bothered with something as trivial and useless about learning [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":false,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2}},"categories":[2,8,52],"tags":[181,219,220,227,315],"class_list":["post-1779","post","type-post","status-publish","format-standard","hentry","category-bad-math","category-bad-statistics","category-politics-2","tag-ignorant-twits","tag-random-sampling","tag-randomness","tag-sampling","tag-statistics"],"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_shortlink":"https:\/\/wp.me\/p4lzZS-sH","jetpack_sharing_enabled":true,"jetpack_likes_enabled":true,"_links":{"self":[{"href":"http:\/\/www.goodmath.org\/blog\/wp-json\/wp\/v2\/posts\/1779","targetHints":{"allow":["GET"]}}],"collection":[{"href":"http:\/\/www.goodmath.org\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/www.goodmath.org\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/www.goodmath.org\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"http:\/\/www.goodmath.org\/blog\/wp-json\/wp\/v2\/comments?post=1779"}],"version-history":[{"count":0,"href":"http:\/\/www.goodmath.org\/blog\/wp-json\/wp\/v2\/posts\/1779\/revisions"}],"wp:attachment":[{"href":"http:\/\/www.goodmath.org\/blog\/wp-json\/wp\/v2\/media?parent=1779"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/www.goodmath.org\/blog\/wp-json\/wp\/v2\/categories?post=1779"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/www.goodmath.org\/blog\/wp-json\/wp\/v2\/tags?post=1779"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}