{"id":1502,"date":"2015-05-06T00:40:17","date_gmt":"2015-05-06T07:40:17","guid":{"rendered":"http:\/\/www.mii.ucla.edu\/causality\/?p=1502"},"modified":"2015-05-06T00:40:17","modified_gmt":"2015-05-06T07:40:17","slug":"david-freedman-statistics-and-structural-equation-models","status":"publish","type":"post","link":"https:\/\/causality.cs.ucla.edu\/blog\/index.php\/2015\/05\/06\/david-freedman-statistics-and-structural-equation-models\/","title":{"rendered":"David Freedman, Statistics, and Structural Equation Models"},"content":{"rendered":"<p>(Re-edited: 5\/6\/15, 4 pm)<\/p>\n<p>Michael A Lewis (Hunter College) sent us the following query:<\/p>\n<p>Dear Judea,<br \/>\nI was reading a book by the late statistician David Freedman and in it he uses the term &#8220;response schedule&#8221; to refer to an equation which represents a causal relationship between variables. It appears that he&#8217;s using that term as a synonym for &#8220;structural equation&#8221; the one you use. In your view, am I correct in regarding these as synonyms? Also, Freedman seemed to be of the belief that response schedules only make sense if the causal variable can be regarded as amenable to manipulation. So variables like race, gender, maybe even socioeconomic status, etc. cannot sensibly be regarded as causes since they can&#8217;t be manipulated. I&#8217;m wondering what your view is of this manipulation perspective.<br \/>\nMichael<\/p>\n<p>&#8212;<br \/>\nMy answer is: Yes. Freedman&#8217;s &#8220;response schedule&#8221; is a synonym for &#8220;structural equation.&#8221; The reason why Freedman did not say so explicitly has to do with his long and rather bumpy journey from statistical to causal thinking. Freedman, like most statisticians in the 1980&#8217;s could not make sense of the Structural Equation Models (SEM) that social scientists (e.g., Duncan) and econometricians (e.g., Goldberger) have adopted for representing causal relations. As a result, he criticized and ridiculed this enterprise relentlessly. In his (1987) paper &#8220;As others see us,&#8221; for example, he went as far as &#8220;proving&#8221; that the entire enterprise is grounded in logical contradictions.  The fact that SEM researchers at that time could not defend their enterprise effectively (they were as confused about SEM as statisticians &#8212; judging by the way they responded to his paper) only intensified Freedman criticism. It continued well into the 1990&#8217;s, with renewed attacks on anything connected with causality, including the causal search program of Spirtes, Glymour and Scheines.<\/p>\n<p>I have had a long and friendly correspondence with Freedman since 1993 and, going over a file of over 200 emails, it appears that it was around 1994 when he began to convert to causal thinking. First through the do-operator (by his own admission) and, later, by realizing that structural equations offer a neat way of encoding counterfactuals.<\/p>\n<p>I speculate that the reason Freedman could not say plainly that causality is based on structural equations was that it would have been too hard for him to admit that he was in error criticizing a model that he misunderstood, and, that is so simple to understand. This oversight was not entirely his fault;  for someone trying to understand the world from a statistical view point, structural equations do not make any sense; the asymmetric nature of the equations and those slippery &#8220;error terms&#8221; stand outside the prism of the statistical paradigm. Indeed, even today, very few statisticians feel comfortable in the company of structural equations. (How many statistics textbooks do we know that discuss structural equations?)<\/p>\n<p>So, what do you do when you come to realize that a concept you ridiculed for 20 years is the key to understanding causation? Freedman decided not to say &#8220;I erred&#8221;, but to argue that the concept was not rigorous enough for statisticians to understood.  He thus formalized &#8220;response schedule&#8221; and treated it as a novel mathematical object. The fact is, however, that if we strip &#8220;response schedule&#8221; from its superlatives, we find that it is just what you and I call a &#8220;function&#8221;. i.e., a mapping between the states of one variable onto the states of another. Some of Freedman&#8217;s disciples are admiring this invention (See R. Berk&#8217;s 2004 book on regression) but most people that I know just look at it and say: This is what a structural equation is.<\/p>\n<p>The story of David Freedman is the story of statistical science itself and the painful journey the field has taken through the causal reformation. Starting with the structural equations of Sewal Wright (1921), and going through Freedman&#8217;s &#8220;response schedule&#8221;, the field still can&#8217;t swallow the fundamental building block of scientific thinking, in which Nature is encoded as a society of sensing and responding variables. Funny, econometrics is yet to start its reformation, though it has been housing SEM since Haavelmo (1943). (How many econometrics textbooks do we know which teach students how to read counterfactuals from structural equations?).<\/p>\n<p>&#8212;<br \/>\nI now go to your second question, concerning the mantra &#8220;no causation without manipulation.&#8221; I do not believe anyone takes this slogan as a restriction nowadays, including its authors, Holland and Rubin. It will remain a relic of an era when statisticians tried to define causation with the only mental tool available to them: the randomized controlled trial (RCT).<\/p>\n<p>I summed it up in Causality, 2009, p. 361: &#8220;To suppress talk about how gender causes the many biological, social, and psychological distinctions between males an females is to suppress 90% of our knowledge about gender differences&#8221;<\/p>\n<p>I further elaborated on this issue in <a href=\"http:\/\/ftp.cs.ucla.edu\/pub\/stat_ser\/r393.pdf\">(Bollen and Pearl 2014 p. 313) <\/a> saying:<\/p>\n<p>&#8220;<a href=\"http:\/\/ftp.cs.ucla.edu\/pub\/stat_ser\/r382.pdf\">Pearl (2011)<\/a> further shows that this restriction has led to harmful consequence by forcing investigators to compromise their research questions only to avoid the manipulability restriction. The essential ingredient of causation, as argued in Pearl (2009: 361), is responsiveness, namely, the capacity of some variables to respond to variations in other variables, regardless of how those variations came about.&#8221;<\/p>\n<p>In (Causality 2009 p. 361) I also find this paragraph: &#8220;It is for that reason, perhaps, that scientists invented counterfactuals; it permit them to state and conceive the realization of antecedent conditions without specifying the physical means by which these conditions are established;&#8221;<\/p>\n<p>All in all, you have touched on one of the most fascinating chapters in the history of science, featuring a respectable scientific community that clings desperately to an outdated dogma, while resisting, adamantly, the light that shines around it. This chapter deserves a major headline in Kuhn&#8217;s book on scientific revolutions. As I once wrote: &#8220;It is easier to teach Copernicus in the Vatican than discuss causation with a statistician.&#8221; But this was in the 1990&#8217;s, before causal inference became fashionable. Today, after a vicious 100-year war of reformation, things are begining to change (See <a href=\"http:\/\/www.nasonline.org\/programs\/sackler-colloquia\/completed_colloquia\/Big-data.html\">http:\/\/www.nasonline.org\/programs\/sackler-colloquia\/completed_colloquia\/Big-data.html<\/a>). I hope your upcoming book further accelerates the transition.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>(Re-edited: 5\/6\/15, 4 pm) Michael A Lewis (Hunter College) sent us the following query: Dear Judea, I was reading a book by the late statistician David Freedman and in it he uses the term &#8220;response schedule&#8221; to refer to an equation which represents a causal relationship between variables. It appears that he&#8217;s using that term [&hellip;]<\/p>\n","protected":false},"author":3,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[5,6,10,37],"tags":[],"class_list":["post-1502","post","type-post","status-publish","format-standard","hentry","category-causal-effect","category-counterfactual","category-definition","category-structural-equations"],"_links":{"self":[{"href":"https:\/\/causality.cs.ucla.edu\/blog\/index.php\/wp-json\/wp\/v2\/posts\/1502","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/causality.cs.ucla.edu\/blog\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/causality.cs.ucla.edu\/blog\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/causality.cs.ucla.edu\/blog\/index.php\/wp-json\/wp\/v2\/users\/3"}],"replies":[{"embeddable":true,"href":"https:\/\/causality.cs.ucla.edu\/blog\/index.php\/wp-json\/wp\/v2\/comments?post=1502"}],"version-history":[{"count":0,"href":"https:\/\/causality.cs.ucla.edu\/blog\/index.php\/wp-json\/wp\/v2\/posts\/1502\/revisions"}],"wp:attachment":[{"href":"https:\/\/causality.cs.ucla.edu\/blog\/index.php\/wp-json\/wp\/v2\/media?parent=1502"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/causality.cs.ucla.edu\/blog\/index.php\/wp-json\/wp\/v2\/categories?post=1502"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/causality.cs.ucla.edu\/blog\/index.php\/wp-json\/wp\/v2\/tags?post=1502"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}