[go: up one dir, main page]

IDEAS home Printed from https://ideas.repec.org/a/ejw/journl/v9y2012i3p256-297.html
   My bibliography  Save this article

Ziliak and McCloskey's Criticisms of Significance Tests: An Assessment

Author

Listed:
  • Thomas Mayer
Abstract
Stephen Ziliak and D. N. McCloskey have sharply criticized the prevailing use of significance tests. Their work has, in turn, come under vigorous attack. The vehemence of the debate may induce readers to wrongly dismiss it as a “he said-she said” debate, or else to take sides in an unbending way that does not do justice to valid points raised by the other side. This paper aims at a more balanced reading. While Ziliak and McCloskey claim that a substantial majority of economists who use significance tests confuse statistical with substantive significance, or commit the logical error of the transposed conditional, I argue that such errors are much less frequent than they claim, though still much too pervasive. They also argue that since significance tests focus on the existence of an effect rather than on its size, the tests do not answer scientific questions. I respond with counter-examples. Ziliak and McCloskey also complain that significance tests ignore loss functions. I argue that loss functions should be introduced only at a later stage. Ziliak and McCloskey are correct, however, that confidence intervals deserve much more emphasis. The most valuable message of their work is that significance tests should be treated less mechanically.

Suggested Citation

  • Thomas Mayer, 2012. "Ziliak and McCloskey's Criticisms of Significance Tests: An Assessment," Econ Journal Watch, Econ Journal Watch, vol. 9(3), pages 256-297, September.
  • Handle: RePEc:ejw:journl:v:9:y:2012:i:3:p:256-297
    as

    Download full text from publisher

    File URL: https://econjwatch.org/File+download/588/MayerSept2012.pdf?mimetype=pdf
    Download Restriction: no

    File URL: https://econjwatch.org/824
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Stephen T. Ziliak & Deirdre N. McCloskey, 2004. "Size Matters: The Standard Error of Regressions in the American Economic Review," Econ Journal Watch, Econ Journal Watch, vol. 1(2), pages 331-358, August.
    2. Brainard, S Lael, 1997. "An Empirical Assessment of the Proximity-Concentration Trade-off between Multinational Sales and Trade," American Economic Review, American Economic Association, vol. 87(4), pages 520-544, September.
    3. Hoover, Kevin D & Sheffrin, Steven M, 1992. "Causation, Spending, and Taxes: Sand in the Sandbox or Tax Collector for the Welfare State?," American Economic Review, American Economic Association, vol. 82(1), pages 225-248, March.
    4. Martha J. Bailey, 2010. ""Momma's Got the Pill": How Anthony Comstock and Griswold v. Connecticut Shaped US Childbearing," American Economic Review, American Economic Association, vol. 100(1), pages 98-129, March.
    5. LaLonde, Robert J, 1986. "Evaluating the Econometric Evaluations of Training Programs with Experimental Data," American Economic Review, American Economic Association, vol. 76(4), pages 604-620, September.
    6. Darby, Michael R, 1982. "The Price of Oil and World Inflation and Recession," American Economic Review, American Economic Association, vol. 72(4), pages 738-751, September.
    7. Abel Brodeur & Mathias Lé & Marc Sangnier & Yanos Zylberberg, 2016. "Star Wars: The Empirics Strike Back," American Economic Journal: Applied Economics, American Economic Association, vol. 8(1), pages 1-32, January.
    8. David Colander, 2018. "CREATING HUMBLE ECONOMISTS: A Code of Ethics for Economists," Chapters, in: How Economics Should Be Done, chapter 17, pages 240-252, Edward Elgar Publishing.
    9. Romer, Christina D, 1986. "Is the Stabilization of the Postwar Economy a Figment of the Data?," American Economic Review, American Economic Association, vol. 76(3), pages 314-334, June.
    10. Dewald, William G & Thursby, Jerry G & Anderson, Richard G, 1986. "Replication in Empirical Economics: The Journal of Money, Credit and Banking Project," American Economic Review, American Economic Association, vol. 76(4), pages 587-603, September.
    11. Angrist, Joshua D & Evans, William N, 1998. "Children and Their Parents' Labor Supply: Evidence from Exogenous Variation in Family Size," American Economic Review, American Economic Association, vol. 88(3), pages 450-477, June.
    12. Kevin Hoover & Mark Siegler, 2008. "The rhetoric of 'Signifying nothing': a rejoinder to Ziliak and McCloskey," Journal of Economic Methodology, Taylor & Francis Journals, vol. 15(1), pages 57-68.
    13. Keuzenkamp, Hugo A. & Magnus, Jan R., 1995. "On tests and significance in econometrics," Journal of Econometrics, Elsevier, vol. 67(1), pages 5-24, May.
    14. Sauer, Raymond D & Leffler, Keith B, 1990. "Did the Federal Trade Commission's Advertising Substantiation Program Promote More Credible Advertising?," American Economic Review, American Economic Association, vol. 80(1), pages 191-203, March.
    15. Ayres, Ian & Siegelman, Peter, 1995. "Race and Gender Discrimination in Bargaining for a New Car," American Economic Review, American Economic Association, vol. 85(3), pages 304-321, June.
    16. Carol Corrado & Charles Hulten & Daniel Sichel, 2009. "Intangible Capital And U.S. Economic Growth," Review of Income and Wealth, International Association for Research in Income and Wealth, vol. 55(3), pages 661-685, September.
    17. Zoltan J. Acs & David B. Audretsch, 2008. "Innovation in Large and Small Firms: An Empirical Analysis," Chapters, in: Entrepreneurship, Growth and Public Policy, chapter 1, pages 3-15, Edward Elgar Publishing.
    18. Garber, Peter M, 1986. "Nominal Contracts in a Bimetallic Standard," American Economic Review, American Economic Association, vol. 76(5), pages 1012-1030, December.
    19. Woodbury, Stephen A & Spiegelman, Robert G, 1987. "Bonuses to Workers and Employers to Reduce Unemployment: Randomized Trials in Illinois," American Economic Review, American Economic Association, vol. 77(4), pages 513-530, September.
    20. Fuhrer, Jeffrey C & Moore, George R, 1995. "Monetary Policy Trade-offs and the Correlation between Nominal Interest Rates and Real Output," American Economic Review, American Economic Association, vol. 85(1), pages 219-239, March.
    21. Alesina, Alberto & Perotti, Roberto, 1997. "The Welfare State and Competitiveness," American Economic Review, American Economic Association, vol. 87(5), pages 921-939, December.
    22. Milton Friedman, 1957. "Introduction to "A Theory of the Consumption Function"," NBER Chapters, in: A Theory of the Consumption Function, pages 1-6, National Bureau of Economic Research, Inc.
    23. Christina D. Romer & David H. Romer, 2010. "The Macroeconomic Effects of Tax Changes: Estimates Based on a New Measure of Fiscal Shocks," American Economic Review, American Economic Association, vol. 100(3), pages 763-801, June.
    24. Søren Leth-Petersen, 2010. "Intertemporal Consumption and Credit Constraints: Does Total Expenditure Respond to an Exogenous Shock to Credit?," American Economic Review, American Economic Association, vol. 100(3), pages 1080-1103, June.
    25. Kevin Hoover & Mark Siegler, 2008. "Sound and fury: McCloskey and significance testing in economics," Journal of Economic Methodology, Taylor & Francis Journals, vol. 15(1), pages 1-37.
    26. Feenstra, Robert C, 1994. "New Product Varieties and the Measurement of International Prices," American Economic Review, American Economic Association, vol. 84(1), pages 157-177, March.
    27. Gigerenzer, Gerd, 2004. "Mindless statistics," Journal of Behavioral and Experimental Economics (formerly The Journal of Socio-Economics), Elsevier, vol. 33(5), pages 587-606, November.
    28. Pontiff, Jeffrey, 1997. "Excess Volatility and Closed-End Funds," American Economic Review, American Economic Association, vol. 87(1), pages 155-169, March.
    29. Glenn Ellison & Edward L. Glaeser & William R. Kerr, 2010. "What Causes Industry Agglomeration? Evidence from Coagglomeration Patterns," American Economic Review, American Economic Association, vol. 100(3), pages 1195-1213, June.
    30. Ham, John C & Svejnar, Jan & Terrell, Katherine, 1998. "Unemployment and the Social Safety Net during Transitions to a Market Economy: Evidence from the Czech and Slovak Republics," American Economic Review, American Economic Association, vol. 88(5), pages 1117-1142, December.
    31. Ann Harrison & Jason Scorse, 2022. "Multinationals and Anti-Sweatshop Activism," World Scientific Book Chapters, in: Globalization, Firms, and Workers, chapter 13, pages 291-317, World Scientific Publishing Co. Pte. Ltd..
    32. H. D. Vinod & B. D. McCullough, 1999. "The Numerical Reliability of Econometric Software," Journal of Economic Literature, American Economic Association, vol. 37(2), pages 633-665, June.
    33. Froyen, Richard T & Waud, Roger N, 1980. "Further International Evidence of Output-Inflation Tradeoffs," American Economic Review, American Economic Association, vol. 70(3), pages 409-421, June.
    34. Milton Friedman, 1957. "A Theory of the Consumption Function," NBER Books, National Bureau of Economic Research, Inc, number frie57-1.
    35. Tom Engsted, 2009. "Statistical vs. Economic Significance in Economics and Econometrics: Further comments on McCloskey & Ziliak," CREATES Research Papers 2009-17, Department of Economics and Business Economics, Aarhus University.
    36. Evans, David S & Heckman, James J, 1984. "A Test for Subadditivity of the Cost Function with an Application to the Bell System," American Economic Review, American Economic Association, vol. 74(4), pages 615-623, September.
    37. Zellner, Arnold, 2004. "To test or not to test and if so, how?: Comments on "size matters"," Journal of Behavioral and Experimental Economics (formerly The Journal of Socio-Economics), Elsevier, vol. 33(5), pages 581-586, November.
    38. Mishkin, Frederic S, 1982. "Does Anticipated Aggregate Demand Policy Matter? Further Econometric Results," American Economic Review, American Economic Association, vol. 72(4), pages 788-802, September.
    39. George J. Borjas, 2021. "Ethnicity, Neighborhoods, and Human-Capital Externalities," World Scientific Book Chapters, in: Foundational Essays in Immigration Economics, chapter 7, pages 135-160, World Scientific Publishing Co. Pte. Ltd..
    40. Josh Lerner & Ulrike Malmendier, 2010. "Contractibility and the Design of Research Agreements," American Economic Review, American Economic Association, vol. 100(1), pages 214-246, March.
    41. Joskow, Paul L, 1987. "Contract Duration and Relationship-Specific Investments: Empirical Evidence from Coal Markets," American Economic Review, American Economic Association, vol. 77(1), pages 168-185, March.
    42. Atif Mian & Amir Sufi & Francesco Trebbi, 2010. "The Political Economy of the US Mortgage Default Crisis," American Economic Review, American Economic Association, vol. 100(5), pages 1967-1998, December.
    43. David D. Hale, 1986. "Analysis," Challenge, Taylor & Francis Journals, vol. 29(5), pages 52-56, November.
    44. Deirdre McCloskey & Stephen Ziliak, 2008. "Signifying nothing: reply to Hoover and Siegler," Journal of Economic Methodology, Taylor & Francis Journals, vol. 15(1), pages 39-55.
    45. Berg, Nathan, 2004. "No-decision classification: an alternative to testing for statistical significance," Journal of Behavioral and Experimental Economics (formerly The Journal of Socio-Economics), Elsevier, vol. 33(5), pages 631-650, November.
    46. Hendricks, Kenneth & Porter, Robert H, 1996. "The Timing and Incidence of Exploratory Drilling on Offshore Wildcat Tracts," American Economic Review, American Economic Association, vol. 86(3), pages 388-407, June.
    47. Erhan Artuç & Shubham Chaudhuri & John McLaren, 2010. "Trade Shocks and Labor Adjustment: A Structural Empirical Approach," American Economic Review, American Economic Association, vol. 100(3), pages 1008-1045, June.
    48. Walter Krämer, 2011. "The Cult of Statistical Significance – What Economists Should and Should Not Do to Make their Data Talk," Schmollers Jahrbuch : Journal of Applied Social Science Studies / Zeitschrift für Wirtschafts- und Sozialwissenschaften, Duncker & Humblot, Berlin, vol. 131(3), pages 455-468.
    49. Bloom, David E & Cavanagh, Christopher L, 1986. "An Analysis of the Selection of Arbitrators," American Economic Review, American Economic Association, vol. 76(3), pages 408-422, June.
    50. Amitabh Chandra & Jonathan Gruber & Robin McKnight, 2010. "Patient Cost-Sharing and Hospitalization Offsets in the Elderly," American Economic Review, American Economic Association, vol. 100(1), pages 193-213, March.
    51. Mendelsohn, Robert & Nordhaus, William D & Shaw, Daigee, 1994. "The Impact of Global Warming on Agriculture: A Ricardian Analysis," American Economic Review, American Economic Association, vol. 84(4), pages 753-771, September.
    52. Tom Engsted, 2009. "Statistical vs. economic significance in economics and econometrics: further comments on McCloskey and Ziliak," Journal of Economic Methodology, Taylor & Francis Journals, vol. 16(4), pages 393-408.
    53. Stekler, H.O., 2007. "Significance tests harm progress in forecasting: Comment," International Journal of Forecasting, Elsevier, vol. 23(2), pages 329-330.
    54. Wolff, Edward N, 1991. "Capital Formation and Productivity Convergence over the Long Term," American Economic Review, American Economic Association, vol. 81(3), pages 565-579, June.
    55. Trejo, Stephen J, 1991. "The Effects of Overtime Pay Regulation on Worker Compensation," American Economic Review, American Economic Association, vol. 81(4), pages 719-740, September.
    56. Sachs, Jeffrey, 1980. "The Changing Cyclical Behavior of Wages and Prices: 1890-1976," American Economic Review, American Economic Association, vol. 70(1), pages 78-90, March.
    57. Mayer, Thomas, 1980. "Economics as a Hard Science: Realistic Goal or Wishful Thinking?," Economic Inquiry, Western Economic Association International, vol. 18(2), pages 165-178, April.
    58. O'Brien, Anthony Patrick, 2004. "Why is the standard error of regression so low using historical data?: Comments on "size matters"," Journal of Behavioral and Experimental Economics (formerly The Journal of Socio-Economics), Elsevier, vol. 33(5), pages 565-570, November.
    59. David Colander, 2001. "The Lost Art of Economics," Books, Edward Elgar Publishing, number 2415.
    60. George J. Borjas, 2021. "Self-Selection and the Earnings of Immigrants," World Scientific Book Chapters, in: Foundational Essays in Immigration Economics, chapter 4, pages 69-91, World Scientific Publishing Co. Pte. Ltd..
    61. Meredith Fowlie, 2010. "Emissions Trading, Electricity Restructuring, and Investment in Pollution Abatement," American Economic Review, American Economic Association, vol. 100(3), pages 837-869, June.
    62. Pranab Bardhan & Dilip Mookherjee, 2010. "Determinants of Redistributive Politics: An Empirical Analysis of Land Reforms in West Bengal, India," American Economic Review, American Economic Association, vol. 100(4), pages 1572-1600, September.
    63. Johnson, William R & Skinner, Jonathan, 1986. "Labor Supply and Marital Separation," American Economic Review, American Economic Association, vol. 76(3), pages 455-469, June.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Alexander Libman & Joachim Zweynert, 2014. "Ceremonial Science: The State of Russian Economics Seen Through the Lens of the Work of ‘Doctor of Science’ Candidates," Working Papers 337, Leibniz Institut für Ost- und Südosteuropaforschung (Institute for East and Southeast European Studies).
    2. Nektarios A. Michail & Constantinos I. Massouras, 2014. "Back to Basics: Is Statistical Significance all that Matters?," Working Papers 2014-3, Central Bank of Cyprus.
    3. Libman, Alexander & Zweynert, Joachim, 2014. "Ceremonial science: The state of Russian economics seen through the lens of the work of ‘Doctor of Science’ candidates," Economic Systems, Elsevier, vol. 38(3), pages 360-378.
    4. Thomas Mayer, 2013. "Reply to Deirdre McCloskey and Stephen Ziliak on Statistical Significance," Econ Journal Watch, Econ Journal Watch, vol. 10(1), pages 87-96, January.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Thomas Mayer, 2012. "Ziliak and McClosky?s Criticisms of Significance Tests: A Damage Assessment," Working Papers 126, University of California, Davis, Department of Economics.
    2. Thomas Mayer, 2012. "Ziliak and McClosky?s Criticisms of Significance Tests: A Damage Assessment," Working Papers 61, University of California, Davis, Department of Economics.
    3. Peter J. Veazie, 2015. "Understanding Statistical Testing," SAGE Open, , vol. 5(1), pages 21582440145, January.
    4. Thomas Mayer, 2013. "Reply to Deirdre McCloskey and Stephen Ziliak on Statistical Significance," Econ Journal Watch, Econ Journal Watch, vol. 10(1), pages 87-96, January.
    5. Stephen T. Ziliak & Deirdre N. McCloskey, 2013. "We Agree That Statistical Significance Proves Essentially Nothing: A Rejoinder to Thomas Mayer," Econ Journal Watch, Econ Journal Watch, vol. 10(1), pages 97-107, January.
    6. Kim, Jae H. & Ji, Philip Inyeob, 2015. "Significance testing in empirical finance: A critical review and assessment," Journal of Empirical Finance, Elsevier, vol. 34(C), pages 1-14.
    7. Alexander Libman & Joachim Zweynert, 2014. "Ceremonial Science: The State of Russian Economics Seen Through the Lens of the Work of ‘Doctor of Science’ Candidates," Working Papers 337, Leibniz Institut für Ost- und Südosteuropaforschung (Institute for East and Southeast European Studies).
    8. Thomas Mayer, 2006. "The Empirical Significance of Econometric Models," Working Papers 620, University of California, Davis, Department of Economics.
    9. Kevin Hoover & Mark Siegler, 2008. "Sound and fury: McCloskey and significance testing in economics," Journal of Economic Methodology, Taylor & Francis Journals, vol. 15(1), pages 1-37.
    10. Fuchs-Schündeln, N. & Hassan, T.A., 2016. "Natural Experiments in Macroeconomics," Handbook of Macroeconomics, in: J. B. Taylor & Harald Uhlig (ed.), Handbook of Macroeconomics, edition 1, volume 2, chapter 0, pages 923-1012, Elsevier.
    11. Kim, Jae, 2015. "How to Choose the Level of Significance: A Pedagogical Note," MPRA Paper 66373, University Library of Munich, Germany.
    12. Garret Christensen & Edward Miguel, 2018. "Transparency, Reproducibility, and the Credibility of Economics Research," Journal of Economic Literature, American Economic Association, vol. 56(3), pages 920-980, September.
    13. Libman, Alexander & Zweynert, Joachim, 2014. "Ceremonial science: The state of Russian economics seen through the lens of the work of ‘Doctor of Science’ candidates," Economic Systems, Elsevier, vol. 38(3), pages 360-378.
    14. Tom Engsted, 2009. "Statistical vs. Economic Significance in Economics and Econometrics: Further comments on McCloskey & Ziliak," CREATES Research Papers 2009-17, Department of Economics and Business Economics, Aarhus University.
    15. Deirdre N. McCloskey & Stephen T. Ziliak, 2012. "Statistical Significance in the New Tom and the Old Tom: A Reply to Thomas Mayer," Econ Journal Watch, Econ Journal Watch, vol. 9(3), pages 298-308, September.
    16. Ricardo Barradas & Ines Tomas, 2023. "Household indebtedness in the European Union countries: Going beyond the mainstream interpretation," PSL Quarterly Review, Economia civile, vol. 76(304), pages 21-49.
    17. Klos, Alexander & Rottke, Simon, 2013. "Saving and Consumption When Children Move Out," VfS Annual Conference 2013 (Duesseldorf): Competition Policy and Regulation in a Global Economic Order 79786, Verein für Socialpolitik / German Economic Association.
    18. Bernd Hayo & Matthias Uhl, 2017. "Taxation and consumption: evidence from a representative survey of the German population," Applied Economics, Taylor & Francis Journals, vol. 49(53), pages 5477-5490, November.
    19. Twum-Barima, Asare, 2015. "Household Consumption Response to Demographic Changes: An Analysis using a Demographic Model," 2015 AAEA & WAEA Joint Annual Meeting, July 26-28, San Francisco, California 205881, Agricultural and Applied Economics Association.
    20. Bernasconi, Michele & Kirchkamp, Oliver & Paruolo, Paolo, 2009. "Do fiscal variables affect fiscal expectations? Experiments with real world and lab data," Journal of Economic Behavior & Organization, Elsevier, vol. 70(1-2), pages 253-265, May.

    More about this item

    Keywords

    Significance tests; t’s; p’s; confidence intervals; Ziliak; McCloskey; oomph;
    All these keywords.

    JEL classification:

    • C12 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Hypothesis Testing: General
    • B4 - Schools of Economic Thought and Methodology - - Economic Methodology

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ejw:journl:v:9:y:2012:i:3:p:256-297. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Jason Briggeman (email available below). General contact details of provider: https://edirc.repec.org/data/edgmuus.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.