Istotność statystyczna I. Nieodrobiona lekcja

Piotr Wolski


Statistical significance I

In the present essay, the first in a short cycle, the author reviews and comments on the problems students and researchers have with proper understanding of the basics of statistical inference. Those difficulties seem to be in part due to mixing of the opposing theoretical stances of Fisher and Neyman, reviewed shortly. The author believes that the inconsistent standards of statistical inference afflict the teaching of methodology particularly.

Słowa kluczowe: statistical inference, null hypothesis significance testing, NHST, p-value

Brzeziński, J.M. (2012). Kontekst teorii psychologicznej a kontekst analizy statystycznej. Roczniki Psychologiczne, 15(3), 75–81.

Button, K.S., Ioannidis, J.P., Mokrysz, C., Nosek, B., Flint, J., Robinson, E.S., Munafò, M.R. (2013). Power failure: Why small sample size undermines the reliability of neuroscience. Nature Review Neuroscience, 14(5), 365–376. doi:10.1038/nrn3475

Cohen, J. (1962). The statistical power of abnormal-social psychological research: A review. Journal of Abnormal and Social Psychology, 65, 145–53.

Cohen, J. (1994). The earth is round (p<.05). American Psychologist, 49(12), 997. doi:10.1037/0003-066X.49.12.997

Cumming, G. (2014). The new statistics: Why and how. Psychological Science, 25(1), 7–29. doi:10.1177/0956797613504966

Falk, R., Greenbaum, W. (1995). Significance tests die hard: The amazing persistence of a probabilistic misconception. Theory & Psychology, 5(1), 75–98. doi:10.1177/0959354395051004

Finch, S., Cumming, G., Williams, J., Palmer, L., Griffith, E., Alders, C., Goodman, O. (2004). Reform of statistical inference in psychology: The case of. Memory & Cognition. Behavior Research Methods, Instruments & Computers, 36(2), 312–324.

Finch, S., Thomason, N., Cumming, G. (2002). Past and future American Psychological Association guidelines for statistical practice. Theory and Psychology, 12, 825–853.

Fisher, R.A. (1971). The Design of Experiments (ed. 8). New York: Hafner Publishing Company.

Gigerenzer, G. (1989). The Empire of Chance: How Probability Changed Science and Everyday Life. Cambridge–New York: Cambridge University Press.

Gigerenzer, G. (1998). We need statistical thinking, not statistical rituals. Behavioral and Brain Sciences, 21, 199–200.

Gigerenzer, G. (2004). Mindless statistics. Journal of Socio-Economics, 33(5), 587–606. doi:

Gliner, J.A., Leech, N.L., Morgan, G.A. (2002). Problems with null hypothesis significance testing (NHST): What do the textbooks say? The Journal of Experimental Education, 71(1), 83–92.

Hald, A. (2003). A History of Probability and Statistics and Their Applications before 1750. Hoboken, NJ: John Wiley & Sons, Inc.

Haller, H., Krauss, S. (2002). Misinterpretations of significance: A problem students share with their teachers? Methods of Psychological Research Online, 7(1).

Inman, H.F. (1994). Karl Pearson and RA Fisher on statistical tests: A 1935 exchange from Nature. The American Statistician, 48(1), 2–11.

Ioannidis, J.P. (2005). Why most published research findings are false. PLoS Medicine, 2(8), 696–701. doi:10.1371/journal.pmed.0020124

Kirk, R.E. (2001). Promoting good statistical practices: Some suggestions. Educational and Psychological Measurement, 61(2), 213–218.

Meehl, P.E. (1978). Theoretical risks and tabular asterisks: Sir Karl, sir Ronald, and the slow progress of soft psychology. Journal of Consulting and Clinical Psychology, 46(4), 806–834.

Neyman, J., Pearson, E.S. (1928a). On the use and interpretation of certain test criteria for purposes of statistical inference: Part II. Biometrika, 263–294.

Neyman, J., Pearson, E.S. (1928b). On the use and interpretation of certain test criteria for purposes of statistical inference: Part I. Biometrika, 175–240.

Neyman, J., Pearson, E.S. (1933). On the problem of the most efficient tests of statistical hypo-
theses. Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, 231(694–706), 289–337. doi:10.1098/rsta.1933.0009

Oakes, M.W. (1986). Statistical Inference: A Commentary for the Social and Behavioural Sciences. New York: Wiley.

Salsburg, D. (2013). The Lady Tasting Tea: How Statistics Revolutionized Science in the Twentieth Century. New York: Henry Holt and Company.

Sedlmeier, P., Gigerenzer, G. (1989). Do studies of statistical power have an effect on the power of studies? Psychological Bulletin, 105(2), 309.

Vankov, I., Bowers, J., Munafò, M.R. (2014). On the persistence of low power in psychological science. Quarterly Journal of Experimental Psychology (Hove), 67(5), 1037–1040. doi:10.1080/17470218.2014.885986

Westover, M.B., Westover, K.D., Bianchi, M.T. (2011). Significance testing as perverse probabilistic reasoning. BMC Medicine, 9, 20. doi:10.1186/1741-7015-9-20

Wilkinson, L., APA Task Force on Statistical Inference (1999). Statistical methods in psychology journals: Guidelines and explanations. American Psychologist, 54(8), 594–604.

Zieliński, R. (2009). Przedział ufności dla frakcji. Matematyka Stosowana, 10, 1–17.

Pierwotną wersją czasopisma jest wersja elektroniczna publikowana w internecie.
Czasopismo ukazuje się w sposób ciągły on-line