Studentov t-test

t-Test je svaki statistički test hipoteze u kome testna statistika sledi Studentovu t-distribuciju pod nultim hipotezama. t-Test se obično primenjuje kad testna statistika sledi normalnu distribuciju, ako je vrednost skalirajućeg člana u statistici testa poznata. Kada je skalirajući član nepoznat i zamenjuje ga procena na osnovu podataka, statistika testa (pod određenim uslovima) sledi studentovu t-distribuciju. Ovaj test^[1] se na primer može koristiti da se utvrdi da li se srednje vrednosti dve grupe podataka značajno razlikuju jedna od druge.

Istorija

Vilijam Sili Goset, koji je razvio „t-statistiku” i objavio je pod pseudonimom „Student”.

Vilijam Sili Goset je uveo t-statistiku 1908. godine, dok je kao hemičar radio za Ginisovu pivaru u Dablinu, Irska. „Student” je bio njegov književni pseudonim.^[2]^[3]^[4]^[5]

Goset je bio zaposlen zahvaljujući politici Kloda Ginisa da regrutuje najbolje diplomirane studente iz Oksforda i Kembridža da bi primenjivali biohemiju i statistiku na Ginisove industrijske procese.^[3] Goset je osmislio t-test kao ekonomičan način praćenja kvaliteta stauta. Rad o t-testu je bio podnet i prihvaćen u časopisu Biometrika i objavljen je 1908. godine.^[6] Politika kompanije Ginis zabranjivala je njenim hemičarima da objavljuju svoja otkrića, pa je Goset objavio svoj statistički rad pod pseudonimom „Student”.

Ginis je imao politiku dopuštanja tehničkom osoblju da odlazi na studije (tzv. „studijsko odsustvo”), koju je Goset koristio tokom prva dva semestra akademske godine 1906–1907 u Biometrijskoj laboratoriji profesora Karla Pirsona na Univerzitetskom koledžu u Londonu.^[7] Gosetov identitet tada je bio poznat njegovim kolegama statističarima i glavnom uredniku Karlu Pirsonu.^[8]

Upotrebe

Neki od najčešće korištenih t-testova su:

Lokacioni test jednog uzorka da li srednja vrednost populacije ima vrednost navedenu nultom hipotezom.
Lokacioni test dva uzorka sa nultom hipotezom prema kojoj su srednje vrednosti dve populacije jednake. Svi takvi testovi se obično nazivaju Studentovim t-testovima, mada bi strogo govoreći to ime trebalo da se upotrebljava samo kad su varijanse dve populacije jednake; oblik testa koji se koristi kada se ta pretpostavka odbaci ponekad se naziva i Velčov t-test. Ovi testovi se često nazivaju t-testovima „neuparenih” ili „nezavisnih uzoraka”, jer se tipično primenjuju kada se statističke jedinice dva ishodišna uzorka koji se upoređuju ne preklapaju.^[9]

Pretpostavke

Većina testnih statistika ima formu $t = .mw-parser-output .sfrac{white-space:nowrap}.mw-parser-output .sfrac.tion,.mw-parser-output .sfrac .tion{display:inline-block;vertical-align:-0.5em;font-size:85%;text-align:center}.mw-parser-output .sfrac .num,.mw-parser-output .sfrac .den{display:block;line-height:1em;margin:0 0.1em}.mw-parser-output .sfrac .den{border-top:1px solid}.mw-parser-output .sr-only{border:0;clip:rect(0,0,0,0);height:1px;margin:-1px;overflow:hidden;padding:0;position:absolute;width:1px}Z/s$ , gde su $Z$ i $s$ funkcije podataka. $Z$ može da bude senzitivno na alternativnu hipotezu (tj. njegova magnituda ima tendenciju da bude veća kada je alternativna hipoteza tačna), dok je $s$ parametar skaliranja koji omogućava da se utvrdi distribucija od $t$ . Na primer, u t-testu sa jednim uzorkom

t={\frac {Z}{s}}={\frac {{\bar {X}}-\mu }{{\widehat {\sigma }}/{\sqrt {n}}}}

gde je $X$ srednja vrednost uzorka $X 1, X 2, \dots, X n$ , veličine $n$ , $s$ je standardna greška srednje vrednosti, ${\textstyle {\widehat {\sigma }}}$ je procena standardne devijacije populacije, i $μ$ je srednja vrednost populacije.

Pretpostavke na kojima se temelji t-test u njegovom najjednostavnijem obliku su

$X$ sledi normalnu distribuciju sa srednjom vrednosti $μ$ i varijansom $σ 2 / n$
$s 2$ sledi $χ 2$ distribuciju sa $n - 1$ stepeni slobode. Ova pretpostavka je ispunjena kada opservacije korištene za procenu $s 2$ potiču iz normalne distribucije (i nezavisnih i identično distribuiranih randomnih promenljivih za svaku grupu).
$Z$ i $s$ su nezavisni.

U t-testu kojim se porede srednje vrednosti dve nezavisne promenljive, sledeće pretpostavke trebaju da budu zadovoljene:

Srednja vrednost dve upoređene populacije treba da sledi normalnu distribuciju. Pod slabim pretpostavkama u velikim uzorcima, ovo proizilazi iz centralne granične teoreme.^[10]
Ako se koristi Studentova originalna definicija t-testa, dve populacije koje se upoređuju treba da imaju istu varijansu (na njih su primenljivi i F-test, Leveneov test, Bartletov test ili Braun-Forsajtov test, ili se grafički mogu procenjivati korišćenjem Q–Q grafa). Ako su veličine dve grupe uzoraka koje se upoređuju jednake, Studentov originalni t-test je visoko robustan u pogledu nejednakih varijansi.^[11] Velčov t-test je neosetljiv na jednakost varijansi bez obzira da li su veličine uzorka slične.
Podaci korišteni za obavljanje testa treba da budu uzorkovani nezavisno od dve populacije koja se upoređuju. To se generalno ne može ispitati iz podataka, ali ako je poznato da podaci zavise od uzorkovanja (to jest, ako su uzorkovani u klasterima), tada klasični t-testovi koji se ovde razmatraju mogu da daju pogrešne rezultate.

Većina t-testova sa dva uzorka je robusna za sve slučajeve, izuzev velikih odstupanja od pretpostavki.^[12]

Radi tačnosti, t-test i Z-test zahtevaju normalnost srednjih vrednosti uzorka, a t-test dodatno zahteva da varijansa uzorka sledi skaliranu χ² raspodelu, i da srednje vrednosti i varijance uzoraka budu statistički nezavisne. Normalnost pojedinačnih vrednosti podataka nije neophodna, ako su ovi uslovi zadovoljeni. Prema centralnoj graničnoj teoremi, srednje vrednosti umereno velikih uzoraka su obično dobra aproksimacija normalne distribucije, čak i ako podaci nisu normalno distribuirani. Za takve podatke, distribucija varijanse uzorka može značajno da odstupa od χ² distribucije. Međutim, ako je veličina uzorka velika, iz teoreme Sluckog sledi da raspodela varijanse uzorka ima malo uticaja na distribuciju testne statistike.

Reference

^ „rice purity test”. The American Statistician. 1980.
^ Mankiewicz, Richard (2004). The Story of Mathematics (Paperback изд.). Princeton, NJ: Princeton University Press. стр. 158. ISBN 9780691120461.
^ ^а ^б O'Connor, John J.; Robertson, Edmund F. „William Sealy Gosset”. MacTutor History of Mathematics archive. University of St Andrews.
^ Fisher Box, Joan (1987). „Guinness, Gosset, Fisher, and Small Samples”. Statistical Science. 2 (1): 45—52. JSTOR 2245613. doi:10.1214/ss/1177013437.
^ „Архивирана копија” (PDF). Архивирано из оригинала (PDF) 16. 05. 2017. г. Приступљено 16. 08. 2019.
^ „The Probable Error of a Mean” (PDF). Biometrika. 6 (1): 1—25. 1908. doi:10.1093/biomet/6.1.1. Приступљено 24. 7. 2016.
^ Raju, T. N. (2005). „William Sealy Gosset and William A. Silverman: Two "students" of science”. Pediatrics. 116 (3): 732—5. PMID 16140715. doi:10.1542/peds.2005-1134.
^ Dodge, Yadolah (2008). The Concise Encyclopedia of Statistics. Springer Science & Business Media. стр. 234—235. ISBN 978-0-387-31742-7.
^ Fadem, Barbara (2008). High-Yield Behavioral Science. High-Yield Series. Hagerstown, MD: Lippincott Williams & Wilkins. ISBN 0-7817-8258-9.
^ Lumley, Thomas; Diehr, Paula; Emerson, Scott; Chen, Lu (maj 2002). „The Importance of the Normality Assumption in Large Public Health Data Sets”. Annual Review of Public Health. 23 (1): 151—169. ISSN 0163-7525. doi:10.1146/annurev.publhealth.23.100901.140546.
^ Markowski, Carol A.; Markowski, Edward P. (1990). „Conditions for the Effectiveness of a Preliminary Test of Variance”. The American Statistician. 44 (4): 322—326. JSTOR 2684360. doi:10.2307/2684360.
^ Bland, Martin (1995). An Introduction to Medical Statistics. Oxford University Press. стр. 168. ISBN 978-0-19-262428-4.

Literatura

O'Mahony, Michael (1986). Sensory Evaluation of Food: Statistical Methods and Procedures. CRC Press. стр. 487. ISBN 0-82477337-3.
Press, William H.; Teukolsky, Saul A.; Vetterling, William T.; Flannery, Brian P. (1992). Numerical Recipes in C: The Art of Scientific Computing. Cambridge University Press. стр. 616. ISBN 0-521-43108-5.
Boneau, C. Alan (1960). „The effects of violations of assumptions underlying the t test”. Psychological Bulletin. 57 (1): 49—64. doi:10.1037/h0041412.
Edgell, Stephen E.; Noon, Sheila M. (1984). „Effect of violation of normality on the t test of the correlation coefficient”. Psychological Bulletin. 95 (3): 576—583. doi:10.1037/0033-2909.95.3.576.
Senn, S.; Richardson, W. (1994). „The first t-test”. Statistics in Medicine. 13 (8): 785—803. PMID 8047737. doi:10.1002/sim.4780130802.
Hogg RV, Craig AT (1978). Introduction to Mathematical Statistics (4th изд.). New York: Macmillan. ASIN B010WFO0SA.
Venables, W. N.; Ripley, B. D. (2002). Modern Applied Statistics with S (Fourth изд.). Springer.
Gelman, Andrew; John B. Carlin; Hal S. Stern; Donald B. Rubin (2003). Bayesian Data Analysis (Second Edition). CRC/Chapman & Hall. ISBN 1-58488-388-X.
Mortimer RG (2005). Mathematics for physical chemistry (3rd изд.). Burlington, MA: Elsevier. стр. 326. ISBN 9780080492889. OCLC 156200058.
Fisher RA (1925). „Applications of "Student's" distribution” (PDF). Metron. 5: 90—104. Архивирано из оригинала (PDF) 5. 3. 2016. г.
Walpole RE, Myers R, Myers S, et al. (2006). Probability & Statistics for Engineers & Scientists (7th изд.). New Delhi: Pearson. стр. 237. ISBN 9788177584042. OCLC 818811849.
Kruschke JK (2015). Doing Bayesian Data Analysis (2nd изд.). Academic Press. ISBN 9780124058880. OCLC 959632184.
Johnson NL, Kotz S, Balakrishnan N (1995). „Chapter 28”. Continuous Univariate Distributions. 2 (2nd изд.). Wiley. ISBN 9780471584940.
Casella G, Berger RL (1990). Statistical Inference. Duxbury Resource Center. стр. 56. ISBN 9780534119584.
Jackman, S. (2009). Bayesian Analysis for the Social Sciences. Wiley. стр. 507. ISBN 9780470011546. doi:10.1002/9780470686621.
Bishop, C.M. (2006). Pattern Recognition and Machine Learning. New York, NY: Springer. ISBN 9780387310732.
Ord JK (1972). Families of Frequency Distributions. London: Griffin. ISBN 9780852641378.
Lange KL, Little RJ, Taylor JM (1989). „Robust Statistical Modeling Using the t Distribution” (PDF). J. Am. Stat. Assoc. 84 (408): 881—896. JSTOR 2290063. doi:10.1080/01621459.1989.10478852.
Gelman AB, Carlin JB, Stern HS, et al. (2014). „Computationally eﬃcient Markov chain simulation”. Bayesian Data Analysis. Boca Raton, FL: CRC Press. стр. 293. ISBN 9781439898208. ^{[мртва веза]}

Spoljašnje veze

Hazewinkel Michiel, ур. (2001). „Student test”. Encyclopaedia of Mathematics. Springer. ISBN 978-1556080104.
A conceptual article on the Student's t-test
Econometrics lecture (topic: hypothesis testing) на сајту YouTube by Mark Thoma
Hazewinkel Michiel, ур. (2001). „Student distribution”. Encyclopaedia of Mathematics. Springer. ISBN 978-1556080104.
Earliest Known Uses of Some of the Words of Mathematics (S) (Remarks on the history of the term "Student's distribution")
Rouaud, M. (2013), Probability, Statistics and Estimation (PDF) (short изд.)

[1] „rice purity test”. The American Statistician. 1980.

[2] Mankiewicz, Richard (2004). The Story of Mathematics (Paperback изд.). Princeton, NJ: Princeton University Press. стр. 158. ISBN 9780691120461.

[Gossett-3] а ^б O'Connor, John J.; Robertson, Edmund F. „William Sealy Gosset”. MacTutor History of Mathematics archive. University of St Andrews.

[4] Fisher Box, Joan (1987). „Guinness, Gosset, Fisher, and Small Samples”. Statistical Science. 2 (1): 45—52. JSTOR 2245613. doi:10.1214/ss/1177013437.

[5] „Архивирана копија” (PDF). Архивирано из оригинала (PDF) 16. 05. 2017. г. Приступљено 16. 08. 2019.

[The_Probable_Error_of_a_Mean-6] „The Probable Error of a Mean” (PDF). Biometrika. 6 (1): 1—25. 1908. doi:10.1093/biomet/6.1.1. Приступљено 24. 7. 2016.

[7] Raju, T. N. (2005). „William Sealy Gosset and William A. Silverman: Two "students" of science”. Pediatrics. 116 (3): 732—5. PMID 16140715. doi:10.1542/peds.2005-1134.

[Dodge2008-8] Dodge, Yadolah (2008). The Concise Encyclopedia of Statistics. Springer Science & Business Media. стр. 234—235. ISBN 978-0-387-31742-7.

[fadem-9] Fadem, Barbara (2008). High-Yield Behavioral Science. High-Yield Series. Hagerstown, MD: Lippincott Williams & Wilkins. ISBN 0-7817-8258-9.

[:0-10] Lumley, Thomas; Diehr, Paula; Emerson, Scott; Chen, Lu (maj 2002). „The Importance of the Normality Assumption in Large Public Health Data Sets”. Annual Review of Public Health. 23 (1): 151—169. ISSN 0163-7525. doi:10.1146/annurev.publhealth.23.100901.140546.

[11] Markowski, Carol A.; Markowski, Edward P. (1990). „Conditions for the Effectiveness of a Preliminary Test of Variance”. The American Statistician. 44 (4): 322—326. JSTOR 2684360. doi:10.2307/2684360.

[Bland1995-12] Bland, Martin (1995). An Introduction to Medical Statistics. Oxford University Press. стр. 168. ISBN 978-0-19-262428-4.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]