[讨论] statement of ASA on p-values

时间Thu Mar 10 23:57:08 2016

最近ASA发了一篇关於统计显着性跟p-values的陈述先给上摘要跟全文(连结为我个人的dropbox，可不登入直接浏览)：简短的摘要：http://tinyurl.com/zswny43 The ASA's statement on p-values: context, process, and purpose: http://tinyurl.com/zw5yyum 当中最重要的应该是提到以下六个原则： 1. P-values can indicate how incompatible the data are with a specified statistical model. 2. P-values do not measure the probability that the studied hypothesis is true, or the probability that the data were produced by random chance alone. 3. Scientific conclusions and business or policy decisions should not be based only on whether a p-value passes a specific threshold. 4. Proper inference requires full reporting and transparency. 5. A p-value, or statistical significance, does not measure the size of an effect or the importance of a result. 6. By itself, a p-value does not provide a good measure of evidence regarding a model or hypothesis. 个人的一点小心得：例如，第一点是说P-values can indicate how incompatible the data are with a specified statistical model.，就是说他是在比跟null hypothesis所指定的统计分配差异，感觉是在指谪像检定是否为特定分配的检定，这就是明显被误用。(这一点也在全文的第9页，第二点中被提及，原文：Researchers often wish to turn a p-value into a statement about the truth of a null hypothesis, or about the probability that random chance produced the observed data. The p-value is neither。) 最常见的就是常态性检定，做常态性检定得到p-values > 0.1，就宣称他的资料是来自常态，他的虚无假设是这资料是常态，根据第一点，你检定的是跟常态的不接近程度，而非是否为常态，这个说明得非常小心；我看到一篇论文的标题，觉得颇有趣，跟大家分享一下：Absence of evidence is not evidence of absence. 这其实是这次ASA的重点之一，不能说缺乏证据证明null hypothesis，就说是 null hypothesis就是对的，如同常态性检定一样，p-value > 0.1时，结论是你没证据显示资料来自非常态，不代表资料来自常态一样。 (Absence of evidence: 没证据表明非常态) (evidence of absence: 常态的证据) (第二点的解释也有提及：It is a statement about data in relation to a specified hypothetical explanation, and is not a statement about the explanation itself.) 第五点也很重要：A p-value, or statistical significance, does not measure the size of an effect or the importance of a result. p-values不能拿来比较重要性的程度，p-values不代表越重要。ASA给了一个其他方式去衡量第五点，像是confidence, credibility, or prediction intervals; Bayesian methods; alternative measures of evidence, such as likelihood ratios or Bayes Factors。其全文如下： In view of the prevalent misuses of and misconceptions concerning p-values, some statisticians prefer to supplement or even replace p-values with other approaches. These include methods that emphasize estimation over testing, such as confidence, credibility, or prediction intervals; Bayesian methods; alternative measures of evidence, such as likelihood ratios or Bayes Factors; and other approaches such as decision-theoretic modeling and false discovery rates. All these measures and approaches rely on further assumptions, but they may more directly address the size of an effect (and its associated uncertainty) or whether the hypothesis is correct. 不知道大家对ASA这篇statement有没有什麽想法？ 3/11早上看到的一篇部落格文章，阐述一些p-value的价值所在： http://tinyurl.com/jebjua6 --

※ 发信站: 批踢踢实业坊(ptt.cc), 来自: 180.218.152.118 ※ 文章网址: https://webptt.com/cn.aspx?n=bbs/Statistics/M.1457625431.A.E49.html

1^F：推 allen1985: 我觉得这篇算是相当"中肯"的文章值得读一下 03/11 03:21

2^F：→ allen1985: 近年Anti-p-value的人很多但有些批评又太过了毕竟 03/11 03:22

这几天看到R blogger有一篇文章写 ASA says NO to p-values.... 这真的是太夸张了~"~，我会倾向ASA在阐述p-values的价值，以及校正观念

3^F：→ allen1985: 统计还是得有个下结论的办法 03/11 03:22

4^F：→ allen1985: 我一直想问的一个问题 p-value = 0.8 跟 p-value = 0.6 03/11 03:23

5^F：→ allen1985: 有没有差异以及 p-value = 0.01 跟 p-value = 0.00001 03/11 03:24

6^F：→ allen1985: 有没有差异 03/11 03:24

光是比p-value这件事本身就是没意义了，更遑论它们有没有差异？

7^F：→ andrew43: 回楼上，我觉得这种比较单比没太多意思，还是要再参考 03/11 07:27

8^F：→ andrew43: 其它指标吧，例如effect size或Bayes factor。 03/11 07:28

9^F：→ andrew43: 不然要说有差也有差，但做结论要说没差也没差的感觉。 03/11 07:31

我倒是满好奇文章提到的likelihood ratios，是因为likelihood ratios是在两个假设下的likelihood比值，所以会比较适合拿来做measure of evidence吗？不像一般假设检定是null跟alternative互为相反。

10^F：→ allen1985: 我主要是想说大部分的文章现在都认为不显着就是不 03/11 08:11

11^F：→ allen1985: 显着不显着的两个p-values是不能直接比较的但还是 03/11 08:11

12^F：→ allen1985: 满多人会拿来比的 03/11 08:11

13^F：→ allen1985: 我绝对赞同很多东西不能单看p-value 需要其他指标图 03/11 08:12

14^F：→ allen1985: 才能下结论 03/11 08:12

这就是第三点了，不该用p-value做一翻两瞪眼的推论，但是p-value其实无法做这件事第三点的部分摘录： Pragmatic considerations often require binary, “yes-no” decisions, but this does not mean that p-values alone can ensure that a decision is correct or incorrect. The widespread use of “statistical significance” (generally interpreted as “p <= 0.05”) as a license for making a claim of a scientific finding (or implied truth) leads to considerable distortion of the scientific process.

15^F：→ KirinGuess: 所以原po是不同意文章的第一个论点? 03/12 19:22

16^F：→ KirinGuess: 认为第一个论点和文章其他内容冲突? 03/12 19:22

我是觉得第一点说得很好啊XD，常态性检定就是常见的误用

17^F：推 milk0925: 这篇对我帮助超大的，感谢分享！ 03/12 22:18

不客气 ※ 编辑: celestialgod (180.218.152.118), 03/12/2016 22:31:09 ※ 编辑: celestialgod (140.109.74.87), 04/19/2016 15:23:49

	[问题/行为] 猫晚上进房间会不会有憋尿问题
	Re: [闲聊] 选了错误的女孩成为魔法少女 XDDDDDDDDDD
	[正妹] 瑞典一张
	[心得] EMS高领长版毛衣.墨小楼MC1002
	[分享] 丹龙隔热纸GE55+33+22
	[问题] 清洗洗衣机
	[寻物] 窗台下的空间
	[闲聊] 双极の女神1 木魔爵
	[售车] 新竹 1997 march 1297cc 白色四门
	[讨论] 能从照片感受到摄影者心情吗
	[狂贺] 贺贺贺贺贺！岛村卯月！总选举NO.1
	[难过] 羡慕白皮肤的女生
	阅读文章
	[黑特]
	[问题] SBK S1安装於安全帽位置
	[分享] 旧woo100绝版开箱!!
	Re: [无言] 关於小包卫生纸
	[开箱] E5-2683V3 RX480Strix 快睿C1 简单测试
	[心得] 苍の海贼龙地狱执行者16PT
	[售车] 1999年Virage iO 1.8EXi
	[心得] 挑战33 LV10 狮子座pt solo
	[闲聊] 手把手教你不被桶之新手主购教学
	[分享] Civic Type R 量产版官方照无预警流出
	[售车] Golf 4 2.0 银色自排
	[出售] Graco提篮汽座（有底座）2000元诚可议
	[问题] 请问补牙材质掉了还能再补吗?(台中半年内
	[问题] 44th 单曲生写竟然都给重复的啊啊！
	[心得] 华南红卡/icash 核卡
	[问题] 拔牙矫正这样正常吗
	[赠送] 老莫高业初业 102年版
	[情报] 三大行动支付本季掀战火
	[宝宝] 博客来Amos水蜡笔5/1特价五折
	Re: [心得] 新鲜人一些面试分享
	[心得] 苍の海贼龙地狱麒麟25PT
	Re: [闲聊] (君の名は。雷慎入) 君名二创漫画翻译
	Re: [闲聊] OGN中场影片：失踪人口局 (英文字幕)
	[问题] 台湾大哥大4G讯号差
	[出售] [全国]全新千寻侘草LED灯, 水草

WEB批踢踢(PTT)

Statistics 板

[讨论] statement of ASA on p-values

热门看板

赞助商连结