TY - JOUR
T1 - The p-Value Requires Context, Not a Threshold
AU - Betensky, Rebecca A.
N1 - Funding Information:
This work was supported in part by NIH Grant UL1TR001102.
Publisher Copyright:
© 2019, © 2019 The Author(s). Published with license by Taylor & Francis Group, LLC.
PY - 2019/3/29
Y1 - 2019/3/29
N2 - It is widely recognized by statisticians, though not as widely by other researchers, that the p-value cannot be interpreted in isolation, but rather must be considered in the context of certain features of the design and substantive application, such as sample size and meaningful effect size. I consider the setting of the normal mean and highlight the information contained in the p-value in conjunction with the sample size and meaningful effect size. The p-value and sample size jointly yield 95% confidence bounds for the effect of interest, which can be compared to the predetermined meaningful effect size to make inferences about the true effect. I provide simple examples to demonstrate that although the p-value is calculated under the null hypothesis, and thus seemingly may be divorced from the features of the study from which it arises, its interpretation as a measure of evidence requires its contextualization within the study. This implies that any proposal for improved use of the p-value as a measure of the strength of evidence cannot simply be a change to the threshold for significance.
AB - It is widely recognized by statisticians, though not as widely by other researchers, that the p-value cannot be interpreted in isolation, but rather must be considered in the context of certain features of the design and substantive application, such as sample size and meaningful effect size. I consider the setting of the normal mean and highlight the information contained in the p-value in conjunction with the sample size and meaningful effect size. The p-value and sample size jointly yield 95% confidence bounds for the effect of interest, which can be compared to the predetermined meaningful effect size to make inferences about the true effect. I provide simple examples to demonstrate that although the p-value is calculated under the null hypothesis, and thus seemingly may be divorced from the features of the study from which it arises, its interpretation as a measure of evidence requires its contextualization within the study. This implies that any proposal for improved use of the p-value as a measure of the strength of evidence cannot simply be a change to the threshold for significance.
KW - Effect size
KW - Sample size
KW - Statistical significance
UR - http://www.scopus.com/inward/record.url?scp=85063215388&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85063215388&partnerID=8YFLogxK
U2 - 10.1080/00031305.2018.1529624
DO - 10.1080/00031305.2018.1529624
M3 - Article
AN - SCOPUS:85063215388
SN - 0003-1305
VL - 73
SP - 115
EP - 117
JO - American Statistician
JF - American Statistician
IS - sup1
ER -