Unmasking P-values
20 May 2020
By Laurence Collette, PhD, Scientific Development Leader, EORTC HQ, Brussels, Belgium
P-values are ubiquitous in the medical literature. Unfortunately, there is a general tendency to put too much emphasis on them. Too often, one sees the scientific community overemphasize or disregarded results of medical experiments based on solitary P-value. That P-value being “significant” determines if a trial was “a success”.
P-values are effectively an extremely simplified statistical summary of the results of otherwise very complex scientific experiments, be they clinical trials or biomarker association studies. Their simplicity makes them attractive but is also the cause of their abuse.
P-values and the notion of statistical significance are very often misinterpreted and misunderstood. For this reason and because of their effective shortcomings, (which we will address below), the past years has seen a number of scientists rage against their use in provocative editorials “Scientists rise up against statistical significance” (Amrhein V et al Nature 2019, Sucz and Ioannidis 2017), or propose to proscribe statistical significance (Ioannidis 2018, Ioannidis 2019). Statisticians themselves have dedicated an entire journal volume to the debate « Moving to a world beyond P<0.05” (Wasserstein 2019).
So, what are P-values exactly and why are they so often abused?
The P-value is a concept invented by Sr. Ronald Fisher in 1925 (Fisher 1925) to test a null hypothesis ((H0) concerning the magnitude of a parameter of interest (e.g. mean of measurements, difference in response rates, correlation between two biomarker measurements, hazard ratio in a comparative survival study). The P-value is the probability to observe a result, at least, as extreme as the one observed in the experiment, under the assumption that the null hypothesis (that the parameter is zero) was actually true.
The P-value measures how compatible the observed results are with the stated null hypothesis. If one assumes that the data follow the stated imaginary distribution and that there is no bias in trial conduct or data collection, then the data should exhibit only random fluctuations around the hypothesised (null) value of the parameter of interest.
Related News
EORTC participation at ISPOR Europe 2025 conference
7 Nov 2025
EORTC Quality of Life Group participating at the ISOQOL 2025 Conference
22 Oct 2025
New results and forthcoming trials presented at ESMO 2025 underline EORTC’s commitment to patient-centred research
21 Oct 2025
EORTC celebrates the 100th anniversary of p-value on World Statistics Day
20 Oct 2025
EORTC Announces Final Overall Survival Results from the PEACE-3 Trial
19 Oct 2025
Results from EORTC trial define new standard of care for aggressive brain tumours
18 Oct 2025
EORTC’s presence at ESMO 2025
17 Oct 2025
EORTC’s presence at EANO 2025
16 Oct 2025
EORTC celebrates Pink October: shaping the future of breast cancer care
15 Oct 2025
EORTC shines a light on advancements in radioligand therapy
8 Oct 2025
