The interplay between PROM score distributions and treatment effect detection likelihood in randomized controlled trials–a metaepidemiologic study
Panula, Valtteri; Saarinen, Antti; Vaajala, Matias; Liukkonen, Rasmus; Pakarinen, Oskari; Laaksonen, Juho; Ponkilainen, Ville; Kuitunen, Ilari; Uimonen, Mikko (2025-03)
Panula, Valtteri
Saarinen, Antti
Vaajala, Matias
Liukkonen, Rasmus
Pakarinen, Oskari
Laaksonen, Juho
Ponkilainen, Ville
Kuitunen, Ilari
Uimonen, Mikko
03 / 2025
Journal of Clinical Epidemiology
112114
Julkaisun pysyvä osoite on
https://urn.fi/URN:NBN:fi:tuni-202601231821
https://urn.fi/URN:NBN:fi:tuni-202601231821
Kuvaus
Peer reviewed
Tiivistelmä
Objectives We hypothesized that, in musculoskeletal randomized controlled trials (RCTs) using patient-reported outcome measures (PROMs), higher baseline scores and the clustering of follow-up scores near the upper bound (ie, ceiling effect) compress variability and attenuate measurable between-group differences, thereby lowering the likelihood of observing a statistically significant effect. We therefore examined how score distributions at pretreatment and follow-up influence the likelihood of detecting between-group differences. Study Design and Setting We conducted a metaepidemiologic study of RCTs, published between 2015 and 2024, that compared treatment effects on musculoskeletal disorders between two study groups using PROMs. The observed distributions of the PROM scores at baseline and follow-up were collected from the included studies. All PROM scores were rescaled to 0-100 with higher scores indicating better health. The likelihood of observing a statistically significant difference in PROM scores between the study groups was examined by calculating the score difference required to achieve a P value <.05. Results A total of 255 RCTs were included. PROM scores improved from baseline to follow-up in most studies (98%), with a mean change of +28 points. The correlation coefficient between the mean baseline score and mean score change was −0.66 (95% CI -0.72 to −0.59) indicating that higher baseline scores were associated with lower score change. In addition, there was a moderate correlation between the mean and SD of PROM scores at follow-up (−0.39; 95% CI -0.48 to −0.28). The mean likelihood of detecting a difference was 65% (SD 11%) at baseline and 65% (SD 11%) at follow-up. The likelihood reached the 80% benchmark in only 8.5% and 8.1% of the studies at baseline and follow-up, respectively. Conclusion The concentration of PROM score distributions toward the high end of the scale, especially when higher baseline scores are present, diminishes the likelihood of detecting significant differences between study groups, particularly at follow-up assessments in studies analyzing musculoskeletal complaints. This underscores the importance of critically evaluating the conclusions drawn from these studies.
Kokoelmat
- TUNICRIS-julkaisut [24210]
