Reverse Bayesian Implications of p-Values Reported in Critical Care Randomized Trials

1. Ioannidis, JPA . Why most published research findings are false. PLoS Med. 2005;2(8):e124.
Google Scholar | Crossref | Medline | ISI2. Goodman, SN . Toward evidence-based medical statistics. 1: the p value fallacy. Ann Intern Med. 1999;130(12):995-1004.
Google Scholar | Crossref | Medline | ISI3. Greenland, S, Senn, SJ, Rothman, KJ, et al. Statistical tests, P-values, confidence intervals, and power: a guide to misinterpretations. Eur J Epidemiol. 2016;31:337-350.
Google Scholar | Crossref | Medline | ISI4. American Statistical Association Board of Directors . ASA Statement on statistical significance and p—values. Am Stat. 2016;70(2):131-133.
Google Scholar5. Wasserstein, RL, Schirm, AL, Lazar, NA. Moving to a world beyond “p<.05”. Am Stat. 2019;73(Suppl1):1-19.
Google Scholar | Crossref6. Price, R, Bethune, R, Massey, L. Problem with p values: why p values do not tell you if your treatment is likely to work. Postgrad Med J. 2020;96(1131):1–3.
Google Scholar | Crossref | Medline7. Goodman, SN . Toward evidence-based medical statistics. 2: the Bayes factor. Ann Intern Med. 1999;130(12):1005-1013.
Google Scholar | Crossref | Medline | ISI8. Colquhoun, D . The false positive risk: a proposal concerning what to do about p-values. Am Stat. 2019;73(Suppl1):192-201.
Google Scholar | Crossref9. Colquhoun, D . The reproducibility of research and the misinterpretation of p-values. R Soc Open Sci. 2017;4:171085.
Google Scholar | Crossref | Medline10. Benjamin, DJ, Berger, JO. Three recommendations for improving the use of p-values. Am Stat. 2019;73(S1):186-191.
Google Scholar | Crossref11. Sellke, T, Bayarri, MJ, Berger, JO. Calibration of p values for testing precise null hypotheses. Am Stat. 2001;55(1):62-71.
Google Scholar | Crossref | ISI12. Held, L, Matthews, R, Ott, M, Pawel, S. Reverse-Bayes methods for evidence assessment and research synthesis. arXiv.org 2021 (preprint). DOI: 2102.13443.v2. Available at: https://arxiv.org/abs/2102.13443 (accessed July 30, 2021).
Google Scholar13. Allmark, P . Bayes and health care research. Med Health Care Philos. 2004;7(3):321-332.
Google Scholar | Crossref | Medline14. Held, L . A nomogram for P values. BMC Med Res Methodol. 2010;10:21.
Google Scholar | Crossref | Medline | ISI15. Matthews, RAJ . Beyond ‘significance’: principles and practice of the analysis of credibility. R Soc Open ci. 2018;5:171047.
Google Scholar | Crossref | Medline16. Matthews, RAJ . Moving towards the post p<.05 era via the analysis of credibility. Am Stat. 2019;73(Suppl1):202-212.
Google Scholar | Crossref17. Santacruz, CA, Pereira, AJ, Celis, E, Vincent, JL. Which multicenter randomized controlled trials in critical care medicine have shown reduced mortality? A systematic review. Crit Care Med. 2019;47(12):1680-1691.
Google Scholar | Crossref | Medline18. Duffett, M, Choong, K, Hartling, L, Menon, K, Thabane, L, Cook, DJ. Randomized controlled trials in pediatric critical care: a scoping review. Critical Care. 2013;17(5):R256.
Google Scholar | Crossref | Medline19. Benjamin, DJ, Berger, JO, Johannesson, M, et al. Redefine statistical significance. Nat Hum Behav. 2018;2(1):6-10.
Google Scholar | Crossref | Medline20. Ioannidis, JPA . The proposal to lower P-value thresholds to .005. JAMA. 2018;319(4):1429-1430.
Google Scholar | Crossref | Medline21. Lazzeroni, LC, Lu, Y, Belitsakaya-Levy, I. Solutions for quantifying p-value uncertainty and replication power. Nat Methods. 2016;13(2):107-108.
Google Scholar | Crossref | Medline22. Cumming, G . Replication and p intervals. P values predict the future only vaguely, but confidence intervals do much better. Perspectives on Psychological Science. 2018;3(4):286-300.
Google Scholar | SAGE Journals23. Young, NS, Ioannidis, JPA, Al-Ubaydii, O. Why current publication practices may distort science. PLoS Med. 2008;5(10):e201.
Google Scholar | Crossref | Medline | ISI24. Altman, N, Krzywinski, M. Interpreting P values. Nat Methods. 2017;14(3):213-214.
Google Scholar | Crossref25. Halsey, LG . The reign of the p-value is over: what alternative analyses could we employ to fill the power vacuum? Biol Lett. 2019;15:20190174.
Google Scholar | Crossref | Medline26. Johnson, N, Lilford, RJ, Brazier, W. At what level of collective equipoise does a clinical trial become ethical? J Med Ethics. 1991;17(1):30-34.
Google Scholar | Crossref | Medline | ISI27. Abrams, D, Montesi, SB, Moore, SKL, et al. Powering bias and clinically important treatment effects in randomized trials of critical illness. Crit Care Med. 2020;48(12):1710-1719.
Google Scholar | Crossref | Medline28. Joffe, AR, Bara, M, Anton, N, Nobis, N. Expectations for the methodology and translation of animal research: a survey of the general public, medical students and animal researchers in North America. Altern Lab Anim. 2016;44(4):361-381.
Google Scholar | SAGE Journals29. Pippin, JJ . Animal research in medical sciences: seeking a convergence of science, medicine, and animal law. S Tex L Rev. 2013;54:469.
Google Scholar30. Ranieri, VM, Thompson, BT, Barie, PS, et al. Williams MD, for the PROWESS-SHOCK study group. Drotrecogin alfa (activated) in adults with septic shock. NEJM. 2012;366(22):2055-2064.
Google Scholar31. National Heart, Lung, and Blood Institute PETAL Clinical Trials Network ; Moss, M, Huang, DT, Brower, RG, et al. Early neuromuscular blockade in the acute respiratory distress syndrome. NEJM. 2019;380(21):1997-2008.
Google Scholar | Crossref | Medline32. Mouncey, PR, Osborn, TM, Power, GS, et al. for the ProMISe Trial Investigators . Trial of early, goal-directed resuscitation for septic shock. NEJM. 2015;372(14):1301-1311.
Google Scholar | Crossref | Medline | ISI33. The NICE-SUGAR Study Investigators . Intensive versus conventional glucose control in critically ill patients. NEJM. 2009;360(13):1283-1297.
Google Scholar | Crossref | Medline | ISI34. Held, L . Reverse-Bayes analysis of two common misinterpretations of significance tests. Clinical Trials. 2013;10(2):236-242.
Google Scholar | SAGE Journals | ISI35. Ridgeon, EE, Young, PJ, Bellomo, R, Muchetti, M, Lembo, R, Landoni, G. The fragility index in multicenter randomized controlled critical care trials. Crit Care Med. 2016;44(7):1278-1284.
Google Scholar | Crossref | Medline | ISI36. Grolleau, F, Collins, GS, Smarandache, A, et al. The fragility and reliability of conclusions of anesthesia and critical care randomized trials with statistically significant findings: a systematic review. Crit Care Med. 2019;47(3):456-462.
Google Scholar | Crossref | Medline37. Vargas, M, Buonano, P, Marra, A, Iacovazzo, C, Servillo, G. Fragility index in multicenter randomized controlled trials in critical care medicine that have shown reduced mortality. Crit Care Med. 2020;48(3):e250-e251.
Google Scholar | Crossref | Medline38. Matics, TJ, Khan, N, Jani, P, Kane, JM. The fragility of statistically significant findings in pediatric critical care randomized controlled trials. Pediatr Crit Care Med. 2019;20(6):e258-e262.
Google Scholar | Crossref | Medline39. Carter, RE, McKie, PM, Storlie, CB. The fragility index: a P-value in sheep's Clothing? Eur Heart J. 2017;38(5):346-348.
Google Scholar | Medline40. Forstmeier, W, Wagenmakers, E-J, Parker, TH. Detecting and avoiding likely false-positive findings - a practical guide. Biol Rev. 2017;92(4):1941-1968.
Google Scholar | Crossref | Medline41. Munafo, MR, Nosek, BA, Bishop, DVM, et al. A manifesto for reproducible science. Nat Hum Behav. 2017;1:0021.
Google Scholar | Crossref | Medline | ISI42. Higginson, AD, Munafo, MR. Current incentives for scientists lead to underpowered studies with erroneous conclusions. PLoS Biol. 2016;14(11):e2000995.
Google Scholar | Crossref | Medline43. Smaldino, PE, McElreath, R. The natural selection of bad science. R Soc Open Sci. 2016;3:160384.
Google Scholar | Crossref | Medline44. Simmons, JP, Nelson, LD, Simonsohn, U. False-positive psychology: undisclosed flexibility in data collection and analysis allows presenting anything as significant. Psych Sci. 2011;22(11):1359-1366.
Google Scholar | SAGE Journals45. Szucs, D . A tutorial on hunting statistical significance by chasing n. Front Psychol. 2016;7:1444.
Google Scholar | Crossref | Medline

留言 (0)

沒有登入
gif