Carta Acesso aberto Revisado por pares

The quality of the reported sample size calculation in clinical trials on COVID-19 patients indexed in PubMed

2020; Elsevier BV; Volume: 77; Linguagem: Inglês

10.1016/j.ejim.2020.04.057

ISSN

1879-0828

Autores

Paul H. Lee,

Tópico(s)

COVID-19 and healthcare impacts

Resumo

Given the utmost priority of COVID-19 research, many medical journals, especially the leading ones, expedited the review process of these papers. It was expected that the amount of submitted papers for peer review would raise sharply and the deadline of the review period at the COVID-19 outbreak will be tightened. Therefore, sample size calculation of these papers, a component that was being omitted in more than 40% of the published randomized controlled trials,1Lee PH Tse ACY. The Diagnosis Checking of Statistical Analysis in Randomised Controlled Trials Indexed in PubMed.Eur J Clin Invest. 2017; 47: 847-852Crossref PubMed Scopus (2) Google Scholar might be neglected during the peer review process. Most importantly, authors of these papers were also rushing to conduct their COVID-19 research and they might not have seek necessary statistical consultation regarding sample size calculation. The CONsolidated Standards Of Reporting Trials (CONSORT) statement recommends trial reports to provide all essential information on the determination of sample size, including the level of statistical significance, the desired level of power, and the estimated effect size of the treatment. However, the degree of compliance to the CONSORT statement regarding sample size calculation of the newly-published COVID-19 trial papers is unknown. Therefore, we reviewed all clinical trials on COVID-19 patients published from 1st January 2020 to 4th April 2020 indexed in PubMed. We adopted the revised Cochrane highly sensitive search strategy 2Robinson KA Dickersin K. Development of a highly sensitive search strategy for the retrieval of reports of controlled trials using PubMed.Int J Epidemiol. 2002; 31: 150-153Crossref PubMed Scopus (564) Google Scholar to search PubMed for papers reporting clinical trial involving COVID-19 patients, either randomized or non-randomized, indexed by 4th April 2020. The search term can be found in the supplementary material. A total of 521 papers were identified. Papers were excluded when they were 1) case studies or case series (n=25), 2) commentaries, editorials, or letters to the editor (n=189), 3) guidelines (n=61), 4) observational studies (n=108), 5) reviews (n=69), 6) studies on the virus instead of the patients (n=15), and 7) not about COVID-19 (n=50). Finally, four studies were included in this study. Details of these papers and the reason of exclusion are listed in Supplementary Table 2. The following information, together with the relevant texts in the papers, were extracted: 1) whether the group allocation of patients was randomized, 2) the measurement type of the primary outcome that the sample size calculation based on, 3) the level of significance used, 4) the desired power, and 5) the estimated effect size and the relevant statistics to calculate this effect size. For papers that provide all essential information to replicate their sample size calculation, including the level of significance, the desired power, and the estimated effect size, we calculated the sample size required to achieve the reported desired power given the level of significance suggested by the authors of these papers. Since all included studies were superiority trials, we will assume that the test was two-tailed if it was not specified in the text. We referred to the standard textbook formulas for calculating the sample size.3Donner A. Approaches to sample size estimation in the design of clinical trials–a review.Stat Med. 1984; 3: 199-214Crossref PubMed Scopus (232) Google Scholar Supplementary Table 1 shows the information of the four included papers.4Gautret P Lagier JC Parola P et al.Hydroxychloroquine and azithromycin as a treatment of COVID-19: results of an open-label non-randomized clinical trial.Int J Antimicrob Agents. 2020; (article in press)Crossref PubMed Scopus (3763) Google Scholar, 5Wu CN Xia LZ Li KH et al.High-flow nasal-oxygenation-assisted fibreoptic tracheal intubation in critically ill patients with COVID-19 pneumonia: a prospective randomised controlled trial.Br J Anaesth. 2020; (article in press)Abstract Full Text Full Text PDF Scopus (34) Google Scholar, 6Cao B Wang Y Wen D et al.A Trial of Lopinavir-Ritonavir in Adults Hospitalized with Severe Covid-19.New Engl J Med. 2020; (article in press)Crossref Scopus (3870) Google Scholar, 7Zhou YH Qin YY Lu YQ et al.Effectiveness of glucocorticoid therapy in patients with severe novel coronavirus pneumonia: protocol of a randomized controlled trial.Chin Med J. 2020; (article in press)Google Scholar Three of them were drug trials (hydroxychloroquine and azithromycin,4Gautret P Lagier JC Parola P et al.Hydroxychloroquine and azithromycin as a treatment of COVID-19: results of an open-label non-randomized clinical trial.Int J Antimicrob Agents. 2020; (article in press)Crossref PubMed Scopus (3763) Google Scholar lopinavir and ritonavir,6Cao B Wang Y Wen D et al.A Trial of Lopinavir-Ritonavir in Adults Hospitalized with Severe Covid-19.New Engl J Med. 2020; (article in press)Crossref Scopus (3870) Google Scholar and methylprednisolone7Zhou YH Qin YY Lu YQ et al.Effectiveness of glucocorticoid therapy in patients with severe novel coronavirus pneumonia: protocol of a randomized controlled trial.Chin Med J. 2020; (article in press)Google Scholar) and the remaining one examined the effectiveness of high-flow nasal-oxygenation-assisted fibreoptic tracheal intubation.5Wu CN Xia LZ Li KH et al.High-flow nasal-oxygenation-assisted fibreoptic tracheal intubation in critically ill patients with COVID-19 pneumonia: a prospective randomised controlled trial.Br J Anaesth. 2020; (article in press)Abstract Full Text Full Text PDF Scopus (34) Google Scholar All four studies were a parallel trial. Three studies reported the level of significance, the desired power, and the estimated effect size,4Gautret P Lagier JC Parola P et al.Hydroxychloroquine and azithromycin as a treatment of COVID-19: results of an open-label non-randomized clinical trial.Int J Antimicrob Agents. 2020; (article in press)Crossref PubMed Scopus (3763) Google Scholar,6Cao B Wang Y Wen D et al.A Trial of Lopinavir-Ritonavir in Adults Hospitalized with Severe Covid-19.New Engl J Med. 2020; (article in press)Crossref Scopus (3870) Google Scholar,7Zhou YH Qin YY Lu YQ et al.Effectiveness of glucocorticoid therapy in patients with severe novel coronavirus pneumonia: protocol of a randomized controlled trial.Chin Med J. 2020; (article in press)Google Scholar and we replicated the sample size calculation for these three studies step-by-step in the following paragraphs. In two trials,4Gautret P Lagier JC Parola P et al.Hydroxychloroquine and azithromycin as a treatment of COVID-19: results of an open-label non-randomized clinical trial.Int J Antimicrob Agents. 2020; (article in press)Crossref PubMed Scopus (3763) Google Scholar,7Zhou YH Qin YY Lu YQ et al.Effectiveness of glucocorticoid therapy in patients with severe novel coronavirus pneumonia: protocol of a randomized controlled trial.Chin Med J. 2020; (article in press)Google Scholar the targeted level of significance ("a type I error rate of 5%" 4Gautret P Lagier JC Parola P et al.Hydroxychloroquine and azithromycin as a treatment of COVID-19: results of an open-label non-randomized clinical trial.Int J Antimicrob Agents. 2020; (article in press)Crossref PubMed Scopus (3763) Google Scholar and "a level of 95% confidence" 7Zhou YH Qin YY Lu YQ et al.Effectiveness of glucocorticoid therapy in patients with severe novel coronavirus pneumonia: protocol of a randomized controlled trial.Chin Med J. 2020; (article in press)Google Scholar) and the desired power ("a 85% power"4Gautret P Lagier JC Parola P et al.Hydroxychloroquine and azithromycin as a treatment of COVID-19: results of an open-label non-randomized clinical trial.Int J Antimicrob Agents. 2020; (article in press)Crossref PubMed Scopus (3763) Google Scholar and "a power of 80%" 7Zhou YH Qin YY Lu YQ et al.Effectiveness of glucocorticoid therapy in patients with severe novel coronavirus pneumonia: protocol of a randomized controlled trial.Chin Med J. 2020; (article in press)Google Scholar) were reported. However, some information for calculating the treatment effect size were missing in these two papers. Zhou et al. (2020)7Zhou YH Qin YY Lu YQ et al.Effectiveness of glucocorticoid therapy in patients with severe novel coronavirus pneumonia: protocol of a randomized controlled trial.Chin Med J. 2020; (article in press)Google Scholar stated that "detecting a reduction of 40% in SOFA scores between the treatment and nontreatment groups",7Zhou YH Qin YY Lu YQ et al.Effectiveness of glucocorticoid therapy in patients with severe novel coronavirus pneumonia: protocol of a randomized controlled trial.Chin Med J. 2020; (article in press)Google Scholar however the SOFA (or the Sequential Organ Failure Assessment) score is a continuous variable. To calculate the sample size, the post-treatment between-group mean difference and the pooled standard deviation of the primary outcome variable, or the Cohen's d effect size which equals the between-group mean difference divided by the pooled standard deviation, should be estimated. The sample size of 23 per group as stated by the authors ("Considering a dropout rate of 5%, the sample size is estimated to be 24 cases in each group") implied that they estimated the Cohen's d to be 0.83 (= (z0.025+z0.2)×√(2/n) = (1.96+0.84)×√(2/23)). In Gautret et al. (2020),4Gautret P Lagier JC Parola P et al.Hydroxychloroquine and azithromycin as a treatment of COVID-19: results of an open-label non-randomized clinical trial.Int J Antimicrob Agents. 2020; (article in press)Crossref PubMed Scopus (3763) Google Scholar the authors only reported the effect of the treatment (hydroxychloroquine) group ("Assuming a 50% efficacy of hydroxychloroquine in reducing the viral load at day 7" 4Gautret P Lagier JC Parola P et al.Hydroxychloroquine and azithromycin as a treatment of COVID-19: results of an open-label non-randomized clinical trial.Int J Antimicrob Agents. 2020; (article in press)Crossref PubMed Scopus (3763) Google Scholar) but the effect of the control group was missing. Note that if we assume the effect of the control group is 0%, which is a common assumption in effect size estimation, the required sample size calculated using Fleiss' formula 8Fleiss JL Tytun A Ury HK A Simple Approximation for Calculating Sample Sizes for Comparing Independent Proportions.Biometrics. 1980; 36: 343-346Crossref PubMed Scopus (281) Google Scholar would be (z0.025 √(2(p1+p2) (p1-p2))) + z0.2 √(p1(1-p1)+p2(1-p2)))2/(p1-p2)2Robinson KA Dickersin K. Development of a highly sensitive search strategy for the retrieval of reports of controlled trials using PubMed.Int J Epidemiol. 2002; 31: 150-153Crossref PubMed Scopus (564) Google Scholar = (1.96√(2(0.5)(0.25))) + 0.84√(0.5(1-0.5))2/(0.5)2Robinson KA Dickersin K. Development of a highly sensitive search strategy for the retrieval of reports of controlled trials using PubMed.Int J Epidemiol. 2002; 31: 150-153Crossref PubMed Scopus (564) Google Scholar = 12 per group, and the required sample size using Fleiss' formula with continuity correction 8Fleiss JL Tytun A Ury HK A Simple Approximation for Calculating Sample Sizes for Comparing Independent Proportions.Biometrics. 1980; 36: 343-346Crossref PubMed Scopus (281) Google Scholar would be n(1+√(1+4/n(p1-p2)))2/4 = 12(1+√(1+4/12(0.5)))2/4 = 16 per group. The authors, claiming that Fleiss' formula with continuity correction was used, calculated that 21 subjects per group were required ("assume …10% loss to follow-up, we calculated that a total of 48 COVID-19 patients (ie, 24 cases in the hydroxychloroquine group and 24 in the control group) would be required for the analysis (Fleiss with CC)" 4Gautret P Lagier JC Parola P et al.Hydroxychloroquine and azithromycin as a treatment of COVID-19: results of an open-label non-randomized clinical trial.Int J Antimicrob Agents. 2020; (article in press)Crossref PubMed Scopus (3763) Google Scholar), which implied that they assumed the effect of the control group would be around 5%, i.e., 1 out of the 24 in the control group would achieve a viral load clearance. This assumption deviated with the usual assumption that the control group has no effect, and the authors should have specify this assumption in the main text. In the only trial that the all essential information for sample size calculation was provided,6Cao B Wang Y Wen D et al.A Trial of Lopinavir-Ritonavir in Adults Hospitalized with Severe Covid-19.New Engl J Med. 2020; (article in press)Crossref Scopus (3870) Google Scholar the sample size calculated in the paper could not be reproduced. The paper reported that "8 days in the median time to clinical improvement between the two groups, assuming that the median time in the standard-care group was 20 days".6Cao B Wang Y Wen D et al.A Trial of Lopinavir-Ritonavir in Adults Hospitalized with Severe Covid-19.New Engl J Med. 2020; (article in press)Crossref Scopus (3870) Google Scholar Using the reported statistics, the hazard of the treatment (lopinavir-ritonavir) group and control group would be 0.0577 and 0.0347, respectively (hazard = ln(2) / median time of survival). Given the censoring time of 28 days ("The time to clinical improvement was assessed after all patients had reached day 28, with failure to reach clinical improvement or death before day 28 considered as right-censored at day 28."),6Cao B Wang Y Wen D et al.A Trial of Lopinavir-Ritonavir in Adults Hospitalized with Severe Covid-19.New Engl J Med. 2020; (article in press)Crossref Scopus (3870) Google Scholar the percentage of patients reaching clinical improvement should be the average of the percentage of the treatment group (1-e(-28×0.0577) = 80.2%) and the percentage of the control group (1-e(-28×0.0347) = 62.1%), that is, 71.1%. Note that this percentage was overestimated in the paper (75%).6Cao B Wang Y Wen D et al.A Trial of Lopinavir-Ritonavir in Adults Hospitalized with Severe Covid-19.New Engl J Med. 2020; (article in press)Crossref Scopus (3870) Google Scholar According to the standard formula, the number of events required to achieve a significance level of 0.05 and a power of 0.8 would be 4× (z0.025+z0.2)2/(ln(HR))2Robinson KA Dickersin K. Development of a highly sensitive search strategy for the retrieval of reports of controlled trials using PubMed.Int J Epidemiol. 2002; 31: 150-153Crossref PubMed Scopus (564) Google Scholar = 4 × (1.96+0.84)2/ln(0.0577/0.0347)2Robinson KA Dickersin K. Development of a highly sensitive search strategy for the retrieval of reports of controlled trials using PubMed.Int J Epidemiol. 2002; 31: 150-153Crossref PubMed Scopus (564) Google Scholar = 121.3Donner A. Approaches to sample size estimation in the design of clinical trials–a review.Stat Med. 1984; 3: 199-214Crossref PubMed Scopus (232) Google Scholar To accumulate 121 events, the total sample size required would be 121 / 71.1% = 170, i.e., 85 per group. The estimated sample size reported by the authors was 160, which had a 6% deviation with our calculated sample size, and the power of this sample size would be 78% instead of the desired 80%. To conclude, the quality of sample size calculation in clinical trials on COVID-19 patients indexed in PubMed was not acceptable. Inappropriate sample size calculation will give a negative impression to its readers and leads to suspicion on the existence of other methodological flaws in the study. Some believed that studies with any sample size could still contribute to science 9Bacchetti P McCulloch C Segal MR Being 'underpowered' does not make a study unethical.Stat Med. 2012; 31: 4138-4139Crossref PubMed Scopus (7) Google Scholar as the published results allow other readers to conduct a meta-analysis to obtain a more accurate estimate of treatment effect when further studies are available. However, in the time of rapid disease outbreak such as COVID-19, researchers are working around the clock to examine the effectiveness of potential treatments, and inappropriate sample size calculation will lead to adverse consequences. If the sample size is overestimated, researchers need to spend unnecessary, additional time to recruit study and clean the data, so that the release of the results would be delayed. On the other hand, if the sample size was underestimated, the probability of committing a type II error, that is, unable to reject the null hypothesis while the treatment is effective, will be high. A null find will discourage clinicians to use an effective treatment, and further studies are required to support its effectiveness, again resulting in a delay. Many existing, effective treatments on infections with coronavirus and other RNA-virus have been proposed to test for its effectiveness on COVID-19 patients. Given the fast-growing number of undergoing trials,10Zhang Q Wang Y Qi C Shen L Li J Clinical trial analysis of 2019‐nCoV therapy registered in China.J Med Virol. 2020; (accepted)Crossref Scopus (112) Google Scholar we expect that many papers on trials for COVID-19 patients will be published very soon. Since the sample size should be determined a priori before the recruitment of the first patient, we strongly suggest all research teams include a statistician as a team member or invite a statistician to evaluate the appropriateness of the sample size calculation. It would be irreversible if flaws in sample size calculation are identified when the trial has been completed or the reported has been submitted for peer review. None. Download .docx (.05 MB) Help with docx files Download .docx (.01 MB) Help with docx files

Referência(s)