Revisão Acesso aberto Revisado por pares

Novel Methodologic Approaches to Phase I, II, and III Trials

2013; Lippincott Williams & Wilkins; Volume: 44; Issue: 6_suppl_1 Linguagem: Inglês

10.1161/strokeaha.111.000031

ISSN

1524-4628

Autores

Sharon D. Yeatts,

Tópico(s)

Meta-analysis and systematic reviews

Resumo

HomeStrokeVol. 44, No. 6_suppl_1Novel Methodologic Approaches to Phase I, II, and III Trials Free AccessResearch ArticlePDF/EPUBAboutView PDFView EPUBSections ToolsAdd to favoritesDownload citationsTrack citationsPermissions ShareShare onFacebookTwitterLinked InMendeleyReddit Jump toFree AccessResearch ArticlePDF/EPUBNovel Methodologic Approaches to Phase I, II, and III Trials Sharon D. Yeatts, PhD Sharon D. YeattsSharon D. Yeatts From the Department of Public Health Sciences and Data Coordination Unit, Medical University of South Carolina, Charleston, SC. Originally published1 Jun 2013https://doi.org/10.1161/STROKEAHA.111.000031Stroke. 2013;44:S116–S118Among the future directions of stroke research identified by the Stroke Progress Review Group was a call for improved trial design, conduct, and outcome assessment. This demand is underscored by recent trial findings. Of the >100 clinical trials in ischemic stroke published during the first decade of the 21st century, only 5 of 31 phase III trials demonstrated efficacy on the primary outcome.1 As the number of experimental therapies undergoing clinical investigation in stroke increases, there is a rising need for more efficient statistical designs to (1) determine the optimal dose, (2) identify interventions with therapeutic potential, and (3) evaluate clinically relevant treatment effects.The correct selection of the optimal dose, via a thorough understanding of the dose–toxicity and dose–efficacy relationships, is critical for establishing the efficacy of a potential therapeutic agent.2 Traditional rule–based dose-escalation designs,3 such as the 3+3 design and its variations, have suboptimal statistical operating characteristics. The inefficiency in the 3+3 design is that it stems from the decision to escalate or de-escalate, a decision is based solely on event data from the current dose, without considering event information available from neighboring doses. An alternative design called the Continual Reassessment Method4 is an adaptive dose-finding algorithm, wherein escalation or de-escalation through the dose region is determined by continuous re-estimation of the dose–toxicity curve, and each cohort of subjects is treated at the dose currently believed to be the maximum tolerated dose. Although computationally more intensive than the 3+3 design, the Continual Reassessment Method and its variations use all available toxicity data in the estimation of the maximum tolerated dose, and thus are statistically more efficient. Ethical considerations also favor the Continual Reassessment Method, because the Continual Reassessment Method typically treats fewer patients at subtherapeutic doses.Once the appropriate dose has been determined through a phase I trial, the next step is to evaluate efficacy potential in a phase II trial. The traditional concurrently controlled phase II design, intended to simultaneously estimate treatment effect and assess variability, is often criticized as an underpowered phase III comparative clinical trial. Proposed alternative designs, such as the selection design and the futility design, are instead intended to weed out ineffective or mediocre therapies.The objective of the futility design, which has been successfully implemented in stroke,5,6 is to discard treatments that do not show promise. Statistical hypotheses are stated such that the goal is to demonstrate that the intervention is futile. Failure to conclude futility would be considered evidence in favor of a need for a definitive phase III clinical trial. In the single arm futility design,7 the experimental treatment arm is compared with a target response rate, π1, defined as the minimum proportion of successes in the treated group which would warrant further study. If the true success proportion π is less than π1, the intervention is declared futile. Comparison of the experimental arm with a target response rate, which is fixed and has no variability, results in a smaller sample size than would be required for direct comparison with a concurrent control arm. The target response rate can be determined based on a clinically relevant treatment effect and historical control data.Concerns over the use of historical control data include temporal changes in outcome associated with improvements in patient management, as well as variations in eligibility criteria, protocol adherence, and primary outcome measures across clinical trials. If the historical control data are outdated and therefore no longer relevant, trial results may not reflect an accurate assessment of the futility of the experimental treatment. Inclusion of a small cohort of control subjects for calibration of the threshold value has been suggested.8 If this cohort is too small, its usefulness in terms of calibration is quite limited; if this cohort is too large, the trial begins to resemble an underpowered phase III.The inclusion of a concurrent control arm in the futility design avoids the drawbacks of historical control data and allows for a direct comparison of treatment arms. The futility hypothesis is based on a direct comparison of the randomized treatment arms, such that futility would be declared if the absolute treatment effect is less than δ, a prespecified clinically meaningful futility margin, in favor of the experimental treatment. The sample size is increased over the single-arm design, but the concurrently controlled futility design is not an alternative to phase III efficacy testing. The objective remains to establish futility, rather than to demonstrate efficacy, of the active treatment. The National Institute of Neurologic Disorders and Stroke-funded phase II trial of deferoxamine mesylate in intracerebral hemorrhage (Hi-Def in ICH [intracerebral hemorrhage], clinicaltrials.gov NCT01662895) uses this design.Selection designs9 can be used to prioritize candidate interventions, such that resources are allocated to the most promising of candidates. In a selection design, the objective is to select the best among K interventions (or K interventions and a control) for further testing. In the selection design, subjects would be randomized to one of the K interventions. The best intervention is defined as the intervention with the numerically, rather than statistically, highest response rate. The sample size is determined to ensure that, if the best treatment is superior by at least some margin D, then the best treatment will be selected with high probability.The selection design can be combined with a futility or superiority test in a sequential 2-stage design, as described in the trial of Co-Q10 in ALS (amyotrophic lateral sclerosis).10 At the conclusion of stage 1, a treatment would be selected, and the statistical hypothesis tested at the end of stage 2. Inclusion of stage 1 subjects in the statistical hypothesis test has the advantage of using all available outcome data but introduces bias, which must be accounted for in the test statistic. If the stage 1 subjects are excluded from the stage 2 hypothesis test, the parameter estimate is unbiased, but the overall sample size is increased.Adaptive designs promise increased flexibility of the trial to respond to accumulating information, a promise which has generated both enthusiasm and confusion. According to the Food and Drug Administration draft guidance,11 an adaptive design "includes… a prospectively planned opportunity for modification of one or more specified aspects of the study design and hypotheses based on analysis of data (usually interim data) from subjects in the study." It is important to emphasize that the potential issues, and in what manner the trial will adapt to each, must be prespecified in the design stage to maintain trial validity.Although certainly not novel, group sequential methods are adaptive according to this definition, in that they allow the trial to be stopped early, based on accumulating data, in the face of overwhelming efficacy or futility. Other valid mechanisms for adaptation in phase III trials include blinded sample size re-estimation12 and covariate adaptive randomization.13 Adaptations based on an unblinded assessment of interim data, including sample size re-estimation and response adaptive randomization, may be more enticing but may also be more controversial in the confirmatory setting. Early phase designs allow for more flexibility with regard to adaptation, and adaptive designs have gained greater acceptance in this exploratory setting. Whether exploratory or confirmatory, Bayesian or Frequentist, each trial must demonstrate that the statistical operating characteristics remain sound in the face of the chosen adaptation(s).Even a well-designed trial of an effective therapy can fail based on an inappropriate primary outcome or primary analysis. The selection of valid, reliable, and efficient outcome measures and analytic strategies for stroke clinical trials is receiving much attention in the stroke literature.14 The modified Rankin Scale is an ordinal disability measure, which is commonly used as the primary efficacy outcome measure in stroke clinical trials. Traditional analysis focuses on dichotomization of the ordinal scale into a binary response, where success or failure is determined by comparing the result of each subject with a fixed threshold. Responder analysis (also referred to as the sliding dichotomy or the stratified dichotomy) tailors the threshold for success on the basis of the baseline prognosis of each subject.15 In the responder analysis setting, a mild stroke would have a more stringent definition of success than a severe stroke. This approach is thought to reflect more accurately the clinical perspective of outcome and is currently being implemented in the SHINE (Stroke Hyperglycemia Insulin Network Effort) trial (clinicaltrials.gov NCT01369069). Although these dichotomization approaches have the advantage of a relatively straightforward clinical interpretation, reduction of an ordinal measure to a binary outcome results in a loss of information.Analytic approaches that maintain the ordinal nature of the scale, sometimes referred to as shift analysis,16 are statistically more powerful than dichotomized analyses; however, the clinical interpretation is also less intuitive. In addition, some of these ordinal approaches require distributional assumptions which may not be supported by the trial data. Ordinal regression, for example, requires the assumption of proportional odds, which means that the estimated odds ratio is constant across all possible cut points of the ordinal scale. Alternative methods must be carefully considered in the design stage; a clinical trial powered for ordinal regression may be underpowered for the logistic regression required if the proportional odds assumption is violated.The modified Rankin Scale reflects a global assessment of function, but a patient may consider many other life aspects in describing outcome after a stroke. Patient-reported outcomes, including the NeuroQOL17 and PROMIS18 tools, may provide important information regarding quality of life, cognition, and social functioning. Consideration of these varying aspects of global outcome may provide a more complete understanding of the evolution of stroke and allow a more sensitive estimate of treatment effect.These are just a sampling of the novel approaches being considered, and in some cases implemented, in current stroke trials. Continued development of innovative trial designs, outcome assessments, and analytic approaches is an essential component of stroke research.DisclosuresDr Yeatts is the SDMC PI (Statistics and Data Management Center Principal Investigator) for the phase II trial of deferoxamine in ICH (Hi-Def; U01 NS074425). She is the unblinded statistician for the IMS 3 (U01 NS077304) and ProTECT (U01 NS062778; Bio-ProTECT R01 NS071867) phase III clinical trials. The other authors have no conflicts to report.FootnotesCorrespondence to Sharon D. Yeatts, PhD, Department of Public Health Sciences, Medical University of South Carolina, 135 Cannon St, Suite 303, MSC 835, Charleston, SC 29425. E-mail [email protected]References1. Hong KS, Lee SJ, Hao Q, Liebeskind DS, Saver JL. Acute stroke trials in the 1st decade of the 21st century.Stroke. 2011; 42:e314.Google Scholar2. Fisher Mfor the Stroke Therapy Academic Industry Roundtable IV. Enhancing the development and approval of acute stroke therapies.Stroke. 2005; 36:1808–1813.LinkGoogle Scholar3. Storer BE. Design and analysis of phase I clinical trials.Biometrics. 1989; 45:925–937.CrossrefMedlineGoogle Scholar4. O'Quigley J, Pepe M, Fisher L. Continual reassessment method: a practical design for phase 1 clinical trials in cancer.Biometrics. 1990; 46:33–48.CrossrefMedlineGoogle Scholar5. The IMS Trial Investigators. Combined intravenous and intra-arterial recanalization for acute ischemic stroke: the Interventional Management of Stroke study.Stroke. 2004; 35:904–912.LinkGoogle Scholar6. The IMS II Trial Investigators. The Interventional Management of Stroke (IMS) II Study.Stroke. 2007; 38:2127–2135.LinkGoogle Scholar7. Palesch YY, Tilley BC, Sackett DL, Johnston KC, Woolson R. Applying a phase II futility study design to therapeutic stroke trials.Stroke. 2005; 36:2410–2414.LinkGoogle Scholar8. Herson J, Carter SK. Calibrated phase II clinical trials in oncology.Stat Med. 1986; 5:441–447.CrossrefMedlineGoogle Scholar9. Simon R, Thall PF, Ellenberg SS. New designs for the selection of treatments to be tested in randomized clinical trials.Stat Med. 1994; 13:417–429.CrossrefMedlineGoogle Scholar10. Levy G, Kaufmann P, Buchsbaum R, Montes J, Barsdorf A, Arbing R, et al. A two-stage design for a phase II clinical trial of coenzyme Q10 in ALS.Neurology. 2006; 66:660–663.CrossrefMedlineGoogle Scholar11. US Food and Drug Administration. Draft Guidance for Industry: Adaptive Design Clinical Trials for Drugs and Biologics.2010; http://www.fda.gov/downloads/Drugs/GuidanceComplianceRegulatoryInformation/Guidances/UCM201790.pdf Accessed October 31, 2012.Google Scholar12. Chow SC, Chang M. Adaptive Sample Size Adjustment. In: , Chow SC, ed. Adaptive Design Methods in Clinical Trials. Boca Raton, CL: Chapman & Hall; 2007: 137–159.Google Scholar13. Chow SC, Chang M. Adaptive Randomization. In: , Chow SC, ed. Adaptive Design Methods in Clinical Trials. Boca Raton, CL: Chapman & Hall; 2007: 47–73.Google Scholar14. Saver JL. Optimal end points for acute stroke therapy trials: best ways to measure treatment effects of drugs and devices.Stroke. 2011; 42:2356–2362.LinkGoogle Scholar15. Adams HP, Leclerc JR, Bluhmki E, Clarke W, Hansen MD, Hacke W. Measuring outcomes as a function of baseline severity of ischemic stroke.Cerebrovasc Dis. 2004; 18:124–129.CrossrefMedlineGoogle Scholar16. Saver JL. Novel end point analytic techniques and interpreting shifts across the entire range of outcome scales in acute stroke trials.Stroke. 2007; 38:3055–3062.LinkGoogle Scholar17. Cella D, Nowinski C, Peterman A, Victorson D, Miller D, Lai JS, et al. The neurology quality-of-life measurement initiative.Arch Phys Med Rehabil. 2011; 92:S28–S36.CrossrefMedlineGoogle Scholar18. Fries JF, Bruce B, Cella D. The promise of PROMIS: using item response theory to improve assessment of patient-reported outcomes.Clin Exp Rheumatol. 2005; 23:S53–S57.MedlineGoogle Scholar Previous Back to top Next FiguresReferencesRelatedDetailsCited By Qureshi A, Lobanova I, Huang W, Ishfaq M, Broderick J, Cassarly C, Martin R, Macdonald R and Suarez J (2021) Lessons Learned from Phase II and Phase III Trials Investigating Therapeutic Agents for Cerebral Ischemia Associated with Aneurysmal Subarachnoid Hemorrhage, Neurocritical Care, 10.1007/s12028-021-01372-4, 36:2, (662-681), Online publication date: 1-Apr-2022. James G, Symeonides S, Marshall J, Young J and Clack G (2021) Assessment of various continual reassessment method models for dose-escalation phase 1 oncology clinical trials: using real clinical data and simulation studies, BMC Cancer, 10.1186/s12885-020-07703-6, 21:1, Online publication date: 1-Dec-2021. Schneider R, Auinger P, Tarolli C, Iourinets J, Gil-Díaz M and Richard I (2020) A trial of buspirone for anxiety in Parkinson's disease: Safety and tolerability, Parkinsonism & Related Disorders, 10.1016/j.parkreldis.2020.10.020, 81, (69-74), Online publication date: 1-Dec-2020. Bornstein N, Saver J, Diener H, Gorelick P, Shuaib A, Solberg Y, Devlin T, Leung T, Molina C, Skoloudik D, Fiksa J, Krieger D, Andersen G, Berrouschot J, Hobohm C, Schneider D, Griewing B, Endres M, Hausler K, Kimmig H, Ringleb P, Weimar C, Schilling M, Kohrmann M, Hetzel A, Kaps M, Cheung R, Sobolewski P, Nyke W, Czlonkowska A, Stepien A, Waldemar B, Słowik A, Zbigniewem S, Lubiński I, Portela P, Segure T, Marti-Fabregas J, Alonso M, Nunez A, Miguel M, Campello A, Arenillas J, Marshall N, Chiu D, Shownkeen H, Rymer M, Sen S, Roubec M, Kuliha M, Lakomý C, Tyl D, Kemlink D, Doležal O, Rekova P, Krejčí V, Christensen A, Belhage B, Maschmann C, Kruse Larsen C, Pott F, Christensen H, Marstrand J, Nielsen J, Meden P, Prytz S, Rosenbaum S, Hedemann Sorensen J, Stenhoj Meier K, Schmift Ettrup K, Dupont Hougaard K, Von Wietzel P, Stoll A, Schwetlick H, Pradel H, Hemprich A, Schulz A, Frerich B, Hobohm C, Weise C, Michalski D, Schaller F, Schiefke F, Helmrich J, Pelz J, Schnieder M, Schneider M, Matzen P, Langos R, Müller-Duerwald S, Lukhaup S, Bauer U, Kloppig W, Hiermann E, Mucha G, Soda H, Weinhardt R, Mucha T, Ziegler V, Abbushi A, Hotter B, Winter B, Anthofer B, Noack C, Laubisch D, Heldge Schneider G, Jan Jungehulsing G, Mueller H, Dreier J, Fiebach J, Flechsenhar J, Villringer K, Ebinger M, Rozanski M, Vajkoczy P, Klingebiel R, Steinicke R, Pittl S, Hoffmann S, Maul S, Krause T, Liman T, Plath T, Nowe T, Schmidt W, Fritzsch C, Haas C, Will H, Haußmann-Betz K, Bayat M, Pordzik T, Hug A, Staff C, Lichy C, Eggers G, Kloss M, Bendszus M, Herrmann O, Seeberger R, Schwarting S, Rhode S, Rizos T, Hacke W, Frank B, Bozkurt B, Holle D, Mueller D, Koch D, Shanib H, Sudendey J, Brenck J, Busch K, Gartzen K, Gasser T, Hagenacker T, Buerke B, Prigge G, Minnerup J, Albers J, Wermker K, Schwindt W, Kallmünzer R, Hauer E, Breuer L, Schellinger P, Kollmar R, Sauer R, Schwab S, Struffert T, Funfack A, Stechmann A, Schlaeger A, Laeppchen C, Schuchardt F, Klingler J, Reis J, Lambeck J, Friedrich M, Laible M, Wellermeyer P, Beck S, Rutsch S, Niesen W, Tanislav C, Schaaf H, Kerkmann H, Schirotzek I, Allendörfer J, Wolff S, Yuk-Lun Lau A, Yin Yan Chan A, Siu D, Wong E, Chu Wong G, Leung H, Wong L, Zhu X, Yan Soo Y, Ting Tse A, Kit Leung G, Leung K, Ngai Hung K, Wai Mei Kwan M, Man Yu Tse M, Tse P, Hon Chan P, Lee R, Shek Kwan Chang R, Yin Yu Pang S, Fong Kwong Hon S, Cheng T, Lui W, Wo Mak W, Sobota A, Wiater B, Loch B, Wolak G, Łabudzka I, Dabal J, Grzesik M, Sledzinska M, Hatalska-Żerebiec R, Szczuchniak W, Gójska A, Nałęcz D, Gasecki D, Kozera G, Dylewicz Ł, Niekra M, Kwarciany M, Chomik P, Skowron P, Kobayashi A, Chabik G, Makowicz G, Bembenek J, Jędrzejewska J, Karlinski M, Czepiel W, Brodacki B, Staszewski J, Kosek J, Jadczak M, Durka-Kęsy M, Kaluzny K, Ziomek M, Fudala M, Sosnowski Z, Ferens A, Szczygieł E, Banaszkiewicz K, Ziomek M, Wnuk M, Szczepańska-Szerej A, Jach E, Maslanko G, Wojczal J, Luchowski P, Kowalczyk A, Jakubiak J, Kopcewicz J, Gajda M, Wichlinska-Lubinska M, Rodriguez D, Santamarin E, Pagola J, Lorente Guerrero J, Ribo M, Rubiera M, Maisterra O, Pinero S, Catalina Iglesias V, Plans G, Quesada H, Aparicio Caballero M, Portela P, De Diego A, Garay D, García Rodriguez M, Martin O, Braña S, Garcia J, Hernandez F, Catala I, Marti-Vilalta J, Delgado Mederos R, de Quintana S, Martinez-Ramirez S, Valcarcel Gonzalez J, Masjuan Vallejo J, Diamantopoulus J, Del Alamo M, Poveda P, Pastor A, Carballal C, Diaz F, Garcia Leal R, Juretschke R, Echabe E, Sanchez J, Yanez M, Garcia R, Muino R, Rivas S, Lopez Gonzalez D, Cuadrado E, Giralt E, Villalba G, Roquer J, Angel O, Jimenez M, Cedeño R, Salinas R, Lejarreta S, Silva Y, Fraile A, Calleja A, Cepeda Landínez G, Tellez N, Garcia Bermejo P, Santos P, Herranz R, Hunt P, Browning D, Violette M, Hoddeson R, Rose J, Zhang J, Mazumdar A, Echiverri H, Chow J, Lovick D, Coleman M, Akhtar N, Sugg R, Zanation A, Germanwala A, Senior B, Huang D, Aucutt-Walter N, Kasner S, LeRoux P, von Kummer R and Palesch Y (2019) Sphenopalatine Ganglion Stimulation to Augment Cerebral Blood Flow, Stroke, 50:8, (2108-2117), Online publication date: 1-Aug-2019. James G, Symeonides S, Marshall J, Young J and Clack G (2016) Continual reassessment method for dose escalation clinical trials in oncology: a comparison of prior skeleton approaches using AZD3514 data, BMC Cancer, 10.1186/s12885-016-2702-6, 16:1, Online publication date: 1-Dec-2016. Margulies S, Anderson G, Atif F, Badaut J, Clark R, Empey P, Guseva M, Hoane M, Huh J, Pauly J, Raghupathi R, Scheff S, Stein D, Tang H and Hicks M (2016) Combination Therapies for Traumatic Brain Injury: Retrospective Considerations, Journal of Neurotrauma, 10.1089/neu.2014.3855, 33:1, (101-112), Online publication date: 1-Jan-2016. Levin B (2015) The futility study—Progress over the last decade, Contemporary Clinical Trials, 10.1016/j.cct.2015.06.013, 45, (69-75), Online publication date: 1-Nov-2015. Smith C, Denes A, Tyrrell P and Di Napoli M (2015) Phase II anti-inflammatory and immune-modulating drugs for acute ischaemic stroke, Expert Opinion on Investigational Drugs, 10.1517/13543784.2015.1020110, 24:5, (623-643), Online publication date: 4-May-2015. June 2013Vol 44, Issue 6_suppl_1 Advertisement Article InformationMetrics © 2013 American Heart Association, Inc.https://doi.org/10.1161/STROKEAHA.111.000031PMID: 23709704 Manuscript receivedNovember 5, 2012Manuscript acceptedMarch 4, 2013Originally publishedJune 1, 2013 Keywordsselection designclinical trial designfutility designadaptive designcontinual reassessment methodPDF download Advertisement SubjectsTreatment

Referência(s)