Actual and Actuarial Probabilities of Competing Risks: Apples and Lemons
2007; Elsevier BV; Volume: 83; Issue: 5 Linguagem: Inglês
10.1016/j.athoracsur.2006.11.044
ISSN1552-6259
AutoresGary L. Grunkemeier, Ruyun Jin, Marinus J.C. Eijkemans, Johanna J.M. Takkenberg,
Tópico(s)Statistical Methods in Clinical Trials
ResumoThe probability of a type of failure that is not inevitable, but can be precluded by other events such as death, is given by the cumulative incidence function. In cardiac research articles, it has become known as the actual probability, in contrast to the actuarial methods of estimation, usually implemented by the Kaplan-Meier (KM) estimate. Unlike cumulative incidence, KM attempts to predict what the latent failure probability would be if death were eliminated. To do this, the KM method assumes that the risk of dying and the risk of failure are independent. But this assumption is not true for many cardiac applications in which the risks of failure and death are negatively correlated (ie, patients with a higher risk of dying have a lower risk of failure, and patients with a lower risk of death have a higher risk of failure, which is a condition called informative censoring).Recent editorials in two cardiac journals have promoted the use of the KM method (actuarial estimate) for competing risk events (specifically for heart valve performance) and criticized the use of the cumulative incidence (actual) estimates. This report has two aims: to explain the difference between these two estimates and to show why the KM is generally not appropriate. In the process we will rely on alternative representations of the KM estimator (using redistribution to the right and inverse probability weighting) to explain the difference between the two estimates and to show how it may be possible to adjust KM to overcome the informative censoring. The probability of a type of failure that is not inevitable, but can be precluded by other events such as death, is given by the cumulative incidence function. In cardiac research articles, it has become known as the actual probability, in contrast to the actuarial methods of estimation, usually implemented by the Kaplan-Meier (KM) estimate. Unlike cumulative incidence, KM attempts to predict what the latent failure probability would be if death were eliminated. To do this, the KM method assumes that the risk of dying and the risk of failure are independent. But this assumption is not true for many cardiac applications in which the risks of failure and death are negatively correlated (ie, patients with a higher risk of dying have a lower risk of failure, and patients with a lower risk of death have a higher risk of failure, which is a condition called informative censoring). Recent editorials in two cardiac journals have promoted the use of the KM method (actuarial estimate) for competing risk events (specifically for heart valve performance) and criticized the use of the cumulative incidence (actual) estimates. This report has two aims: to explain the difference between these two estimates and to show why the KM is generally not appropriate. In the process we will rely on alternative representations of the KM estimator (using redistribution to the right and inverse probability weighting) to explain the difference between the two estimates and to show how it may be possible to adjust KM to overcome the informative censoring. “Tissue valves suffer from structural failure, with an average lifetime of 10 to 12 years before replacement is needed [1Yoganathan A.P. Chandran K.B. Sotiropoulos F. Flow in prosthetic heart valves: state-of-the-art and future directions.Ann Biomed Eng. 2005; 33: 1689-1694Crossref PubMed Scopus (145) Google Scholar].” This quote seems like a reasonable answer to the question, “What is the average lifetime of a tissue valve” … or is it? The average lifetime of a group of patients can always be determined exactly by someone who outlives the last patient. But a fundamental difference exists between the lifetime of a patient and the lifetime of a tissue valve. The probability that a patient will die is 100%, but the probability that her tissue valve will fail is far less than 100%, because in most cases it will still be functioning when she dies, say from an accident or cancer. In a recent simulation study of aortic tissue valves, estimates of the percentage of valves that fail ranged from 40% to 50% in the youngest age group of patients to less than 10% in the oldest age group [2Puvimanasinghe J.P. Takkenberg J.J. Eijkemans M.J. et al.Comparison of Carpentier-Edwards pericardial and supraannular bioprostheses in aortic valve replacement.Eur J Cardiothorac Surg. 2006; 29: 374-379Crossref PubMed Scopus (27) Google Scholar]. This complicates the estimation or even the meaning of the average lifetime of a tissue valve. How does one estimate the average lifetime of the valves in a series if more than half never reach their end of life and are thus not available to even contribute to this average? To do this, one would have to eliminate deaths that occur before valve failure, since they censor (hide) the valve’s latent failure time (see Appendix 1). One cannot do this in reality, so attempts are made to virtually eliminate the intervening deaths by statistical means, using the valves that do fail to predict the latent valve failure times for the patients who die with functioning valves, and thus complete the censored lifetimes statistically. This is what the Kaplan-Meier (KM) estimate for death does. The KM method completes the survival experience of the censored valves by assuming that they ultimately die at a future time, and at the same rate as those who have already died. However, this last assumption, of noninformative or ignorable censoring cannot be proven to be true [3Tsiatis A. A nonidentifiability aspect of the problem of competing risks.Proceedings of the National Academy of Sciences. 1975; 72: 20-22Crossref PubMed Scopus (528) Google Scholar, 4Crowder M. Identifiability crises in competing risks.International Statistical Review. 1994; 62: 379-391Crossref Google Scholar], and it is in fact false in many cases including the valve failure example. Patients who die before valve failure are generally older than those who do not, whereas the rate of tissue valve failure is lower in older patients. Thus the (counterfactual) future failure rate in deceased (censored) patients would be lower than that of the (generally younger) patients who did live to experience valve failure. The probability of valve failure in a particular patient population is given by a function called the cumulative incidence, also known as the actual probability in the cardiac literature, because it estimates the percentage that will actually fail from each of the competing risks. The most common use of actual analysis in the cardiac literature is for structural valve deterioration (SVD) of tissue valves [5Mahoney C.B. Miller D.C. Khan S.S. Hill J.D. Cohn L.H. Twenty-year, three-institution evaluation of the Hancock Modified Orifice aortic valve durability Comparison of actual and actuarial estimates.Circulation. 1998; 98 (discussion II93–4): II88-II93PubMed Google Scholar, 6Grossi E.A. Galloway A.C. Zakow P.K. et al.Choice of mitral prosthesis in the elderly An analysis of actual outcome.Circulation. 1998; 98: II116-II119PubMed Google Scholar, 7Khan S. Trento A. Kass R. DeRobertis M. Sandhu M. Nessim S. Actual failure rates: a method of assessing tissue valve reoperation rates.Am Heart J. 1999; 138: 108-113Abstract Full Text Full Text PDF PubMed Scopus (6) Google Scholar, 8Jamieson W.R. Burr L.H. Miyagishima R.T. Germann E. Anderson W.N. Actuarial versus actual freedom from structural valve deterioration with the Carpentier-Edwards porcine bioprostheses.Can J Cardiol. 1999; 15: 973-978PubMed Google Scholar, 9Miller 3rd, C.C. Safi H.J. Winnerkvist A. Baldwin J.C. Actual versus actuarial analysis for cardiac valve complications: the problem of competing risks.Curr Opin Cardiol. 1999; 14: 79-83Crossref PubMed Scopus (10) Google Scholar, 10Kaempchen S. Guenther T. Toschke M. Grunkemeier G.L. Wottke M. Lange R. Assessing the benefit of biological valve prostheses: cumulative incidence (actual) vs. Kaplan-Meier (actuarial) analysis.Eur J Cardiothorac Surg. 2003; 23 (discussion 713–4): 710-713Crossref PubMed Scopus (17) Google Scholar], but it has also been used for other events, including heart transplantation listing [11McGiffin D.C. Naftel D.C. Kirklin J.K. et al.Pediatric Heart Transplant Study GroupPredicting outcome after listing for heart transplantation in children: comparison of Kaplan-Meier and parametric competing risk analysis.J Heart Lung Transplant. 1997; 16: 713-722PubMed Google Scholar], coronary artery reintervention [12Blackstone E.H. Lytle B.W. Competing risks after coronary bypass surgery: the influence of death on reintervention.J Thorac Cardiovasc Surg. 2000; 119: 1221-1230Abstract Full Text Full Text PDF PubMed Scopus (35) Google Scholar], reoperation in patients with aortic dissection [13Lai D.T. Robbins R.C. Mitchell R.S. et al.Does profound hypothermic circulatory arrest improve survival in patients with acute type A aortic dissection?.Circulation. 2002; 106: I218-I228PubMed Google Scholar], and complications of combined carotid and coronary artery surgery [14Akins C.W. Hilgenberg A.D. Vlahakes G.J. et al.Late results of combined carotid and coronary surgery using actual versus actuarial methodology.Ann Thorac Surg. 2005; 80: 2091-2097Abstract Full Text Full Text PDF PubMed Scopus (33) Google Scholar]. The general setting in all of these examples is that of competing risks [15Putter H. Fiocco M. Geskus R.B. Tutorial in biostatistics: competing risks and multi-state models.Stat Med. 2006; 10: 10Google Scholar]. In the case of valve replacement, two processes are “competing” to terminate the function of the heart valve (ie, the wearing out of the valve [SVD] and the wearing out of the patient [death]). Similarly, in assessing the durability of coronary artery bypass grafting surgery (CABG), reintervention (REDO) ends the success of the original intervention, or the patient may die of other causes before having to undergo REDO. The setting of tissue valve SVD has been used before to compare the actual and actuarial methods [16Starr A. Grunkemeier G.L. The expected lifetime of porcine valves.Ann Thorac Surg. 1989; 48: 317-318Abstract Full Text PDF PubMed Scopus (27) Google Scholar, 17Grunkemeier G.L. Jamieson W.R. Miller D.C. Starr A. Actuarial versus actual risk of porcine structural valve deterioration.J Thorac Cardiovasc Surg. 1994; 108: 709-718PubMed Google Scholar, 18Grunkemeier G.L. Anderson R.P. Miller D.C. Starr A. Time-related analysis of nonfatal heart valve complications: cumulative incidence (actual) versus Kaplan-Meier (actuarial).Circulation. 1997; 96 (II-70–4; discussion II-74–5)PubMed Google Scholar]. Thus in this report we will use instead our CABG surgery data to demonstrate: (1) the cumulative incidence (actual probability) of REDO; (2) the KM estimate of REDO, based on the incorrect assumption; and (3) a method of correcting the KM estimate. From 1968 through 2003, 20,835 patients underwent at least one isolated CABG procedure on our service. For this exposition, we excluded patients who died in hospital or within 30 days (2.7%) and patients who survived the operation but were followed for fewer than 30 days (8%). The remaining 18,596 patients have been followed for 171,959 patient-years, during which 16% have had REDO, 35% have died without REDO, either surgical or percutaneous, and 49% were last known to be alive and REDO-free at the time of analysis (Fig 1). The maximum follow-up is 35 years, but we truncated the figures at 30 years, beyond which only 50 patients had follow-up. For characterizing time-related events in mathematics, the most fundamental function is the distribution function or curve, which plots the cumulative percentage of events occurring with time. In survival analysis, the complement of this curve is used (ie, the percentage event-free at each point in time). Herein we will keep the death (event), rather than the survival (event-free) perspective, because it is more fundamental (and is consistent with the ubiquitous use of operative mortality rather than operative survival). Thus the KM estimate we use is the complement of the usual KM estimate. (One could always invert these curves, physically or in one’s mind’s eye, if the event-free perspective is desired.) The probability of REDO is given by the cumulative incidence function or curve (actual failure), which can be derived from the data in Figure 1. If the series were completed (ie, if no patients were still alive), then the probability of REDO is obtained by simply adding up the number of REDOs in time and dividing by the total number of patients. How can we estimate this curve when approximately half of these patients (Fig 1) have not yet reached their endpoint (death or REDO)? In this case, the calculation can be accomplished by distributing patients who are still alive without REDO proportionally into the other two groups. The proportions are based on the assumption of noninformative censoring (ie, patients alive without REDO will experience either future REDO or death before REDO at the same rate as those who have already done so). This assumption is usually reasonable because the majority of these patients are censored, simply because they have not been in the study long enough to reach one of the endpoints (ie, death or REDO). However, if there has been a change in patients recruited during the study timeframe, or if a large percentage of the censored patients are lost to follow-up, then this assumption may be violated. This process can be demonstrated graphically by showing the modification to the current status data of Figure 1. We can gradually develop the cumulative incidence estimate by distributing the patients who are alive in each bar of Figure 1 out across time to “complete” their experience, ending in either the REDO or the dead status. Panels A and B in Figure 2 show this process of redistribution to the right [19Efron B. The two sample problem with censored data. Vol. 4. University of California Press, Berkeley, CA1967: 831-853Google Scholar] partially completed, and in panel C it is complete. To approximate the cumulative incidence curves, the bars in panel C are simply cumulated in time (Fig 2D). Figure 3 shows the cumulative incidence curves, which are smooth versions of the curves in Figure 2D, divided by the total number of patients to normalize the maximum to 100%. Note that the probability of REDO levels off to a percentage (30%) that estimates the percentage of CABG surgeries that will actually undergo REDO. For this reason, the cumulative incidence is called a subdistribution function [4Crowder M. Identifiability crises in competing risks.International Statistical Review. 1994; 62: 379-391Crossref Google Scholar], which emphasizes that it is an “improper” distribution function that does not reach 100%, because (unlike death) not all patients will experience the event. For the event death, the KM method works the same way as previously mentioned, and gives an estimate of the percent mortality at each time by distributing the patients who are still alive (censored) into future deaths at similar rates as those who have already died. But for events that are not necessarily fatal, such as REDO after CABG, KM estimates the probabilities by treating both the patients who are alive and those who have already died as censored. The KM method makes the assumption that both subgroups will eventually have REDOs and that their rates will be the same as that of those who have already had REDO. The way that KM redistributes both the dead and alive patients from Figure 1 is shown in Figure 4. The final KM estimate of REDO is about twice as high as the actual REDO curve, which is shown as a dashed line in Figure 3. The KM method attempts to predict what the REDO rate would be if no patient ever died. Of course, this value would be higher than the actual value, but to do this, KM relies on the assumption of noninformative censoring for not only the patients who are alive, but also for the patients who have died, an assumption that is not true in this situation nor in many other medical applications. The cumulative hazard of death before REDO increases with patient age (Fig 5), whereas the cumulative hazard of REDO decreases with patient age (Fig 6). This informative censoring causes the KM REDO curve to be higher than it should be; if the patients who died before REDO were revived and allowed to continue on to REDO, they would lower the estimate given by the KM curve. We can estimate the amount of this lowering using a technique called inverse probability weighting.Fig 6Cumulative hazard of reintervention (REDO) by age group. (CABG = coronary artery bypass grafting.)View Large Image Figure ViewerDownload (PPT) Some methods to overcome the limitations of informative censoring have been suggested [20Slud E.V. Rubinstein L.V. Dependent competing risks and summary survival curves.Biometrika. 1983; 70: 643-649Crossref Scopus (63) Google Scholar, 21Moeschberger M.L. Klein J.P. Statistical methods for dependent competing risks.Lifetime Data Anal. 1995; 1: 195-204Crossref PubMed Scopus (54) Google Scholar, 22Zheng M. Klein J.P. Estimates of marginal survival for dependent competing risks based on an assumed copula.Biometrika. 1995; 82: 127-138Crossref Scopus (184) Google Scholar], but they are rarely used in the cardiac literature, and the incorrect KM method is used instead. One method of adjusting for informative censoring, called inverse probability weighting (IPW) [23Satten G.A. Datta S. The Kaplan–Meier estimator as an inverse-probability-of-censoring weighted average.The American Statistician. 2001; 55: 207-210Crossref Scopus (114) Google Scholar, 24Xie J. Liu C. Adjusted Kaplan-Meier estimator and log-rank test with inverse probability of treatment weighting for survival data.Stat Med. 2005; 24: 3089-3110Crossref PubMed Scopus (326) Google Scholar], can be used to illustrate and attempt to correct the KM estimate. The IPW method is based on an alternative but equivalent expression for the KM estimator. The KM estimator is usually expressed as a product of incremental survival probabilities computed at each event time (it is called the product limit estimator), but it can also be derived in other ways [25Peterson A.V.J. Expressing the Kaplan-Meier estimator as a function of empirical subsurvival functions.J Am Stat Assoc. 1977; 72: 854-858Google Scholar, 26Gooley T.A. Leisenring W. Crowley J. Storer B.E. Estimation of failure probabilities in the presence of competing risks: new representations of old estimators.Stat Med. 1999; 18: 695-706Crossref PubMed Scopus (2296) Google Scholar], including the redistribution to the right technique demonstrated in Fig 2, Fig 4. A useful alternative representation is as a sum of weighted death probabilities, computed at each event time in which the weight is the inverse of the probability of that event being observed [23Satten G.A. Datta S. The Kaplan–Meier estimator as an inverse-probability-of-censoring weighted average.The American Statistician. 2001; 55: 207-210Crossref Scopus (114) Google Scholar, 27Robins J.M. Finkelstein D.M. Correcting for noncompliance and dependent censoring in an AIDS Clinical Trial with inverse probability of censoring weighted (IPCW) log-rank tests.Biometrics. 2000; 56: 779-788Crossref PubMed Scopus (513) Google Scholar]. Thus, the weight (IPW) assigned to each observed event increases as the probability that similar events were not (yet) observed increases (see Appendix 2). The probability that each REDO event was observed (ie, not censored) was estimated by a Cox regression, based on age, diabetes and previous coronary intervention, and was used to (inversely) weight that REDO. (This Cox model was not intended as a clinically serious model, but just as an example to demonstrate the IPW technique. In practice, one would have to have the appropriate clinical information and to thoroughly evaluate the resulting Cox model to insure that it correctly estimated the probability of each REDO event being observed.) The resulting IPW-corrected curve is lower than the original KM curve (Fig 7). Note that if the correlation between REDO and censoring were positive instead of negative, the IPW-corrected curve would be higher than the original KM curve. The question posed at the beginning of this report was, “What is the average lifetime of a tissue valve?” We presented data on CABG REDO instead of heart valve failure, so we should translate the question to its analog: “What is the average durability (lifetime) of a CABG operation?” From the cumulative incidence curve in Figure 7, the actual answer has 2 parts (similar to the analogous answer for heart valve SVD): “Only about 30% of patients undergoing CABG surgery are expected to have a REDO (ie, the lifetime risk of REDO for a patient in this population is 30%), and for these patients, the median time to REDO is about 11 years.” This method answers the durability question for a patient in a particular population: “Doctor, what is my risk of reoperation?” This answer can be extended to other patient populations as long as the death rates between the populations do not differ. An alternative to the “actual” computation method is offered by simulation techniques using Markov models [28Birkmeyer N.J. Birkmeyer J.D. Tosteson A.N. Grunkemeier G.L. Marrin C.A. O’Connor G.T. Prosthetic valve type for patients undergoing aortic valve replacement: a decision analysis.Ann Thorac Surg. 2000; 70: 1946-1952Abstract Full Text Full Text PDF PubMed Scopus (63) Google Scholar, 29Takkenberg J.J. Puvimanasinghe J.P. Grunkemeier G.L. Simulation models to predict outcome after aortic valve replacement.Ann Thorac Surg. 2003; 75: 1372-1376Abstract Full Text Full Text PDF PubMed Scopus (31) Google Scholar]. Using the uncorrected KM (Fig 7), the answer would be: “The median time to REDO is about 23 years.” This is obviously a fictitious answer; why would anyone want to use it? The reason is that the percentage of REDO is affected by the death rate. There may be interest in determining what the REDO curve would look like if every patient lived long enough to undergo REDO. That is, if the area “died before REDO” were somehow eliminated from Figure 3, what would be the shape of the “REDO” area? Using KM, this would be the area below the dashed line in Figure 3. Although this may not seem clinically relevant in the case of CABG REDO, when this method is used for tissue valve failure the hope is to extract the latent failure probability to assess the intrinsic durability of the prosthesis itself in a death-free environment. An engineer designing a heart valve would not want the durability results to be contaminated by the death of a patient, nor would a cardiac surgeon who is trying to find the most durable tissue valve for his and her patients. Thus the KM estimate is an attempt to produce an indicator in a biological system of the true durability of the valve. This is an important clinical question for which the KM estimate, if it can be properly corrected for informative censoring, may be the preferred estimate. Recent editorials called for the banning of “actual” analysis and the preferential use of KM for reporting heart valve performance [30Bodnar E. Blackstone E.H. Editorial: An “actual” problem another issue of apples and oranges.J Heart Valve Dis. 2005; 14: 706-708PubMed Google Scholar, 31Bodnar E. Blackstone E.H. An “actual” problem: another issue of apples and oranges.J Thorac Cardiovasc Surg. 2006; 131: 1-3Abstract Full Text Full Text PDF Scopus (8) Google Scholar]. However, as we have seen, the KM estimate has a fundamental though underappreciated problem: KM does not provide the desired latent probability when there is dependence between the competing risks of death (and hence censoring) and the event of interest. For tissue valve SVD, just as in the CABG REDO example, informative censoring would exist and need to be corrected. Heart valve patients censored by death would not have the same risk of eventual SVD if they lived as those who actually did live to experience SVD, because the risk of death is high in patients in which the risk of SVD is low (eg, the elderly), and vice versa. So the risks of SVD and censoring (by death before SVD) are negatively correlated, not independent. It is for this reason that the KM method has been repeatedly criticized as having in general no valid probability interpretation in the competing risks setting [26Gooley T.A. Leisenring W. Crowley J. Storer B.E. Estimation of failure probabilities in the presence of competing risks: new representations of old estimators.Stat Med. 1999; 18: 695-706Crossref PubMed Scopus (2296) Google Scholar, 32Pepe M.S. Longton G. Pettinger M. Mori M. Fisher L.D. Storb R. Summarizing data on survival, relapse, and chronic graft-versus-host disease after bone marrow transplantation: motivation for and description of new methods.Br J Haematol. 1993; 83: 602-607Crossref PubMed Scopus (75) Google Scholar, 33Lin D.Y. Non-parametric inference for cumulative incidence functions in competing risks studies.Stat Med. 1997; 16: 901-910Crossref PubMed Scopus (369) Google Scholar, 34Andersen P.K. Abildstrom S.Z. Rosthoj S. Competing risks as a multi-state model.Stat Methods Med Res. 2002; 11: 203-215Crossref PubMed Scopus (209) Google Scholar, 35Klein J.P. Andersen P.K. Regression modeling of competing risks data based on pseudovalues of the cumulative incidence function.Biometrics. 2005; 61: 223-229Crossref PubMed Scopus (265) Google Scholar]. These editorials also took exception with the use of the term actual (probability, failure, analysis) [30Bodnar E. Blackstone E.H. Editorial: An “actual” problem another issue of apples and oranges.J Heart Valve Dis. 2005; 14: 706-708PubMed Google Scholar, 31Bodnar E. Blackstone E.H. An “actual” problem: another issue of apples and oranges.J Thorac Cardiovasc Surg. 2006; 131: 1-3Abstract Full Text Full Text PDF Scopus (8) Google Scholar]. We originally proposed this term [16Starr A. Grunkemeier G.L. The expected lifetime of porcine valves.Ann Thorac Surg. 1989; 48: 317-318Abstract Full Text PDF PubMed Scopus (27) Google Scholar] because it seemed appropriate, and because there was no consensus in the technical literature as to the best terminology to use. Potential terms, among others [18Grunkemeier G.L. Anderson R.P. Miller D.C. Starr A. Time-related analysis of nonfatal heart valve complications: cumulative incidence (actual) versus Kaplan-Meier (actuarial).Circulation. 1997; 96 (II-70–4; discussion II-74–5)PubMed Google Scholar, 36Fine J.P. Gray R.J. A proportional hazards model for the subdistribution of a competing risk.J Am Stat Assoc. 1999; 94: 496-509Crossref Scopus (9493) Google Scholar, 37Grunkemeier G.L. Takkenberg J.J. Jamieson W.R. Miller D.C. Editorial: an “actual” problem: another issue of apples and oranges.J Heart Valve Dis. 2005; 14 (In response to: Bodnar E, Blackstone EH) (J Heart Valve Dis 2006;15:305–6; author reply, 306–7): 706-708PubMed Google Scholar], included: cumulative incidence, crude probability, crude incidence, cause-specific failure probability, absolute cause-specific risk, and subdistribution function. Subsequently the actual terminology has been accepted by other investigators [5Mahoney C.B. Miller D.C. Khan S.S. Hill J.D. Cohn L.H. Twenty-year, three-institution evaluation of the Hancock Modified Orifice aortic valve durability Comparison of actual and actuarial estimates.Circulation. 1998; 98 (discussion II93–4): II88-II93PubMed Google Scholar, 6Grossi E.A. Galloway A.C. Zakow P.K. et al.Choice of mitral prosthesis in the elderly An analysis of actual outcome.Circulation. 1998; 98: II116-II119PubMed Google Scholar, 7Khan S. Trento A. Kass R. DeRobertis M. Sandhu M. Nessim S. Actual failure rates: a method of assessing tissue valve reoperation rates.Am Heart J. 1999; 138: 108-113Abstract Full Text Full Text PDF PubMed Scopus (6) Google Scholar, 8Jamieson W.R. Burr L.H. Miyagishima R.T. Germann E. Anderson W.N. Actuarial versus actual freedom from structural valve deterioration with the Carpentier-Edwards porcine bioprostheses.Can J Cardiol. 1999; 15: 973-978PubMed Google Scholar, 9Miller 3rd, C.C. Safi H.J. Winnerkvist A. Baldwin J.C. Actual versus actuarial analysis for cardiac valve complications: the problem of competing risks.Curr Opin Cardiol. 1999; 14: 79-83Crossref PubMed Scopus (10) Google Scholar, 10Kaempchen S. Guenther T. Toschke M. Grunkemeier G.L. Wottke M. Lange R. Assessing the benefit of biological valve prostheses: cumulative incidence (actual) vs. Kaplan-Meier (actuarial) analysis.Eur J Cardiothorac Surg. 2003; 23 (discussion 713–4): 710-713Crossref PubMed Scopus (17) Google Scholar, 11McGiffin D.C. Naftel D.C. Kirklin J.K. et al.Pediatric Heart Transplant Study GroupPredicting outcome after listing for heart transplantation in children: comparison of Kaplan-Meier and parametric competing risk analysis.J Heart Lung Transplant. 1997; 16: 713-722PubMed Google Scholar, 12Blackstone E.H. Lytle B.W. Competing risks after coronary bypass surgery: the influence of death on reintervention.J Thorac Cardiovasc Surg. 2000; 119: 1221-1230Abstract Full Text Full Text PDF PubMed Scopus (35) Google Scholar, 13Lai D.T. Robbins R.C. Mitchell R.S. et al.Does profound hypothermic circulatory arrest improve survival in patients with acute type A aortic dissection?.Circulation. 2002; 106: I218-I228PubMed Google Scholar, 14Akins C.W. Hilgenberg A.D. Vlahakes G.J. et al.Late results of combined carotid and coronary surgery using actual versus actuarial methodology.Ann Thorac Surg. 2005; 80: 2091-2097Abstract Full Text Full Text PDF PubMed Scopus (33) Google Scholar]. In conclusion, there are three methods for estimating the probabilities of time-related events, such as SVD or REDO, when death is a competing risk that can preclude their occurrence:1The actual (cumulative incidence) method estimates the probability that a patient will experience the event. This is the correct probability that should be used for clinical predictions, patient management decisions, cost-effectiveness studies, and so forth.2The usual KM estimate, which includes pre-event deaths as noninformative censoring, attempts to estimate the probability if no one died. This does not pertain to patient predictions, but could be useful in trying to elicit the latent or intrinsic risk of tissue valve failure. However, this use of KM is a bitter fruit: it is only valid if death is independent of the event of interest, which is usually not the case in cardiac applications, including SVD and REDO.3When there is dependence between the competing risks of death and the event of interest, the IPW technique could be used to correct KM for the resulting informative censoring, provided that a model for the censoring can be produced. This is the estimate that could be used by those interested in the hypothetical durability of a valve or surgical procedure, rather then its real durability in an actual patient. The authors sincerely appreciate the constructive comments from the reviewers at The Annals of Thoracic Surgery, who improved the presentation of these concepts. If we know that a certain patient who was recruited into our interventional study 5 years ago is still alive, we have partial, but not complete, information on her post-intervention survival time (ie, we know that it is longer than 5 years). We say that her post-intervention survival time is (administratively) censored at 5 years. Censoring is an essential characteristic of event times. An artificial analogy could be constructed by using, say, a 5-foot ruler to measure height. For any patient whose height is less than 5 feet, we measure it exactly, but for patients taller than 5 feet, we can only say that their height is at least, or is censored at, 5 feet. The inverse probability weighting (IPW) provides another way of obtaining probability estimates from censored survival data, and in the case of noninformative censoring it provides the same answer as the Kaplan-Meier (KM) method. It begins with the idea that if there were no censoring, then the mortality curve (the complement of the survival curve) would be given by the percentages of cumulative deaths with time, obtained by a simple summation. However, suppose that an event has occurred at time (t), and that exactly half of the patients are currently censored before that time. That is, only half of the patients have passed that point so far in time, and have been exposed to the risk of an event at time t. Then, on average, one could expect that eventually one more event will happen at that time (ie, after the second half of the patients have experienced the risk associated with that point in time). Therefore, the total number of expected events is 2. In general, if P is the proportion of patients who have passed through that point in time, then the expected number of events (E) is E = 1/P. However, P is the proportion of patients who are not censored, so if PC is the proportion of people who are censored (P + PC = 1), then E = 1/(1 − PC). For example, if there is no censoring, then PC = 0 and E = 1/(1 – 0) = 1, that is, the expected number of events stays at 1 because there are no more patients left to increase the number of events at time t. If there is mild censoring, say PC = 0.1, then 1 observed event is expected to increase to E = 1/0.9 = 1.1, just a slight increase because there will be only a small percentage of patients in the future with the potential to increase the number of observed events at time t. On the other hand, if there is severe censoring, say PC = 0.9, then the observed event is expected to increase to E = 1/0.1 = 10 events. The previous examples assumed a homogeneous event risk. If there is informative censoring, then PC could vary from patient to patient, and it would have to be estimated individually according the risk factors of the patient experiencing the event. This is what was done in the coronary artery bypass grafting (CABG) data example, using a Cox regression to estimate the PC values at each event time, according to the characteristics of the patient who suffered that event.
Referência(s)