Multiple Comparisons in Studies of Gene × Gene and Gene × Environment Interaction

Carta Acesso aberto Revisado por pares

Multiple Comparisons in Studies of Gene × Gene and Gene × Environment Interaction

2004; Elsevier BV; Volume: 74; Issue: 3 Linguagem: Inglês

10.1086/382051

ISSN

1537-6605

Autores

Peter Kraft,

Tópico(s)

Helicobacter pylori-related gastroenterology studies

Resumo

To the Editor: Complex diseases are (by definition) influenced by multiple genes, environmental factors, and their interactions. There is currently a strong interest in studies testing for association between combinations of these factors and disease, in part because genes that affect the risk of disease only in the presence of another genetic variant or particular environment may not be detected in a marginal (gene-by-gene) analysis (Culverhouse et al. Culverhouse et al., 2002Culverhouse R Suarez BK Lin J Reich T A perspective on epistasis: limits of models displaying no main effect.Am J Hum Genet. 2002; 70: 461-471Abstract Full Text Full Text PDF PubMed Scopus (295) Google Scholar). Such studies raise the problem of multiple comparisons. Even when a small number of candidate genes and environmental factors is examined, a large number of possible interactions may need to be tested, as illustrated by a recent article in The American Journal of Human Genetics (Bugawan et al. Bugawan et al., 2003Bugawan TL Mirel DB Valdes AM Panelo A Pozzili P Erlich HA Association and interaction of the IL4R, IL4, and IL13 loci with Type 1 diabetes among Filipinos.Am J Hum Genet. 2003; 72: 1505-1514Abstract Full Text Full Text PDF PubMed Scopus (62) Google Scholar). Bugawan et al. (Bugawan et al., 2003Bugawan TL Mirel DB Valdes AM Panelo A Pozzili P Erlich HA Association and interaction of the IL4R, IL4, and IL13 loci with Type 1 diabetes among Filipinos.Am J Hum Genet. 2003; 72: 1505-1514Abstract Full Text Full Text PDF PubMed Scopus (62) Google Scholar) investigated potential interaction between the IL4R locus and five tightly linked SNPs in the IL4 and IL13 loci on chromosome 5, through use of a sample of 90 patients with type I diabetes and 94 population-based controls. They independently tested each of the chromosome 5 SNPs for interaction with IL4R, through use of logistic regression (cf. their table 7), and corrected for multiple comparisons through use of a permutation procedure. They concluded that there is statistically significant evidence for an epistatic interaction between at least one of the chromosome 5 SNPs and the IL4R locus. However, the authors’ permutation procedure does not have the desired statistical property—that is, it rejects the global null hypothesis of no interaction too often when none of the estimated interaction parameters differ from their null value. In this letter, I discuss why their procedure fails, present several alternatives, and compare the performance of these alternatives in a small simulation study. The procedure presented by Bugawan et al. (Bugawan et al., 2003Bugawan TL Mirel DB Valdes AM Panelo A Pozzili P Erlich HA Association and interaction of the IL4R, IL4, and IL13 loci with Type 1 diabetes among Filipinos.Am J Hum Genet. 2003; 72: 1505-1514Abstract Full Text Full Text PDF PubMed Scopus (62) Google Scholar) amounts to plugging the order statistics for the observed p values, p(1),…,p(5), into their joint cumulative distribution function under the null: p=F0(p(1),…,p(5))=Pr(P(1)≤p(1),…,P(5)≤p(5)). (Here, italicized uppercase letters refer to random variables, and lowercase letters refer to observed values of the corresponding variables. This differs from the notation in the Bugawan et al. [Bugawan et al., 2003Bugawan TL Mirel DB Valdes AM Panelo A Pozzili P Erlich HA Association and interaction of the IL4R, IL4, and IL13 loci with Type 1 diabetes among Filipinos.Am J Hum Genet. 2003; 72: 1505-1514Abstract Full Text Full Text PDF PubMed Scopus (62) Google Scholar] article.) The authors estimate F0 by permuting case-control labels 200 times and calculating the ordered p values for each permutation. A simple example shows that this approach is inappropriate. Consider the p values from two independent tests, P1 and P2. If we assume a large enough sample size, P1 and P2 are independently uniform on (0,1) under the null, and, hence, the cumulative distribution function for the associated order statistics, F0(p(1),p(2)), is P(1)(2p(2)−p(1)) (Bickel and Doksum Bickel and Doksum, 1977Bickel PJ Doksum KA Mathematical statistics: basic ideas and selected topics. Prentice Hall, Englewood Cliffs, New Jersey1977Google Scholar). The distribution of P=F0(P(1),P(2)) under the global null is shown in figure 1a. P does not have a uniform distribution under the null, as we expect for a p value. In this case, a test that rejects the global null hypothesis that both tests are null when P<.05 would have a type I error rate between 10% and 15%. As shown in figure 1b, the magnitude of the type I error rate increases as the number of independent tests increases. There are several alternative, theoretically justified and simple procedures that correct for multiple comparisons, besides the notoriously conservative Bonferroni correction. Simes’s test (Simes Simes, 1986Simes RJ An improved Bonferroni procedure for multiple tests of significance.Biometrika. 1986; 73: 751-754Crossref Scopus (1423) Google Scholar), for example, controls the overall significance level (also known as the “familywise error rate”) when the tests are independent or exhibit a special type of dependence (Sarkar Sarkar, 1998Sarkar S Some probability inequalities for ordered MTP2 random variables: a proof of the Simes conjecture.Ann Stat. 1998; 26: 494-504Crossref Google Scholar). Simes’s test rejects the global null hypothesis that all K test-specific null hypotheses are true if p(k)≤αk/K for any k in 1,…,K. Simulation results reported in table 1 suggest that Simes’s test has the appropriate false-positive rate, even when the tests are correlated.Table 1Observed False-Positive Rates (False-Discovery Rates) for Procedures with Nominal 5% Rates in the Context of Testing Five Possible Gene × Gene Interactions, Calculated from 500 Simulated Data SetsFalse-Positive Rate under ModelProcedurea“CDF” denotes the cumulative distribution function procedure used by Bugawan et al. (2003); “Simes” is the standard Simes’s test; “RSimes” is Simes’s test applied to p values calculated by comparing the observed p values to the distribution of p values generated by permuting the outcome variable 200 times; “BHD” is the Benjamini and Hochberg step-up procedure corrected for general dependency (Benjamini and Yekutieli 2001) (the usual step-up procedure is identical to Simes’s test in this case); and “DRW” is the related procedure proposed by Devlin et al. (2003).Null INull IICDF.194.214Simes.032.036RSimes.048.058False-Discovery Rate under ModelNull INull IIBHD.014.014DRW.050.070Note.—Six SNPs were simulated for 100 cases and 100 controls. The first SNP had mutant-allele frequency of .2; the other five SNPs were generated independently of the first by sampling five-SNP haplotypes with frequencies similar to those given in table 5 of Bugawan et al. (Bugawan et al., 2003Bugawan TL Mirel DB Valdes AM Panelo A Pozzili P Erlich HA Association and interaction of the IL4R, IL4, and IL13 loci with Type 1 diabetes among Filipinos.Am J Hum Genet. 2003; 72: 1505-1514Abstract Full Text Full Text PDF PubMed Scopus (62) Google Scholar). Under model Null I, none of the SNPs were associated with disease. Under Null II, each mutant allele for the first SNP doubles disease risk, but the remaining five SNPs are not associated with disease. The multiple-comparisons procedures are applied to the p values from five Wald tests for interaction based on the logistic model Pr(disease)=α+β1SNP1+βSNPi+βintSNP*1SNPi, analogous to that of Bugawan et al. (Bugawan et al., 2003Bugawan TL Mirel DB Valdes AM Panelo A Pozzili P Erlich HA Association and interaction of the IL4R, IL4, and IL13 loci with Type 1 diabetes among Filipinos.Am J Hum Genet. 2003; 72: 1505-1514Abstract Full Text Full Text PDF PubMed Scopus (62) Google Scholar).a “CDF” denotes the cumulative distribution function procedure used by Bugawan et al. (Bugawan et al., 2003Bugawan TL Mirel DB Valdes AM Panelo A Pozzili P Erlich HA Association and interaction of the IL4R, IL4, and IL13 loci with Type 1 diabetes among Filipinos.Am J Hum Genet. 2003; 72: 1505-1514Abstract Full Text Full Text PDF PubMed Scopus (62) Google Scholar); “Simes” is the standard Simes’s test; “RSimes” is Simes’s test applied to p values calculated by comparing the observed p values to the distribution of p values generated by permuting the outcome variable 200 times; “BHD” is the Benjamini and Hochberg step-up procedure corrected for general dependency (Benjamini and Yekutieli Benjamini and Yekutieli, 2001Benjamini Y Yekutieli D The control of the false discovery rate in multiple testing under dependency.Ann Stat. 2001; 29: 1165-1188Crossref Scopus (5998) Google Scholar) (the usual step-up procedure is identical to Simes’s test in this case); and “DRW” is the related procedure proposed by Devlin et al. (Devlin et al., 2003Devlin B Roeder K Wasserman L Analysis of multilocus models of association.Genet Epidemiol. 2003; 25: 36-47Crossref PubMed Scopus (53) Google Scholar). Open table in a new tab Note.— Six SNPs were simulated for 100 cases and 100 controls. The first SNP had mutant-allele frequency of .2; the other five SNPs were generated independently of the first by sampling five-SNP haplotypes with frequencies similar to those given in table 5 of Bugawan et al. (Bugawan et al., 2003Bugawan TL Mirel DB Valdes AM Panelo A Pozzili P Erlich HA Association and interaction of the IL4R, IL4, and IL13 loci with Type 1 diabetes among Filipinos.Am J Hum Genet. 2003; 72: 1505-1514Abstract Full Text Full Text PDF PubMed Scopus (62) Google Scholar). Under model Null I, none of the SNPs were associated with disease. Under Null II, each mutant allele for the first SNP doubles disease risk, but the remaining five SNPs are not associated with disease. The multiple-comparisons procedures are applied to the p values from five Wald tests for interaction based on the logistic model Pr(disease)=α+β1SNP1+βSNPi+βintSNP*1SNPi, analogous to that of Bugawan et al. (Bugawan et al., 2003Bugawan TL Mirel DB Valdes AM Panelo A Pozzili P Erlich HA Association and interaction of the IL4R, IL4, and IL13 loci with Type 1 diabetes among Filipinos.Am J Hum Genet. 2003; 72: 1505-1514Abstract Full Text Full Text PDF PubMed Scopus (62) Google Scholar). Other approaches with particular appeal in the context of multiple-gene and multiple-environmental-factor studies aim to control the false-discovery rate—that is, the expected proportion of rejected null hypotheses that are falsely rejected. This approach is particularly useful when a portion of the null hypotheses can be assumed false, as in microarray studies. Devlin et al. (Devlin et al., 2003Devlin B Roeder K Wasserman L Analysis of multilocus models of association.Genet Epidemiol. 2003; 25: 36-47Crossref PubMed Scopus (53) Google Scholar) recently proposed a variant of the Benjamini and Hochberg (Benjamini and Hochberg, 1995Benjamini Y Hochberg Y Controlling the false discovery rate: a practical and powerful approach to multiple testing.J R Stat Soc Ser B. 1995; 57: 289-300Google Scholar) step-up procedure that controls the false-discovery rate when testing a large number of possible gene × gene interactions in multilocus association studies. The Benjamini and Hochberg procedure is related to Simes’s test; setting k*=maxk such that p(k)≤αk/K, it rejects all k* null hypotheses corresponding to p(1),…,p(k*). In fact, the Benjamini and Hochberg procedure reduces to Simes’s test when all null hypotheses are true (Benjamini and Yekutieli Benjamini and Yekutieli, 2001Benjamini Y Yekutieli D The control of the false discovery rate in multiple testing under dependency.Ann Stat. 2001; 29: 1165-1188Crossref Scopus (5998) Google Scholar). Devlin et al.’s (Devlin et al., 2003Devlin B Roeder K Wasserman L Analysis of multilocus models of association.Genet Epidemiol. 2003; 25: 36-47Crossref PubMed Scopus (53) Google Scholar) proof for the validity of their false-discovery-rate procedure requires that the analyzed genes be statistically independent. This is not the case for the IL4 and IL13 SNPs studied by Bugawan et al. (Bugawan et al., 2003Bugawan TL Mirel DB Valdes AM Panelo A Pozzili P Erlich HA Association and interaction of the IL4R, IL4, and IL13 loci with Type 1 diabetes among Filipinos.Am J Hum Genet. 2003; 72: 1505-1514Abstract Full Text Full Text PDF PubMed Scopus (62) Google Scholar), but the simulation results in table 1 suggest that Devlin et al.’s (Devlin et al., 2003Devlin B Roeder K Wasserman L Analysis of multilocus models of association.Genet Epidemiol. 2003; 25: 36-47Crossref PubMed Scopus (53) Google Scholar) procedure controls the false-discovery rate even when the analyzed genes are correlated. The p values reported in table 7 of Bugawan et al. (Bugawan et al., 2003Bugawan TL Mirel DB Valdes AM Panelo A Pozzili P Erlich HA Association and interaction of the IL4R, IL4, and IL13 loci with Type 1 diabetes among Filipinos.Am J Hum Genet. 2003; 72: 1505-1514Abstract Full Text Full Text PDF PubMed Scopus (62) Google Scholar) do not lead to any significant results at the .05 level when any of the alternative procedures discussed here are used. Clearly, effective methods are needed for adjusting for multiple comparisons when testing for association between multiple factors and complex disease. On the one hand, blithely reporting any results marginally “significant” at the .05 level or relying on outdated and ill-performing stepwise model-building procedures (see, e.g., Burnham and Anderson [Burnham and Anderson, 2002Burnham KP Anderson DR Model selection and multimodel inference: a practical information-theoretic approach. Springer, New York2002Google Scholar] and Devlin et al. [Devlin et al., 2003Devlin B Roeder K Wasserman L Analysis of multilocus models of association.Genet Epidemiol. 2003; 25: 36-47Crossref PubMed Scopus (53) Google Scholar]) will lead to spurious results, expensive follow-up studies with little chance of replication, and confusion. On the other hand, overly conservative procedures will create missed opportunities. Although the procedures discussed here are known to control the familywise error rate or false-discovery rate in particular situations (e.g., independent covariates), their performance in more general situations needs further investigation.

Ver no editor

Altmetric

PlumX

Entrar

Lembrar minha senha

Receber meu e-mail de confirmação

Multiple Comparisons in Studies of Gene × Gene and Gene × Environment Interaction