Genetics and Genomics
2008; Lippincott Williams & Wilkins; Volume: 118; Issue: 15 Linguagem: Inglês
10.1161/circulationaha.107.714600
ISSN1524-4539
Autores Tópico(s)Genetics, Bioinformatics, and Biomedical Research
ResumoHomeCirculationVol. 118, No. 15Genetics and Genomics Free AccessReview ArticlePDF/EPUBAboutView PDFView EPUBSections ToolsAdd to favoritesDownload citationsTrack citationsPermissions ShareShare onFacebookTwitterLinked InMendeleyReddit Jump toFree AccessReview ArticlePDF/EPUBGenetics and GenomicsGene Expression Microarrays Karla V. Ballman, PhD Karla V. BallmanKarla V. Ballman From the Division of Biostatistics, Mayo Clinic, Rochester, Minn. Originally published7 Oct 2008https://doi.org/10.1161/CIRCULATIONAHA.107.714600Circulation. 2008;118:1593–1597Since its development almost 2 decades ago, gene expression microarray technology has generated excitement and raised expectations for dramatic discoveries of underlying disease mechanisms. The general concept of a gene expression microarray is relatively straightforward. Messenger RNA is extracted from a biospecimen. Examples of biospecimens include cells grown in culture, tissue from animal models, and human tissue. The messenger RNAs are converted into DNA, labeled with a fluorescent dye, and hybridized to a slide containing probes corresponding to target genes of interest arranged in a matrix format. The level of gene expression is estimated from the raw intensities of the fluorescence emitted by the labeled sequence bound to probes representing genes from which messenger RNAs were transcribed. Various different platforms are available to measure gene expression levels. Some are single-channel arrays in which 1 sample is hybridized on each array; others are 2-channel arrays in which 2 samples are hybridized simultaneously on a single array with 2 different fluorescent dyes. Some platforms are all-purpose standard arrays manufactured on a large scale that typically interrogate the expression level of all known genes. There also are custom arrays available that measure the expression levels of genes known to be associated with a particular disease or genes of interest to an individual researcher. Throughout this article, the term microarray is used to refer to a gene expression microarray.The intent of the present article is to present an overview of the statistical components related to the design and analysis of a microarray experiment. In general, the usual statistical standards of scientific research apply to microarray studies. This article reviews common study objectives and designs, describes the typical analytical techniques, and concludes with a discussion of threats to the validity of a microarray study. Because articles in this statistics series have word-count restrictions that necessarily limit their breadth and depth, readers wanting more detail are directed to more comprehensive reviews by Allison et al,1 Dobbin and Simon,2 and Dupuy and Simon.3Study ObjectivesEffective use of microarray experiments requires clear objectives and effective study design. Clearly stated study objectives are needed to determine the appropriate type of specimens, an adequate sample size, and suitable analysis plans. Medical research studies using microarrays encompass a wide variety of disease-related objectives that generally fall into 1 of 3 broad categories: class comparison, class prediction, and class discovery.Class ComparisonClass comparison is also commonly known as differential analysis or analysis of differential expression. It involves the comparison of gene expression profiles of samples from distinct, predefined groups to identify the genes that are differentially expressed among the groups. Examples include comparing treated and untreated cell lines to ascertain the effect of a new drug on gene expression levels or a comparison of healthy tissue with diseased tissue to identify genes with altered expression. Such studies yield lists of genes that are significantly altered among the groups. The purpose of the lists is to provide insight into the underlying biological mechanisms and perhaps to identify potential therapeutic targets.Class PredictionA class prediction study develops a classifier that uses the gene expression level of multiple genes that can be applied to the expression profile of a newly acquired biospecimen to predict its (unknown) class. As in class comparison studies, the focus is to identify genes that differ across predefined classes. However, in class prediction studies, the gene expression values are explanatory variables rather than outcome variables. Furthermore, the goal of the analysis of class prediction studies is to identify a small set of genes able to accurately discern among the distinct classes; it is not to identify all genes that differ. There are many different possibilities for classes of interest. Class prediction studies could be yield classifiers that are (1) diagnostic, eg, a classifier that distinguishes between 2 different disease states; (2) prognostic, eg, a classifier that distinguishes short-term survivors from long-term survivors; or (3) predictive, eg, a classifier that predicts whether a patient will respond to a particular drug.Class DiscoveryClass discovery differs from class comparison and class prediction studies in that the classes are not predefined. Typically, the objective of these studies is to determine whether subsets of samples with seemingly homogeneous phenotypes can be discerned on the basis of differences in their gene expression profiles. For example, there are many diseases for which individuals with seemingly similar phenotypes have a large variability in outcomes such as survival. The hypothesis is that differences at a molecular level are the cause of the observed variability in outcome. Class discovery studies also may uncover biological features of the disease that provide insight into the disease pathogenesis; in particular, identifying molecular differences that define disease subgroups may lead to improved therapeutic agents.Study DesignThere have been numerous claims of gene expression profiles that can accurately diagnose disease or make an accurate prognosis, and some prominent results have not been reproduced.4–7 There are 3 main reasons for this: bias, overfitting, and generalizability. Bias poses a substantial threat to the validity of a gene expression microarray study and is the most difficult to avoid. Study bias is addressed primarily through proper study design. Another important aspect of study design is determining an adequate sample size to address the question of interest.Study Design IssuesGood study design minimizes potential bias, defined here as a systematic erroneous association of a variable with a group in a way that distorts the comparison between groups. As an example of bias, suppose, in a study designed to assess the ability of a multigene expression test to discern between groups of patients with and without left ventricular dysfunction, that the samples from individuals in the left ventricular dysfunction group have been stored an average of 7 years and those in the comparison group have an average storage time of 2 years. If the test is affected by tissue storage time, the bias could account for the ability of the test to distinguish between the groups.To avoid bias, study groups should be created that differ only with respect to the variable of interest; they should be "equal" with respect to all other variables. An experimental method such as a randomized controlled trial provides the most effective way to create equal groups and to minimize bias. Often, the experimental method cannot be used to address research questions of interest in microarray studies such as diagnosis and prognosis. However, the principle of experimental method can be applied to these observational studies, which can minimize some common biases that arise in microarray studies.8 Note that the issue of bias is a relatively new challenge for laboratory researchers who are used to tightly controlled conditions of experimental research. Designs for microarray studies should strive to create homogeneity across groups on all variables but the one of interest. In particular, consideration should be given to the equality of characteristics of individuals, specimen collection, handling, and storage. A mechanism to facilitate obtaining subject (biospecimen) homogeneity is the use of inclusion/exclusion criteria. Likewise, there should be similar protocols used for the collection, storage, and handling of specimens for all groups. Applying similar protocols across groups poses a challenge because disease tissues often undergo different collection, handling, and storage procedures compared with normal tissues.Sample SizeMicroarray studies tend to be underpowered, meaning they have too few samples for detecting features of interest. The reason cited is often the large cost of the arrays. However, from a scientific perspective, it is essential to have adequate sample sizes. One simple, common approach in the setting of class comparison is to perform power and sample size calculations based on a 2-sample t test. Appropriately large samples are required to avoid missing important expression differentials. Other approaches to sample size estimation would be used for prediction or discovery settings.The approach to sample size planning for a microarray class comparison study is the same as for any other medical research study and is based on 4 parameters: α, the level of significance or probability of a type I error; 1−β, the statistical power (β is the probability of a type II error); δ, the minimum detectable difference in gene expression levels (usually on a log base 2 scale); and σ, the SD of a gene expression level (usually on a log base 2 scale). The investigator sets the values of the first 3 parameters. Recommended values for α and β for gene expression microarray studies are α=0.001 and β=0.05. The small α value is to protect against multiple comparisons (more below); the small β value is to provide good statistical power for identifying genes that really are differentially expressed. The value of δ differs for each gene, and investigators often set δ=1, which corresponds to a 2-fold difference on a log base 2 scale. Finally, estimates are needed for the variation of expression values among samples. This variability depends on the heterogeneity of the biospecimens, with cell line samples being the least heterogeneous, human samples being the most heterogeneous, and animal model samples falling somewhere in between. Cell line samples are derived from multiple cultures of the same cell line, 1 per culture. Furthermore, the amount of variability depends on a particular gene.Simon et al9 report that the average within-class expression level variability (log 2 scale) ranges from 0.10 for cell lines to 0.25 for animal model tissue to 0.5 to 0.75 for human tissues. The Table gives general guidelines for minimum sample sizes according to the numbers above and using a 2-sided t test. As microarray data become more abundant, estimates of σ should be based on previous experiments using similar biospecimens from similar populations. Table. Sample Size per Group for Class Comparison in a Gene Expression Study as a Function of the SDBiospecimenσNo. per Groupα=0.001; β=0.95; 2-sided t test; δ=1.*Multiple samples arise from different cultures of the same cell line, 1 sample per culture.Cell line*0.14Animal model tissue0.256Human tissue, small variability0.5015Human tissue, large variability0.7531Objectives of microarray experiments also include class prediction and class discovery. At the present time, there are no generally accepted methods for determining sample sizes for class prediction models; however, methods do exist in the literature such as that of Dobbin and Simon.10 The development of sample size determination for class discovery is currently an active area of research. It is generally acknowledged that these types of studies require larger sample sizes than a class comparison or class prediction study.AnalysisBefore analysis, microarray data often are transformed to the log 2 base scale. There are 2 reasons. First, gene expression levels span orders of magnitude, and using a log base 2 scale reduces the magnitude of the range of the data and often makes it more normally distributed. Second, biologists are comfortable interpreting changes on a log 2 scale in which a 1-unit change is equivalent to a doubling in the original scale. There are typically 2 steps in the analysis of microarray data. The first involves low-level preprocessing of the data; the second is a higher-order analysis that depends on the nature of the research question.Low-Level Analysis: Data PreprocessingData preprocessing (low-level analysis) consists of background subtraction and normalization. The purpose of background subtraction is to make the foreground intensity more properly proportional to the abundance of bound material (ie, cDNA). There is a tradeoff between performing a background subtraction or not in terms of variance and bias. If no background subtraction is done, the variance in expression levels is the smallest, but the bias in terms of the measured intensity reflecting the amount of bound material is greatest. Performing background subtraction reduces the bias but increases the amount of variability in gene expression level. Some investigators choose not to perform any background correction because currently little is known of the biological effects of a given level of gene expression. Hence, having bias in the estimate in the absolute level of gene expression is of little consequence, and the gain is less variability, which translates to greater power. Global background correction techniques11,12 use the same constant to represent the background for all spots on the array. It is easy to perform but does not take into account background variation that exists across the array. Regional background correction13–15 provides greater flexibility than global correction in terms of accounting for variation in background across the array.In addition to the biological variability among samples, other sources of variability or noise are introduced through the processes of biospecimen handling, RNA extraction, fluorescent labeling, hybridization to the array, and laser scanning to measure the intensities. Before the gene expression values between arrays are compared, the arrays typically are normalized to remove unwanted/systematic variability in expression levels for a gene across arrays induced by the process of RNA extraction, hybridization, and scanning while preserving the true biological variability. Many normalization procedures are based on the assumption that the expression levels of most genes do not change among the groups being compared. This is a reasonable assumption for most microarray experiments that use general-purpose arrays designed to interrogate all genes. There are 2 general types of normalization techniques: linear normalization16 and nonlinear normalization.17–19 Linear normalization uses the same normalization factor for all genes on the array. Nonlinear normalization determines the normalization factor for a spot on the basis of its level of expression. It is generally agreed that nonlinear normalization techniques are superior to linear normalization techniques.Class ComparisonThe goal of a class comparison study is to identify genes that are differentially expressed between prespecified classes. Tests used to identify genes as differentially expressed between groups need to account for variability. In early microarray studies, investigators selected genes with mean values that differed between groups by a predefined magnitude such as a ≥2-fold difference. This completely ignored the amount of variability in gene expression levels, greatly increasing the probability of declaring a gene differentially expressed when it truly is not.20 Methods that account for the magnitude of the difference in mean expression between groups and level of variability of gene expression within a group include the t test, permutation test, and Wilcoxon rank-sum test. The last 2 are used when there is concern that the gene expression levels are not normally distributed. If there are >2 groups, there are natural extensions of these tests: ANOVA, permutation F tests, and the Kruskal-Wallis test, respectively.Class PredictionThere are numerous methods for class comparison analyses, and the number keeps increasing. Methods include logistic regression,21 linear and quadratic discriminant analysis,22 nearest neighbor classifiers,23 decision trees,24 shrunken centroids,25 neural networks,26 random forests,27 support vector machines,28 and many more. It has been shown that simpler methods tend to work better,29 likely because many of the more sophisticated methods that uncover even slight structure in the data do not work well when the number of cases is small compared with the number of variables because they then tend to model the noise in the data in addition to, or rather than, the true structure.Class DiscoveryThe idea of class discovery is to discover patterns in microarray data. There are 2 types of problems in this category: identifying groups of coexpressed genes and finding patterns in expression profiles of different specimens that do not have predefined profiles from among a set of samples with similar phenotypes. The techniques used for class discovery are called unsupervised methods. A primary tool used for class discovery is clustering algorithms; their goal is to form subgroups so that the objects (specimens or genes) within a subgroup are more similar to one another than objects in different groups. There are 2 main types of clustering algorithms: hierarchical methods and partitional methods. Hierarchical algorithms derive a nested series of partitions of data points either by merging 2 clusters at each step (agglomerative hierarchical clustering) or by splitting a cluster into 2 clusters at each step (divisive hierarchical clustering). Partitional methods aim to produce a single partition of the items; there is no nesting. K-means clustering30 and self-organizing maps31 are both partitional methods.The results of clustering algorithms are typically displayed graphically. In the case of hierarchical clustering, a tree structure or dendrogram is generated (see the Figure). Another popular display for hierarchical clustering results in a heat map (also called a color image plot). A heat map is a rectangular array of colored blocks with the block color representing the expression level of 1 gene on 1 array (specimen). Each column of boxes represents a specimen, and each row of boxes represents a gene. The columns are ordered according to how they are ordered in a hierarchical clustering dendogram. Typically, the rows also are ordered according to the dendogram that resulted from a cluster analysis of the genes (see the Figure). The results are patches of 2 different colors (typically red and green) indicating combinations of genes and specimens that exhibit high or low expression. Download figureDownload PowerPointFigure. Example of a dendogram and heat map reprinted from Chen et al32 with permission from Lippincott Williams & Wilkins.DiscussionBecause of the nature of microarray data (orders-of-magnitude more variables than samples), chance often threatens the validity of the results by leading to erroneous conclusions. In the case of class comparisons, tens of thousands of hypothesis tests are performed simultaneously, 1 for each gene. Suppose that no genes truly are differentially expressed between 2 groups. If a microarray measures 30 000 genes and a significance level of 0.05 is used, 1500 genes would be falsely declared as being differentially expressed as a result of chance alone. Several approaches are used to control the number of false positives in a study. The first method is to choose a smaller level of significance, typically 0.001. In the example, this would reduce the number of false positives from 1500 to 30. Other methods used to address the multiple comparison problems are the use of Bonferroni-adjusted probability values and multivariate permutations methods. Another approach is to control the false discovery proportion, ie, of those genes that are identified as being differentially expressed, control the proportion of false positives. This average false discovery proportion is sometimes called the false discovery rate.33A problem caused by chance that is often less familiar to investigators performing gene expression microarray studies is overfitting.34 Overfitting occurs when a multivariable model designed to generate a classifier that discriminates between groups of patients is made to fit the data perfectly. A problem occurs when a pattern is found that fits perfectly by chance. Given that thousands of genes are typically considered when developing a multigene classifier, there is a large likelihood that this will occur. Overfitting is not inherent in microarray studies; it can occur whenever a multivariable analysis is done to assess associations between a large number of explanatory variables and an outcome with a relatively small sample size (especially when the number of explanatory variables exceeds the sample size). Overfitting can easily be checked for by assessing reproducibility in a completely independent group of individuals. Overfitting remains a problem in microarray studies mainly because of failure to adequately carry out such checking.ConclusionsThe design and analysis of microarray studies should adhere to proven principles for all clinical research studies. Problems caused by chance and bias must be minimized at every step of a study. Bias presents the greatest difficulty; many biases are subtle. Unfortunately, in current microarray research, problems related to bias are widely ignored by investigators, reviewers, and editors, often leading to erroneous conclusions. Chance also threatens the validity of many microarray results. To lessen this threat, the possibility of overfitting should be explicitly assessed. Specifically, in microarray studies, claims of discrimination of classifiers should prompt the question of whether reproducibility has been assessed in an independent validation set of adequate size.35 If it has not, then the results should be viewed as inconclusive.DisclosuresNone.FootnotesCorrespondence to Karla V. Ballman, PhD, Division of Biostatistics, Mayo Clinic, Harwick 8, 200 First St SW, Rochester, MN 55905. E-mail [email protected] References 1 Allison DB, Cui X, Page GP, Sabripour M. Microarray data analysis: from disarray to consolidation and consensus. Nat Rev Genet. 2006; 7: 55–65.CrossrefMedlineGoogle Scholar2 Dobbin K, Simon R. Experimental design of DNA microarray studies. In: Jorde L, Little P, Dunn M, Subramaniam S, eds. Encyclopedia of Genetics, Genomics, Proteomics, and Bioinformatics. New York, NY: Wiley; 2005.Google Scholar3 Dupuy A, Simon R. Critical review of published microarray studies for cancer outcome and guidelines on statistical analysis and reporting. J Natl Cancer Inst. 2007; 99: 147–157.CrossrefMedlineGoogle Scholar4 Ransohoff DF. Gene-expression signatures in breast cancer. N Engl J Med. 2003; 348: 1715–1717.CrossrefGoogle Scholar5 Mehta T, Tanik M, Allison DB. Towards sound epistemological foundations of statistical methods for high-dimensional biology. Nat Genet. 2004; 36: 943–947.CrossrefMedlineGoogle Scholar6 Marshall E. Getting the noise out of gene arrays. Science. 2004; 306: 630–631.CrossrefMedlineGoogle Scholar7 Ambroise C, McLachlan GJ. Selection bias in gene extraction on the basis of microarray gene-expression data. Proc Natl Acad Sci U S A. 2002; 99: 6562–6566.CrossrefMedlineGoogle Scholar8 Ransohoff DF. Bias as a threat to the validity of cancer molecular-marker research. Nat Rev Cancer. 2005; 5: 142–149.CrossrefMedlineGoogle Scholar9 Simon R, Radmacher MD, Dobbin K. Design of studies using DNA microarrays. Genet Epidemiol. 2002; 23: 21–36.CrossrefMedlineGoogle Scholar10 Dobbin K, Simon R. Sample size planning for developing classifiers using high dimension DNA microarray data. Biostatistics. 2007; 8: 101–117.CrossrefMedlineGoogle Scholar11 Brown CS, Goodwine PC, Sorger PK. Image metrics in the statistical analysis of DNA microarray data. Proc Natl Acad Sci U S A. 2001; 98: 8944–8949.CrossrefMedlineGoogle Scholar12 Statistical Algorithms Reference Guide. Santa Clara, Calif: Affymetrix; 2001.Google Scholar13 Yang YH, Buckley MJ, Dudoit S, Speed TP. Analysis of cDNA microarray images. Brief Bioinform. 2001; 2: 341–349.CrossrefMedlineGoogle Scholar14 Jain AK, Tokuyasu TA, Snijders AM, Segraves R, Albertson DG, Pinkel D. Fully automatic quantification of microarray image data. Genome Res. 2002; 12: 325–332.CrossrefMedlineGoogle Scholar15 GenePix Pro 4.0 User's Guide. Foster City, Calif: Axon Instruments Inc; 2001.Google Scholar16 Affymetrix Microarray Suite User Giude, Version 5. Santa Clara, Calif: Affymetrix; 2001.Google Scholar17 Yang YH, Dudoit S, Luu P, Lin DM, Peng V, Ngai J, Speed TP. Normalization for cDNA microarray data: a robust composite method addressing signe and multiple slide systematic variation. Nucleic Acids Res. 2002; 30: e15.CrossrefMedlineGoogle Scholar18 Bolstad BM, Irizarry RA, Astrand M, Speed TP. A comparison of normalization methods of high density array data based on variance and bias. Bioinformatics. 2003; 19: 185–193.CrossrefMedlineGoogle Scholar19 Ballman KV, Grill DE, Oberg AL, Therneau TM. Faster cyclic loess: normalizing RNA arrays via linear models. Bioinformatics. 2004; 20: 2778–2786.CrossrefMedlineGoogle Scholar20 Miller RA, Galecki A, Shmookler-Reis RJ. Interpretation, design, and analysis of gene array expression experiments. J Gerontol. 2001; 56A: B52–B57.Google Scholar21 Cox DR. Analysis of Binary Data. London, UK: Methuen; 1970.Google Scholar22 Johnson RA, Wichern DW. Applied Multivariate Statistical Analysis. 4th ed. Upper Saddle River, NJ: Prentice-Hall; 1999.Google Scholar23 Hastie T, Tibshirani R, Friedman J. The Elements of Statistical Learning: Data Mining, Inference, and Prediction. New York, NY: Springer; 2001.Google Scholar24 Brieman L, Friedman J, Stone C, Olshen R. Classification and Regression Trees. Belmont, Calif: Wadsworth; 1984.Google Scholar25 Tibshirani R, Hastie T, Narasimhan B, Chu G. Diagnosis of multiple cancer types by shrunken centroids of gene expression. Proc Natl Acad Sci U S A. 2002; 99: 6567–6572.CrossrefMedlineGoogle Scholar26 Ripley BD. Pattern Recognition and Neural Networks. Cambridge, UK: Cambridge University Press; 1996.Google Scholar27 Brieman L. Random forests. Machine Learning. 2001; 45: 5–32.CrossrefGoogle Scholar28 Vapnik V. Statistical Learning Theory. New York, NY: Wiley; 1998.Google Scholar29 Dudoit S, Fridlyand J, Speed TP. Comparison of discrimination methods for classification of tumors using DNA microarrays. J Am Stat Assoc. 2002; 97: 77–87.CrossrefGoogle Scholar30 MacQueen J. Some methods for classification and analysis of multivariate observations. In: Proceedings of Fifth Berkeley Symposium on Mathematical Statistics and Probability. Berkeley, Calif; University of Berkeley; 1967; 281–297.Google Scholar31 Kohonen T. Self-Organizing Maps. Berlin, Germany: Springer; 1997.Google Scholar32 Chen MM, Ashley EA, Deng DXF, Tsalenko A, Deng A, Tabibiazar R, Ben-Dor A, Fenster B, Yang E, King JY, Fowler M, Robbins R, Johnson FL, Bruhn L, McDonagh T. Novel role for the potent endogenous inotrope apelin in human cardiac dysfunction. Circulation. 2003; 108: 1432–1439.LinkGoogle Scholar33 Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J Roy Stat Soc Ser B. 1995; 57: 289–300.Google Scholar34 Ransohoff DF. Rules of evidence for cancer molecular-marker discovery and validation. Nat Rev Cancer. 2004; 4: 309–314.CrossrefMedlineGoogle Scholar35 Simon R, Radmacher MD, Dobbin K, McShane LM. Pitfalls in the analysis of DNA microarray data for diagnostic and prognostic classification. J Natl Cancer Inst. 2003; 95: 14–18.CrossrefMedlineGoogle Scholar Previous Back to top Next FiguresReferencesRelatedDetailsCited By Kanamori F, Yokoyama K, Ota A, Yoshikawa K, Karnan S, Maruwaka M, Shimizu K, Ota S, Uda K, Araki Y, Okamoto S, Maesawa S, Wakabayashi T and Natsume A Transcriptome-wide analysis of intracranial artery in patients with moyamoya disease showing upregulation of immune response, and downregulation of oxidative phosphorylation and DNA repair, Neurosurgical Focus, 10.3171/2021.6.FOCUS20870, 51:3, (E3) Kok M, de Ronde M, Moerland P, Ruijter J, Creemers E and Pinto-Sietsma S (2018) Small sample sizes in high-throughput miRNA screens: A common pitfall for the identification of miRNA biomarkers, Biomolecular Detection and Quantification, 10.1016/j.bdq.2017.11.002, 15, (1-5), Online publication date: 1-May-2018. de Ronde M, Ruijter J, Moerland P, Creemers E and Pinto-Sietsma S (2018) Study Design and qPCR Data Analysis Guidelines for Reliable Circulating miRNA Biomarker Experiments: A Review, Clinical Chemistry, 10.1373/clinchem.2017.285288, 64:9, (1308-1318), Online publication date: 1-Sep-2018. Zhang W, Zhu M, Zhang G, Liu F, Wang H, Guo X and Xu B (2016) Molecular cloning, expression, and stress response of the estrogen-related receptor gene (AccERR) from Apis cerana cerana, The Science of Nature, 10.1007/s00114-016-1340-9, 103:3-4, Online publication date: 1-Apr-2016. Lareau C, White B, Oberg A and McKinney B (2015) Differential co-expression network centrality and machine learning feature selection for identifying susceptibility hubs in networks with scale-free structure, BioData Mining, 10.1186/s13040-015-0040-x, 8:1, Online publication date: 1-Jun-2015. Shah S, Katz D and Deo R (2014) Phenotypic Spectrum of Heart Failure with Preserved Ejection Fraction, Heart Failure Clinics, 10.1016/j.hfc.2014.04.008, 10:3, (407-418), Online publication date: 1-Jul-2014. Williams-DeVane C, Reif D, Cohen Hubal E, Bushel P, Hudgens E, Gallagher J and Edwards S (2013) Decision tree-based method for integrating gene expression, demographic, and clinical data to determine disease endotypes, BMC Systems Biology, 10.1186/1752-0509-7-119, 7:1, (119), . Mahoney D, Therneau T, Heppelmann C, Higgins L, Benson L, Zenka R, Jagtap P, Nelsestuen G, Bergen H and Oberg A (2011) Relative Quantification: Characterization of Bias, Variability and Fold Changes in Mass Spectrometry Data from iTRAQ-Labeled Peptides, Journal of Proteome Research, 10.1021/pr2001308, 10:9, (4325-4333), Online publication date: 2-Sep-2011. Gallagher J, Hubal E and Edwards S (2010) Biomarkers for Environmental Exposure Biomarkers, 10.1002/9780470918562.ch20, (517-547), Online publication date: 20-Sep-2010. Vasikova A, Belickova M, Budinska E and Cermak J (2010) A distinct expression of various gene subsets in CD34+ cells from patients with early and advanced myelodysplastic syndrome, Leukemia Research, 10.1016/j.leukres.2010.02.021, 34:12, (1566-1572), Online publication date: 1-Dec-2010. HASDEMIR C, AYDIN H, CELIK H, SIMSEK E, PAYZIN S, KAYIKCIOGLU M, AYDIN M, KULTURSAY H and CAN L (2010) Transcriptional Profiling of Septal Wall of the Right Ventricular Outflow Tract in Patients with Idiopathic Ventricular Arrhythmias, Pacing and Clinical Electrophysiology, 10.1111/j.1540-8159.2009.02606.x, 33:2, (159-167), Online publication date: 1-Feb-2010. Ma H, Kao C, Lee Y, Chiou G, Tai L, Lu K, Huang C, Chen Y, Chiou S, Cheng I and Wong T (2009) Differential expression profiling between atypical teratoid/rhabdoid and medulloblastoma tumor in vitro and in vivo using microarray analysis, Child's Nervous System, 10.1007/s00381-009-1016-2, 26:3, (293-303), Online publication date: 1-Mar-2010. Heitner S, Hollenberg S and Colilla S (2010) Heat maps, random forests, and nearest neighbors: A peek into the new molecular diagnostic world*, Critical Care Medicine, 10.1097/CCM.0b013e3181c545ed, 38:1, (296-298), Online publication date: 1-Jan-2010. Shah S (2009) Genetics of systemic sclerosis-associated pulmonary arterial hypertension: Recent progress and current concepts, Current Rheumatology Reports, 10.1007/s11926-009-0013-5, 11:2, (89-96), Online publication date: 1-Apr-2009. Tromp G and Kuivaniemi H (2009) Developments in Genomics to Improve Understanding, Diagnosis and Management of Aneurysms and Peripheral Artery Disease, European Journal of Vascular and Endovascular Surgery, 10.1016/j.ejvs.2009.08.010, 38:6, (676-682), Online publication date: 1-Dec-2009. Nunes F, Ihle K, Mutti N, Simões Z and Amdam G (2013) The gene vitellogenin affects microRNA regulation in honey bee ( Apis mellifera ) fat body and brain , Journal of Experimental Biology, 10.1242/jeb.089243 October 7, 2008Vol 118, Issue 15 Advertisement Article InformationMetrics https://doi.org/10.1161/CIRCULATIONAHA.107.714600PMID: 18838575 Originally publishedOctober 7, 2008 Keywordsgenesgene expression microarraysstatisticsPDF download Advertisement SubjectsGene Expression and Regulation
Referência(s)