Phylogeny of East Asian Mitochondrial DNA Lineages Inferred from Complete Sequences
2003; Elsevier BV; Volume: 73; Issue: 3 Linguagem: Inglês
10.1086/377718
ISSN1537-6605
AutoresQing‐Peng Kong, Yong‐Gang Yao, Chang Sun, Hans‐Jürgen Bandelt, Chunling Zhu, Ya‐Ping Zhang,
Tópico(s)Genomics and Phylogenetic Studies
ResumoThe now-emerging mitochondrial DNA (mtDNA) population genomics provides information for reconstructing a well-resolved mtDNA phylogeny and for discerning the phylogenetic status of the subcontinentally specific haplogroups. Although several major East Asian mtDNA haplogroups have been identified in studies elsewhere, some of the most basal haplogroups, as well as numerous minor subhaplogroups, were not yet determined or fully characterized. To fill the lacunae, we selected 48 mtDNAs from >2,000 samples across China for complete sequencing that cover virtually all (sub)haplogroups discernible to date in East Asia. This East Asian mtDNA phylogeny can henceforth serve as a solid basis for phylogeographic analyses of mtDNAs, as well as for studies of mitochondrial diseases in East and Southeast Asia. The now-emerging mitochondrial DNA (mtDNA) population genomics provides information for reconstructing a well-resolved mtDNA phylogeny and for discerning the phylogenetic status of the subcontinentally specific haplogroups. Although several major East Asian mtDNA haplogroups have been identified in studies elsewhere, some of the most basal haplogroups, as well as numerous minor subhaplogroups, were not yet determined or fully characterized. To fill the lacunae, we selected 48 mtDNAs from >2,000 samples across China for complete sequencing that cover virtually all (sub)haplogroups discernible to date in East Asia. This East Asian mtDNA phylogeny can henceforth serve as a solid basis for phylogeographic analyses of mtDNAs, as well as for studies of mitochondrial diseases in East and Southeast Asia. Recent progress in the analysis of complete or nearly complete mtDNA sequences has provided new insights into the origin and spread of modern humans and the phylogeny of the major African, European, Asian, and Native American mtDNA lineages (Ingman et al. Ingman et al., 2000Ingman M Kaessmann H Pääbo S Gyllensten U Mitochondrial genome variation and the origin of modern humans.Nature. 2000; 408: 708-713Crossref PubMed Scopus (1023) Google Scholar; Finnilä et al. Finnilä et al., 2001Finnilä S Lehtonen MS Majamaa K Phylogenetic network for European mtDNA.Am J Hum Genet. 2001; 68: 1475-1484Abstract Full Text Full Text PDF PubMed Scopus (289) Google Scholar; Maca-Meyer et al. Maca-Meyer et al., 2001Maca-Meyer N González AM Larruga JM Flores C Cabrera VC Major genomic mitochondrial lineages delineate early human expansions.BMC Genetics. 2001; 2: 13Crossref PubMed Scopus (265) Google Scholar; Torroni et al. Torroni et al., 2001Torroni A Rengo C Guida V Cruciani F Sellitto D Coppa A Luna Calderon F Simionati B Valle G Richards M Macaulay V Scozzari R Do the four clades of the mtDNA haplogroup L2 evolve at different rates?.Am J Hum Genet. 2001; 69: 1348-1356Abstract Full Text Full Text PDF PubMed Scopus (169) Google Scholar; Derbeneva et al. Derbeneva et al., 2002Derbeneva OA Sukernik RI Volodko NV Hosseini SH Lott MT Wallace DC Analysis of mitochondrial DNA diversity in the Aleuts of the Commander Islands and its implications for the genetic history of Beringia.Am J Hum Genet. 2002; 71: 415-421Abstract Full Text Full Text PDF PubMed Scopus (64) Google Scholar; Herrnstadt et al. Herrnstadt et al., 2002Herrnstadt C Elson JL Fahy E Preston G Turnbull DM Anderson C Ghosh SS Olefsky JM Beal MF Davis RE Howell N Reduced-median-network analysis of complete mitochondrial DNA coding-region sequences for the major African, Asian, and European haplogroups.Am J Hum Genet. 2002; 70 (erratum 71:448–449): 1152-1171Abstract Full Text Full Text PDF PubMed Scopus (446) Google Scholar; Mishmar et al. Mishmar et al., 2003Mishmar D Ruiz-Pesini E Golik P Macaulay V Clark AG Hosseini S Brandon M Easley K Chen E Brown MD Sukernik RI Olckers A Wallace DC Natural selection shaped regional mtDNA variation in humans.Proc Natl Acad Sci USA. 2003; 100: 171-176Crossref PubMed Scopus (784) Google Scholar). These studies differ in regard to sequencing technique (viz., Maca-Meyer et al. [Maca-Meyer et al., 2001Maca-Meyer N González AM Larruga JM Flores C Cabrera VC Major genomic mitochondrial lineages delineate early human expansions.BMC Genetics. 2001; 2: 13Crossref PubMed Scopus (265) Google Scholar] employed manual sequencing), inclusion of the control region (which was not disclosed by Herrnstadt et al. [Herrnstadt et al., 2002Herrnstadt C Elson JL Fahy E Preston G Turnbull DM Anderson C Ghosh SS Olefsky JM Beal MF Davis RE Howell N Reduced-median-network analysis of complete mitochondrial DNA coding-region sequences for the major African, Asian, and European haplogroups.Am J Hum Genet. 2002; 70 (erratum 71:448–449): 1152-1171Abstract Full Text Full Text PDF PubMed Scopus (446) Google Scholar]), and sampling scheme: mtDNAs either were chosen according to the language spoken by their bearers (Ingman et al. Ingman et al., 2000Ingman M Kaessmann H Pääbo S Gyllensten U Mitochondrial genome variation and the origin of modern humans.Nature. 2000; 408: 708-713Crossref PubMed Scopus (1023) Google Scholar), were randomly selected from a certain geographic range (Finnilä et al. Finnilä et al., 2001Finnilä S Lehtonen MS Majamaa K Phylogenetic network for European mtDNA.Am J Hum Genet. 2001; 68: 1475-1484Abstract Full Text Full Text PDF PubMed Scopus (289) Google Scholar; Herrnstadt et al. Herrnstadt et al., 2002Herrnstadt C Elson JL Fahy E Preston G Turnbull DM Anderson C Ghosh SS Olefsky JM Beal MF Davis RE Howell N Reduced-median-network analysis of complete mitochondrial DNA coding-region sequences for the major African, Asian, and European haplogroups.Am J Hum Genet. 2002; 70 (erratum 71:448–449): 1152-1171Abstract Full Text Full Text PDF PubMed Scopus (446) Google Scholar), or were preselected according to RFLP haplogroup status (Maca-Meyer et al. Maca-Meyer et al., 2001Maca-Meyer N González AM Larruga JM Flores C Cabrera VC Major genomic mitochondrial lineages delineate early human expansions.BMC Genetics. 2001; 2: 13Crossref PubMed Scopus (265) Google Scholar; Torroni et al. Torroni et al., 2001Torroni A Rengo C Guida V Cruciani F Sellitto D Coppa A Luna Calderon F Simionati B Valle G Richards M Macaulay V Scozzari R Do the four clades of the mtDNA haplogroup L2 evolve at different rates?.Am J Hum Genet. 2001; 69: 1348-1356Abstract Full Text Full Text PDF PubMed Scopus (169) Google Scholar; Derbeneva et al. Derbeneva et al., 2002Derbeneva OA Sukernik RI Volodko NV Hosseini SH Lott MT Wallace DC Analysis of mitochondrial DNA diversity in the Aleuts of the Commander Islands and its implications for the genetic history of Beringia.Am J Hum Genet. 2002; 71: 415-421Abstract Full Text Full Text PDF PubMed Scopus (64) Google Scholar; Mishmar et al. Mishmar et al., 2003Mishmar D Ruiz-Pesini E Golik P Macaulay V Clark AG Hosseini S Brandon M Easley K Chen E Brown MD Sukernik RI Olckers A Wallace DC Natural selection shaped regional mtDNA variation in humans.Proc Natl Acad Sci USA. 2003; 100: 171-176Crossref PubMed Scopus (784) Google Scholar). Prior to these systematic sequencing efforts, a number of single, nearly complete mtDNA sequences, mainly sampled from Europe and Japan, were published in the field of medical genetics. All available mtDNAs of Asian and Native American (as well as of Papuan and Australian) origin that were published before the year 2002 were summarized in a tree by Kivisild et al. (Kivisild et al., 2002Kivisild T Tolk H-V Parik J Wang Y Papiha SS Bandelt H-J Villems R The emerging limbs and twigs of the East Asian mtDNA tree.Mol Biol Evol. 2002; 19: 1737-1751Crossref PubMed Scopus (336) Google Scholar). On the basis of the information provided by additional screening of a particular fragment (10171–10659) and other sites of the coding region, Yao et al. (Yao et al., 2002aYao Y-G Kong Q-P Bandelt H-J Kivisild T Zhang Y-P Phylogeographic differentiation of mitochondrial DNA in Han Chinese.Am J Hum Genet. 2002a; 70: 635-651Abstract Full Text Full Text PDF PubMed Scopus (464) Google Scholar) devised a classification tree of East Asian mtDNA haplogroups that highlights some diagnostic motifs from the control region and particular parts of the coding region. Can this tree stand the test from complete sequence data and fully reflect the relationships among the mtDNA lineages observed in East Asia? To answer these questions, a fully resolved phylogeny of complete mtDNA sequences covering all major East Asian haplogroups is indispensable. In the present study, we selected 48 mtDNAs for complete sequencing from >2,000 samples across China that belong to different subhaplogroups and also include previously unclassified haplotypes (Yao et al. Yao et al., 2002aYao Y-G Kong Q-P Bandelt H-J Kivisild T Zhang Y-P Phylogeographic differentiation of mitochondrial DNA in Han Chinese.Am J Hum Genet. 2002a; 70: 635-651Abstract Full Text Full Text PDF PubMed Scopus (464) Google Scholar, Yao et al., 2002cYao Y-G Nie L Harpending H Fu Y-X Yuan Z-G Zhang Y-P Genetic relationship of Chinese ethnic populations revealed by mtDNA sequence diversity.Am J Phys Anthropol. 2002c; 118: 63-76Crossref PubMed Scopus (150) Google Scholar, Yao et al., 2003aYao Y-G Kong Q-P Man X-Y Bandelt H-J Zhang Y-P Reconstructing the evolutionary history of China: a caveat about inferences drawn from ancient DNA.Mol Biol Evol. 2003a; 20: 214-219Crossref PubMed Scopus (77) Google Scholar; Yao and Zhang Yao and Zhang, 2002Yao Y-G Zhang Y-P Phylogeographic analysis of mtDNA variation in four ethnic populations from Yunnan Province: new data and a reappraisal.J Hum Genet. 2002; 47: 311-318Crossref PubMed Scopus (67) Google Scholar; authors' unpublished data). With the exception of the mtDNA haplogroup M7a (prominent in Japan), which has four representatives in the compilation of Kivisild et al. (Kivisild et al., 2002Kivisild T Tolk H-V Parik J Wang Y Papiha SS Bandelt H-J Villems R The emerging limbs and twigs of the East Asian mtDNA tree.Mol Biol Evol. 2002; 19: 1737-1751Crossref PubMed Scopus (336) Google Scholar), each of the (sub)haplogroups in the classification tree of Yao et al. (Yao et al., 2002aYao Y-G Kong Q-P Bandelt H-J Kivisild T Zhang Y-P Phylogeographic differentiation of mitochondrial DNA in Han Chinese.Am J Hum Genet. 2002a; 70: 635-651Abstract Full Text Full Text PDF PubMed Scopus (464) Google Scholar) is represented by at least 1 of these 48 mtDNAs. Several mtDNAs that were only roughly classified as B4*, F2*, D*, G*, R*, and M* in Yao et al. (Yao et al., 2002aYao Y-G Kong Q-P Bandelt H-J Kivisild T Zhang Y-P Phylogeographic differentiation of mitochondrial DNA in Han Chinese.Am J Hum Genet. 2002a; 70: 635-651Abstract Full Text Full Text PDF PubMed Scopus (464) Google Scholar, Yao et al., 2003aYao Y-G Kong Q-P Man X-Y Bandelt H-J Zhang Y-P Reconstructing the evolutionary history of China: a caveat about inferences drawn from ancient DNA.Mol Biol Evol. 2003a; 20: 214-219Crossref PubMed Scopus (77) Google Scholar) were also selected for sequencing to better understand their phylogenetic status. The complete mtDNA sequences were amplified by use of 15 pairs of primers (available from the authors on request). After being purified on spin columns (Watson BioTechnologies), each of the 15 overlapping fragments was sequenced for both strands by use of the BigDye Terminator Cycle Sequence Kit (ABI Applied Biosystems) and was run on an ABI 377 and an ABI 3700 DNA sequencer (ABI Applied Biosystems). The primers used for sequencing are composed of the PCR primers and a set of 47 internal primers (available from the authors on request). The sequences were edited and aligned by use of the DNASTAR software, and the mutations were scored relative to the revised reference sequence (rCRS) (Andrews et al. Andrews et al., 1999Andrews RM Kubacka I Chinnery PF Lightowlers RN Turnbull DM Howell N Reanalysis and revision of the Cambridge reference sequence for human mitochondrial DNA.Nat Genet. 1999; 23: 147Crossref PubMed Scopus (2551) Google Scholar). The length variation of the A and C stretches in region 16180–16193 was disregarded in the analysis. To avoid the five major types of errors observed in published mtDNA data (viz., base shifts, reference bias, phantom mutations, base misscoring, and artificial recombination), as classified by Bandelt et al. (Bandelt et al., 2001Bandelt H-J Lahermo P Richards M Macaulay V Detecting errors in mtDNA data by phylogenetic analysis.Int J Legal Med. 2001; 115: 64-69Crossref PubMed Scopus (167) Google Scholar), we took the following quality-control measures in the course of data generation and handling. First, every mtDNA was sequenced at least twice. Second, all mutations recorded in the phylogenetic tree (fig. 1) were confirmed by rechecking the sequence electropherograms or the original references. Third, all of the insertions and deletions (indels) in the samples, and some potentially misrecorded or seemingly recurrent polymorphisms compared with the reported data (Derbeneva et al. Derbeneva et al., 2002Derbeneva OA Sukernik RI Volodko NV Hosseini SH Lott MT Wallace DC Analysis of mitochondrial DNA diversity in the Aleuts of the Commander Islands and its implications for the genetic history of Beringia.Am J Hum Genet. 2002; 71: 415-421Abstract Full Text Full Text PDF PubMed Scopus (64) Google Scholar; Herrnstadt et al. Herrnstadt et al., 2002Herrnstadt C Elson JL Fahy E Preston G Turnbull DM Anderson C Ghosh SS Olefsky JM Beal MF Davis RE Howell N Reduced-median-network analysis of complete mitochondrial DNA coding-region sequences for the major African, Asian, and European haplogroups.Am J Hum Genet. 2002; 70 (erratum 71:448–449): 1152-1171Abstract Full Text Full Text PDF PubMed Scopus (446) Google Scholar; Kivisild et al. Kivisild et al., 2002Kivisild T Tolk H-V Parik J Wang Y Papiha SS Bandelt H-J Villems R The emerging limbs and twigs of the East Asian mtDNA tree.Mol Biol Evol. 2002; 19: 1737-1751Crossref PubMed Scopus (336) Google Scholar; Mishmar et al. Mishmar et al., 2003Mishmar D Ruiz-Pesini E Golik P Macaulay V Clark AG Hosseini S Brandon M Easley K Chen E Brown MD Sukernik RI Olckers A Wallace DC Natural selection shaped regional mtDNA variation in humans.Proc Natl Acad Sci USA. 2003; 100: 171-176Crossref PubMed Scopus (784) Google Scholar)—such as site 4086 in samples GD7824 and GD7811, site 5108 in GD7812, sites 12358 and 12372 in GD7825, site 13928C in SD10313, site 8877 in EWK28, site 10658 in Miao271, site 10685 in SD10324; site 10801 in QD8147; and site 10810 in GD7809—were confirmed by independent amplification and sequencing. Moreover, the absence of mutations at sites 8450, 13827, 14180, 15217, and 15805 in GD7830; site 14340 in SD10324; site 4086 in QD8167; sites 5465, 9123, and 10238 in GD7812; and site 7933 in XJ8426 were also confirmed by at least two independent experiments. To test the quality of the complete mtDNA sequences obtained, all of the "private" polymorphisms in 15 of the 48 completely sequenced mtDNAs in the terminal branches of the phylogenetic tree (fig. 1) have been rechecked in another experiment together with some controls (authors' unpublished data). As a result, we detected discrepancies with the partial information published earlier in Yao et al. (Yao et al., 2002aYao Y-G Kong Q-P Bandelt H-J Kivisild T Zhang Y-P Phylogeographic differentiation of mitochondrial DNA in Han Chinese.Am J Hum Genet. 2002a; 70: 635-651Abstract Full Text Full Text PDF PubMed Scopus (464) Google Scholar) in three samples (QD8141 actually has the 16217 mutation; LN7595 has two "C" insertions at site 315 instead of three; and GD7842 carries the mutation at site 10586). The now-available information provided by the complete mtDNA sequences analyzed in this study (GenBank accession numbers AY255133–AY255180) and those sequences reported by other labs (Derbeneva et al. Derbeneva et al., 2002Derbeneva OA Sukernik RI Volodko NV Hosseini SH Lott MT Wallace DC Analysis of mitochondrial DNA diversity in the Aleuts of the Commander Islands and its implications for the genetic history of Beringia.Am J Hum Genet. 2002; 71: 415-421Abstract Full Text Full Text PDF PubMed Scopus (64) Google Scholar; Herrnstadt et al. Herrnstadt et al., 2002Herrnstadt C Elson JL Fahy E Preston G Turnbull DM Anderson C Ghosh SS Olefsky JM Beal MF Davis RE Howell N Reduced-median-network analysis of complete mitochondrial DNA coding-region sequences for the major African, Asian, and European haplogroups.Am J Hum Genet. 2002; 70 (erratum 71:448–449): 1152-1171Abstract Full Text Full Text PDF PubMed Scopus (446) Google Scholar; Kivisild et al. Kivisild et al., 2002Kivisild T Tolk H-V Parik J Wang Y Papiha SS Bandelt H-J Villems R The emerging limbs and twigs of the East Asian mtDNA tree.Mol Biol Evol. 2002; 19: 1737-1751Crossref PubMed Scopus (336) Google Scholar and references therein; Mishmar et al. Mishmar et al., 2003Mishmar D Ruiz-Pesini E Golik P Macaulay V Clark AG Hosseini S Brandon M Easley K Chen E Brown MD Sukernik RI Olckers A Wallace DC Natural selection shaped regional mtDNA variation in humans.Proc Natl Acad Sci USA. 2003; 100: 171-176Crossref PubMed Scopus (784) Google Scholar) offers a solid basis for discerning the phylogenetic relationship of the mtDNA haplogroups (fig. 1). The definition of haplogroups D, M7, C, A, and N9a, as well as macrohaplogroups M, R, and N, is confirmed and remains unchanged (Kivisild et al. Kivisild et al., 2002Kivisild T Tolk H-V Parik J Wang Y Papiha SS Bandelt H-J Villems R The emerging limbs and twigs of the East Asian mtDNA tree.Mol Biol Evol. 2002; 19: 1737-1751Crossref PubMed Scopus (336) Google Scholar). Our results also provide complementary information for the haplogroups that were defined only by control-region and/or partial coding-region information in Yao et al. (Yao et al., 2002aYao Y-G Kong Q-P Bandelt H-J Kivisild T Zhang Y-P Phylogeographic differentiation of mitochondrial DNA in Han Chinese.Am J Hum Genet. 2002a; 70: 635-651Abstract Full Text Full Text PDF PubMed Scopus (464) Google Scholar, Yao et al., 2003aYao Y-G Kong Q-P Man X-Y Bandelt H-J Zhang Y-P Reconstructing the evolutionary history of China: a caveat about inferences drawn from ancient DNA.Mol Biol Evol. 2003a; 20: 214-219Crossref PubMed Scopus (77) Google Scholar) and Kivisild et al. (Kivisild et al., 2002Kivisild T Tolk H-V Parik J Wang Y Papiha SS Bandelt H-J Villems R The emerging limbs and twigs of the East Asian mtDNA tree.Mol Biol Evol. 2002; 19: 1737-1751Crossref PubMed Scopus (336) Google Scholar): D5, G1, G2, M7b, M7c, M8, M8a, Z, M9a (originally named "M9" in Yao et al. [Yao et al., 2002aYao Y-G Kong Q-P Bandelt H-J Kivisild T Zhang Y-P Phylogeographic differentiation of mitochondrial DNA in Han Chinese.Am J Hum Genet. 2002a; 70: 635-651Abstract Full Text Full Text PDF PubMed Scopus (464) Google Scholar]), M10, F1, F1a, F1c, F2, B4c, B5, B5a, and B5b are all further characterized by additional mutations (fig. 1). Some haplogroups are redefined here. We follow Bandelt et al. (Bandelt et al., in pressBandelt H-J, Herrnstadt C, Yao Y-G, Kong Q-P, Kivisild T, Rengo C, Scozzari R, Richards M, Villems R, Macaulay V, Howell N, Torroni A, Zhang Y-P. Identification of Native American founder mtDNAs through the analysis of complete mtDNA sequences: some caveats. Ann Hum Genet (in press)Google Scholar) in broadening the definition of "G1" (Kivisild et al. Kivisild et al., 2002Kivisild T Tolk H-V Parik J Wang Y Papiha SS Bandelt H-J Villems R The emerging limbs and twigs of the East Asian mtDNA tree.Mol Biol Evol. 2002; 19: 1737-1751Crossref PubMed Scopus (336) Google Scholar) by requiring only three coding-region mutations (8200, 15323, and 15497) for G1 status. Haplogroup R9 is now broadened by requiring only two characteristic coding-region mutations (3970 and 13928C). It embraces two haplogroups, F (first introduced as "R9" in Yao et al. [Yao et al., 2002aYao Y-G Kong Q-P Bandelt H-J Kivisild T Zhang Y-P Phylogeographic differentiation of mitochondrial DNA in Han Chinese.Am J Hum Genet. 2002a; 70: 635-651Abstract Full Text Full Text PDF PubMed Scopus (464) Google Scholar]) and R9b (initially named "R10" in Yao and Zhang [Yao and Zhang, 2002Yao Y-G Zhang Y-P Phylogeographic analysis of mtDNA variation in four ethnic populations from Yunnan Province: new data and a reappraisal.J Hum Genet. 2002; 47: 311-318Crossref PubMed Scopus (67) Google Scholar]). Note that there are two equally parsimonious reconstructions for the evolution of 16304; here, we opt for the one that places a forward mutation at site 16304 on the way to haplogroup R9. This prompts yet another broadening of haplogroup F, which is now recognizable by an "A" deletion scored at site 249 and transitions at sites 6392 and 10310. Haplogroup F thus encompasses haplogroups F1, F2, and F3 (originally called "R9a" in Yao et al. [Yao et al., 2002aYao Y-G Kong Q-P Bandelt H-J Kivisild T Zhang Y-P Phylogeographic differentiation of mitochondrial DNA in Han Chinese.Am J Hum Genet. 2002a; 70: 635-651Abstract Full Text Full Text PDF PubMed Scopus (464) Google Scholar]). The definition of "haplogroup Y" is also broadened, since the mtDNAs with motif 16126-16261-16311 lack the 3834 mutation (authors' unpublished data). Furthermore, some mtDNA haplotypes that were previously not well classifiable relative to the employed classification tree (marked by a star after the corresponding haplogroup acronym in Yao et al. [Yao et al., 2002aYao Y-G Kong Q-P Bandelt H-J Kivisild T Zhang Y-P Phylogeographic differentiation of mitochondrial DNA in Han Chinese.Am J Hum Genet. 2002a; 70: 635-651Abstract Full Text Full Text PDF PubMed Scopus (464) Google Scholar, Yao et al., 2003aYao Y-G Kong Q-P Man X-Y Bandelt H-J Zhang Y-P Reconstructing the evolutionary history of China: a caveat about inferences drawn from ancient DNA.Mol Biol Evol. 2003a; 20: 214-219Crossref PubMed Scopus (77) Google Scholar]) can now be assigned to new (sub)haplogroups defined as follows. The three M* haplotypes, GD7817 (Yao et al. Yao et al., 2002aYao Y-G Kong Q-P Bandelt H-J Kivisild T Zhang Y-P Phylogeographic differentiation of mitochondrial DNA in Han Chinese.Am J Hum Genet. 2002a; 70: 635-651Abstract Full Text Full Text PDF PubMed Scopus (464) Google Scholar), SD10324 (Yao et al. Yao et al., 2003aYao Y-G Kong Q-P Man X-Y Bandelt H-J Zhang Y-P Reconstructing the evolutionary history of China: a caveat about inferences drawn from ancient DNA.Mol Biol Evol. 2003a; 20: 214-219Crossref PubMed Scopus (77) Google Scholar), and Miao271 (authors' unpublished data)—which share seven coding-region mutations (1095, 6531, 7642, 8108, 9950, 11969, and 13074) and four mutations in HVS-II (146, 215, 318, and 326)—form a new M branch, named "haplogroup M11." It is then evident that the mtDNAs QD8130 and XJ8436 (Yao et al. Yao et al., 2002aYao Y-G Kong Q-P Bandelt H-J Kivisild T Zhang Y-P Phylogeographic differentiation of mitochondrial DNA in Han Chinese.Am J Hum Genet. 2002a; 70: 635-651Abstract Full Text Full Text PDF PubMed Scopus (464) Google Scholar) also belong to this haplogroup. The two R* haplotypes, LN7595 and QD8168, which bear a motif similar to that of B5 but do not show the 9-bp deletion in the COII/tRNALys intergenic region (Yao et al. Yao et al., 2002aYao Y-G Kong Q-P Bandelt H-J Kivisild T Zhang Y-P Phylogeographic differentiation of mitochondrial DNA in Han Chinese.Am J Hum Genet. 2002a; 70: 635-651Abstract Full Text Full Text PDF PubMed Scopus (464) Google Scholar), form a new branch that has eight characteristic mutations (709, 8277, 8278+3C, 10031, 10398, 11061, 12950, and 13681) in the coding region and four mutations (185, 189, 16189, and 16311) in the control region. This new haplogroup is designated as "R11." The two B4* haplotypes, LN7589 and QD8141 (Yao et al. Yao et al., 2002aYao Y-G Kong Q-P Bandelt H-J Kivisild T Zhang Y-P Phylogeographic differentiation of mitochondrial DNA in Han Chinese.Am J Hum Genet. 2002a; 70: 635-651Abstract Full Text Full Text PDF PubMed Scopus (464) Google Scholar)—determined by three coding-region transitions at sites 11914, 13942, and 15930—form a new branch of B4 named "B4d," which is a sister group to B4b. Then, the smallest haplogroup (designated as "B4bd") that comprises both B4b and B4d is recognizable by two transitions (at sites 827 and 15535). In total, >70 named nested haplogroups are discerned that can be regarded as sufficiently supported by the complete sequence data (fig. 1), and most of them can be recognized by specific mutations in both the coding and the control regions. As a result, the identification of haplogroup status in future East Asian mtDNA studies could be simplified, since it requires only a few coding-region mutations to be typed according to a preliminary prediction of haplogroup status based on control-region motifs. Furthermore, our phylogenetic strategy for sample selection employed here is much more efficient and effective than random sampling or use of nonphylogenetic criteria and thus can be widely used in other mtDNA phylogeographic studies. For comparison with previous approaches and published data, we also estimate the ages of the major haplogroups, on the basis of our collection of 48 lineages. We adopt the mutation rate of one base substitution (i.e., one mutation other than indel) in the coding region per 5,140 years (Mishmar et al. Mishmar et al., 2003Mishmar D Ruiz-Pesini E Golik P Macaulay V Clark AG Hosseini S Brandon M Easley K Chen E Brown MD Sukernik RI Olckers A Wallace DC Natural selection shaped regional mtDNA variation in humans.Proc Natl Acad Sci USA. 2003; 100: 171-176Crossref PubMed Scopus (784) Google Scholar), which was calibrated on the basis of an assumed human-chimp split of 6.5 million years ago. This yields the following ages of those branches of the tree of figure 1 that carry more than five sampled lineages: 50.8±6.6 thousand years (ky) for B, 57.4±8.2 ky for D, 60.0±9.2 ky for F, 65.4±10.3 ky for R9, 62.3±6.3 ky for R, 64.6±6.8 ky for N, and 69.3±5.4 ky for M, where age ±SD is calculated as in Saillard et al. (Saillard et al., 2000Saillard J Forster P Lynnerup N Bandelt H-J Nørby S mtDNA variation among Greenland Eskimos: the edge of the Beringian expansion.Am J Hum Genet. 2000; 67: 718-726Abstract Full Text Full Text PDF PubMed Scopus (414) Google Scholar). The ages of the three macrohaplogroups M, N, and R are thus only slightly larger than those calculated by Mishmar et al. (Mishmar et al., 2003Mishmar D Ruiz-Pesini E Golik P Macaulay V Clark AG Hosseini S Brandon M Easley K Chen E Brown MD Sukernik RI Olckers A Wallace DC Natural selection shaped regional mtDNA variation in humans.Proc Natl Acad Sci USA. 2003; 100: 171-176Crossref PubMed Scopus (784) Google Scholar). Since there is no positive evidence yet that the East Asian haplogroups would share any mutations with the West Eurasian or South Asian haplogroups other than those defining M, N, and R (Kivisild et al. Kivisild et al., 2002Kivisild T Tolk H-V Parik J Wang Y Papiha SS Bandelt H-J Villems R The emerging limbs and twigs of the East Asian mtDNA tree.Mol Biol Evol. 2002; 19: 1737-1751Crossref PubMed Scopus (336) Google Scholar), it seems that the first modern humans carried exactly the root haplotypes of the three Eurasian macrohaplogroups into Southeast Asia. The founder age for the three root haplotypes that are based on the set of 48 coding-region sequences is then estimated as 65.4±3.8 ky. Incidentally, this value nearly equals the corresponding age of 66.0 ky, on the basis of the heuristic rate of one transition within 16090–16365 per 20,180 years (but see Saillard et al. [Saillard et al., 2000Saillard J Forster P Lynnerup N Bandelt H-J Nørby S mtDNA variation among Greenland Eskimos: the edge of the Beringian expansion.Am J Hum Genet. 2000; 67: 718-726Abstract Full Text Full Text PDF PubMed Scopus (414) Google Scholar] for a critical view on the calibration of this rate). Thus, we do not see the necessity yet that "conjectures about the timing of human migrations may need to be reassessed" (Mishmar et al. Mishmar et al., 2003Mishmar D Ruiz-Pesini E Golik P Macaulay V Clark AG Hosseini S Brandon M Easley K Chen E Brown MD Sukernik RI Olckers A Wallace DC Natural selection shaped regional mtDNA variation in humans.Proc Natl Acad Sci USA. 2003; 100: 171-176Crossref PubMed Scopus (784) Google Scholar). The detailed phylogeny of complete mtDNA sequences is particularly important for the study of mtDNA-related diseases. It allows us to allocate each identified mutation to a certain branch of the mtDNA phylogeny, so that pathogenic and/or disease-associated mutations can be clearly distinguished from haplogroup-specific mutations. For example, our previous analysis of the 5178A polymorphism, which is a basal mutation specific to haplogroup D, in different age samples, showed no evidence for association between this mutation and longevity (contra Tanaka et al. [Tanaka et al., 1998Tanaka M Gong JS Zhang J Yoneda M Yagi K Mitochondrial genotype associated with longevity.Lancet. 1998; 351: 185-186Abstract Full Text Full Text PDF PubMed Scopus (281) Google Scholar]). This highlights the importance of examining pathogenic mtDNA mutations from a phylogenetic point of view (Rocha et al. Rocha et al., 1999Rocha H Flores C Campos Y Arenas J Vilarinho L Santorelli FM Torroni A About the "pathological" role of the mtDNA T3308C mutation….Am J Hum Genet. 1999; 65: 1457-1459Abstract Full Text Full Text PDF PubMed Scopus (28) Google Scholar; Yao et al. Yao et al., 2002bYao Y-G Kong Q-P Zhang Y-P Mitochondrial DNA 5178A polymorphism and longevity.Hum Genet. 2002b; 111: 462-463Crossref PubMed Scopus (41) Google Scholar). Although the mtDNA phylogenetic background does not seem to make any contribution to the phenotypic presentation of the pathogenic mutation 3243 in patients with either MELAS syndrome or a wide array of disease phenotypes (Torroni et al. Torroni et al., 2003Torroni A Campos Y Rengo C Sellitto D Achilli A Magri C Semino O Garcia A Jara P Arenas J Scozzari R Mitochondrial DNA haplogroups do not play a role in the variable phenotypic presentation of the A3243G mutation.Am J Hum Genet. 2003; 72: 1005-1012Abstract Full Text Full Text PDF PubMed Scopus (52) Google Scholar), the observed association between certain mtDNA haplogroup(s) and either longevity (De Benedictis et al. De Benedictis et al., 1999De Benedictis G Rose G Carrieri G De Luca M Falcone E Passarino G Bonafé M Monti D Baggio G Bertolini S Mari D Mattace R Franceschi C Mitochondrial DNA inherited variants are associated with successful aging and longevity in humans.FASEB J. 1999; 13: 1532-1536PubMed Google Scholar; Niemi et al. Niemi et al., 2003Niemi AK Hervonen A Hurme M Karhunen PJ Jylha M Majamaa K Mitochondrial DNA polymorphisms associated with longevity in a Finnish population.Hum Genet. 2003; 112: 29-33Crossref PubMed Scopus (219) Google Scholar), Leber hereditary optic neuropathy (LHON [Brown et al. Brown et al., 1997Brown MD Sun F Wallace DC Clustering of Caucasian Leber hereditary optic neuropathy patients containing the 11778 or 14484 mutations on an mtDNA lineage.Am J Hum Genet. 1997; 60: 381-387Abstract Full Text PDF PubMed Scopus (93) Google Scholar; Torroni et al. Torroni et al., 1997Torroni A Petrozzi M D'Urbano L Sellitto D Zeviani M Carrara F Carducci C Leuzzi V Carelli V Barboni P De Negri A Scozzari R Haplotype and phylogenetic analyses suggest that one European-specific mtDNA background plays a role in the expression of Leber hereditary optic neuropathy by increasing the penetrance of the primary mutations 11778 and 14484.Am J Hum Genet. 1997; 60: 1107-1121PubMed Google Scholar]), or Parkinson disease (van der Walt et al. van der Walt et al., 2003van der Walt JM Nicodemus KK Martin ER Scott WK Nance MA Watts RL Hubble JP et al.Mitochondrial polymorphisms significantly reduce the risk of Parkinson disease.Am J Hum Genet. 2003; 72: 804-811Abstract Full Text Full Text PDF PubMed Scopus (479) Google Scholar) strongly suggests that this phylogenetic approach should be more widely used in mtDNA-related medical genetics. Moreover, tracing mtDNA mutations along phylogenetic pathways is helpful in pinpointing potential oversights and artificial recombination (e.g., as shown in Yao et al. [Yao et al., 2003bYao Y-G Macaulay V Kivisild T Zhang Y-P Bandelt H-J To trust or not to trust an idiosyncratic mitochondrial data set.Am J Hum Genet. 2003b; 72: 1341-1346Abstract Full Text Full Text PDF PubMed Scopus (34) Google Scholar] and Yao and Zhang [Yao and Zhang, 2003Yao Y-G Zhang Y-P Pitfalls in the analysis of ancient human mtDNA.Chinese Sci Bull. 2003; 48: 826-830Google Scholar]). The B5b sequence of a patient suffering from LHON and cardiomyopathy recently reported by Mimaki et al. (Mimaki et al., 2003Mimaki M Ikota A Sato A Komaki H Akanuma J Nonaka I Goto Y A double mutation (G11778A and G12192A) in mitochondrial DNA associated with Leber's hereditary optic neuropathy and cardiomyopathy.J Hum Genet. 2003; 48: 47-50Crossref PubMed Scopus (28) Google Scholar) evidently missed a batch of mutations relative to the rCRS (73, 204, 263, 1438, 8281–8289del, 8584, 10398, 15223, 16140, and 16189). The mutational pattern can also be studied in detail with a large complete mtDNA phylogeny at hand. For instance, transversions A→G or T→G are apparently rather rare in the coding region (cf. Herrnstadt et al. [Herrnstadt et al., 2003Herrnstadt C Preston G Howell N Errors, phantom and otherwise, in human mtDNA sequences.Am J Hum Genet. 2003; 72: 1585-1586Abstract Full Text Full Text PDF PubMed Scopus (42) Google Scholar]). The only shared transversions to G in the Eurasian mtDNA tree reported to date by more than one lab seem to be 961G in haplogroup H, 12083G in haplogroup I, and 12738G in haplogroup K1 (Ingman et al. Ingman et al., 2000Ingman M Kaessmann H Pääbo S Gyllensten U Mitochondrial genome variation and the origin of modern humans.Nature. 2000; 408: 708-713Crossref PubMed Scopus (1023) Google Scholar; Finnilä et al. Finnilä et al., 2001Finnilä S Lehtonen MS Majamaa K Phylogenetic network for European mtDNA.Am J Hum Genet. 2001; 68: 1475-1484Abstract Full Text Full Text PDF PubMed Scopus (289) Google Scholar; Maca-Meyer et al. Maca-Meyer et al., 2001Maca-Meyer N González AM Larruga JM Flores C Cabrera VC Major genomic mitochondrial lineages delineate early human expansions.BMC Genetics. 2001; 2: 13Crossref PubMed Scopus (265) Google Scholar; Herrnstadt et al. Herrnstadt et al., 2002Herrnstadt C Elson JL Fahy E Preston G Turnbull DM Anderson C Ghosh SS Olefsky JM Beal MF Davis RE Howell N Reduced-median-network analysis of complete mitochondrial DNA coding-region sequences for the major African, Asian, and European haplogroups.Am J Hum Genet. 2002; 70 (erratum 71:448–449): 1152-1171Abstract Full Text Full Text PDF PubMed Scopus (446) Google Scholar). Further transversions to G found in lineages from haplogroups J, T, W, and X by Mishmar et al. (Mishmar et al., 2003Mishmar D Ruiz-Pesini E Golik P Macaulay V Clark AG Hosseini S Brandon M Easley K Chen E Brown MD Sukernik RI Olckers A Wallace DC Natural selection shaped regional mtDNA variation in humans.Proc Natl Acad Sci USA. 2003; 100: 171-176Crossref PubMed Scopus (784) Google Scholar) may thus be problematic, at least 14974G (Herrnstadt et al. Herrnstadt et al., 2003Herrnstadt C Preston G Howell N Errors, phantom and otherwise, in human mtDNA sequences.Am J Hum Genet. 2003; 72: 1585-1586Abstract Full Text Full Text PDF PubMed Scopus (42) Google Scholar). On the other hand, indels in the coding region seem to occur at an absolute frequency comparable with that of transversions but might be missed occasionally, owing to conservative reading of ambiguous sequencer outputs. For example, only a single private coding-region indel (15944d in an African haplogroup L1c lineage) can be scored in the 53 complete mtDNA sequences of Ingman et al. (Ingman et al., 2000Ingman M Kaessmann H Pääbo S Gyllensten U Mitochondrial genome variation and the origin of modern humans.Nature. 2000; 408: 708-713Crossref PubMed Scopus (1023) Google Scholar) (contrast this to nine private indels and five shared ones detected in our 48 complete mtDNA sequences); moreover, their single haplogroup F sequence (closely related to the lineage XJ8440 of Yao et al. [Yao et al., 2002aYao Y-G Kong Q-P Bandelt H-J Kivisild T Zhang Y-P Phylogeographic differentiation of mitochondrial DNA in Han Chinese.Am J Hum Genet. 2002a; 70: 635-651Abstract Full Text Full Text PDF PubMed Scopus (464) Google Scholar]) misses the 249 deletion. We agree with Herrnstadt et al. (Herrnstadt et al., 2003Herrnstadt C Preston G Howell N Errors, phantom and otherwise, in human mtDNA sequences.Am J Hum Genet. 2003; 72: 1585-1586Abstract Full Text Full Text PDF PubMed Scopus (42) Google Scholar) that the solution to the problem of mtDNA databases containing errors "is further effort, both at the front end (the sequencing process itself) and at the back end (increased quality control), of mtDNA database construction." In short, the phylogenetic tree of East Asian mtDNAs obtained in the present study covers all of the major haplogroups in the region and testifies to the phylogenetic status of the newly identified haplogroups (Kivisild et al. Kivisild et al., 2002Kivisild T Tolk H-V Parik J Wang Y Papiha SS Bandelt H-J Villems R The emerging limbs and twigs of the East Asian mtDNA tree.Mol Biol Evol. 2002; 19: 1737-1751Crossref PubMed Scopus (336) Google Scholar; Yao et al. Yao et al., 2002aYao Y-G Kong Q-P Bandelt H-J Kivisild T Zhang Y-P Phylogeographic differentiation of mitochondrial DNA in Han Chinese.Am J Hum Genet. 2002a; 70: 635-651Abstract Full Text Full Text PDF PubMed Scopus (464) Google Scholar, Yao et al., 2003aYao Y-G Kong Q-P Man X-Y Bandelt H-J Zhang Y-P Reconstructing the evolutionary history of China: a caveat about inferences drawn from ancient DNA.Mol Biol Evol. 2003a; 20: 214-219Crossref PubMed Scopus (77) Google Scholar; authors' unpublished data) that were formerly defined on the basis of control-region and/or only partial coding-region information. This tree, then, can serve as a basis for haplogroup inferences in future studies of East Asian populations and for distinguishing pathogenic mutations from rare polymorphisms in mtDNA medical genetics. We thank Shi-Fang Wu for technical assistance. We also thank Dr. Vincent Macaulay for helpful comments on the manuscript. This study was supported by grants from Chinese Academy of Sciences (KSCX2-SW-2010), Natural Sciences Foundation of China, and Natural Sciences Foundation of Yunnan Province. Erratum et al.The American Journal of Human GeneticsJuly, 2004In BriefIn the September 2003 issue of the Journal, in the article entitled "Phylogeny of East Asian Mitochondrial DNA Lineages Inferred from Complete Sequences," by Kong et al., the variation in sample YN163 (displayed in fig. 1) was incorrectly reported. This prompted us to reanalyze all recurrent private mutations in the 48 mtDNAs by a second independent amplification and sequencing round, to reaffirm our earlier findings and to meet high-quality standards for ongoing complete sequencing of mtDNA. It turned out that sample YN163 does not bear the mutation at position 13269, as was claimed in our article; instead, this sample carries two other transitions—that is, at 13135 and 13152. Full-Text PDF Open Archive
Referência(s)