The Neandertal genome and ancient DNA authenticity
2009; Springer Nature; Volume: 28; Issue: 17 Linguagem: Inglês
10.1038/emboj.2009.222
ISSN1460-2075
AutoresRichard E. Green, Adrian W. Briggs, Johannes Krause, Kay Prüfer, Hernán A. Burbano, Michael Siebauer, Michael Lachmann, Svante Pääbo,
Tópico(s)Archaeology and ancient environmental studies
ResumoEMBO Member's Review6 August 2009free access The Neandertal genome and ancient DNA authenticity Richard E Green Richard E Green Max-Planck Institute for Evolutionary Anthropology, Leipzig, Germany Search for more papers by this author Adrian W Briggs Adrian W Briggs Max-Planck Institute for Evolutionary Anthropology, Leipzig, Germany Search for more papers by this author Johannes Krause Johannes Krause Max-Planck Institute for Evolutionary Anthropology, Leipzig, Germany Search for more papers by this author Kay Prüfer Kay Prüfer Max-Planck Institute for Evolutionary Anthropology, Leipzig, Germany Search for more papers by this author Hernán A Burbano Hernán A Burbano Max-Planck Institute for Evolutionary Anthropology, Leipzig, Germany Search for more papers by this author Michael Siebauer Michael Siebauer Max-Planck Institute for Evolutionary Anthropology, Leipzig, Germany Search for more papers by this author Michael Lachmann Michael Lachmann Max-Planck Institute for Evolutionary Anthropology, Leipzig, Germany Search for more papers by this author Svante Pääbo Corresponding Author Svante Pääbo Max-Planck Institute for Evolutionary Anthropology, Leipzig, Germany Search for more papers by this author Richard E Green Richard E Green Max-Planck Institute for Evolutionary Anthropology, Leipzig, Germany Search for more papers by this author Adrian W Briggs Adrian W Briggs Max-Planck Institute for Evolutionary Anthropology, Leipzig, Germany Search for more papers by this author Johannes Krause Johannes Krause Max-Planck Institute for Evolutionary Anthropology, Leipzig, Germany Search for more papers by this author Kay Prüfer Kay Prüfer Max-Planck Institute for Evolutionary Anthropology, Leipzig, Germany Search for more papers by this author Hernán A Burbano Hernán A Burbano Max-Planck Institute for Evolutionary Anthropology, Leipzig, Germany Search for more papers by this author Michael Siebauer Michael Siebauer Max-Planck Institute for Evolutionary Anthropology, Leipzig, Germany Search for more papers by this author Michael Lachmann Michael Lachmann Max-Planck Institute for Evolutionary Anthropology, Leipzig, Germany Search for more papers by this author Svante Pääbo Corresponding Author Svante Pääbo Max-Planck Institute for Evolutionary Anthropology, Leipzig, Germany Search for more papers by this author Author Information Richard E Green1, Adrian W Briggs1, Johannes Krause1, Kay Prüfer1, Hernán A Burbano1, Michael Siebauer1, Michael Lachmann1 and Svante Pääbo 1 1Max-Planck Institute for Evolutionary Anthropology, Leipzig, Germany *Corresponding author. MPI for Evolutionary Anthropology, MPI, Deutscher Platz 6, D-04103 Leipzig, Germany. Tel.: +49 341 3550 501; Fax: +49 341 3550 550; E-mail: [email protected] The EMBO Journal (2009)28:2494-2502https://doi.org/10.1038/emboj.2009.222 PDFDownload PDF of article text and main figures. ToolsAdd to favoritesDownload CitationsTrack CitationsPermissions ShareFacebookTwitterLinked InMendeleyWechatReddit Figures & Info Recent advances in high-thoughput DNA sequencing have made genome-scale analyses of genomes of extinct organisms possible. With these new opportunities come new difficulties in assessing the authenticity of the DNA sequences retrieved. We discuss how these difficulties can be addressed, particularly with regard to analyses of the Neandertal genome. We argue that only direct assays of DNA sequence positions in which Neandertals differ from all contemporary humans can serve as a reliable means to estimate human contamination. Indirect measures, such as the extent of DNA fragmentation, nucleotide misincorporations, or comparison of derived allele frequencies in different fragment size classes, are unreliable. Fortunately, interim approaches based on mtDNA differences between Neandertals and current humans, detection of male contamination through Y chromosomal sequences, and repeated sequencing from the same fossil to detect autosomal contamination allow initial large-scale sequencing of Neandertal genomes. This will result in the discovery of fixed differences in the nuclear genome between Neandertals and current humans that can serve as future direct assays for contamination. For analyses of other fossil hominins, which may become possible in the future, we suggest a similar ‘boot-strap’ approach in which interim approaches are applied until sufficient data for more definitive direct assays are acquired. Ancient DNA and authenticity The presence of DNA in ancient remains was initially shown by staining of DNA in histological samples of Egyptian mummies (Pääbo, 1984) and by extraction and cloning in bacterial plasmids of DNA from the extinct quagga and mummies (Higuchi et al, 1984; Pääbo, 1985). However, it was only the advent of the polymerase chain reaction (PCR) (Mullis and Faloona, 1987) that made it possible to reproduce results. Through PCR it became possible to ensure that the sequence determined did not contain errors (Pääbo and Wilson, 1988) and that the DNA sequences determined were indeed derived from the organism under study. Early results included the determination of DNA sequences from extinct mammals such as marsupial wolves (Thomas et al, 1989), moas (Cooper et al, 1992), and mammoths (Hagelberg et al, 1994; Höss et al, 1994; Krause et al, 2006) allowing resolution of the phylogenetic relationships between these extinct organisms and extant species. Along with these successes came the realization that the ability of the PCR to amplify few or even single template molecules meant that when ancient specimens contain little or no endogenous DNA, the DNA amplified could be partially or wholly derived from exogenous DNA contaminating a specimen and be mistaken for endogenous DNA (Pääbo et al, 1989). For example, reports of dinosaur DNA sequences (Woodward et al, 1994) proved to be derived from human DNA contaminating the fossil or performed experiments (Zischler et al, 1995). These and other similar experiences served as cautionary tales for the growing field. As a remedy, ‘criteria of authenticity’ for the study of ancient DNA were suggested (Pääbo et al, 1989) and the community converged on a set of laboratory practices to prevent contamination that have developed over time (Cooper and Poinar, 2000; Hofreiter et al, 2001b; Pääbo et al, 2004). They include, for example, strict spatial separation of ancient DNA extraction from other experiments and UV irradiation and bleach treatment of extraction areas to minimize the extent of contamination. Often, the ancient DNA extraction facilities fulfill clean room requirements in that they operate under positive pressure using filtered air, require personnel to wear sterile clothing and face shields, and to work in laminar flow hoods. The criteria for authenticity include independent replication of results within a laboratory and in many cases, such as when particularly noteworthy or surprising results are obtained, in an independent second laboratory. Such practices have generally served the field well. In the case of hominins, that is modern humans and their close relatives such as Cro-Magnons, Neandertals, and Homo floresiensis (Brown et al, 2004), the issue of authenticity is particularly acute because they can be expected to be identical to current humans for much or almost all of their genome. For example, although morphologically distinct, Neandertals were so closely related to people living today that most Neandertal DNA fragments retrieved from a fossil are expected to carry no sequence differences to the corresponding human sequences (Pääbo, 1999). As human DNA is a very common contaminant in fossils and laboratory experiments, this makes it particularly challenging to ascertain the authenticity of Neandertal DNA sequences. Indeed, for Cro-Magnons and other modern humans, the problems are so severe that over the past 15 years we have been pessimistic over the prospects of ever reliably determining such DNA sequences (Pääbo et al, 2004). For Neandertals, the situation is more tractable because their mitochondrial (mt) genome proved to be different from that of any modern human studied to date (Krings et al, 1997), making it possible to determine Neandertal mtDNA sequences confidently. Recently, high-throughput sequencing techniques have become available that allow large numbers of DNA sequences to be determined (Margulies et al, 2005; Bentley et al, 2008). They rely on the construction of sequencing libraries by the ligation of DNA adapters to the ends of DNA molecules in a sample. These adapters then serve as priming sites both for amplification and for sequencing, which occur either on beads or on a solid surface in which each bead or cluster on a surface represents an amplified copy of a single original template molecule. Currently, the most common application of these techniques in ancient DNA research is direct shot-gun sequencing of random DNA fragments extracted from a fossil (Green et al, 2006; Poinar et al, 2006; Stiller et al, 2006; Miller et al, 2008). The advent of the high-throughput approach to ancient DNA analyses makes it important to revisit the criteria of authenticity. For example, reproduction of results in the same or different laboratories as a prerequisite for publication is not practicable for large-scale DNA sequencing of random molecules because of constraints on time, costs, and sample materials. Nonetheless, there are means to control and assess the extent of contamination. Here, we discuss the measures taken to control contamination in the initial sequencing efforts aimed at showing the feasibility of sequencing the Neandertal genome. We describe technical improvements incorporated into the ongoing genome sequencing effort, and use published and unpublished Neandertal DNA sequence data to describe means of assessing contamination that we are convinced will allow the reliable determination of a useful Neandertal genome sequence. MtDNA as an inroad to the Neandertal genome The mitochondrial genome is only 1/200 000 the size of the nuclear genome, occurs in many copies per cell, is maternally inherited without recombination, and has been extensively used in PCR-based studies of ancient DNA, including Neandertals. These studies have shown that Neandertal mtDNAs fall outside the variation found among extant humans (Krings et al, 1997, 2000; Ovchinnikov et al, 2000; Serre et al, 2004; Orlando et al, 2006). Thus, there are substitutions seen in all or most Neandertal mtDNAs but not in current humans, and others seen in all or most current human mtDNAs but not in Neandertals. By PCR amplification of mtDNA regions containing such diagnostic positions and subsequent cloning and sequencing of multiple independent clones from the PCR products, such substitutions can be used to estimate the relative amounts of Neandertal versus human mtDNA in a Neandertal fossil extract (Figure 1). The level of contamination observed at diagnostic positions is thus assumed to represent the contamination level across the mitochondrial genome, allowing authenticity to be determined even for DNA fragments from regions in which no fixed differences between Neandertals and current humans occur. Thus, for the initial work addressing the feasibility of large-scale Neandertal sequencing, extracts of Neandertal bones were prepared under clean room conditions and analysed for contamination by amplification of mtDNA regions containing diagnostic positions. The extent of mtDNA contamination was estimated to be below 1% for two extracts from a ∼38 000-year-old Neandertal bone from Vindija Cave, Croatia (Serre et al, 2004). These were then sent to other laboratories to be directly sequenced by high-throughput technologies (Green et al, 2006) or cloned in plasmid vectors and subjected to sequencing (Noonan et al, 2006). Figure 1.Estimates of human mtDNA contamination in Neandertal extracts. DNA extracts of Neandertal bones contain a large excess of microbial DNA (brown), at most a few percent of Neandertal DNA (blue) and generally variable amounts of contaminating DNA from current humans (red). Traditionally, contamination has been assayed through PCR directly from DNA extract from fossil bone (left lower panel). Accumulation of large numbers of reads from high-throughput sequencing allows a direct estimate of mtDNA contamination in the sequencing library (right lower panel). Once human/Neandertal diagnostic nuclear genome positions are learned, this strategy can be extended to nuclear DNA sequences. Download figure Download PowerPoint However, analysis of contamination in DNA extracts cannot show contamination in subsequent laboratory steps, for example, in library construction and sequencing. A particular concern is that libraries from Neandertal DNA extracts contain at most a few percent of Neandertal DNA whereas the rest stems from microbes that have colonized bones after the death of the Neandertal. Therefore, contamination of a Neandertal library with even tiny quantities of a library that contains 100% human DNA will greatly affect the results. For the ongoing genome sequencing effort, novel sequencing adapters were therefore designed and used for library construction (Briggs et al, 2007) that contain a unique four-nucleotide tag (TGAC) at their 3′-ends. Thus, each sequence determined from a Neandertal library should start with these four bases. Indeed, subsequent work in our laboratory has shown that carry-over of low amounts of library molecules from one sequencing run to the next can occur, making the use of such tags particularly important. A further advantage of constructing sequencing libraries using project-specific tags is that this provides a ‘snapshot’ of the molecules that are present in the extract at the time of library construction. Thus, estimates of contamination and other parameters can be determined once and applied to the library as it is being used for further sequencing or other experiments. Unfortunately, because of logistical constraints associated with the locations of the emerging technologies, it was necessary during the initial exploratory work to take scrupulously prepared DNA extracts from our clean room and use them to construct libraries in laboratories elsewhere. Subsequently, once a Neandertal genome sequencing effort based on direct sequencing of DNA extracts was initiated, the construction of tagged libraries for high-throughput sequencing was established in the clean room environment in our laboratory (Briggs et al, 2007). Indeed, work by Wall and Kim (2007) confirmed the need to perform library construction under clean room conditions using tagged library adapters. They showed that the DNA sequences from the two extracts that were prepared in our clean room and shown to carry similar and low mtDNA levels of contamination before being sent to other laboratories, showed evidence of contamination by current human DNA in at least one (Green et al, 2006) of the two data sets. Once a tagged sequencing library is constructed, the next step is to estimate the level of contamination in the library. One way is to perform sequencing and identify DNA fragments indicative of contamination. The first, most obvious means of directly assessing contamination is by examining mtDNA sequences. However, until recently only about 600 bp of the Neandertal mtDNA were known, yielding only a small number of sites that could distinguish Neandertal mtDNA from current human mtDNA. Consequently, few DNA fragments among random sequence reads were informative with respect to contamination estimates. The later determination of the 16 565 nt that make up the complete mtDNA from this Neandertal individual (Green et al, 2008) dramatically increased the number of informative positions to 133. Using these positions, it is possible to identify enough informative fragments in data sets of reasonable size to estimate levels of mtDNA contamination. Table I shows estimates of contamination using these 133 sites from several Neandertal libraries as well as estimates directly from the extracts from which they were prepared. The corresponding extract and library estimates are generally in agreement. However, there is one notable exception: in the library constructed without the tagged adapters outside our clean room facility in 2006 (Green et al, 2006), 8 of 75 fragments carrying one or more of the 133 informative positions indicate current human contamination, giving an estimate of 11% contamination (CI 4.7–20%). This confirms that contamination was introduced into this dataset as suggested (Wall and Kim, 2007), presumably during library construction outside the clean room. Table 1. MtDNA contamination and mtDNA to nuclear DNA ratios in some DNA extracts and sequencing libraries used to study the Neandertal genome Extract N H Extract cont. Library N H Library cont. Nuclear-mtDNA ratio A 111 1 0.8% (0.0–4.9%) A.1 67 8 10.7% (4.7–19.9%) 375 A.2 4 0 0% (0.0–60.2%) 222 B 103 0 0.0% (0.0–3.5%) BC.1 22 0 0.0% (0.0–15.4%) 186 C 112 0 0.0% (0.0–3.2%) BC.2 1822 7 0.4% (0.2–0.8%) 157 D 152 8 5.0% (2.2–9.6%) DEF.1 30 1 3.2% (0.1–16.7%) 419 E 100 1 1.0% (0.0–5.4%) F 174 8 4.4% (1.9–8.5%) Six extracts of Neandertal bone Vindija Vi33.16 (A–F) were prepared and analysed with respect to mtDNA contamination using PCR. N and H refer to Neandertal- and current human-like clones of mtDNA amplification products, respectively. These extracts were used to construct libraries used for sequencing. Library A.1 was constructed outside the clean room facility using standard 454 sequencing adapters and is published in Green et al (2006). The other libraries were constructed in the clean room using tagged adapters. Library designations refer to the extracts used to construct them. N and H refer to Neandertal- and current human-like mtDNA fragments, respectively. For each library the mtDNA to nuclear DNA ratios are given. For contamination estimates, 95% confidence intervals are given in parentheses. Nuclear DNA contamination estimates A limitation inherent to extrapolation of mtDNA contamination estimates to the nuclear genome is that the ratio of mtDNA to nuclear DNA may differ among different tissues as well as between different bones (Schwarz et al, 2009). Thus, although analysis of mtDNA sequences yields reliable estimates of the extent of contamination of the mtDNA, the level of nuclear DNA contamination can be under- or over-estimated if the contaminating DNA source contains less or more mtDNA, respectively, than the endogenous DNA. For example, as we have pointed out (Green et al, 2006), if a contaminating source of DNA in the proof-of-principle data set contained low amounts of mtDNA relative to the endogenous bone DNA, the level of contamination can be higher than indicated by the mtDNA assay alone. In fact, though 375 nuclear DNA fragments were seen for each mtDNA fragment seen in the proof-of-principle data set, a subsequently produced bar-coded library from the same extract generated in our clean room has yielded 222 nuclear DNA fragments for each mtDNA fragment (Table I). If one takes the entire excess of nuclear fragments to represent contamination, that is, assumes that the contamination was exclusively of mtDNA-free nuclear DNA, this yields an estimate of contamination of 41% in that proof-of-principle data. Although this estimate relies on several tenuous assumptions it serves to show the limitation of extrapolating mtDNA contamination estimates to the nuclear genome. To achieve reliable nuclear DNA sequences from Neandertals, it is therefore necessary to develop direct nuclear estimates of contamination similar to those for mtDNA. This will become possible once large amounts of DNA sequences from the Neandertal nuclear genome become available because fixed differences between Neandertals and current humans will then be identifiable. On the road to this goal, however, other interim approaches are needed. Y chromosomal contamination estimates One such interim approach is available for bones derived from female Neandertals. As females contain no Y chromosome, any Y chromosome sequence from such a bone must be from contaminating DNA derived from a male individual. By comparing the number of such sequences to the total number of sequences that map elsewhere in the genome, it is possible to estimate the levels of male contamination in female bones. However, because the Y chromosome has many regions that are identical or highly similar to those on the X chromosome, it is imperative to avoid misidentifying sequences from the X chromosome as being derived from the Y chromosome. To this end, we have identified regions of Y-unique sequence, totaling 98 kilobases, each of which has ⩾10% sequence difference to all sequences in the human genome outside of the Y chromosome. Fortunately, the three Neandertal bones used for the bulk of the sequencing of the Neandertal genome derive from females and so this approach can be applied to the shotgun data currently being generated. For one of these bones (Vi33.16), 21 671 548 fragments have currently been identified as being derived from a hominin genome. Of these, two align to these Y-unique regions whereas 380 would be expected if the DNA derived from a male individual. This yields an estimate of 0.5% (CI 0.1–1.9%) for male human contamination. Thus, these bones seem to have levels of nuclear contamination as low as those suggested by the mtDNA assays. X chromosomal contamination estimates Assays that allow the detection of female contamination are obviously also desirable. In the case of male Neandertal bones, one such strategy has been suggested. As males are haploid for the X chromosome, heterozygosity should not be observed in overlapping X chromosomal DNA fragments from male DNA samples. In cases in which different alleles are observed, at least one allele must derive from a contaminant. Although this strategy is conceptually attractive, there are several limitations in practice. First, sequencing errors caused by nucleotide misincorporations, common in ancient DNA, or machine error, will appear as a heterozygous position and can be mistaken for contamination. Second, many contaminant molecules will not harbour a sequence difference between Neandertals and humans and thus evade detection. Third, in cases of mismapping of sequences, paralogous regions that have genuine sequence differences may be mistaken for contamination. This approach is therefore unlikely to yield realistic estimates of contamination. Autosomal contamination estimates The most direct approach to detect modern human contamination would be to identify autosomal sequence positions in which all Neandertals differ from all or almost all current humans. However, because for so much of their nuclear genome Neandertals share the variation still present in modern humans (Pääbo, 1999), such positions will be rare. Extensive sequence information from several Neandertals will therefore be required before a set of such positions is found. One interim solution is to use a two-stage approach for any particular Neandertal fossil under study. In the first stage, a targeted capture method such as PEC (Briggs et al, 2009) can be used to isolate one or more genomic region of putative Neandertal DNA from sequencing libraries. In these regions, some positions will show a human–chimpanzee difference in which outgroup comparison shows that a substitution occurred on the human lineage after the human–chimpanzee split 5–7 million years ago. At a proportion of these positions the Neandertal will carry the ancestral, ape-like state. These positions can be then genotyped in population samples of humans from around the world to identify the subset of these positions in which all or almost all humans are derived. This final subset represents positions in which this particular Neandertal differs from all or almost all extant humans. Then, in a second stage, these human–Neandertal diagnostic positions can be used to generate estimates of nuclear DNA contamination in sequencing of other, independent libraries from the same Neandertal individual. A limitation of this approach is that it requires libraries of relatively high genome coverage so that a reasonable number of informative positions can be identified in the first set of experiments, at sufficiently high coverage to be confident that the Neandertal is homozygous, and then independently retrieved in the second set of experiments. It also requires that a large amount of sequencing be performed before estimates can be generated. However, in conjunction with mtDNA and Y chromosomal estimates this approach is currently in our opinion the best way to arrive at realistic estimates of contamination in Neandertal genome sequence data. Eventually, once a first Neandertal genome sequence is available and information from further Neandertals accumulates, a set of positions in which fixed differences between Neandertals and current humans exist will be identified. These will then serve as future direct estimators of contamination, as is now possible for the Neandertal mitochondrial genome. Indirect estimates of contamination In contrast to the approaches above that rely on direct observation of DNA sequences that are indicative of contamination in Neandertal libraries, other approaches have been suggested that use global characteristics of large-scale datasets and compare them to what can be expected for ancient DNA. For example, fragmentation is a universal feature of ancient DNA (Pääbo, 1989). Therefore, bona fide ancient DNA sequences will generally be short. Likewise, nucleotide misincorporations resulting from deamination of cytosine residues result in many C to T and G to A substitutions in ancient DNA (Hofreiter et al, 2001a; Briggs et al, 2007; Brotherton et al, 2007). These approaches have the benefit of allowing contamination levels to be estimated from relatively small sets of random sequences. However, many pitfalls that are briefly discussed below make them unreliable as estimators of contamination. Estimates based on fragment size Variability in the distribution of fragment lengths in different fossils and even within extracts from a single fossil make the extent of DNA fragmentation a less attractive option for estimating contamination. For example, the fraction of endogenous mtDNA fragments that are above 80 bp in length is ∼11% in a Neandertal from El Sidron in Spain, ∼27% in a Neandertal from Vindija in Croatia, and ∼37% in a Neandertal from Feldhofer in Germany (Figure 2) (Briggs et al, 2009). Furthermore, this method is only reliable if the contaminating DNA is not fragmented to an extent similar to that of the endogenous ancient DNA. This is not always the case. For example, in DNA extracts of the El Sidron Neandertal, mtDNA fragments retrieved that are known to be contamination because they carry nucleotide substitutions typical of current humans are as short as the endogenous Neandertal mtDNA; in the Vindija Neandertal some contaminating fragments are longer whereas others are short; and even in the Feldhofer Neandertal, in which the contaminating mtDNA fragments are clearly on average longer than the endogenous ones, the two types of molecules overlap in size (Figure 2) (Briggs et al, 2009). Thus, fragment length per se is not a reliable estimator of contamination. Figure 2.Lengths of Neandertal and human mtDNA fragments. Distributions of mtDNA fragments carrying Neandertal diagnostic positions are shown in blue for three Neandertal fossils. Each red dot represents a single contaminating human mtDNA fragment of the indicated length (data from Briggs et al, 2009). Download figure Download PowerPoint Fragment size and divergence to humans One promising approach is to analyse fragment length in conjunction with other features of the DNA fragments. Wall and Kim (2007) showed that in one of the data sets published in 2006 (Green et al, 2006) longer sequences were less diverged from the human reference genome sequence and more often carried the derived allele at positions in which current humans are polymorphic than did shorter sequences. As these are features expected for contaminating human DNA, they suggested that longer sequences were enriched for current human DNA in these data. However, random DNA fragments sequenced from the extract of a Neandertal bone must first be identified as being of hominin rather than bacterial origin before they can be analysed. This step relies on recognizing sequence similarity between Neandertal sequences and either the human or chimpanzee genomes. This is difficult for short fragment, especially if they carry nucleotide misincorporations and sequencing errors, and as a consequence the estimates of the human–Neandertal divergence can be biased upwards for short fragments. To illustrate this, we analysed a data set of 3700 million base pairs determined from a Neandertal fossil on the Roche FLX platform in sequence bins of increasing length (Figure 3). For each length bin, we analysed the divergence using fragments identified using the same score cut-off for uniqueness in the human and chimpanzee genomes as was used in earlier analyses (Green et al, 2006; Noonan et al, 2006; Wall and Kim, 2007) as well as a more stringent cut-off. Strikingly, the divergence estimates for the fragments greater than 80 nucleotides in length is only minimally affected by increasing the stringency whereas the discrepancy becomes progressively larger for the shorter fragments. Thus, fragments of length 30–40 nucleotides identified with the lower, less strict cut-off score diverge on average 14.0% back in the past along the lineage to the human–chimpanzee common ancestor whereas the
Referência(s)