Genetic variation and the de novo assembly of human genomes
2015; Nature Portfolio; Volume: 16; Issue: 11 Linguagem: Inglês
10.1038/nrg3933
ISSN1471-0064
AutoresMark Chaisson, Richard K. Wilson, Evan E. Eichler,
Tópico(s)Chromosomal and Genetic Variations
ResumoThe wealth of existing and emerging DNA-sequencing data provides an opportunity for a comprehensive understanding of human genetic variation, including the discovery of disease-causing variants. This Review describes how the limitations of current reference-genome assemblies confound the characterization of genetic variation and how this can be mitigated by important advances in algorithms and sequencing technology that facilitate thede novoassembly of genomes. The discovery of genetic variation and the assembly of genome sequences are both inextricably linked to advances in DNA-sequencing technology. Short-read massively parallel sequencing has revolutionized our ability to discover genetic variation but is insufficient to generate high-quality genome assemblies or resolve most structural variation. Full resolution of variation is only guaranteed by complete de novo assembly of a genome. Here, we review approaches to genome assembly, the nature of gaps or missing sequences, and biases in the assembly process. We describe the challenges of generating a complete de novo genome assembly using current technologies and the impact that being able to perfectly sequence the genome would have on understanding human disease and evolution. Finally, we summarize recent technological advances that improve both contiguity and accuracy and emphasize the importance of complete de novo assembly as opposed to read mapping as the primary means to understanding the full range of human genetic variation.
Referência(s)