Artigo Acesso aberto Revisado por pares

Glocal alignment: finding rearrangements during alignment

2003; Oxford University Press; Volume: 19; Issue: suppl_1 Linguagem: Inglês

10.1093/bioinformatics/btg1005

ISSN

1367-4811

Autores

Michael Brudno, Sanket Malde, Alexander Poliakov, Joshua Shulman, Olivier Couronne, Inna Dubchak, Serafim Batzoglou,

Tópico(s)

Chromosomal and Genetic Variations

Resumo

Abstract Motivation: To compare entire genomes from different species, biologists increasingly need alignment methods that are efficient enough to handle long sequences, and accurate enough to correctly align the conserved biological features between distant species. The two main classes of pairwise alignments are global alignment, where one string is transformed into the other, and local alignment, where all locations of similarity between the two strings are returned. Global alignments are less prone to demonstrating false homology as each letter of one sequence is constrained to being aligned to only one letter of the other. Local alignments, on the other hand, can cope with rearrangements between non-syntenic, orthologous sequences by identifying similar regions in sequences; this, however, comes at the expense of a higher false positive rate due to the inability of local aligners to take into account overall conservation maps. Results: In this paper we introduce the notion of glocal alignment, a combination of global and local methods, where one creates a map that transforms one sequence into the other while allowing for rearrangement events. We present Shuffle-LAGAN, a glocal alignment algorithm that is based on the CHAOS local alignment algorithm and the LAGAN global aligner, and is able to align long genomic sequences. To test Shuffle-LAGAN we split the mouse genome into BAC-sized pieces, and aligned these pieces to the human genome. We demonstrate that Shuffle-LAGAN compares favorably in terms of sensitivity and specificity with standard local and global aligners. From the alignments we conclude that about 9% of human/mouse homology may be attributed to small rearrangements, 63% of which are duplications. Availability: Our systems, supplemental information, and the alignment of the human and mouse genomes using Shuffle-LAGAN are available at http://lagan.stanford.edu/glocal Contact: serafim@cs.stanford.edu *To whom correspondence should be addressed. †These authors contributed equally to the work.

Referência(s)
Altmetric
PlumX