selscan: An Efficient Multithreaded Program to Perform EHH-Based Scans for Positive Selection
2014; Oxford University Press; Volume: 31; Issue: 10 Linguagem: Inglês
10.1093/molbev/msu211
ISSN1537-1719
AutoresZachary A. Szpiech, Ryan D. Hernandez,
Tópico(s)Genetic Associations and Epidemiology
ResumoHaplotype-based scans to detect natural selection are useful to identify recent or ongoing positive selection in genomes. As both real and simulated genomic datasets grow larger, spanning thousands of samples and millions of markers, there is a need for a fast and efficient implementation of these scans for general use. Here we present selscan, an efficient multi-threaded application that implements Extended Haplotype Homozygosity (EHH), Integrated Haplotype Score (iHS), and Cross-population Extended Haplotype Homozygosity (XPEHH). selscan accepts phased genotypes in multiple formats, including TPED, and performs extremely well on both simulated and real data and over an order of magnitude faster than existing available implementations. It calculates iHS on chromosome 22 (22,147 loci) across 204 CEU haplotypes in 353s on one thread (33s on 16 threads) and calculates XPEHH for the same data relative to 210 YRI haplotypes in 578s on one thread (52s on 16 threads). Source code and binaries (Windows, OSX and Linux) are available at https://github.com/szpiech/selscan .
Referência(s)