Artigo Acesso aberto Revisado por pares

KATK: Fast genotyping of rare variants directly from unmapped sequencing reads

2021; Wiley; Volume: 42; Issue: 6 Linguagem: Inglês

10.1002/humu.24197

ISSN

1098-1004

Autores

Lauris Kaplinski, Märt Möls, Tarmo Puurand, Fanny‐Dhelia Pajuste, Maido Remm,

Tópico(s)

Biomedical Text Mining and Ontologies

Resumo

KATK is a fast and accurate software tool for calling variants directly from raw next-generation sequencing reads. It uses predefined k-mers to retrieve only the reads of interest from the FASTQ file and calls genotypes by aligning retrieved reads locally. KATK does not use data about known polymorphisms and has NC (no call) as the default genotype. The reference or variant allele is called only if there is sufficient evidence for their presence in data. Thus it is not biased against rare variants or de-novo mutations. With simulated datasets, we achieved a false-negative rate of 0.23% (sensitivity 99.77%) and a false discovery rate of 0.19%. Calling all human exonic regions with KATK requires 1–2 h, depending on sequencing coverage.

Referência(s)
Altmetric
PlumX