Artigo Acesso aberto Revisado por pares

CRISPRdisco: An Automated Pipeline for the Discovery and Analysis of CRISPR-Cas Systems

2018; Mary Ann Liebert, Inc.; Volume: 1; Issue: 2 Linguagem: Inglês

10.1089/crispr.2017.0022

ISSN

2573-1602

Autores

Alexandra B. Crawley, James R. Henriksen, Rodolphe Barrangou,

Tópico(s)

Innovation and Socioeconomic Development

Resumo

CRISPR-Cas adaptive immune systems of bacteria and archaea have catapulted into the scientific spotlight as genome editing tools. To aid researchers in the field, we have developed an automated pipeline, named CRISPRdisco (CRISPR discovery), to identify CRISPR repeats and cas genes in genome assemblies, determine type and subtype, and describe system completeness. All six major types and 23 currently recognized subtypes and novel putative V-U types are detected. Here, we use the pipeline to identify and classify putative CRISPR-Cas systems in 2,777 complete genomes from the NCBI RefSeq database. This allows comparison to previous publications and investigation of the occurrence and size of CRISPR-Cas systems. Software available at http://github.com/crisprlab/CRISPRdisco provides reproducible, standardized, accessible, transparent, and high-throughput analysis methods available to all researchers in and beyond the CRISPR-Cas research community. This tool opens new avenues to enable classification within a complex nomenclature and provides analytical methods in a field that has evolved rapidly.

Referência(s)