Artigo Acesso aberto Revisado por pares

16S Classifier: A Tool for Fast and Accurate Taxonomic Classification of 16S rRNA Hypervariable Regions in Metagenomic Datasets

2015; Public Library of Science; Volume: 10; Issue: 2 Linguagem: Inglês

10.1371/journal.pone.0116106

ISSN

1932-6203

Autores

Nikhil Chaudhary, Ashok Sharma, P. C. Agarwal, Ankit Gupta, Vineet K. Sharma,

Tópico(s)

Microbial Community Ecology and Physiology

Resumo

The diversity of microbial species in a metagenomic study is commonly assessed using 16S rRNA gene sequencing. With the rapid developments in genome sequencing technologies, the focus has shifted towards the sequencing of hypervariable regions of 16S rRNA gene instead of full length gene sequencing. Therefore, 16S Classifier is developed using a machine learning method, Random Forest, for faster and accurate taxonomic classification of short hypervariable regions of 16S rRNA sequence. It displayed precision values of up to 0.91 on training datasets and the precision values of up to 0.98 on the test dataset. On real metagenomic datasets, it showed up to 99.7% accuracy at the phylum level and up to 99.0% accuracy at the genus level. 16S Classifier is available freely at http://metagenomics.iiserb.ac.in/16Sclassifier and http://metabiosys.iiserb.ac.in/16Sclassifier.

Referência(s)