Artigo Acesso aberto Revisado por pares

Functional classification of long non-coding RNAs by k-mer content

2018; Nature Portfolio; Volume: 50; Issue: 10 Linguagem: Inglês

10.1038/s41588-018-0207-8

ISSN

1546-1718

Autores

Jessime M. Kirk, Susan O. Kim, Kaoru Inoue, Matthew J. Smola, David M. Lee, Megan D. Schertzer, Joshua S. Wooten, Allison R. Baker, Daniel J. Sprague, David W. Collins, Christopher R. Horning, Shuo Wang, Qidi Chen, Kevin M. Weeks, Peter J. Mucha, J. Mauro Calabrese,

Tópico(s)

RNA Research and Splicing

Resumo

The functions of most long non-coding RNAs (lncRNAs) are unknown. In contrast to proteins, lncRNAs with similar functions often lack linear sequence homology; thus, the identification of function in one lncRNA rarely informs the identification of function in others. We developed a sequence comparison method to deconstruct linear sequence relationships in lncRNAs and evaluate similarity based on the abundance of short motifs called k-mers. We found that lncRNAs of related function often had similar k-mer profiles despite lacking linear homology, and that k-mer profiles correlated with protein binding to lncRNAs and with their subcellular localization. Using a novel assay to quantify Xist-like regulatory potential, we directly demonstrated that evolutionarily unrelated lncRNAs can encode similar function through different spatial arrangements of related sequence motifs. K-mer-based classification is a powerful approach to detect recurrent relationships between sequence and function in lncRNAs. SEEKR is a method that deconstructs linear sequence relationships between lncRNAs and evaluates similarity on the basis of abundance of short motifs called k-mers. LncRNAs of related function often have similar k-mer profiles despite lacking linear homology.

Referência(s)