Artigo Acesso aberto Revisado por pares

A General Sequence Processing and Analysis Program for Protein Engineering

2014; American Chemical Society; Volume: 54; Issue: 10 Linguagem: Inglês

10.1021/ci500362s

ISSN

1549-960X

Autores

Ryan Stafford, Erik S. Zimmerman, Trevor J. Hallam, Aaron K. Sato,

Tópico(s)

Glycosylation and Glycoproteins Research

Resumo

Protein engineering projects often amass numerous raw DNA sequences, but no readily available software combines sequence processing and activity correlation required for efficient lead identification. XLibraryDisplay is an open source program integrated into Microsoft Excel for Windows that automates batch sequence processing via a simple step-by-step, menu-driven graphical user interface. XLibraryDisplay accepts any DNA template which is used as a basis for trimming, filtering, translating, and aligning hundreds to thousands of sequences (raw, FASTA, or Phred PHD file formats). Key steps for library characterization through lead discovery are available including library composition analysis, filtering by experimental data, graphing and correlating to experimental data, alignment to structural data extracted from PDB files, and generation of PyMOL visualization scripts. Though larger data sets can be handled, the program is best suited for analyzing approximately 10 000 or fewer leads or naïve clones which have been characterized using Sanger sequencing and other experimental approaches. XLibraryDisplay can be downloaded for free from sourceforge.net/projects/xlibrarydisplay/ .

Referência(s)