Artigo Acesso aberto Revisado por pares

Polypeptide structure and encoding location of the adenovirus serotype 2 late, nonstructural 33K protein

1983; American Society for Microbiology; Volume: 45; Issue: 1 Linguagem: Inglês

10.1128/jvi.45.1.251-263.1983

ISSN

1098-5514

Autores

E A Oosterom-Dragon, Carl W. Anderson,

Tópico(s)

RNA and protein synthesis mechanisms

Resumo

Radiochemical microsequence analysis of selected tryptic peptides of the adenovirus type 2 33K nonstructural protein has revealed the precise region of the genomic nucleotide sequence that encodes this protein. The initiation codon for the 33K protein lies 606 nucleotides to the right of the EcoRI restriction site at 70.7 map units and 281 nucleotides to the left of the postulated carboxyterminal codon of the adenovirus 100K protein. The coding regions for these two proteins thus overlap; however, the 33K protein is derived from the +1 frame with respect to the postulated 100K reading frame. Our results contradict an earlier published report suggesting that these two proteins share extensive amino acid sequence homology (N. Axelrod, Virology 87:366-383, 1978). The published nucleotide sequence of the Ad2 EcoRI-F fragment (70.7 to 75.9 map units) cannot accommodate in a single reading frame the peptide sequences of the 33K protein that we have determined. Sequence analysis of DNA fragments derived from virus has confirmed the published nucleotide sequence in all critical regions with respect to the coding region for the 33K protein. Consequently, our data are only consistent with the existence of an mRNA splice within the coding region for 33K. Consensus donor and acceptor splice sequences have been located that would predict the removal of 202 nucleotides from the transcripts for the 33K protein. Removal of these nucleotides would explain the structure of a peptide that cannot otherwise be directly encoded by the EcoRI-F fragment. Identification of the precise splice points by peptide sequencing has permitted a prediction of the complete amino acid sequence for the 33K protein.

Referência(s)