Structure of Arterivirus nsp4
2002; Elsevier BV; Volume: 277; Issue: 42 Linguagem: Inglês
10.1074/jbc.m206978200
ISSN1083-351X
AutoresIsabelle H. Barrette‐Ng, Kenneth Ng, Brian L. Mark, Danny van Aken, M.M. Cherney, Craig R. Garen, Yuliya Kolodenko, Alexander E. Gorbalenya, Eric J. Snijder, Michael N.G. James,
Tópico(s)Viral gastroenteritis research and epidemiology
ResumoArteriviruses are enveloped, positive-stranded RNA viruses and include pathogens of major economic concern to the swine- and horse-breeding industries. The arterivirus replicase gene encodes two large precursor polyproteins that are processed by the viral main proteinase nonstructural protein 4 (nsp4). The three-dimensional structure of the 21-kDa nsp4 from the arterivirus prototype equine arteritis virus has been determined to 2.0 Å resolution. Nsp4 adopts the smallest known chymotrypsin-like fold with a canonical catalytic triad of Ser-120, His-39, and Asp-65, as well as a novel α/β C-terminal extension domain that may play a role in mediating protein-protein interactions. In different copies of nsp4 in the asymmetric unit, the oxyanion hole adopts either a collapsed inactive conformation or the standard active conformation, which may be a novel way of regulating proteolytic activity. Arteriviruses are enveloped, positive-stranded RNA viruses and include pathogens of major economic concern to the swine- and horse-breeding industries. The arterivirus replicase gene encodes two large precursor polyproteins that are processed by the viral main proteinase nonstructural protein 4 (nsp4). The three-dimensional structure of the 21-kDa nsp4 from the arterivirus prototype equine arteritis virus has been determined to 2.0 Å resolution. Nsp4 adopts the smallest known chymotrypsin-like fold with a canonical catalytic triad of Ser-120, His-39, and Asp-65, as well as a novel α/β C-terminal extension domain that may play a role in mediating protein-protein interactions. In different copies of nsp4 in the asymmetric unit, the oxyanion hole adopts either a collapsed inactive conformation or the standard active conformation, which may be a novel way of regulating proteolytic activity. porcine reproductive and respiratory syndrome virus equine arteritis virus open reading frame nonstructural protein Sindbis virus core protein Semliki forest virus core protein chymotrypsin-like proteinase root mean-squared difference human rhinovirus-2 S. griseusproteinase E maltose-binding protein The arteriviruses porcine reproductive and respiratory syndrome virus (PRRSV)1 (1Wensvoort G. Terpstra C. Pol J.M. ter Laak E.A. Bloemraad M. de Kluyver E.P. Kragten C. van Buiten L. den Besten A. Wagenaar F. Vet. Q. 1991; 13: 121-130Crossref PubMed Scopus (1111) Google Scholar) and equine arteritis virus (EAV) (2Doll E.R. Bryans J.T. McCollum W.H.M. Wallace M.E. Cornell Vet. 1957; 47: 3-41PubMed Google Scholar) are major economic concerns for the swine- and horse-breeding industries worldwide. Both viruses are widespread in the population, can establish persistent infections, and are easily transmitted via both respiratory and venereal routes. PRRSV is currently considered to be a major swine pathogen, causing reproductive failure and severe pneumonia in neonates. Horses infected with EAV are often asymptomatic, but persistently infected stallions ("shedding stallions") can infect mares through semen, regularly leading to spontaneous abortion of the fetus (3Timoney P.J. Lekeux P. Equine Respiratory Diseases. International Veterinary Information Service, Ithaca, NY2002Google Scholar). Apart from vaccinations that are costly and (given the use of live attenuated vaccines) not without risk (4Botner A. Strandbygaard B. Sorensen K.J. Have P. Madsen K.G. Madsen E.S. Alexandersen S. Vet. Rec. 1997; 141: 497-499Crossref PubMed Scopus (199) Google Scholar), there are currently no effective methods to combat arteriviral diseases. The family Arteriviridae (order Nidovirales) comprises enveloped, mammalian RNA viruses with a 12–16-kilobase positive stranded genome (5Snijder E.J. Meulenberg J.J.M. Knipe D.M. Howley P.M. 4th Ed. Fields Virology. 1. Lippincott Williams and Wilkins, Philadelphia, PA2001: 1205-1220Google Scholar, 6Snijder E.J. Meulenberg J.J. J. Gen. Virol. 1998; 79: 961-979Crossref PubMed Scopus (691) Google Scholar, 7Ziebuhr J. Snijder E.J. Gorbalenya A.E. J. Gen. Virol. 2000; 81: 853-879Crossref PubMed Scopus (776) Google Scholar). In addition to the prototype EAV and PRRSV, the family includes the lactate dehydrogenase-elevating virus of mice and simian hemorrhagic fever virus. Nidovirus genomes are polycistronic, containing a large 5′-terminal replicase gene, which is expressed from the viral genome RNA, and a downstream set of (largely) structural protein genes. The latter are expressed via the transcription of a nested set of subgenomic mRNAs from the 3′-terminal region of the genome (8Pasternak A.O. van den Born E. Spaan W.J. Snijder E.J. EMBO J. 2001; 20: 7220-7228Crossref PubMed Scopus (120) Google Scholar). The replicase gene is translated into two multidomain precursor proteins, which are cleaved into mature nonstructural proteins by multiple viral proteinases, a key regulatory mechanism in the nidovirus life cycle (7Ziebuhr J. Snijder E.J. Gorbalenya A.E. J. Gen. Virol. 2000; 81: 853-879Crossref PubMed Scopus (776) Google Scholar). In the arterivirus prototype EAV, the replicase gene is translated into open reading frame 1a (ORF1a) and ORF1ab polyproteins of 1,727 and 3,175 amino acids, respectively; the latter product resulting from a ribosomal frameshift that can occur just prior to termination of ORF1a translation. As illustrated in Fig. 1, the EAV ORF1a and ORF1ab proteins are cleaved by three different ORF1a-encoded proteinases, papain-like cysteine proteinases located in nonstructural protein 1 (nsp1) and nsp2, and the chymotrypsin-like serine proteinase nsp4 (9den Boon J.A. Snijder E.J. Chirnside E.D. de Vries A.A. Horzinek M.C. Spaan W.J. J. Virol. 1991; 65: 2910-2920Crossref PubMed Google Scholar). Following the rapid autocatalytic release of nsp1 and nsp2, the remainder of the polyproteins (nsp3-8 and nsp3-12) is processed by nsp4, the main viral proteinase (10Snijder E.J. Wassenaar A.L. Spaan W.J. J. Virol. 1994; 68: 5755-5764Crossref PubMed Google Scholar). Probably, internal hydrophobic domains target these proteins to intracellular membranes, which are modified to accommodate viral RNA synthesis (11Pedersen K.W. van der Meer Y. Roos N. Snijder E.J. J. Virol. 1999; 73: 2016-2026Crossref PubMed Google Scholar). The nsp4 main proteinase processes the remaining eight cleavage sites, five in the ORF1a protein (12Snijder E.J. Wassenaar A.L. van Dinten L.C. Spaan W.J. Gorbalenya A.E. J. Biol. Chem. 1996; 271: 4864-4871Abstract Full Text Full Text PDF PubMed Scopus (91) Google Scholar, 13Wassenaar A.L. Spaan W.J.M. Gorbalenya A.E. Snijder E.J. J. Virol. 1997; 71: 9313-9322Crossref PubMed Google Scholar) and three in ORF1b-encoded part of the ORF1ab protein (14van Dinten L.C. Rensen S. Gorbalenya A.E. Snijder E.J. J. Virol. 1999; 73: 2027-2037Crossref PubMed Google Scholar), which encodes (among other functions) the viral RNA-dependent RNA polymerase and helicase (9den Boon J.A. Snijder E.J. Chirnside E.D. de Vries A.A. Horzinek M.C. Spaan W.J. J. Virol. 1991; 65: 2910-2920Crossref PubMed Google Scholar). Alternative processing pathways are used during cleavage of the nsp3-8 intermediate (13Wassenaar A.L. Spaan W.J.M. Gorbalenya A.E. Snijder E.J. J. Virol. 1997; 71: 9313-9322Crossref PubMed Google Scholar). In the major pathway, nsp2 associates with nsp3-8 as a cofactor and triggers nsp4 to autocatalytically cleave its C terminus to yield the nsp3-4 and nsp5-8 intermediates. The relatively slow processing of the nsp3↓4 and nsp7↓8 sites ensues (10Snijder E.J. Wassenaar A.L. Spaan W.J. J. Virol. 1994; 68: 5755-5764Crossref PubMed Google Scholar), but the nsp5↓6 and nsp6↓7 bonds are not cleaved, and cleaved nsp6 and nsp7 are not produced. In the minor pathway, nsp2 does not associate with nsp3-8, and the nsp4↓5 junction is not cleaved (13Wassenaar A.L. Spaan W.J.M. Gorbalenya A.E. Snijder E.J. J. Virol. 1997; 71: 9313-9322Crossref PubMed Google Scholar). However, nsp4 does cleave the nsp3↓4, nsp5↓6, nsp6↓7, and nsp7↓8 sites of the nsp3-8 intermediate, indicating that the nsp4 proteinase is fully functional. To provide a structural framework for understanding the complexities of proteolysis in arteriviruses, we have determined the crystallographic structure of EAV nsp4 at 2.0 Å resolution. The structure reveals the smallest chymotrypsin-like fold known as well as a novel C-terminal α/β extension domain. This first structure of an arteriviral proteinase opens new avenues for understanding replicase maturation and establishes a basis for rationally developing antiviral agents. Nsp4 (nt 3417–4025 of the EAV genome; GenBankTM accession numberX53459) was cloned into pMAL-c2 (New England Biolabs, Inc.) and expressed as an MBP-nsp4 fusion in Escherichia coliBL21(DE3) (Promega). Expression was induced with 1 mmisopropyl-1-thio-β-d-galactopyranoside for 4 h at 37 °C, and cells were lysed using a French press. MBP-nsp4 was purified by amylose-affinity chromatography and cleaved with 1 unit of thrombin/mg of protein in 20 mm Tris, pH 8.1 and 150 mm NaCl. Following cleavage, nsp4 contained additional residues at its N and C terminus, Ser-Met and Leu-Ala-Ser, respectively. Further purification was achieved using Sepharose G-75 and Mono Q columns (Amersham Biosciences). MBP and uncleaved fusion protein were removed using amylose resin. The integrity of nsp4 was confirmed by mass spectrometry. Crystals were grown at room temperature by the hanging drop vapor diffusion method by mixing 2 μl of protein (25 mg/ml) and 2 μl of 23% (w/v) polyethylene glycol MME 2000, 100 mm Tris, pH 8.5, 10% (w/v) ethylene glycol, and 0.2m ammonium acetate. Crystals grew within 1–2 weeks and had typical dimensions of 0.5 × 0.2 × 0.2 mm. The crystals belonged to space group P1 with cell dimensions a = 56.6, b = 60.7, c = 62.8 Å, α = 74.0°, β = 63.2°, and γ = 66.2°. All data were collected on flash-frozen crystals at 100 K by transferring crystals from the mother liquor or heavy atom soaking solutions into a nitrogen gas stream. Data from the crystal soaked in mersalyl were measured at beamline BM-14-C (BioCARS-CAT, Advanced Photon Source) using an ADSC Quantum-4 detector. All other data were measured using CuKα radiation (Rigaku RU-300 generator, Osmic optics) and a R-AXIS IV++ image plate detector. Data were processed using the HKL suite (15Otwinowski Z. Minor W. Methods Enzymol. 1997; 276: 307-326Crossref PubMed Scopus (38617) Google Scholar). Experimental phases were calculated using SOLVE (16Terwilliger T.C. Berendzen J. Acta Crystallogr. D Biol. Crystallogr. 1999; 55: 849-861Crossref PubMed Scopus (3220) Google Scholar). RESOLVE (17Terwilliger T.C. Acta Crystallogr. D Biol. Crystallogr. 2001; 57: 1755-1762Crossref PubMed Scopus (166) Google Scholar) was used for density modification, improving the figure of merit from 0.57 to 0.69, and to generate a partial main-chain trace. The experimental electron density map was further improved by ARP/wARP (18Perrakis A. Morris R. Lamzin V.S. Nat. Struct. Biol. 1999; 6: 458-463Crossref PubMed Scopus (2565) Google Scholar). From this modified electron density map, a nearly complete trace of a single copy in the asymmetric unit was constructed using XFIT (19McRee D.E. J. Struct. Biol. 1999; 125: 156-165Crossref PubMed Scopus (2022) Google Scholar). A model for the complete asymmetric unit was generated by transposing the coordinates of the first copy onto partial models of the other copies and making manual adjustments. Refinement against experimental phase information was performed using CNS (20Brunger A.T. Adams P.D. Clore G.M. DeLano W.L. Gros P. Grosse-Kunstleve R.W. Jiang J.S. Kuszewski J. Nilges M. Pannu N.S. Read R.J. Rice L.M. Simonson T. Warren G.L. Acta Crystallogr. D Biol. Crystallogr. 1998; 54: 905-921Crossref PubMed Scopus (16979) Google Scholar). The model was further refined against the Native 1 data set using REFMAC version 5.0.36 (21Winn M.D. Isupov M.N. Murshudov G.N. Acta Crystallogr. D Biol. Crystallogr. 2001; 57: 122-133Crossref PubMed Scopus (1654) Google Scholar) with individual TLS parameters refined separately for the N-terminal barrel, C-terminal barrel, and the C-terminal domain. The Native 1 data set was used for refinement instead of the higher resolution mersalyl data set because of systematic measurement errors in the latter. Non-crystallographic symmetry restraints were not used at any point during refinement because of significant differences in conformation between different copies. Residues 1–6 are missing from all copies. In addition, the following residues are missing: 204 in copy A, 199–204 in copy B, 198–204 in copy C, and 8 and 204 in copy D. 88.6% of residues lie in the most favored regions of the Ramachandran plot and 10.7% lie in the additional allowed regions (regions defined by PROCHECK) (22Morris A.L. MacArthur M.W. Hutchinson E.G. Thornton J.M. Proteins. 1992; 12: 345-364Crossref PubMed Scopus (1420) Google Scholar). Additional checks on geometry were performed using WHATCHECK (23Hooft R.W. Vriend G. Sander C. Abola E.E. Nature. 1996; 381: 272Crossref PubMed Scopus (1816) Google Scholar). Data quality, phasing, and refinement statistics are given in Table I. Figures were prepared with MOLSCRIPT (24Kraulis P.J. J. Appl. Crystallogr. 1991; 24: 946-950Crossref Google Scholar), BOBSCRIPT (25Esnouf R.M. J. Mol. Graph. Model. 1997; 15: 132-134Crossref PubMed Scopus (1795) Google Scholar), and RASTER3D (26Merritt E.A. Bacon M. Methods Enzymol. 1997; 277: 505-524Crossref PubMed Scopus (3878) Google Scholar).Table IData collection, phasing, and refinement parametersData processing and heavy atom statisticsDatasetNative 1Native 2Pb(CH3CO2)2HgCl2UO2(NO3)2KPtCl4MersalylResolution (Å)16.2–2.0040.0–2.1040.0–2.0640.0–2.5040.0–1.9040.0–1.9040.0–1.60High-resolution range (Å)2.07–2.002.16–2.102.13–2.062.57–2.501.97–1.901.97–1.901.64–1.60Total reflectionsaValues for the outermost resolution shell are given in parentheses.169,296 (15,362)72,351 (5,208)76,161 (2,292)46,156 (3,558)186,601 (9,932)188,735 (10,212)313,166 (11,610)Unique reflectionsaValues for the outermost resolution shell are given in parentheses.43,171 (4,151)37,782 (2,942)36,920 (1,188)22,609 (1,834)48,233 (2,916)48,619 (2,934)82,476 (3,261)Completeness (%)aValues for the outermost resolution shell are given in parentheses.95.0 (91.5)94.5 (88.9)81.6 (31.3)96.6 (94.0)90.3 (55.0)90.3 (55.3)91.7 (54.4)I/ςaValues for the outermost resolution shell are given in parentheses.22.4 (6.1)11.2 (3.9)13.0 (7.2)12.6 (5.1)18.6 (2.9)18.5 (3.4)19.1 (2.8)R sym(%)aValues for the outermost resolution shell are given in parentheses.bR sym = Σ‖I − 〈I〉 ‖/ΣI, where I is the integrated intensity of a given reflection.5.0 (17.9)6.5 (16.2)5.3 (10.2)6.1 (15.5)6.6 (35.4)6.4 (28.5)5.2 (64.4)Heavy atomaValues for the outermost resolution shell are given in parentheses.bR sym = Σ‖I − 〈I〉 ‖/ΣI, where I is the integrated intensity of a given reflection.(mm)201.02.52.52.0Number of sites44354Phasing statistics to 2.4 ÅR iso(%)cR iso = Σ‖FPH − FP‖/ΣFP, where FRI and FP are the derivative and native structure factor amplitudes, respectively.33.7 (33.9)24.3 (25.6)15.6 (18.9)13.0 (12.9)27.1 (30.7)Phasing powerdPhasing power = 〈r.m.s. heavy atom structure factor〉/〈r.m.s. lack of closure〉.0.61 (0.41)0.86 (0.65)0.37 (0.29)0.74 (0.46)0.59 (0.40)Refinement statistics (numbers in parentheses refer to the outermost resolution bin (2.05–2.00 ÅResolution range (Å)16.2–2.00R workeR work = Σ‖ ‖Fo‖ − ‖Fc‖ ‖/Σ‖Fo‖ for the 90% of the reflection data used in refinement.0.204 (0.222)R freefR free = Σ ‖ Fo‖ − ‖Fc‖ ‖/Σ‖Fo‖ for the remaining 10% of the reflection data excluded from refinement.0.261 (0.283)Number of atoms or moleculesprotein5606water400R.m.s. deviation from ideal geometryBond lengths (Å)0.007Bond angles (°)1.271Average B-factor (Å2)15.6a Values for the outermost resolution shell are given in parentheses.b R sym = Σ‖I − 〈I〉 ‖/ΣI, where I is the integrated intensity of a given reflection.c R iso = Σ‖FPH − FP‖/ΣFP, where FRI and FP are the derivative and native structure factor amplitudes, respectively.d Phasing power = 〈r.m.s. heavy atom structure factor〉/〈r.m.s. lack of closure〉.e R work = Σ‖ ‖Fo‖ − ‖Fc‖ ‖/Σ‖Fo‖ for the 90% of the reflection data used in refinement.f R free = Σ ‖ Fo‖ − ‖Fc‖ ‖/Σ‖Fo‖ for the remaining 10% of the reflection data excluded from refinement. Open table in a new tab The structure of EAV nsp4 was determined using the multiple isomorphous replacement technique (Fig.2 A; refer to TableI for crystallographic parameters). There are four copies of nsp4 (designated A, B, C, and D) in the triclinic unit cell, each containing two β-barrels as well as a unique C-terminal domain. The N-terminal barrel consists of six β-strands (A1 to F1), while the C-terminal barrel is composed of seven (A2 to G2) (Fig. 2 B). The G2 β-strand is also found in Sindbis virus core protein (SCP) (27Tong L. Wengler G. Rossmann M.G. J. Mol. Biol. 1993; 230: 228-247Crossref PubMed Scopus (82) Google Scholar) and Semliki forest virus core protein (SFCP) (28Choi H.K., Lu, G. Lee S. Wengler G. Rossmann M.G. Proteins. 1997; 27: 345-359Crossref PubMed Scopus (84) Google Scholar) but is not present in most other chymotrypsin-like proteinases (CLPs). The core of both β-barrels is comprised of conserved hydrophobic residues (Fig. 2 B). In addition, Trp-114 is a conserved, fully solvent exposed hydrophobic residue that lies in a groove lined with other conserved residues near the substrate-binding pockets. This groove and Trp-114 may mediate protein-protein interactions. The most striking feature of the nsp4 structure is the presence of an additional C-terminal domain not found in most other CLPs. This third domain comprises residues 156–204 and consists of two short pairs of β-strands and two α-helices. The C-terminal domain interacts with the C-terminal barrel through an interface (buried surface area of 1638 Å2) consisting of conserved hydrophobic residues: Leu-105 and Leu-112 from the C-terminal β-barrel and Val-158, Leu-163, Phe-167, Ile-182, Leu-196, and Ile-197 from the C-terminal domain. There is also an exposed patch of conserved solvent-exposed hydrophobic residues (Fig. 2 C) that may form part of the interface with nsp5 in the nsp4-5 intermediate. This hydrophobic patch may also mediate interactions with nsp2, which associates with nsp3-8 to induce cleavage of the nsp4↓5 site by nsp4 (13Wassenaar A.L. Spaan W.J.M. Gorbalenya A.E. Snijder E.J. J. Virol. 1997; 71: 9313-9322Crossref PubMed Google Scholar). The overall conformation of the two β-barrels in all four copies of nsp4 is highly conserved, except for the loops between C1 and D1 and E2 and F2, both of which are flexible and have higher temperature factors (Fig. 2 D). The largest variation in conformation is seen in the position of the C-terminal domain relative to the β-barrels. The root mean square deviations when superimposing the entire molecule range from 0.40 to 0.77 Å and are 11–91% higher than when superimposing only the β-barrels. DYNDOM (29Hayward S. Berendsen H.J. Proteins. 1998; 30: 144-154Crossref PubMed Scopus (712) Google Scholar) reveals a rotation (residue 157 or 161 as the hinge) of the C-terminal domain relative to the proteinase domain of 11° when comparing copies A and C or C and D. The C-terminal domain can clearly adopt different orientations relative to the two β-barrels, which may facilitate substrate binding or autoproteolysis as discussed below. Despite having little sequence identity to other CLPs, nsp4 was proposed to have a CLP fold (7Ziebuhr J. Snijder E.J. Gorbalenya A.E. J. Gen. Virol. 2000; 81: 853-879Crossref PubMed Scopus (776) Google Scholar, 12Snijder E.J. Wassenaar A.L. van Dinten L.C. Spaan W.J. Gorbalenya A.E. J. Biol. Chem. 1996; 271: 4864-4871Abstract Full Text Full Text PDF PubMed Scopus (91) Google Scholar). The crystal structure confirms the existence of a CLP fold, which at 149 residues is the smallest serine proteinase domain known. Human rhinovirus 2 (HRV-2) 3C is a viral CLP (30Matthews D.A. Dragovich P.S. Webber S.E. Fuhrman S.A. Patick A.K. Zalman L.S. Hendrickson T.F. Love R.A. Prins T.J. Marakovits J.T. Zhou R. Tikhe J. Ford C.E. Meador J.W. Ferre R.A. Brown E.L. Binford S.L. Brothers M.A. DeLisle D.M. Worland S.T. Proc. Natl. Acad. Sci. U. S. A. 1999; 96: 11000-11007Crossref PubMed Scopus (266) Google Scholar) with a similar P1 specificity of substrates to nsp4. A least squares superposition with nsp4 gives an r.m.s.d. of 0.96 Å (40 Cα pairs, cut-off of 1.5 Å) (Fig.3 A). Apart from the additional C-terminal domain in nsp4, a major difference between the two structures is the length of the loop separating strands B2 and C2. In HRV-2 3C and in most other CLPs, this β-hairpin is 10–20 residues longer. The shorter β-hairpin found in nsp4 provides enough space for the seventh β-strand of the C-terminal barrel, G2. A short β-hairpin connecting B2 and C2 is also present in SCP and SFCP, both of which also have G2. The presence of this shorter β-hairpin in SCP may facilitate cis cleavage to produce the viral capsid protein (27Tong L. Wengler G. Rossmann M.G. J. Mol. Biol. 1993; 230: 228-247Crossref PubMed Scopus (82) Google Scholar). The structures of nsp4 and SCP align well, despite remote sequence similarity, with an r.m.s.d. of 0.95 Å (39 Cαpairs, cut-off of 1.5 Å) (Fig. 3 B). The overall shape of nsp4 is much like SCP as it is more compact than most other CLPs and results from shorter β-hairpins. Another example of a CLP with a specificity for glutamic acid in the P1 position is Streptomyces griseus proteinase E (SGPE) (31Nienaber V.L. Breddam K. Birktoft J.J. Biochemistry. 1993; 32: 11469-11475Crossref PubMed Scopus (74) Google Scholar) (Fig. 3 C). Nsp4 and SGPE align well with an r.m.s.d. of 0.86 Å (56 Cα pairs, cut-off of 1.5 Å). The S1 specificity pocket in nsp4, SGPE, and picornaviral 3C proteinases show remarkable structural similarities. A recently reported structure of a coronavirus chymotrypsin-like cysteine main proteinase from the transmissible gastroenteritis virus in pigs reveals the only other structure of a CLP with an additional C-terminal domain (32Anand K. Palm G.J. Mesters J.R. Siddell S.G. Ziebuhr J. Hilgenfeld R. EMBO J. 2002; 21: 3213-3224Crossref PubMed Scopus (479) Google Scholar). Coronaviruses comprise the distantly related second family in the order Nidovirales (7Ziebuhr J. Snijder E.J. Gorbalenya A.E. J. Gen. Virol. 2000; 81: 853-879Crossref PubMed Scopus (776) Google Scholar). Unlike nsp4, the structure of the C-terminal domain in the coronavirus proteinase is nearly twice as large at 110 residues and is comprised of only α-helices. In addition, the linker between the C-terminal domain and the C-terminal β-barrel differs markedly from that found in nsp4. Despite the lack of sequence and structural similarity, the C-terminal domains in both arteriviruses and coronaviruses may share a common functionality in facilitating substrate recognition (32Anand K. Palm G.J. Mesters J.R. Siddell S.G. Ziebuhr J. Hilgenfeld R. EMBO J. 2002; 21: 3213-3224Crossref PubMed Scopus (479) Google Scholar). The crystal structure of nsp4 reveals that His-39, Asp-65, and Ser-120 form a catalytic triad similar to that of other CLPs (Fig. 2 A), confirming the predictions from sequence analysis (9den Boon J.A. Snijder E.J. Chirnside E.D. de Vries A.A. Horzinek M.C. Spaan W.J. J. Virol. 1991; 65: 2910-2920Crossref PubMed Google Scholar) and site-directed mutagenesis (12Snijder E.J. Wassenaar A.L. van Dinten L.C. Spaan W.J. Gorbalenya A.E. J. Biol. Chem. 1996; 271: 4864-4871Abstract Full Text Full Text PDF PubMed Scopus (91) Google Scholar). The active sites in all four copies of nsp4 adopt a very similar conformation (Fig. 4 A), except for the peptide bond between residues 117 and 118. The distances between Ser-120 Oγ and His-39 Nε2 range from 2.8 to 3.5 Å, and the distances between His-39 Nδ1 and the two carboxylate oxygen atoms of Asp-65 range from 2.8 to 3.0 Å. These distances are similar to values seen in other CLPs. The rotamer adopted by Ser-120 is the same as in other CLPs (χ1 ∼ −80°). In CLPs, the oxyanion hole (in nsp4, main chain amides between residues 118 and 120) stabilizes the negative charge on the P1 carbonyl oxygen atom in the tetrahedral intermediate. Remarkably, in three of the four copies, the amide nitrogen of Gly-118 points away from Ser-120, causing the collapse of the oxyanion hole (Fig. 4, A andB). In contrast, the oxyanion hole is properly formed in copy B (Fig. 4 C), where the only major difference between the other copies is the conformation of the peptide bond between residues 117 and 118. This represents an interconversion of a type I (copies A, C, and D) to a type II (copy B) β-turn. The psi angle of Ser-117 in copy B is 136°, and for copies A, C, and D it ranges between −41 and −51°. The phi angle of Gly-118 in copy B is 116° whereas that for copies A, C, and D ranges between −59 and −64°. This is a common type of peptide flip observed in pairs of homologous protein structures, especially with a glycine residue in the i + 1 position of the flipped peptide bond (33Hayward S. Protein Sci. 2001; 10: 2219-2227Crossref PubMed Scopus (62) Google Scholar). Molecular orbital calculations indicate an energy barrier of ∼3 kcal/mol for this flip (34Gunasekaran K. Gomathi L. Ramakrishnan C. Chandrasekhar J. Balaram P. J. Mol. Biol. 1998; 284: 1505-1516Crossref PubMed Scopus (97) Google Scholar). Nsp4 is the first example of a wild-type serine proteinase in which alternate conformations of the oxyanion hole are observed. Previously, a collapsed oxyanion hole was observed in an active site cysteine to alanine mutant of HAV 3C proteinase (35Allaire M. Chernaia M.M. Malcolm B.A. James M.N. Nature. 1994; 369: 72-76Crossref PubMed Scopus (260) Google Scholar). The collapsed oxyanion hole in nsp4 may be part of a novel mechanism regulating proteolytic activity. An important structural feature stabilizing the collapsed conformation of the oxyanion hole is a water molecule that forms hydrogen bonds with Ser-120 Oγ and His-39 Nε2 (Fig.4 B). A second water molecule hydrogen bonds with His-134 Nε2 and the main-chain carbonyl oxygen atom of Thr-116. These hydrogen bonds can only form if the oxyanion hole adopts a collapsed conformation, and hence these water molecules are absent from copy B. Previous studies have shown that nsp4 has a specificity for Glu (and in one case Gln) at the P1 position (7Ziebuhr J. Snijder E.J. Gorbalenya A.E. J. Gen. Virol. 2000; 81: 853-879Crossref PubMed Scopus (776) Google Scholar, 12Snijder E.J. Wassenaar A.L. van Dinten L.C. Spaan W.J. Gorbalenya A.E. J. Biol. Chem. 1996; 271: 4864-4871Abstract Full Text Full Text PDF PubMed Scopus (91) Google Scholar,14van Dinten L.C. Rensen S. Gorbalenya A.E. Snijder E.J. J. Virol. 1999; 73: 2027-2037Crossref PubMed Google Scholar). As in the picornaviral 3C proteinases and SGPE, which have specificity for either Gln or Glu in the P1 position, nsp4 also has a conserved histidine residue (His-134 in nsp4, His-161 inHRV-2 3C, His-213 in SGPE) at the base of the S1 specificity pocket, and a conserved Ser/Thr residue lining one "wall" of the S1 pocket (Thr-115 in nsp4, Thr-142 in HRV-2 3C, Ser-192 in SGPE) (Fig.5 A). The histidine in the S1 pocket hydrogen bonds to the P1 side-chain carbonyl oxygen atom of an inhibitor bound to HRV-2 3C (30Matthews D.A. Dragovich P.S. Webber S.E. Fuhrman S.A. Patick A.K. Zalman L.S. Hendrickson T.F. Love R.A. Prins T.J. Marakovits J.T. Zhou R. Tikhe J. Ford C.E. Meador J.W. Ferre R.A. Brown E.L. Binford S.L. Brothers M.A. DeLisle D.M. Worland S.T. Proc. Natl. Acad. Sci. U. S. A. 1999; 96: 11000-11007Crossref PubMed Scopus (266) Google Scholar) and to one of the glutamate carboxylate oxygens of the P1 side chain of a peptide bound to SGPE (31Nienaber V.L. Breddam K. Birktoft J.J. Biochemistry. 1993; 32: 11469-11475Crossref PubMed Scopus (74) Google Scholar). Thr-142 in HRV-2 3C and Ser-192 in SGPE also hydrogen bond to the carbonyl or carboxylate oxygen atoms of the P1 side chain. Because the structures of the active site and S1 pocket are conserved, His-134 and Thr-115 in nsp4 could also donate hydrogen bonds to the carboxylate oxygen atom of the P1 side chain in substrates. Site-directed mutagenesis studies confirm that His-134 is essential for the efficient processing of the EAV polyprotein cleavage sites (12Snijder E.J. Wassenaar A.L. van Dinten L.C. Spaan W.J. Gorbalenya A.E. J. Biol. Chem. 1996; 271: 4864-4871Abstract Full Text Full Text PDF PubMed Scopus (91) Google Scholar). Thr-115 can be replaced by serine, glycine, and asparagine with variable effects on cleavage specificity and efficiency (12Snijder E.J. Wassenaar A.L. van Dinten L.C. Spaan W.J. Gorbalenya A.E. J. Biol. Chem. 1996; 271: 4864-4871Abstract Full Text Full Text PDF PubMed Scopus (91) Google Scholar). Both nsp4 and SGPE prefer glutamic acid over glutamine in the P1 position of substrates. In SGPE, Ser-216 hydrogen bonds with the carboxylate oxygen atom (not interacting with His-213 or Ser-192) of the P1 side chain. Ser-216 cannot hydrogen bond with the amide nitrogen of a P1 glutamine side chain, because the lone pair already accepts a hydrogen bond from the main-chain amide of Gly-219; this proposal is supported by site-directed mutagenesis studies (36Stennicke H.R. Birktoft J.J. Breddam K. Protein Sci. 1996; 5: 2266-2275Crossref PubMed Scopus (31) Google Scholar). In nsp4, Ser-137 occupies the same position as Ser-216 in SGPE and should also recognize a negatively charged P1 residue (Fig. 5 B). In picornaviral 3C proteinases, which prefer glutamine at the P1 position, a glycine is found in lieu of serine. Nsp4 is known to cleave its N and C termini in the nsp3-8 intermediate (7Ziebuhr J. Snijder E.J. Gorbalenya A.E. J. Gen. Virol. 2000; 81: 853-879Crossref PubMed Scopus (776) Google Scholar, 12Snijder E.J. Wassenaar A.L. van Dinten L.C. Spaan W.J. Gorbalenya A.E. J. Biol. Chem. 1996; 271: 4864-4871Abstract Full Text Full Text PDF PubMed Scopus (91) Google Scholar, 13Wassenaar A.L. Spaan W.J.M. Gorbalenya A.E. Snijder E.J. J. Virol. 1997; 71: 9313-9322Crossref PubMed Google Scholar, 14van Dinten L.C. Rensen S. Gorbalenya A.E. Snijder E.J. J. Virol. 1999; 73: 2027-2037Crossref PubMed Google Scholar). Although nsp4 can cleave several sites in trans (14van Dinten L.C. Rensen S. Gorbalenya A.E. Snijder E.J. J. Virol. 1999; 73: 2027-2037Crossref PubMed Google Scholar), self-processing may also be a cis event. To act as both an enzyme and a substrate in cis reactions and to act as an enzyme in trans reactions, nsp4 must adopt alternate conformations. The N or C termini must be either in or away from the substrate-binding pocket for cis and transcleavages, respectively. In the nsp4 crystal structure, the chain termini are remarkably close together and do not lie in the substrate-binding pocket. The positions adopted by the chain termini likely differ from that adopted during cis cleavage to prevent self-inhibition of proteolytic activity, as in picornaviral proteinases (35Allaire M. Chernaia M.M. Malcolm B.A. James M.N. Nature. 1994; 369: 72-76Crossref PubMed Scopus (260) Google Scholar, 37Petersen J.F. Cherney M.M. Liebig H.D. Skern T. Kuechler E. James M.N. EMBO J. 1999; 18: 5463-5475Crossref PubMed Scopus (98) Google Scholar). Each terminus can be brought into the appropriate position for cis cleavage by a conformational rearrangement of the ten N-terminal residues or part of the C-terminal domain. In support of this possibility, the N-terminal six residues are disordered and residues 198–203 have higher than average temperature factors (in copies A and D) or are completely disordered (in copies B and C). One of the few hydrogen-bonding interactions fixing the position of the C terminus is the Asp-119:Arg-203 ion pair. Although Asp-119 is conserved in arteriviral proteinases, Arg-203 is not. Interestingly, Arg-4, which is disordered in all four copies of nsp4, is conserved among arteriviral proteinases. Even though the N-terminal six residues are disordered, the position adopted by Lys-7 indicates that Arg-4 may be able to interact with Asp-119 in an alternate conformation. When the C terminus adopts the conformation required for ciscleavage, the Asp-119:Arg-203 ion pair must be broken, thereby allowing Arg-4 to interact with Asp-119. Our crystallographic analysis of EAV nsp4 elucidates the structural basis of proteolytic processing in arteriviruses. Nsp4 adopts alternate conformations of the oxyanion hole, which may be a novel means of regulating proteolytic activity. The structure reveals how the C-terminal extension interacts with the chymotrypsin-like domains and suggests how it may mediate the formation of multiprotein complexes to control proteolytic processing pathways critical to the viral life cycle. Understanding the structural details of proteolytic processing in arteriviruses and other nidoviruses will allow for the design of novel antiviral agents.
Referência(s)