Exceptional Disfavor for Proline at the P+1 Position among AGC and CAMK Kinases Establishes Reciprocal Specificity between Them and the Proline-directed Kinases
2005; Elsevier BV; Volume: 280; Issue: 11 Linguagem: Inglês
10.1074/jbc.m413159200
ISSN1083-351X
AutoresGuozhi Zhu, Koichi Fujii, Natalya Belkina, Yin Liu, Michael N.G. James, Juan J. Herrero, Stephen Shaw,
Tópico(s)Microtubule and mitosis dynamics
ResumoTo precisely regulate critical signaling pathways, two kinases that phosphorylate distinct sites on the same protein substrate must have mutually exclusive specificity. Evolution could assure this by designing families of kinase such as basophilic kinases and proline-directed kinase with distinct peptide specificity; their reciprocal peptide specificity would have to be very complete, since recruitment of substrate allows phosphorylation of even rather poor phosphorylation sites in a protein. Here we report a powerful evolutionary strategy that assures distinct substrates for basophilic kinases (PKA, PKG and PKC (AGC) and calmodulin-dependent protein kinase (CAMK)) and proline-directed kinase, namely by the presence or absence of proline at the P+1 position in substrates. Analysis of degenerate and non-degenerate peptides by in vitro kinase assays reveals that proline at the P+1 position in substrates functions as a "veto" residue in substrate recognition by AGC and CAMK kinases. Furthermore, analysis of reported substrates of two typical basophilic kinases, protein kinase C and protein kinase A, shows the lowest occurrence of proline at the P+1 position. Analysis of crystal structures and sequence conservation provides a molecular basis for this disfavor and illustrate its generality. To precisely regulate critical signaling pathways, two kinases that phosphorylate distinct sites on the same protein substrate must have mutually exclusive specificity. Evolution could assure this by designing families of kinase such as basophilic kinases and proline-directed kinase with distinct peptide specificity; their reciprocal peptide specificity would have to be very complete, since recruitment of substrate allows phosphorylation of even rather poor phosphorylation sites in a protein. Here we report a powerful evolutionary strategy that assures distinct substrates for basophilic kinases (PKA, PKG and PKC (AGC) and calmodulin-dependent protein kinase (CAMK)) and proline-directed kinase, namely by the presence or absence of proline at the P+1 position in substrates. Analysis of degenerate and non-degenerate peptides by in vitro kinase assays reveals that proline at the P+1 position in substrates functions as a "veto" residue in substrate recognition by AGC and CAMK kinases. Furthermore, analysis of reported substrates of two typical basophilic kinases, protein kinase C and protein kinase A, shows the lowest occurrence of proline at the P+1 position. Analysis of crystal structures and sequence conservation provides a molecular basis for this disfavor and illustrate its generality. Phosphorylation is a prevalent modification in cells that controls many functions such as signaling transduction, proliferation and apoptosis. It is estimated that at least one-third of all proteins in eukaryotic cell are phosphorylated at any given time (1Mann M. Ong S.E. Gronborg M. Steen H. Jensen O.N. Pandey A. Trends Biotechnol. 2002; 20: 261-268Abstract Full Text Full Text PDF PubMed Scopus (796) Google Scholar, 2Zolnierowicz S. Bollen M. EMBO J. 2000; 19: 483-488Crossref PubMed Scopus (155) Google Scholar). More than 500 human protein kinases have been identified so far (3Manning G. Whyte D.B. Martinez R. Hunter T. Sudarsanam S. Science. 2002; 298: 1912-1934Crossref PubMed Scopus (6256) Google Scholar). A high degree of selectivity in substrate phosphorylation is necessary to maintain functional integrity of this very complicated signaling environment. Precision in phosphorylation is particularly critical when a substrate protein is phosphorylated at two (or more) phosphorylation sites, and those sites 1) are phosphorylated by distinct upstream kinases and 2) confer distinct properties on that substrate. To assure fidelity of signaling in this common situation, each upstream kinase must show high specificity by phosphorylating only the relevant phosphorylation site and not the inappropriate site(s). This requirement poses a major challenge in evolutionary design of kinase peptide specificity, since those upstream kinases are usually recruited to the substrate, and such recruitment can overcome much of the barrier provided by peptide specificity (4Zhu G. Liu Y. Shaw S. Cell Cycle. 2005; 4: 52-56Crossref PubMed Scopus (55) Google Scholar). These considerations raise the important issue as to what elements in kinase peptide specificity confer the strongest reciprocal specificity between kinases, i.e. which prevent one kinase from phosphorylating substrates phosphorylated by another. For Ser/Thr kinases, we propose that much of this reciprocal specificity is provided by the evolution of three broad classes (5Kreegipuu A. Blom N. Brunak S. Jarv J. FEBS Lett. 1998; 430: 45-50Crossref PubMed Scopus (112) Google Scholar): basophilic kinases that phosphorylate sites with clustered positive charges, acidophilic kinases that phosphorylate sites with clustered negative charges, and proline-directed kinases that phosphorylate sites in which Ser/Thr is followed immediately by a proline (i.e. proline at the P+1 position). This classification by peptide specificity corresponds generally with classification of Ser/Thr kinases based on sequence similarity. Based on studies of a sampling of family members, two large families of kinases appear to be largely included in the basophilic group: the 61 gene AGC 1The abbreviations used are: AGC, PKA, PKG and PKC; CAMK, calmodulin-dependent protein kinase; PK, protein kinase; PDB, protein data base. (PKA, PKG and PKC) family (6Songyang Z. Blechner S. Hoagland N. Hoekstra M.F. Piwnica-Worms H. Cantley L.C. Curr. Biol. 1994; 4: 973-982Abstract Full Text Full Text PDF PubMed Scopus (537) Google Scholar, 7Nishikawa K. Toker A. Johannes F.J. Songyang Z. Cantley L.C. J. Biol. Chem. 1997; 272: 952-960Abstract Full Text Full Text PDF PubMed Scopus (493) Google Scholar, 8Shabb J.B. Chem. Rev. 2001; 101: 2381-2411Crossref PubMed Scopus (283) Google Scholar) and the 83 gene CAMK (calmodulin-dependent protein kinase) family (5Kreegipuu A. Blom N. Brunak S. Jarv J. FEBS Lett. 1998; 430: 45-50Crossref PubMed Scopus (112) Google Scholar, 9Hudmon A. Schulman H. Biochem. J. 2002; 364: 593-611Crossref PubMed Scopus (468) Google Scholar, 10White R.R. Kwon Y.G. Taing M. Lawrence D.S. Edelman A.M. J. Biol. Chem. 1998; 273: 3166-3172Abstract Full Text Full Text PDF PubMed Scopus (121) Google Scholar). Proline-directed kinases such as GSK, CDK, and Erk belong to CMGC (CDK, MAPK, GSK3, CLK kinases) superfamily of kinases, which consists of 61 kinases (6Songyang Z. Blechner S. Hoagland N. Hoekstra M.F. Piwnica-Worms H. Cantley L.C. Curr. Biol. 1994; 4: 973-982Abstract Full Text Full Text PDF PubMed Scopus (537) Google Scholar, 11Clark-Lewis I. Sanghera J.S. Pelech S.L. J. Biol. Chem. 1991; 266: 15180-15184Abstract Full Text PDF PubMed Google Scholar, 12Hall F.L. Braun R.K. Mihara K. Fung Y.K. Berndt N. Carbonaro-Hall D.A. Vulliet P.R. J. Biol. Chem. 1991; 266: 17430-17440Abstract Full Text PDF PubMed Google Scholar, 13Songyang Z. Lu K.P. Kwon Y.T. Tsai L.H. Filhol O. Cochet C. Brickey D.A. Soderling T.R. Bartleson C. Graves D.J. DeMaggio A.J. Hoekstra M.F. Blenis J. Hunter T. Cantley L.C. Mol. Cell. Biol. 1996; 16: 6486-6493Crossref PubMed Scopus (489) Google Scholar). The acidophilic kinase group is much smaller, including, for example, casein kinase I and casein kinase II. The basophilic kinases and acidophilic kinases would be expected to be largely non-overlapping in peptide specificity; the electrostatic interactions that promote phosphorylations of basic sites by basophilic kinases will create an aversion for acidic sites (and vice versa). Moreover, proline-directed kinases will eschew substrates of the other two which lack proline at P+1. However, a missing element in the foregoing paradigm is one that prevents basophilic kinases or acidophilic kinases from phosphorylating sites preferred by proline-directed kinases. Such an element may be greater importance for basophilic kinases, since their preference for basic residues is shared in a limited fashion by some proline-directed kinases (e.g. Cdk1, Cdk2, Cdk5, Dyrk, GSK3) (5Kreegipuu A. Blom N. Brunak S. Jarv J. FEBS Lett. 1998; 430: 45-50Crossref PubMed Scopus (112) Google Scholar, 6Songyang Z. Blechner S. Hoagland N. Hoekstra M.F. Piwnica-Worms H. Cantley L.C. Curr. Biol. 1994; 4: 973-982Abstract Full Text Full Text PDF PubMed Scopus (537) Google Scholar, 13Songyang Z. Lu K.P. Kwon Y.T. Tsai L.H. Filhol O. Cochet C. Brickey D.A. Soderling T.R. Bartleson C. Graves D.J. DeMaggio A.J. Hoekstra M.F. Blenis J. Hunter T. Cantley L.C. Mol. Cell. Biol. 1996; 16: 6486-6493Crossref PubMed Scopus (489) Google Scholar, 14Himpel S. Tegge W. Frank R. Leder S. Joost H.G. Becker W. J. Biol. Chem. 2000; 275: 2431-2438Abstract Full Text Full Text PDF PubMed Scopus (185) Google Scholar). We have identified an evolutionary strategy widely used by basophilic kinases that addresses that problem. We find a disfavor for proline at the P+1 position among AGC and CAMK kinases. That disfavor is stronger than for any other residues at positions around the phosphorylation site and is unusually consistent among members of these kinase families. As a result, proline at the P+1 position serves as a "veto residue" for phosphorylation by these kinases; this strategy established by kinase evolution provides tight reciprocal specificity between basophilic and proline-directed kinases. Degenerate Peptides and Kinases—Biotinylated single sequence peptides and degenerate peptides were synthesized as C-terminal amides on Mimotopes (Clayton, Australia) SynPhase Rink amide acrylic-grafted polypropylene solid support (loading 7.5 μmol) using conventional Fmoc (N-(9-fluorenyl)methoxycarbonyl) chemistry as described previously (15Fujii K. Zhu G. Liu Y. Hallam J. Chen L. Herrero J. Shaw S. Proc. Natl. Acad. Sci. U. S. A. 2004; 101: 13744-13749Crossref PubMed Scopus (83) Google Scholar). Seven catalytically active kinases were used. PKC-δ, PKC-ζ, PKA, and PKG were purchased from Calbiochem, and PKD2 and Cdk2 were purchased from Upstate Biotechnology. A 6 × His-tagged construct corresponding to residues 615–874 of human PRK1 (GenBank™ accession number BC040061) was expressed in 293T cell by calcium phosphate transfection and the corresponding protein purified by nickel affinity chromotography. In Vitro Kinase Assay—Peptides were phosphorylated by in vitro kinase assay as described previously (15Fujii K. Zhu G. Liu Y. Hallam J. Chen L. Herrero J. Shaw S. Proc. Natl. Acad. Sci. U. S. A. 2004; 101: 13744-13749Crossref PubMed Scopus (83) Google Scholar). In brief in vitro phosphorylation in the presence of γ-32P-labeled ATP (in 100 μm cold ATP) was performed in 50 μl of solution under standardized conditions resulting in stoichiometry of phosphorylation less than 10%. After reaction termination, 50 pmol of substrate was transferred to streptavidin-coated plates and emissions counted after extensive washing. Kinase buffer for PKC-δ, PKC-ζ, PKD2, PKG, PKA, and PRK1 is 100 mm HEPES, 0.05% Triton X-100, 1 mm CaCl2, 20 mm MgCl2. For kinase PKC-δ, PKC-ζ, and PKD2, 0.2 mg/ml phosphatidylserine (Avanti Polar Lipids) and 100 ng/ml phorbol 12-myristate 13-acetate were added to the kinase buffer immediately before the assay. Kinase buffer for CAMK II is 25 mm HEPES, 10 mm MgCl2, 1 mm CaCl2, 8 μg/ml calmodulin, and 1 mm dithiothreitol, which was minimally modified from Madhavan (34Madhavan R. Jarrett H.W. Biochim. Biophys. Acta. 1999; 1434: 260-274Crossref PubMed Scopus (27) Google Scholar). Kinase buffer for Cdk2 is HEPES, 10 mm MgCl2, 1 ml/ml bovine serum albumin, 10 mm dithiothreitol, and protease inhibitor, which was minimally modified from Kaldis et al. (35Kaldis P. Cheng A. Solomon M.J. J. Biol. Chem. 2000; 275: 32578-32584Abstract Full Text Full Text PDF PubMed Scopus (18) Google Scholar). Data Analysis—Data were analyzed as described previously (15Fujii K. Zhu G. Liu Y. Hallam J. Chen L. Herrero J. Shaw S. Proc. Natl. Acad. Sci. U. S. A. 2004; 101: 13744-13749Crossref PubMed Scopus (83) Google Scholar). All results shown represent at least duplicate assays, and usually quadruplicate assays. Data analysis was largely done in Microsoft Excel spreadsheets. Functionality of embedded spreadsheet formulas was augmented by automation using Microsoft Visual Basic and data storage using Microsoft SQL Server. The PSSM (position-specific scoring matrix) Logo was generated using postscript files generated by Visual Basic code; some postscript code was adapted from Tom Schneider's "makelogo" 8.69 (www.bio.cam.ac.uk/cgi-bin/seqlogo/logo.cgi). During studies of PKC isoform specificity (15Fujii K. Zhu G. Liu Y. Hallam J. Chen L. Herrero J. Shaw S. Proc. Natl. Acad. Sci. U. S. A. 2004; 101: 13744-13749Crossref PubMed Scopus (83) Google Scholar) a strong disfavor for proline at the P+1 position was noted. Given the potential biological importance of P+1 proline disfavor, analysis was expanded to other AGC family members such as PKG (Fig. 1A). Markedly different levels of phosphorylation by PKG are observed between peptides that differ only by a single amino acid at their P+1 position. Since each peptide is a pool of about 1 billion individual peptides, the preference (or disfavor) observed must reflect a preference (or disfavor) of that kinase for many peptides in that huge pool. When Pro is present at the P+1 position, the amount of phosphorylation is much lower than with any other residue. Specifically, phosphorylation of peptide #13 (Pro at P+1) is about 1/10 of the average for all peptide pools. Thus, Pro must be unfavorable at the P+1 position for most of the phosphorylatable peptides in this degenerate peptide pool #13. We investigated whether the strong disfavor for Pro at P+1 was a characteristic shared with other kinases. Four additional members of the AGC kinase family (PKA, PKC-δ, PKC-ζ, and PRK1) were tested and each showed strong disfavor for Pro at P+1. One way to visualize this comparison of kinases is with a PSSM Logo showing preferences of those kinases at the P+1 position (Fig. 1B). Consider first the results for PKG (Fig. 1B, left column, which corresponds to data shown in Fig. 1A). The parameter represented in the PSSM Logo is log2 of the ratio to mean for each residue; this mathematical conversion gives favored residue positive scores and disfavored residues negative scores. The PSSM Logo shows the residue most favored by PKG at the top (I) and the most disfavored residue (P) at the bottom. The columns corresponding to other kinases show that the strong disfavor for proline at the P+1 position is observed for each of the four other AGC kinases (PKA, PKC-δ, PKC-ζ, and PRK1). To broaden the analysis, we also examined two members of the CAMK family (CAMK II and PKD2); since they share the preference for Arg at P-3 (and P-2) (5Kreegipuu A. Blom N. Brunak S. Jarv J. FEBS Lett. 1998; 430: 45-50Crossref PubMed Scopus (112) Google Scholar, 6Songyang Z. Blechner S. Hoagland N. Hoekstra M.F. Piwnica-Worms H. Cantley L.C. Curr. Biol. 1994; 4: 973-982Abstract Full Text Full Text PDF PubMed Scopus (537) Google Scholar) they can be analyzed with the same sets of degenerate peptides. The results show that CAMK II and PKD2 also have strong disfavor for Pro at P+1 (Fig. 1B). One useful way to further characterize the proline disfavor at P+1 is to compare it with the preferences for residues in other positions near the phosphorylation site. Such a comparison is shown for the five basophilic kinases studied in further detail (Fig. 2). Consider first the results for PKG (Fig. 2, left panel). The distribution of scores is approximately centered on zero, and most residue preferences are close to zero. The outlier far to the left corresponds to Pro at P+1; its separation from the rest of the distribution demonstrates that this is a singular disfavor, i.e. distinctly stronger than the disfavor for any residue at any other position studied. At the opposite extreme, there are several outliers in the favored direction; these correspond to Arg and Lys at P-2 and P-3, as would be expected based on the known very strong PKG preference for those residues in those positions (5Kreegipuu A. Blom N. Brunak S. Jarv J. FEBS Lett. 1998; 430: 45-50Crossref PubMed Scopus (112) Google Scholar). Thus, disfavor for Pro at P+1 position is by far the single strongest disfavored residue for PKG. The frequency distribution of scores for these others kinases revealed that for each of them Pro at P+1 is the most disfavored residue among all residues at all positions tested. Scores for Pro at position P+1 range from -2.2 to -3.1; these scores are both more extreme than any other disfavored residue we have studied and more consistent between AGC/CAMK kinases than any other disfavored residue. The foregoing studies of in vitro peptide phosphorylation indicate that Pro at the P+1 position is a disfavored residue for many kinases of the AGC and CAMK family. If this characteristic is relevant not only to peptides but also to proteins, then the frequency of Pro at the P+1 position in reported substrates should be much less than the frequency of Pro in human proteins generally (6.5%). Such a comparison can be convincing only when a large number of substrates have been reported, as is the case for PKA and PKC. We have accumulated a list of 124 PKC substrates from review of the literature (15Fujii K. Zhu G. Liu Y. Hallam J. Chen L. Herrero J. Shaw S. Proc. Natl. Acad. Sci. U. S. A. 2004; 101: 13744-13749Crossref PubMed Scopus (83) Google Scholar); the list was not biased by our hypothesis of Pro disfavor, since it was accumulated before that hypothesis was formulated. Analysis of the number and position of Pro residues in these PKC substrates strongly supports the prediction. The overall frequency of Pro residues in these substrates (from position P-8 to P+8) was 4.8% and thus not very different from the proteome at large. Notably the frequency of Pro at P+1 was very low; only 1 of 124 reported substrates had Pro at P+1, which represents a frequency of only 0.8%. It is useful to convert results of residue frequency analysis in substrates into a log score by determining the log base 2 of (observed frequency/expected frequency); that log2 score for Pro at P+1 from reported PKC substrate analysis is -3.1, which is comparable with our score of -2.6 derived from degenerate peptide analysis. Analysis was expanded to determine proline favor/disfavor at residue position P-7 to P+6 (Fig. 3A). There is generally good agreement between the scores derived from residue frequency in reported substrates and scores derived from degenerate peptides. We performed a similar analysis on PKA sites compiled by Shabb (8Shabb J.B. Chem. Rev. 2001; 101: 2381-2411Crossref PubMed Scopus (283) Google Scholar); of those 136 were mammalian sites plausibly assigned to PKA based on the references cited. The frequency of Pro at P+1 in these substrates was 1.3%, which is again much lower than expected based on overall frequency in those substrates (5.5%) or human proteins generally (6.5%). As was seen with PKC, the position-specific scores for Pro frequency in substrates are in general agreement with scores from degenerate peptides (Fig. 3B). Thus, there is a very low Pro frequency at the P+1 position in reported PKC and PKA substrates and the P+1 position is the only one at which Pro disfavor occurs in both PKC and PKA. The potential importance of this disfavor is magnified when understood in the context of the virtual requirement for Pro at P+1 among proline-directed kinases (5Kreegipuu A. Blom N. Brunak S. Jarv J. FEBS Lett. 1998; 430: 45-50Crossref PubMed Scopus (112) Google Scholar, 11Clark-Lewis I. Sanghera J.S. Pelech S.L. J. Biol. Chem. 1991; 266: 15180-15184Abstract Full Text PDF PubMed Google Scholar, 12Hall F.L. Braun R.K. Mihara K. Fung Y.K. Berndt N. Carbonaro-Hall D.A. Vulliet P.R. J. Biol. Chem. 1991; 266: 17430-17440Abstract Full Text PDF PubMed Google Scholar), consisting primarily of the CMGC family of kinases, such as Cdk2. As noted in the introduction, at least five proline-directed kinases have a modest basophilic preference, which creates the potential risk of poor discrimination between basophilic AGC/CAMK kinases and proline-directed kinases. We therefore tested whether the disfavor for Pro at P+1 is an important element in discrimination of AGC/CAMK sites from sites recognized by proline-directed kinases. As a proof of principle we chose "proteomic" peptides corresponding to basic sites in the proteome that our algorithms suggested could be good substrates for basophilic kinases in all respects except for their Pro at P+1. We then synthesized peptides with wildtype sequence (Pro at P+1) or with substitutions to two alternative residues at the P+1 position. One proteomic peptide was particularly informative for comparison of PKC-δ versus Cdk2 (Fig. 4A). PKC-δ was efficient in phosphorylating the peptide whose P+1 residues was Phe (PKC-delta PSSM score 1.0, also see Fig. 1B); less efficient with an His at P+1 (PSSM score 0.1, also see Fig. 1B) and almost inactive with a Pro at P+1 (PSSM score -3.1, also see Fig. 1B). In contrast Cdk2 efficiently phosphorylated only the peptide with Pro at P+1 and neither of the others. Thus, the specificity differences between Cdk2 and PKC-δ are as much a result of the aversion of PKC-δ for Pro as the preference of Cdk2 for Pro. Analysis of the second proteomic peptide (and P+1 variations thereof) was particularly informative for comparison of CAMK II with Cdk2 (Fig. 4B). CAMK II can efficiently phosphorylate the peptide with Phe at P+1 but is very inefficient in phosphorylating the peptide with Pro at the P+1 position. Note that phosphorylation of the peptide with Pro at P+1 (the most disfavored residue at P+1 for CAMK II, Fig. 1B) is much less than the peptide having Gln at P+1 (the second most disfavored residue, Fig. 1B); thus Pro has an exceptional capacity to veto phosphorylation by CAMK II. Based on the foregoing studies, we introduce the term "proline-aversive" kinase to describe those with a strong disfavor for Pro at P+1 and thereby highlight the reciprocity of their specificity to that of proline-directed kinases. The structural basis for the difference between the proline-directed kinases and proline-aversive kinase can be understood based on differences between the solved structures of proline-directed kinases with bound peptide (e.g. PDB code 1QMZ) and AGC/CAMK kinases with bound peptide (e.g. PDB code 1ATP). One of the regions of the kinase domain that is most important for substrate recognition region is the activation loop, a flexible region of the kinase that 1) lies along the catalytic cleft, 2) is involved in binding C-terminal residues of substrate, and 3) is subject to extensive conformational regulation (16Cox S. Radzio-Andzelm E. Taylor S.S. Curr. Opin. Struct. Biol. 1994; 4: 893-901Crossref PubMed Scopus (128) Google Scholar, 17Johnson L.N. Noble M.E. Owen D.J. Cell. 1996; 85: 149-158Abstract Full Text Full Text PDF PubMed Scopus (1178) Google Scholar, 18Engh R.A. Bossemeyer D. Adv. Enzyme Regul. 2001; 41: 121-149Crossref PubMed Scopus (38) Google Scholar). In the activation loop there is a critical residue which we refer to as the toggle residue whose conformation differs dramatically between proline-aversive and proline-directed kinases. In the two proline-aversive kinases with solved structure (AGK family kinase PKA and the CAMK member PHK) the toggle residue is a glycine, whose backbone carbonyl is oriented toward the catalytic cleft (Gly200 (G200, Fig. 5, A and B); that carbonyl facilitates binding of substrate by the formation of a H-bond with the backbone amide of the substrate P+1 residue (19Knighton D.R. Zheng J.H. Ten Eyck L.F. Xuong N.H. Taylor S.S. Sowadski J.M. Science. 1991; 253: 414-420Crossref PubMed Scopus (812) Google Scholar, 20Lowe E.D. Noble M.E. Skamnaki V.T. Oikonomakos N.G. Owen D.J. Johnson L.N. EMBO J. 1997; 16: 6646-6658Crossref PubMed Scopus (175) Google Scholar). Substrates having Pro at P+1 cannot form that H-bond because the proline amide is not an H-bond donor. In contrast, in proline-directed kinases (Erk2, Cdk2, GSK3) the carbonyl group of their toggle residue is oriented away from the catalytic cleft (21Bax B. Carter P.S. Lewis C. Guy A.R. Bridges A. Tanner R. Pettman G. Mannix C. Culbert A.A. Brown M.J. Smith D.G. Reith A.D. Structure (Camb.). 2001; 9: 1143-1152Abstract Full Text Full Text PDF PubMed Scopus (184) Google Scholar, 22Bertrand J.A. Thieffine S. Vulpetti A. Cristiani C. Valsasina B. Knapp S. Kalisz H.M. Flocco M. J. Mol. Biol. 2003; 333: 393-407Crossref PubMed Scopus (274) Google Scholar, 23Brown N.R. Noble M.E. Endicott J.A. Johnson L.N. Nat Cell Biol. 1999; 1: 438-443Crossref PubMed Scopus (479) Google Scholar, 24Canagarajah B.J. Khokhlatchev A. Cobb M.H. Goldsmith E.J. Cell. 1997; 90: 859-869Abstract Full Text Full Text PDF PubMed Scopus (624) Google Scholar, 25Russo A.A. Jeffrey P.D. Pavletich N.P. Nat. Struct. Biol. 1996; 3: 696-700Crossref PubMed Scopus (505) Google Scholar) (Val164 (V164), Fig. 5, C and 5D). So binding of all residues except Pro at P+1 position will be disfavored because of an uncompensated H-bond from the main-chain P+1 amide of the substrate (23Brown N.R. Noble M.E. Endicott J.A. Johnson L.N. Nat Cell Biol. 1999; 1: 438-443Crossref PubMed Scopus (479) Google Scholar, 24Canagarajah B.J. Khokhlatchev A. Cobb M.H. Goldsmith E.J. Cell. 1997; 90: 859-869Abstract Full Text Full Text PDF PubMed Scopus (624) Google Scholar). The orientation of the toggle residue in proline-directed kinases is enforced by a strategically placed arginine (Arg169 (R169), Fig. 5D), which we refer to as a toggle-regulating residue. That arginine is located in the P+1 loop, which is a loop adjacent to the activation loop (that is firmly attached to the stable C-lobe of the kinase domain). That arginine forms an H-bond with the carbonyl of the toggle residue, thereby maintaining the usual conformation of that toggle residue (23Brown N.R. Noble M.E. Endicott J.A. Johnson L.N. Nat Cell Biol. 1999; 1: 438-443Crossref PubMed Scopus (479) Google Scholar, 24Canagarajah B.J. Khokhlatchev A. Cobb M.H. Goldsmith E.J. Cell. 1997; 90: 859-869Abstract Full Text Full Text PDF PubMed Scopus (624) Google Scholar) (Fig. 5D). In the foregoing model, the single residue most pivotal to creating the two distinct conformations of the toggle residue is this toggle-regulating arginine. Sequence analysis of human kinases reveals a striking pattern of conservation of the toggle-regulating residue (Table I). All 61 kinases of the CMGC family have arginine at that position. In contrast, none of the 324 kinases in the AGC, CAMK, STE, TK, and TKL families have arginine at that position; instead they have acyclic hydrophobic residues, which cannot provide an H-bond to the carbonyl of the toggle residue. Therefore, this amino acid position has the property of absolute conservation within the CMGC family and complete discordance from AGC/CAMK and other kinases. Recently Kannan and Neuwald (26Kannan N. Neuwald A.F. Protein Sci. 2004; 13: 2059-2077Crossref PubMed Scopus (115) Google Scholar) noted that this arginine residue, which they refer to as the "CMGC-arginine," is the most characteristic feature of the CMGC kinase family.Table 1Analysis of sequence conservation of the toggle-regulating residue and the toggle residue among human kinases Analysis is based on alignment of kinase domains of human kinases by Manning et al. (3Manning G. Whyte D.B. Martinez R. Hunter T. Sudarsanam S. Science. 2002; 298: 1912-1934Crossref PubMed Scopus (6256) Google Scholar).PositionsResidueKinase familyCMGCAGCCAMKSTETK + TKLToggle-regulatingILMV095599865Ala023003Thr001023Arg1000000ToggleAG2195879426Gln200020FILMV4634061No. of kinases61638148132 Open table in a new tab Analysis of sequence conservation of the toggle residue of the proline-aversive kinases is likewise informative. Nearly all CAMK and AGC kinases have A or G at this position (Table I). This makes sense from a structural perspective. Given the importance outlined above for hydrogen bond formation between the carbonyl group of the toggle residue and the P+1 backbone amide, it is essential that toggle residue have sufficient conformational flexibility to engage the backbone amide. This flexibility would be limited by spatial constraints imposed by a substantial side chain. To test these structural hypotheses we have mutationally swapped the toggle residue and toggle-regulating residues between Cdk2 and PKC-δ; however, both constructs are catalytically inactive (data not shown), indicating that these residues function normally only in the context of other kinase family-related residues. Nevertheless, these patterns of conservation of the toggle residue and toggle-regulating residue are fully consistent with their critical role in establishing a major difference between CMGC kinase and AGC/CAMK kinases. These results demonstrate that Pro at the P+1 position is a veto residue which virtually precludes phosphorylation by multiple AGC and CAMK kinases. As a result, Pro at P+1 provides tight control of reciprocal substrate specificity between proline-directed kinases and AGC/CAMK kinases. These findings for peptides are also relevant to proteins, since we find that Pro occurs at a much reduced frequency at the P+1 position among reported substrates for PKA and PKC. A structural explanation for this disfavor for proline is evident in crystal structures of AGC/CAMK in which a carbonyl group is strategically oriented toward the catalytic cleft. The ability of all substrate P+1 residues except proline to provide a backbone amide for H-bonding to that carbonyl gives such kinases a singular disfavor for proline. The foregoing analysis focuses on substrate binding but does not explicitly consider substrate positioning. A cardinal feature identified by structure/function analysis of protein kinases is that proper orientation of the activation loop and peptide is essential for catalysis (16Cox S. Radzio-Andzelm E. Taylor S.S. Curr. Opin. Struct. Biol. 1994; 4: 893-901Crossref PubMed Scopus (128) Google Scholar, 17Johnson L.N. Noble M.E. Owen D.J. Cell. 1996; 85: 149-158Abstract Full Text Full Text PDF PubMed Scopus (1178) Google Scholar, 18Engh R.A. Bossemeyer D. Adv. Enzyme Regul. 2001; 41: 121-149Crossref PubMed Scopus (38) Google Scholar). For proline-directed kinases substrate positioning is provided by docking of proline into a proline-selective pocket (Fig. 5C). Remarkably those residues (Val63 and Glu162) form this proline-selective pocket only when the activation loop is phosphorylated at Thr160 and the kinase domain is bound to cyclin A; in the unphosphorylated inactive complex the conformations of these two residues are very different, and there is no such tight and specific docking site for proline (27Jeffrey P.D. Russo A.A. Polyak K. Gibbs E. Hurwitz J. Massague J. Pavletich N.P. Nature. 1995; 376: 313-320Crossref PubMed Scopus (1216) Google Scholar). For proline-aversive kinases positioning is provided in a different manner; the H-bond between activation loop and the P+1 backbone amide positions the substrate into an orientation suitable for phosphorylation (Fig. 5A). This H-bond also stabilizes the P+1 residue in a preferred orientation in which its side chain is oriented into the hydrophobic P+1 pocket of PKA; this orientation is observed with isoleucine at P+1 in 1ATP and is consistent with the preference for hydrophobic residues at the P+1 position for PKA (6Songyang Z. Blechner S. Hoagland N. Hoekstra M.F. Piwnica-Worms H. Cantley L.C. Curr. Biol. 1994; 4: 973-982Abstract Full Text Full Text PDF PubMed Scopus (537) Google Scholar). But in the special case of peptides with proline at P+1, this positioning mechanism cannot operate for PKA (or other CAMK/AGC kinase). We hypothesize that proline at P+1, since it lacks the orientation/constraint of a H-bond, can find another binding site(s) in the wide hydrophobic pocket formed by Leu205, Leu198, or even Phe187 (Figs. 5A). Improper binding of the proline residue can be quite stable but fails to result in catalysis because of incorrect positioning of the substrate (and phosphorylatable residue) in the catalytic cleft or conformation changes resulting from ectopic binding of proline. Extensive in silico and experimental analysis will be required to properly test this hypothesis. Proline is unique among natural amino acids in having a substituted amide and resulting conformational rigidity. Those properties make it uniquely suited for particular structural roles. For example, it is an important element in the peptide sequences recognized by proline-specific binding domain such as SH3, WW, and EVH1 domains (28Bedford M.T. Sarbassova D. Xu J. Leder P. Yaffe M.B. J. Biol. Chem. 2000; 275: 10359-10369Abstract Full Text Full Text PDF PubMed Scopus (66) Google Scholar, 29Kay B.K. Williamson M.P. Sudol M. FASEB J. 2000; 14: 231-241Crossref PubMed Scopus (1044) Google Scholar, 30Zarrinpar A. Bhattacharyya R.P. Lim W.A. Science's STKE. 2003; 2003: RE8PubMed Google Scholar). The role of proline in favoring recognition by proline-directed kinases has been well established but not its reciprocal role as a veto residue at P+1 for basophilic kinases. Prior to the present study, a disfavor for proline at P+1 was noted for PKA (6Songyang Z. Blechner S. Hoagland N. Hoekstra M.F. Piwnica-Worms H. Cantley L.C. Curr. Biol. 1994; 4: 973-982Abstract Full Text Full Text PDF PubMed Scopus (537) Google Scholar) and γ-PAK (31Tuazon P.T. Spanos W.C. Gump E.L. Monnig C.A. Traugh J.A. Biochemistry. 1997; 36: 16059-16064Crossref PubMed Scopus (60) Google Scholar). The proline disfavor of γ-PAK is particularly interesting since it is a basophilic kinase that belongs to a distinct STE subfamily of kinase (3Manning G. Whyte D.B. Martinez R. Hunter T. Sudarsanam S. Science. 2002; 298: 1912-1934Crossref PubMed Scopus (6256) Google Scholar). It suggests that disfavor for proline at P+1 extends beyond the AGC and CAMK families, as is suggested by our sequence analysis tabulated in Table I. The present findings raise the possibility that disfavor for proline at other substrate positions may also contribute to fine peptide specificity of kinases. For example, PKC has a moderate disfavor for proline at the P+2 and P+3 positions evident both in our peptide specificity determination and in analyses of reported substrates (Fig. 3A). In contrast PKA shows disfavor for proline at P-2 by the same two criteria (Fig. 3B). Moreover, a strong disfavor for proline at P-4 has been reported for a plant kinase belonging to the SNRK subfamily of CAMK (32Huang J.Z. Huber S.C. Plant Cell Physiol. 2001; 42: 1079-1087Crossref PubMed Scopus (69) Google Scholar). These findings indicate that proline residues at substrate positions other than P+1 can contribute to fine peptide specificity differences among basophilic kinases. Detailed structural understanding of such preferences is not available; however, it is plausible that PKCs disfavor for prolines at P+2 and P+3 in substrates reflects disruption of anti-parallel β sheet formation between activation loop and substrate positions P+1 to P+3 that has been observed in the Ser/Thr kinase PHK (20Lowe E.D. Noble M.E. Skamnaki V.T. Oikonomakos N.G. Owen D.J. Johnson L.N. EMBO J. 1997; 16: 6646-6658Crossref PubMed Scopus (175) Google Scholar) and the tyrosine kinase IRK (33Hubbard S.R. EMBO J. 1997; 16: 5572-5581Crossref PubMed Scopus (784) Google Scholar). These results highlight the importance of disfavored residues in kinase peptide specificity. They demonstrate the singular importance of Pro at the P+1 position as a veto residue for diverse kinases of the AGC and CAMK families. The reciprocity of this pattern to proline-directed CMGC kinases constitutes a powerful evolutionary strategy to assure non-overlapping phosphorylation by these large and important families of kinases. We thank Oskar Fernandez-Capetillo, Larry Samelson, Jean-Baptiste Vincourt, and Ron Wange for discussion and critique of the manuscript. We thank Lin Chen, Holly Clark, Lubing Lian, and John Rodwell for facilitating these studies.
Referência(s)