Novel Allergen Structures with Tandem Amino Acid Repeats Derived from German and American Cockroach
1998; Elsevier BV; Volume: 273; Issue: 46 Linguagem: Inglês
10.1074/jbc.273.46.30801
ISSN1083-351X
AutoresAnna Pomés, Erik Melén, Lisa D. Vailes, Jacques Retief, L. Karla Arruda, Martin D. Chapman,
Tópico(s)Insect and Pesticide Research
ResumoCockroaches produce potent allergens that are an important cause of asthma. The two principal domiciliary cockroach species, Blattella germanica and Periplaneta americana, secrete major allergens, Bla g 1 and Per a 1. Here, we report the molecular cloning of three Bla g 1 cDNA clones, which showed 70% amino acid sequence identity with Per a 1. Plaque immunoassays with human IgE antibodies or murine monoclonal antibodies showed that these allergens were antigenically cross-reactive. The Bla g 1 sequences also showed homology to five previously undefined cockroach allergen sequences. An unusual feature of all these sequences was that they contained multiple tandem amino acid repeats of ∼100 amino acid residues. Between one and seven repeat units were identified by dot-plot matrix analysis. The sequences also showed homology to a mosquito protein involved in digestion (ANG12 precursor) and to mitochondrial energy transfer proteins. High levels of Bla g 1 were found in cockroach hindgut and proventriculus. Amino acid sequencing of natural Bla g 1 and Per a 1 suggested that these allergens are cleaved by trypsin-like enzymes following secretion into the digestive tract. The repeat sequences appear to have evolved by duplication of an ancestral amino acid domain, which may have arisen from the mitochondrial energy transfer proteins. Cockroaches produce potent allergens that are an important cause of asthma. The two principal domiciliary cockroach species, Blattella germanica and Periplaneta americana, secrete major allergens, Bla g 1 and Per a 1. Here, we report the molecular cloning of three Bla g 1 cDNA clones, which showed 70% amino acid sequence identity with Per a 1. Plaque immunoassays with human IgE antibodies or murine monoclonal antibodies showed that these allergens were antigenically cross-reactive. The Bla g 1 sequences also showed homology to five previously undefined cockroach allergen sequences. An unusual feature of all these sequences was that they contained multiple tandem amino acid repeats of ∼100 amino acid residues. Between one and seven repeat units were identified by dot-plot matrix analysis. The sequences also showed homology to a mosquito protein involved in digestion (ANG12 precursor) and to mitochondrial energy transfer proteins. High levels of Bla g 1 were found in cockroach hindgut and proventriculus. Amino acid sequencing of natural Bla g 1 and Per a 1 suggested that these allergens are cleaved by trypsin-like enzymes following secretion into the digestive tract. The repeat sequences appear to have evolved by duplication of an ancestral amino acid domain, which may have arisen from the mitochondrial energy transfer proteins. antibody monoclonal antibody 5-bromo-4-chloro-3-indolyl phosphate enzyme-linked immunosorbent assay endoplasmic reticulum. Inhalation of environmental allergens from pollens, mites, animal danders, insects, and fungi induces IgE antibody responses in ∼20% of the Western population. Immediate hypersensitivity reactions caused by IgE antibodies (Ab)1 occur in genetically predisposed individuals and may result in the development of allergic diseases such as rhinitis, asthma, and atopic dermatitis. Most allergens are soluble low molecular proteins or glycoproteins (5–50 kDa), which rapidly diffuse through mucosal surfaces where they cross-link IgE Ab bound to mast cells and basophils (1Chapman M.D. Roitt I.M. Delves P.J. Encyclopedia of Immunology. 2nd Ed. 1. Academic Press, London1998: 64-69Crossref Google Scholar). Over 200 allergen sequences have now been cloned. Allergens are usually single polypeptide chains and occasionally occur as homodimers (e.g. Alternaria, Alt a 1) or heterodimers (cat, Fel d 1). Sequence comparisons, structural studies, and assays of biological function have shown that allergens are a diverse group of proteins, including enzymes, ligand binding proteins, and structural and regulatory proteins (2Scheiner O. Kraft D. Allergy. 1995; 50: 384-391Crossref PubMed Scopus (133) Google Scholar, 3Stewart G.A. Thompson P.J. Clin. Exp. Allergy. 1996; 25: 1020-1044Crossref Scopus (128) Google Scholar). There is some evidence, especially for mite allergens, that proteolytic enzyme activity contributes to allergenicity (4Hewitt C.R.A. Brown A.P. Hart B.J. Pritchard D.I. J. Exp. Med. 1995; 182: 1537-1544Crossref PubMed Scopus (257) Google Scholar, 5Schultz O. Laing P. Sewell H.F. Shakib F. Eur. J. Immunol. 1995; 25: 3191-3194Crossref PubMed Scopus (144) Google Scholar). Aerodynamic particle size, allergen dose, and sensitization through the respiratory tract are also important determinants for IgE responses. Cockroaches produce potent allergens that are a common cause of asthma in United States cities (6Chapman M.D. Smith A.M. Vailes L.D. Arruda L.K. Allergy. 1997; 52: 374-379Crossref PubMed Scopus (30) Google Scholar, 7Call R.S. Smith T.F. Morris E. Chapman M.D. Platts-Mills T.A.E. J. Pediatr. 1992; 121: 862-866Abstract Full Text PDF PubMed Scopus (360) Google Scholar, 8Rosenstreich D.L. Eggleston P. Kattan M. Baker D. Slavin R.G. Gergen P. Mitchell H. McNiff-Mortimer K. Lynn H. Ownby D. Malveaux F. N. Engl. J. Med. 1997; 336: 1356-1363Crossref PubMed Scopus (1167) Google Scholar). Several allergens have been cloned from German cockroach (Blattella germanica), including Bla g 2 (aspartic protease); Bla g 4 (lipocalin); and Bla g 5 (glutathione transferase) (9Arruda L.K. Vailes L.D. Mann B.J. Shannon J. Fox J.W. Vedvick T.S. Hayden M.L. Chapman M.D. J. Biol. Chem. 1995; 270: 19563-19568Abstract Full Text Full Text PDF PubMed Scopus (157) Google Scholar, 10Arruda L.K. Vailes L.D. Hayden M.L. Benjamin D.C. Chapman M.D. J. Biol. Chem. 1995; 270: 31196-31201Abstract Full Text Full Text PDF PubMed Scopus (128) Google Scholar, 11Arruda L.K. Vailes L.D. Platts-Mills T.A.E. Hayden M.L. Chapman M.D. J. Biol. Chem. 1997; 272: 20907-20912Abstract Full Text Full Text PDF PubMed Scopus (132) Google Scholar). These allergens elicit IgE Ab responses in 50–80% of cockroach allergic patients and are B. germanica-specific (12Chapman M.D. Vailes L.D. Hayden M.L. Platts-Mills T.A.E. Arruda L.K. Kay A.B. Allergy and Allergic Diseases. Blackwell Science Ltd., Oxford1997: 942-951Google Scholar). Antigenically cross-reactive allergens have been identified in both German and American cockroach (Bla g 1 and Per a 1) but their structure was unknown (13Schou C. Lind P. Fernandez-Caldas E. Lockey R.F. Løwenstein H. J. Allergy Clin. Immunol. 1990; 86: 935-946Abstract Full Text PDF PubMed Scopus (92) Google Scholar, 14Pollart S.M. Mullins D.E. Vailes L.D. Hayden M.L. Platts-Mills T.A.E. Sutherland W.M. Chapman M.D. J. Allergy Clin. Immunol. 1991; 87: 511-521Abstract Full Text PDF PubMed Scopus (127) Google Scholar, 15Wu C.-H. Luo S.-F. Wong D.W. Allergy. 1997; 52: 411-416Crossref PubMed Scopus (22) Google Scholar). We recently cloned the cDNA encoding Per a 1 and here we report the sequences of three Bla g 1 cDNA clones (16Melén E. Vailes L.D. Pomés A. Arruda L.K. Chapman M.D. J. Allergy Clin. Immunol. 1998; 101: 156Google Scholar). A novel feature of these allergens, which has not previously been described among allergen structures, is that they are composed of several tandem repeats of ∼100 amino acid residues. The allergens show sequence homology to a mosquito protein associated with digestive function (ANG12 precursor), and the repeat sequences may have evolved from ancestral amino acid domains found in mitochondrial energy transfer proteins. A B. germanicacDNA library was prepared in the Uni-ZAPTM XR expression vector (Stratagene, La Jolla, CA) using 10 μg of mRNA isolated from adult cockroaches as described previously (9Arruda L.K. Vailes L.D. Mann B.J. Shannon J. Fox J.W. Vedvick T.S. Hayden M.L. Chapman M.D. J. Biol. Chem. 1995; 270: 19563-19568Abstract Full Text Full Text PDF PubMed Scopus (157) Google Scholar, 10Arruda L.K. Vailes L.D. Hayden M.L. Benjamin D.C. Chapman M.D. J. Biol. Chem. 1995; 270: 31196-31201Abstract Full Text Full Text PDF PubMed Scopus (128) Google Scholar). The library was screened with a murine monoclonal antibody (mAb) against Bla g 1, clone 10A6 (17Pollart S.M. Smith T.F. Morris E.C. Gelber L.E. Platts-Mills T.A.E. Chapman M.D. J. Allergy Clin. Immunol. 1991; 87: 505-510Abstract Full Text PDF PubMed Scopus (186) Google Scholar). Positive plaques were detected using alkaline phosphatase-labeled goat anti-mouse IgG (1:1000) (KPL, Gaithersburg, MD) followed by color development with BCIP/nitro blue tetrazolium (KPL, Gaithersburg, MD) (9Arruda L.K. Vailes L.D. Mann B.J. Shannon J. Fox J.W. Vedvick T.S. Hayden M.L. Chapman M.D. J. Biol. Chem. 1995; 270: 19563-19568Abstract Full Text Full Text PDF PubMed Scopus (157) Google Scholar, 10Arruda L.K. Vailes L.D. Hayden M.L. Benjamin D.C. Chapman M.D. J. Biol. Chem. 1995; 270: 31196-31201Abstract Full Text Full Text PDF PubMed Scopus (128) Google Scholar). Three positive clones (Bla g 1 clone 1, clone 2, and clone 3) were identified and further screened by plaque immunoassay for IgE Ab using 15 sera from cockroach-allergic patients. Nine sera were obtained from patients having detectable IgE Ab to Bla g 1 by radioimmunoassay; four patients had IgE Ab to cockroach, but not to Bla g 1; and two sera were from nonallergic controls. Positive plaques were detected using alkaline phosphatase-labeled goat anti-human IgE (1:500) (KPL). In addition, the Bla g 1 cDNA clones were screened against a panel of high performance liquid chromatography-purified mAb to cockroach allergens: Bla g 1 (mAb 10A6), Bla g 2 (mAb 8F4), and Per a 1 (mAb Per a 1–03, kindly provided by Drs. Henmar and Schou, ALK-ABELLO, Horsholm, Denmark) (17Pollart S.M. Smith T.F. Morris E.C. Gelber L.E. Platts-Mills T.A.E. Chapman M.D. J. Allergy Clin. Immunol. 1991; 87: 505-510Abstract Full Text PDF PubMed Scopus (186) Google Scholar). The mAb were used at 1–10 μg/ml for screening, and bound mAb was detected using alkaline phosphatase-labeled goat anti-mouse IgG Ab at 1/1000 dilution. The pBluescript phagemids containing cloned cDNA inserts were excised from the Uni-ZAPTM XR vector and plated with fresh Escherichia coli cells to produce colonies (ExAssist/SOLR System, Stratagene). The size of the inserts was assessed by double digestion with BamHI and KpnI. Double-stranded cDNA was isolated and sequenced using an ABI Prism 100 Model 377 automated DNA sequencer (Biomolecular Research Facility, University of Virginia). The nucleotide sequences of the three Bla g 1 cDNA clones were submitted to the GenBank with accession numbers AF072219 (Bla g 1.01 clone 1), AF072221(Bla g 1.01 clone 2), and AF072220(Bla g 1.02). 2These sequences have been scanned against the data base, and all sequences with significant relatedness to them identified and their accession numbers included. Sequences were analyzed using the GCG sequence analysis package (Wisconsin package, Oxford Molecular Group, Inc.). Protein sequences were compared with the GenBank, EBI, DDBJ, PDB, SwissProt, SPupdate, PIR data bases using FASTA and BLAST (18Pearson W.R. Lipman D.J. Proc. Natl. Acad. Sci. U. S. A. 1988; 85: 2444-2448Crossref PubMed Scopus (9328) Google Scholar,19Altschul S.F. Gish W. Miller W. Myers E.W. Lipman D.J. J. Mol. Biol. 1990; 215: 403-410Crossref PubMed Scopus (68368) Google Scholar). Sequence alignments were made with Clustal W (accessible via Internet site: http://www2.ebi.ac.uk/clustalw/) and GCG (Framealign, Gap) (20Thompson J.D. Higgins D.G. Gibson T.J. Nucleic Acids Res. 1994; 22: 4673-4680Crossref PubMed Scopus (54908) Google Scholar). The GCG package was used to search for motifs in the amino acid sequences and for dot-plot analysis of nucleotide or amino acid sequences. Transmembrane spanning domains in the amino acid sequences were predicted using the program TMpredict, which is based on a data base of membrane-spanning protein segments, TMbase (21Hofmann K. Stoffel W. Biol. Chem. Hoppe-Seyler. 1993; 374: 166Google Scholar). Prediction of protein localization sites was done using the program PSORT (22Nakai K. Kanehisa M. Genomics. 1992; 14: 897-911Crossref PubMed Scopus (1364) Google Scholar). Both programs are available at (http://www.isrec.isb-sib.ch/software/TMPRED_form.htmland http://psort.nibb.ac.jp/). Amino-terminal amino acid sequences of purified Bla g 1 and Per a 1 (kindly provided by Dr. Schou, ALK-ABELLO) were determined by Edman degradation using a gas phase sequencer (model 470-A, Applied Biosystems, Foster City, CA) in the Biomolecular Research Facility (University of Virginia). The first 10 and 47 N-terminal amino acids were obtained for Bla g 1 and Per a 1, respectively. Cockroaches were dissected and body parts identified according to the method of Bell (23Bell W.J. Chapman Hall The Laboratory Cockroach: Experiments in Cockroach Anatomy, Physiology, and Behavior. 1st Ed. University of Kansas, Lawrence, KS1981: 33-38Google Scholar). Tissues were homogenized with a Polytron homogenizer and extracted overnight in 0.5 ml of BBS at 4 °C. After centrifugation at 13,200 × g for 15 min, extracts were stored at –20 °C until assayed. Bla g 1 levels in cockroach tissues were measured by mAb ELISA (17Pollart S.M. Smith T.F. Morris E.C. Gelber L.E. Platts-Mills T.A.E. Chapman M.D. J. Allergy Clin. Immunol. 1991; 87: 505-510Abstract Full Text PDF PubMed Scopus (186) Google Scholar). Protein concentration was measured by Bradford assay using γ globulin as standard. Three cDNA clones encoding Bla g 1 were obtained by screening a B. germanica cDNA expression library with anti-Bla g 1 mAb, 10A6. The cDNA clones contained inserts of 1429, 1791, and 715 base pairs encoding 412, 492, and 188, respectively, amino acid fragments with calculated molecular masses of 45.8, 55.5, and 21.0 kDa, respectively, and isoelectric points of 4.31, 4.55, and 4.22, respectively. Bla g 1 clones 1 and 2 were isoallergens and showed 75% sequence identity (Fig. 1). Following the WHO/IUIS allergen nomenclature, the cDNAs were designated as Bla g 1.01 and Bla g 1.02. The third clone corresponded to a C-terminal fragment of clone 1, shown in bold in Fig. 1. IgE Ab reactivity to the three Bla g 1 clones, and to a recently isolated Per a 1 cDNA, was compared by plaque immunoassay (Fig. 2). The proportion of sera from nine patients allergic to Bla g 1 that recognized Bla g 1 clones 1, 2, and 3, and Per a 1 was 8/9 (89%), 9/9 (100%), 7/9 (78%), and 8/9 (89%), respectively. Four sera from patients allergic to Bla g 2, but not to Bla g 1, and two sera from nonallergic patients were used as controls and gave negative results. Monoclonal antibodies against Bla g 1 (10A6), Bla g 2 (8F4), and Per a 1 (Per a 1–03) were also tested; 10A6 and Per a 1–03 reacted strongly with Bla g 1 and Per a 1, whereas 8F4 gave no reaction (data not shown).Figure 2Plaque immunoassay showing the reactivity of IgE Ab from patients allergic to Bla g 1 (first 4 rows) and to cockroach allergens but not to Bla g 1 (last row), against fusion proteins produced by the three Bla g 1 cDNA clones: Bla g 1.01, Bla g 1.02, and Bla g 1.01 clone 2 (first three columns), and by Per a 1 clone (last column).View Large Image Figure ViewerDownload Hi-res image Download (PPT) Initially, we compared the sequence similarity of the three Bla g 1 clones with the antigenically related allergen fromP. americana, Per a 1 (16Melén E. Vailes L.D. Pomés A. Arruda L.K. Chapman M.D. J. Allergy Clin. Immunol. 1998; 101: 156Google Scholar). The results showed 70–72% sequence identity between Per a 1, Bla g 1.01, and Bla g 1.02 (Table I). The Per a 1 and the Bla g 1 sequences showed 32% homology to mosquito (Anopheles gambiae) ANG12 precursor protein (Fig. 3). The ANG12 precursor is found exclusively in the midgut of female mosquito and is induced following a blood meal.Table ISequence homology of Bla¦g¦1 cDNA clones with related cockroach allergens and ANG12 precursorBla¦g¦1.01Bla¦g¦1.01 (clone 2)Bla¦g¦1.02Bla¦g¦Bd90K (4058 base pairs)949574Per¦a¦1 (231 aa1-aaa, amino acids.)727070Cr-PII (274 aa)717071Cr-PII2 (446 aa)606460Cr-PII3 (395 aa)666764Cr-PII4 (228 aa)726970ANG12precursor (211 aa)3235321-a aa, amino acids. Open table in a new tab Further sequence comparisons revealed homology between the Bla g 1 sequences and other recently reported cockroach allergen sequences: Bla g Bd90K and four P. americana cDNA clones (24Helm R. Cockrell G. Stanley J.S. Brenner R.J. Burks W. Bannon G.A. J. Allergy Clin. Immunol. 1996; 98: 172-180Abstract Full Text Full Text PDF PubMed Scopus (55) Google Scholar, 25Wu C.-H. Wang N.M. Lee M.-F. Kao C.-Y.Y. Luo S.-F. J. Allergy Clin. Immunol. 1998; 101: 832-840Abstract Full Text Full Text PDF PubMed Scopus (37) Google Scholar) (Table I). A partial nucleotide sequence of Bla g Bd90K from B. germanica was previously published, and the full sequence is deposited in the GenBank. This long sequence (4058 base pairs) is untranslatable as a whole unit. However, Bla g 1.01 clearly aligns with the translatable part of Bla g Bd90K C terminus, including the sequence following the stop codon, and with other translated parts in different reading frames (Fig. 3). Translation of the N terminus in the third reading frame encodes for a methionine after the first five nucleotides, which presumably is the start of the open reading frame for Bla g Bd90K protein (Fig. 3). The four P. americana clones described as Cr-PII (GenBank accession numbers U69261, U69260, U69957, and U78970) encode for sequences of 274, 446, 395, and 228 amino acids with calculated molecular masses of 31, 51, 45 and 26 kDa, respectively. These clones show 60–72% amino acid sequence homology to Bla g 1 (Table I). One of the four cDNA clones (Cr-PII3) encodes for a protein which N terminus aligns with the N termini of Bla g Bd90K and ANG12 precursor proteins (Fig. 3, top panel). Together, the results of these sequence comparisons show an extended family of B. germanicaand P. americana allergen cDNAs encoding for proteins with deduced molecular masses of 20–90 kDa. A unique feature of the primary structure of Bla g 1.01 was the existence of multiple amino acid repeats, which have not previously been found in allergen sequences. Close inspection of the sequences revealed segments that were repeated every ∼100 amino acids and suggested that the sizes of the Bla g 1, Per a 1, and related cDNAs varied with the number of repeat sequences they contained. The repeats were compared using a dot-plot matrix analysis that plots a point file with the coordinates of the points in common between two sequences that are compared and a diagonal where two sequences are identical. Repeated fragments show as parallel lines above the diagonal. Dot-plot matrix analysis showed that there were four entire amino acid repeats in Bla g 1.01, five in Bla g 1.02, and two in Bla g 1.01 clone 2 and Per a 1 (Fig. 4). Each repeat is about 100 amino acids long. In Bla g 1.01, the first and third and the second and fourth repeats are identical (Fig. 5,panel A), and in Bla g 1.02, the first, third, and fifth and the second and fourth repeats are almost identical (Fig. 5,panel B).Figure 5Alignment of Bla g 1.01 (panel A) and Bla g 1.02 (panel B) entire repeats, and of Bla g 1.01 clone 2 with Bla g Bd90K translated repeat previously published by Helm et al. (panel C) (24Helm R. Cockrell G. Stanley J.S. Brenner R.J. Burks W. Bannon G.A. J. Allergy Clin. Immunol. 1996; 98: 172-180Abstract Full Text Full Text PDF PubMed Scopus (55) Google Scholar).View Large Image Figure ViewerDownload Hi-res image Download (PPT) The Bla g Bd90K sequence contained seven repeats of ∼576 nucleotides (192 amino acids) (Fig. 4, and Ref. 24Helm R. Cockrell G. Stanley J.S. Brenner R.J. Burks W. Bannon G.A. J. Allergy Clin. Immunol. 1996; 98: 172-180Abstract Full Text Full Text PDF PubMed Scopus (55) Google Scholar). A repeat in Bla g Bd90K corresponds to two different consecutive repeats of Bla g 1.01, which occur within Bla g 1.01 clone 2 (Fig. 5,panel C), and this clone encodes for the basic unit that constitutes the 90-kDa protein (Fig. 5, panel C). Dot-plot analysis for ANG12 precursor, a less closely related sequence, shows absence of the clear structure of 100 amino acids repeats. Remains of that structure are apparent as smaller repeated fragments (Fig. 4). The Bla g 1.01 and Bla g 1.02 sequences contained myristoylation, amidation, and phosphorylation sites, with no obvious N-linked glycosylation sites (Fig. 6). Myristoylation sites are localized at the beginning of each repeat. Bla g 1.01 also has a motif that is characteristic of mitochondrial energy transfer proteins (P-x-[DE]-x-[LIVAT]-[RK]-x-[LRH]-[LIVMFY]), which is present in Per a 1 and degenerates in Bla g 1.02, and the ANG12 precursor. Proteins belonging to this family are evolutionarily related membrane proteins with three repeated sequences, each containing two transmembrane domains. However, analysis of Bla g 1 using the program TMpredict showed that Bla g 1 is not a membrane protein. The program PSORT predicts that Bla g 1 and Bla g Bd90K proteins would be directed to the outside of the cell following the secretory pathway through the endoplasmic reticulum (ER). The N terminus of Bla g Bd90K contains an ER targeting signal (KLALIFLAFL) that would initiate transport across the ER (Fig. 6). Natural Bla g 1 and natural Per a 1 were partially sequenced by Edman degradation. The results showed that each natural allergen preparation contained three amino acid sequences. The N-terminal residues of each of these sequences were preceded by arginine residues (deduced from the cDNA sequence), suggesting that the natural allergen had been cleaved by trypsin-like enzymes (Fig. 6). The most abundant of the natural Bla g 1 sequences was located in the first and third Bla g 1.01 repeats, and the other two minority sequences were located in the second and third repeats. The results of these experiments suggest that Bla g 1 and Per a 1 are secreted in the cockroach digestive tract and cleaved by trypsin-like enzymes before being excreted. To localize sites of Bla g 1 production, cockroach tissues were dissected and assayed for Bla g 1 content by ELISA. The results showed that Bla g 1 was predominantly found in digestive organs, with highest amounts in the hindgut (633 units/mg protein), followed by the proventriculus (623 units/mg protein), and esophagus (334 units/mg protein). Previous studies established that Bla g 1 and Per a 1 are important allergens that elicit IgE Ab responses in ∼50% of cockroach allergic patients. They are the only allergens identified to date that show antigenic cross-reactivity between B. germanica and P. americana. In keeping with this, the cDNA cloning studies showed extensive amino acid sequence homology between Bla g 1 and Per a 1 (70–72% sequence identity), confirming that these are structurally related proteins. The three Bla g 1 cDNA clones encode the sequences of two isoallergens, Bla g 1.01 and Bla g 1.02, with 75% amino acid sequence identity. Antigenic cross-reactivity between bacterially expressed Bla g 1 and Per a 1 clones was confirmed by plaque immunoassay; IgE antibodies against Bla g 1 and the monoclonal anti-Bla g 1 antibody reacted strongly with Per a 1 fusion proteins, and, conversely, anti-Per a 1 mAb bound to Bla g 1 fusion proteins. The novel and unique feature of the Bla g 1 (and Per a 1) allergen sequences is that they contain multiple tandem repeats of ∼100 amino acid residues. This type of structure has not previously been described for any allergen but does occur in other proteins. An earlier report by Helm et al. described a long (4058 base pair) cockroach sequence with seven 576-nucleotide repeats (24Helm R. Cockrell G. Stanley J.S. Brenner R.J. Burks W. Bannon G.A. J. Allergy Clin. Immunol. 1996; 98: 172-180Abstract Full Text Full Text PDF PubMed Scopus (55) Google Scholar). However, the full sequence was untranslatable, and the identity of the putative ∼90 kDa protein encoded by the sequence could not be established. Our data clearly shows that Bla g 1, Per a 1, Bla g Bd90K, and the four recently described Cr-PII sequences (25Wu C.-H. Wang N.M. Lee M.-F. Kao C.-Y.Y. Luo S.-F. J. Allergy Clin. Immunol. 1998; 101: 832-840Abstract Full Text Full Text PDF PubMed Scopus (37) Google Scholar), form a family of structurally and antigenically related cockroach allergens, which differ in the number of amino acid repeats they contain. Collectively, these sequences belong to the "Group 1" allergens produced by B. germanica and P. americana. It also seems likely that these allergens occur in other cockroach species. Dot-plot analysis showed that two consecutive amino acid repeats from Bla g 1.01 corresponded to one of the Bla g Bd90K nucleotide repeats. Analysis of these repeat amino acid sequences suggests that they probably derived from an original domain of approximately 100 amino acids by duplication. Both repeats in the "duplex" would diverge in a way that they are recognized by dot-plot analysis as two amino acid repeats but the corresponding nucleotide sequence does not show any repeat. Despite their divergence during evolution, both repeats have conserved amino acids that may be important for biological function (26Bowie J.U. Reidhaar-Olson J.F. Lim W.A. Sauer R.T. Science. 1990; 247: 1306-1310Crossref PubMed Scopus (404) Google Scholar). The "duplex" would replicate up to seven times to form the long DNA with seven tandem repeats in the Bla g Bd90K nucleotide sequence. The related mosquito protein ANG12 precursor has lost the clear structure of 100 amino acid repeats, indicating that ANG12 precursor had degenerated from the original structure during evolution. This would be consistent with the fact that mosquitoes appeared later in evolution than cockroaches in which the repeat allergen structure was conserved. The similarity between Bla g 1 and mitochondrial energy transfer proteins is interesting, because these proteins share amino acid motifs in a structure involving repeat sequences. This raises the possibility that the allergen may have evolved from a primordial mitochondrial sequence. Several lines of evidence indicate that the Bla g Bd90K cDNA clone contains the open reading frame for Bla g Bd90K protein: (i) labeled Bla g Bd90K insert hybridized to a single-size mRNA of approximately the same size, 4kb (24Helm R. Cockrell G. Stanley J.S. Brenner R.J. Burks W. Bannon G.A. J. Allergy Clin. Immunol. 1996; 98: 172-180Abstract Full Text Full Text PDF PubMed Scopus (55) Google Scholar); (ii) there is a methionine in the third reading frame translation of the Bla g Bd90K N terminus; (iii) a 90-kDa protein was reported in cockroach extracts (24Helm R. Cockrell G. Stanley J.S. Brenner R.J. Burks W. Bannon G.A. J. Allergy Clin. Immunol. 1996; 98: 172-180Abstract Full Text Full Text PDF PubMed Scopus (55) Google Scholar); and (iv) the N terminus of the third reading frame translation of Bla g Bd90K aligned with the N termini of the related proteins Cr-PII3 and ANG12 precursor. Despite the evidence for production of a 90-kDa protein, a broad range of molecular masses (6–37 kDa) has previously been reported for purified Bla g 1 and Per a 1 and for cross-reacting IgE Ab binding bands in cockroach extracts (13Schou C. Lind P. Fernandez-Caldas E. Lockey R.F. Løwenstein H. J. Allergy Clin. Immunol. 1990; 86: 935-946Abstract Full Text PDF PubMed Scopus (92) Google Scholar, 14Pollart S.M. Mullins D.E. Vailes L.D. Hayden M.L. Platts-Mills T.A.E. Sutherland W.M. Chapman M.D. J. Allergy Clin. Immunol. 1991; 87: 511-521Abstract Full Text PDF PubMed Scopus (127) Google Scholar, 15Wu C.-H. Luo S.-F. Wong D.W. Allergy. 1997; 52: 411-416Crossref PubMed Scopus (22) Google Scholar, 24Helm R. Cockrell G. Stanley J.S. Brenner R.J. Burks W. Bannon G.A. J. Allergy Clin. Immunol. 1996; 98: 172-180Abstract Full Text Full Text PDF PubMed Scopus (55) Google Scholar). Natural Bla g 1 and Per a 1 appear to be produced following trypsin cleavage. A 90-kDa protein could be a precursor (like ANG12 precursor) that would be post-translationally modified and cleaved in certain sites such as myristoylation sites. The number of repeats contained in trypsin-cleaved fragments or in fragments derived from post-translational modifications would explain this broad range of molecular masses. If cockroach trypsin-like enzymes have the same characteristics as mosquito trypsin-like enzymes, they may have been co-purified with Bla g 1 and Per a 1 (27Graf R. Boehlen P. Briegel H. Experientia. 1991; 47: 603-609Crossref PubMed Scopus (31) Google Scholar). Then, the trypsin cleavage of the natural allergen could take place not only in the digestive tract but also during and after purification. Plaque immunoassay with Bla g 1.01 clone 2 also showed that a 21-kDa unit, which contained two amino acid repeats, is sufficient to be recognized by Bla g 1- and Per a 1-specific IgE antibodies. Therefore, cloning the "duplex" represents cloning the basic allergenic unit of Bla g 1. Despite the presence in Bla g 1 of a signature motif for mitochondrial energy transfer membrane proteins, Bla g 1 is not a membrane protein. Rather, Bla g 1 is predicted to follow a secretory pathway through the endoplasmic reticulum, for which it has a targeting signal, to the outside of the cell and into the digestive tract. Accordingly, high levels of Bla g 1 were measured by ELISA in the proventriculus and hindgut, in keeping with a previous report of localization of cockroach allergens in gastrointestinal cells using IgE Ab (28Zwick H. Popp W. Sertl K. Rauscher H. Wanke T. J. Allergy Clin. Immunol. 1991; 87: 626-630Abstract Full Text PDF PubMed Scopus (25) Google Scholar). From the digestive tract, Bla g 1 would be excreted into the feces, where it becomes a potential allergen if inhaled by humans. The homology between Bla g 1 and mosquito ANG 12 suggests that Bla g 1 may have a digestive function. Mosquitos (A. gambiae) also produce trypsin-like proteins after taking blood meals (29Müller H.-M. Catteruccia F. Vizioli J. della Torre A. Crisanti A. Exp. Parasitol. 1995; 81: 371-385Crossref PubMed Scopus (95) Google Scholar, 30Müller H.-M. Crampton J.M. della Torre A. Sinden R. Crisanti A. EMBO J. 1993; 12: 2891-2900Crossref PubMed Scopus (153) Google Scholar). We speculate that similar enzymes may be produced in the cockroach digestive tract and give rise to the different molecular forms of the Bla g 1 and Per a 1 allergens. Proteases are found in commercial cockroach allergen extracts and can reduce allergenic potency when mixed with other extracts (31Esch R.E. Klein R Regulatory Control and Standardization of Allergenic Extracts. Gustav Fischer Verlag, Stuttgart, Germany1991: 171-179Google Scholar, 32Nelson H.S. Iklé J.D. Buchmeier A. J. Allergy Clin. Immunol. 1996; 98: 382-388Abstract Full Text Full Text PDF PubMed Scopus (105) Google Scholar). The precise digestive function of Bla g 1 and ANG 12 remains to be determined. In conclusion, the molecular cloning of Bla g 1 has revealed a new family of "Group 1" cockroach allergens. The novel feature of these allergens is the occurrence of multiple tandem amino acid repeats, which has not previously been described among allergen structures. We have recently expressed Per a 1 in the yeast, Pichia pastoris. Manipulation and overexpression of the sequences in appropriate vectors will facilitate structural and immunologic studies and the production of recombinant allergens for diagnostic and therapeutic purposes. We are grateful to Drs. John Shannon and Jay Fox of the Biomolecular Research Facility, University of Virginia, for support with amino acid and nucleotide sequencing. We also thank Wanda Harvey for secretarial help and Tracy Glime for technical support.
Referência(s)