Artigo Acesso aberto Revisado por pares

A Chromosome Scale Assembly of an Australian Puccinia striiformis f. sp. tritici Isolate of the PstS1 Lineage

2022; American Phytopathological Society; Volume: 35; Issue: 3 Linguagem: Inglês

10.1094/mpmi-09-21-0236-a

ISSN

1943-7706

Autores

Benjamin Schwessinger, Ashley Jones, Mustafa Albekaa, Yiheng Hu, Amy R. MacKenzie, Rita Tam, Ramawatar Nagar, Andrew Milgate, John P. Rathjen, Sambasivam Periyannan,

Tópico(s)

Chromosomal and Genetic Variations

Resumo

HomeMolecular Plant-Microbe Interactions®Vol. 35, No. 3A Chromosome Scale Assembly of an Australian Puccinia striiformis f. sp. tritici Isolate of the PstS1 Lineage PreviousNext RESOURCE ANNOUNCEMENT OPENOpen Access licenseA Chromosome Scale Assembly of an Australian Puccinia striiformis f. sp. tritici Isolate of the PstS1 LineageBenjamin Schwessinger, Ashley Jones, Mustafa Albekaa, Yiheng Hu, Amy Mackenzie, Rita Tam, Ramawatar Nagar, Andrew Milgate, John P. Rathjen, and Sambasivam PeriyannanBenjamin Schwessinger†Corresponding authors: B. Schwessinger; E-mail Address: benjamin.schwessinger@anu.edu.au, J. P. Rathjen; E-mail Address: john.rathjen@anu.edu.au, and S. Periyannan; E-mail Address: sambasivam.periyannan@anu.edu.auhttps://orcid.org/0000-0002-7194-2922Research School of Biology, The Australian National University, Canberra, Australia, Ashley JonesResearch School of Biology, The Australian National University, Canberra, Australia, Mustafa AlbekaaResearch School of Biology, The Australian National University, Canberra, Australia, Yiheng Huhttps://orcid.org/0000-0002-6513-9834Research School of Biology, The Australian National University, Canberra, AustraliaDepartment of Microbial Interactions, IMIT/ZMBP, University of Tübingen, Tübingen, Germany, Amy MackenzieResearch School of Biology, The Australian National University, Canberra, AustraliaCSIRO Agriculture and Food, Canberra, Australia, Rita TamResearch School of Biology, The Australian National University, Canberra, Australia, Ramawatar NagarResearch School of Biology, The Australian National University, Canberra, AustraliaNational Institute for Plant Biotechnology, Indian Council of Agricultural Research, New Delhi, India, Andrew MilgateNew South Wales Department of Primary Industries, Wagga Wagga Agricultural Institute, Wagga Wagga, NSW, Australia, John P. Rathjen†Corresponding authors: B. Schwessinger; E-mail Address: benjamin.schwessinger@anu.edu.au, J. P. Rathjen; E-mail Address: john.rathjen@anu.edu.au, and S. Periyannan; E-mail Address: sambasivam.periyannan@anu.edu.auResearch School of Biology, The Australian National University, Canberra, Australia, and Sambasivam Periyannan†Corresponding authors: B. Schwessinger; E-mail Address: benjamin.schwessinger@anu.edu.au, J. P. Rathjen; E-mail Address: john.rathjen@anu.edu.au, and S. Periyannan; E-mail Address: sambasivam.periyannan@anu.edu.auResearch School of Biology, The Australian National University, Canberra, AustraliaCSIRO Agriculture and Food, Canberra, Australia AffiliationsAuthors and Affiliations Benjamin Schwessinger1 † Ashley Jones1 Mustafa Albekaa1 Yiheng Hu1 2 Amy Mackenzie1 3 Rita Tam1 Ramawatar Nagar1 4 Andrew Milgate5 John P. Rathjen1 † Sambasivam Periyannan1 3 † 1Research School of Biology, The Australian National University, Canberra, Australia 2Department of Microbial Interactions, IMIT/ZMBP, University of Tübingen, Tübingen, Germany 3CSIRO Agriculture and Food, Canberra, Australia 4National Institute for Plant Biotechnology, Indian Council of Agricultural Research, New Delhi, India 5New South Wales Department of Primary Industries, Wagga Wagga Agricultural Institute, Wagga Wagga, NSW, Australia Published Online:15 Feb 2022https://doi.org/10.1094/MPMI-09-21-0236-AAboutSectionsPDF ToolsAdd to favoritesDownload CitationsTrack Citations ShareShare onFacebookTwitterLinked InRedditEmailWechat Genome AnnouncementPuccinia striiformis f. sp. tritici is a globally important pathogen of wheat. P. striiformis f. sp. tritici causes wheat stripe rust (or yellow rust) disease, leading to global losses estimated at $1 billion annually. Stripe rust disease threatens global wheat cultivation and food security (Schwessinger 2017; Wellings 2011). In Australia, pathotypes belonging to 134 E16 A+ (PstS1) lineages have dominated wheat fields for nearly two decades, since their first detection in 2002 (Ding et al. 2021). Introduced accidentally from North America through international travel, the lineage spread rapidly from the port of entry in Western Australia to eastern states within a year, completely replacing the predecessor lineage 104 E137 A−. The P. striiformis f. sp. tritici 134 E16 A+ lineage evolved stepwise through the generation of derivative pathotypes with additional virulence to the Yr10, Yr17, Yr24, and Yr27 resistance genes deployed widely in commercial wheat varieties. Subsequently, some of these mutant pathotypes adapted to commercial triticale varieties through virulence to the YrJ and YrT genes present in these wheat × rye hybrids (Ding et al. 2021). Despite the dominance of this lineage over 20 years, we lack knowledge of its genome content and architecture. Here, we announce the genome of a recent derivative of 134 E16 A+ named 134 E16 A+ 17+ 33+, collected from Wagga Wagga, New South Wales, Australia, during 2017. The new isolate has virulence to the Yr2, Yr6, Yr7, Yr8, Yr9, Yr17, Yr25, Yr33, and YrA genes but is avirulent to Yr1, Yr3, Yr4, Yr5, Yr10, Yr15, Yr24, Yr26, Yr27, Yr32, YrJ, YrT, and YrSp.We extracted high-molecular weight DNA from freshly harvested uredinospores using a modified cetrimonium bromide (cetyltrimethylammonium bromide) extraction protocol based on Arseneau et al. (2017), which is described in detail on protocols.io (Jones 2021). The extracted DNA underwent size selection for high-molecular weight fragments ≥20 kb using a PippinHT (Sage Science), prior to sequencing with the Oxford Nanopore Technologies MinION platform. A long-read native DNA sequencing library was prepared according to the manufacturer's protocol 1D genomic DNA by ligation (SQK-LSK109), and sequencing was performed on a MinION Mk1B using a FLO-MIN106D R9.4.1 flow cell, according to the manufacturer's instructions. The fast5 reads were base called to fastq with Guppy version 4.0.11 (high accuracy). The DNA was also used to generate a short-read library, for highly accurate sequencing on the Illumina NovaSeq 6000 platform (150-bp paired-end reads) (Jones 2018). These datasets were complemented with HiC 3D conformation capture sequencing of in vivo crosslinked chromatin of ungerminated uredinospores using the PhaseGenomics Microbial HiC kit and 150-bp paired-end reads on the Illumina platform. In addition, we extracted RNA for Illumina-based RNA sequencing (RNA-seq) experiments for genome annotation at four distinct spore and infection stages, as follows: ungerminated spores, germinated spores, and 6 and 9 days postinfection of wheat variety Morocco. RNA was extracted with the Qiagen RNAeasy kit according to the manufacturer's instructions, followed by Illumina sequencing performed by GENEWIZ. All sequencing data are available under the BioProjects PRJNA749614 and PRJNA757545 of NCBI.We obtained, in total, 5.7 Gb of Nanopore long reads (L50 of reads = 35.02 kb), which represents approximately 30× coverage per haplotype. In addition, we obtained 8 Gb of 150-bp paired-end Illumina short reads for polishing and 60 Gb of 150-bp paired-end Illumina reads for the HiC dataset. We assembled, curated, preliminary phased, and scaffolded the genome as described recently using a hybrid approach (Duan et al. 2021). Briefly, the initial assembly was generated with Nanopore long reads followed by long-read base-error correction applying the same dataset using a combination of Canu 2.0 (Koren et al. 2017), Racon 1.4.13 (Vaser et al. 2017), and medaka 1.0.3. We polished this initial long-read assembly with Illumina short reads using Pilon 1.22 (Walker et al. 2014), as previously described (Duan et al. 2021). We curated assembly errors based on HiC contact maps generated with HiC-Pro 2.11.1 (Servant et al. 2015). We removed contigs identified as bacterial contaminations or mitochondrial sequences based on BLAST homology searches, GC content, and unusual genome coverage of remapped long reads (Altschul et al. 1990; Li 2018; Quinlan and Hall 2010). We tentatively phased the curated contigs into nuclear genome haplotypes with NuclearPhaser using previous P. striiformis f. sp. tritici 104E gene models as seeds (Duan et al. 2021; Schwessinger et al. 2018). However, we anticipate that the assembly might contain residual phase switches which currently cannot be addressed with Guppy v4 Nanopore sequencing data (Duan et al. 2021). Each resulting nuclear genome complement was scaffolded independently with Salsa 2.0 (Ghurye et al. 2019). Resulting chromosomes were further curated manually (Duan et al. 2021). We defined centromeric regions based on bowtie signatures in the HiC contact maps. Our final assembly resulted in two tentative nuclear assigned chromosome sets of 18 chromosomes of approximately 80 Mb each and 166 unplaced contigs of approximately 9 Mb, with 1,736 genes located on these contigs (Table 1). The karyotype and centromere locations are shown in Figure 1. These assembly statistics are a huge improvement over previous haploid or partially phased assemblies that contained the (primary) assembly in more than 100 contigs representing a single chimeric haplotype (Cantu et al. 2011, 2013; Cuomo et al. 2017; Li et al. 2019; Schwessinger et al. 2018, 2020; Vasquez-Gross et al. 2020; Xia et al. 2018; Zheng et al. 2013). We expect that our genome assembly has residual phase switching errors between haplotypes A and B, as described previously for a P. triticina hybrid assembly using equivalent sequencing and assembly approaches (Duan et al. 2021). In future, P. striiformis f. sp. tritici assemblies will be further improved by using high per-base accuracy long reads with accuracies >99% such as PacBio HiFi or Nanopore Q20+.Table 1. Genome assembly and quality statisticsFeaturesaPst134 combinedHaplotype AHaplotype BUnplaced contigsGenome size (Mb)167.6780.45 (0.017% gap)78.18 (0.016% gap)9.04 (0% gap)Number of scaffolds2021818166N50/L5016/4.70 Mb8/4.77 Mb8/4.65 Mb52/69.87 kbNumber of genes34,07516,27216,0641,739Complete BUSCOs (%)97.693.093.67.9Duplicated BUSCOs (%)88.810.011.20.6Fragmented BUSCOs (%)1.84.43.71.8Missing BUSCOs (%)0.62.62.790.3aBUSCO = benchmarking universal single-copy orthologs.Table 1. Genome assembly and quality statisticsView as image HTML Fig. 1. Karyotype plot of Puccinia striiformis f. sp. tritici chromosomes The figure depicts the two sets of 18 chromosomes, with haplotype A colored in red and haplotype B colored in blue. Bowties indicate the approximate locations of centromeres as defined by HiC interaction patters. The left scale bar indicates the size of each chromosome in megabase pairs.Download as PowerPointWe performed preliminary repeat prediction and soft masking with RepeatModeler v2 and RepeatMasker v4.1.2 to avoid the negative impact of repeat regions on downstream gene prediction processes (Flynn et al. 2020). In total, we predicted that approximately 40% of the genome comprises repeats which, overall, is in line with repeat annotations in previous P. striiformis f. sp. tritici genomes (Cantu et al. 2011, 2013; Schwessinger et al. 2018, 2020; Zheng et al. 2013). We annotated our soft-masked chromosome-scale assembly with the fungal-specific gene prediction and annotation pipeline funannotate v1.8.1 (Palmer and Stajich 2020) using previous high-quality P. striiformis f. sp. tritici proteome datasets (Schwessinger et al. 2018, 2020) and our strain-specific RNA-seq data as hints. The overall total and the haplotype-specific numbers of predicted genes (Table 1) were consistent with previous long-read P. striiformis f. sp. tritici genome assemblies. We used benchmarking universal single-copy orthologs (BUSCO, v.3.0.2) (Simão et al. 2015) to assess the completeness of our gene predictions using the BUSCO basidiomycete v9 dataset as reference (Table 1). In total, we predicted 97.6% of all BUSCOs, which is the highest completeness rate of any P. striiformis f. sp. tritici assembly to date. Finally, we predicted candidate effector proteins on the 4,904 secreted proteins using EffectorP3 and obtained 2,876 predicted effectors. Of these, 1,817 and 1,059 were predicted to be located in the cytoplasm and apoplasm, respectively.We anticipate that this first chromosome-scale assembly for P. striiformis f. sp. tritici will be a useful resource for the community studying P. striiformis f. sp. tritici effector function, host adaptation, and the evolution of wheat rust fungi. Future assemblies will address the residual issue of phase switching and incorporate a long-read RNA-seq dataset to further improve gene prediction.Data AvailabilityThe draft chromosome scale assembly and gene annotation are available at the NCBI GenBank database under the BioProjects PRJNA749614 and PRJNA757545. All raw read datasets are associated with BioProjects PRJNA749614.AcknowledgmentsWe thank J. Sperschneider, J. Stajich, and J. Palmer for their assistance in genome assembly, curation, and distribution.The author(s) declare no conflict of interest.Literature CitedAltschul, S. F., Gish, W., Miller, W., Myers, E. W., and Lipman, D. J. 1990. Basic local alignment search tool. J. Mol. Biol. 215:403-410. https://doi.org/10.1016/S0022-2836(05)80360-2 Crossref, Medline, ISI, Google ScholarArseneau, J.-R., Steeves, R., and Laflamme, M. 2017. Modified low-salt CTAB extraction of high-quality DNA from contaminant-rich tissues. Mol. Ecol. Resour. 17:686-693. https://doi.org/10.1111/1755-0998.12616 Crossref, Medline, ISI, Google ScholarCantu, D., Govindarajulu, M., Kozik, A., Wang, M., Chen, X., Kojima, K. K., Jurka, J., Michelmore, R. W., and Dubcovsky, J. 2011. Next generation sequencing provides rapid access to the genome of Puccinia striiformis f. sp. tritici, the causal agent of wheat stripe rust. PLoS One 6:e24230. https://doi.org/10.1371/journal.pone.0024230 Crossref, Medline, ISI, Google ScholarCantu, D., Segovia, V., MacLean, D., Bayles, R., Chen, X., Kamoun, S., Dubcovsky, J., Saunders, D. G. O., and Uauy, C. 2013. Genome analyses of the wheat yellow (stripe) rust pathogen Puccinia striiformis f. sp. tritici reveal polymorphic and haustorial expressed secreted proteins as candidate effectors. BMC Genomics 14:270. https://doi.org/10.1186/1471-2164-14-270 Crossref, Medline, ISI, Google ScholarCuomo, C. A., Bakkeren, G., Khalil, H. B., Panwar, V., Joly, D., Linning, R., Sakthikumar, S., Song, X., Adiconis, X., Fan, L., Goldberg, J. M., Levin, J. Z., Young, S., Zeng, Q., Anikster, Y., Bruce, M., Wang, M., Yin, C., McCallum, B., Szabo, L. J., Hulbert, S., Chen, X., and Fellers, J. P. 2017. Comparative analysis highlights variable genome content of wheat rusts and divergence of the mating loci. G3 Genes Genomes Genet. 7:361-376. https://doi.org/10.1534/g3.116.032797 Crossref, ISI, Google ScholarDing, Y., Cuddy, W. S., Wellings, C. R., Zhang, P., Thach, T., Hovmoller, M. S., Qutob, D., Brar, G. S., Kutcher, H. R., and Park, R. F. 2021. Incursions of divergent genotypes, evolution of virulence and host jumps shape a continental clonal population of the stripe rust pathogen Puccinia striiformis. Mol. Ecol. 30:6566-6584. https://doi.org/10.1111/mec.16182 Crossref, Medline, ISI, Google ScholarDuan, H., Jones, A. W., Hewitt, T., Mackenzie, A., Hu, Y., Sharp, A., Lewis, D., Mago, R., Upadhyaya, N. M., Rathjen, J. P., Stone, E. A., Schwessinger, B., Figueroa, M., Dodds, P. N., Periyannan, S., and Sperschneider, J. 2021. Identification and correction of phase switches with Hi-C data in the Nanopore and HiFi chromosome-scale assemblies of the dikaryotic leaf rust fungus Puccinia triticina. bioRxiv. https://doi.org/10.1101/2021.04.28.441890 Medline, Google ScholarFlynn, J. M., Hubley, R., Goubert, C., Rosen, J., Clark, A. G., Feschotte, C., and Smit, A. F. 2020. RepeatModeler2 for automated genomic discovery of transposable element families. Proc. Natl. Acad. Sci. U.S.A. 117:9451-9457. https://doi.org/10.1073/pnas.1921046117 Crossref, Medline, ISI, Google ScholarGhurye, J., Rhie, A., Walenz, B. P., Schmitt, A., Selvaraj, S., Pop, M., Phillippy, A. M., and Koren, S. 2019. Integrating Hi-C links with assembly graphs for chromosome-scale assembly. PLOS Comput. Biol. 15:e1007273. https://doi.org/10.1371/journal.pcbi.1007273 Crossref, Medline, ISI, Google ScholarJones, A. 2018. Cost-conscious generation of multiplexed short-read DNA libraries for whole genome sequencing. protocols.io. https://doi.org/10.17504/protocols.io.unbevan Google ScholarJones, A. 2021. High-molecular weight DNA extraction from challenging fungi using CTAB for lysis and precipitation. protocols.io. https://doi.org/10.17504/protocols.io.bwr8pd9w Google ScholarKoren, S., Walenz, B. P., Berlin, K., Miller, J. R., Bergman, N. H., and Phillippy, A. M. 2017. Canu: Scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res. 27:722-736. https://doi.org/10.1101/gr.215087.116 Crossref, Medline, ISI, Google ScholarLi, H. 2018. Minimap2: Pairwise alignment for nucleotide sequences. Bioinformatics 34:3094-3100. https://doi.org/10.1093/bioinformatics/bty191 Crossref, Medline, ISI, Google ScholarLi, Y., Xia, C., Wang, M., Yin, C., and Chen, X. 2019. Genome sequence resource of a Puccinia striiformis isolate infecting wheatgrass. Phytopathology 109:1509-1512. https://doi.org/10.1094/PHYTO-02-19-0054-A Link, ISI, Google ScholarPalmer, J. M., and Stajich, J. 2020. Funannotate v1.8.1: Eukaryotic genome annotation. Zenodo. https://zenodo.org/record/4054262 Google ScholarQuinlan, A. R., and Hall, I. M. 2010. BEDTools: A flexible suite of utilities for comparing genomic features. Bioinformatics 26:841-842. https://doi.org/10.1093/bioinformatics/btq033 Crossref, Medline, ISI, Google ScholarSchwessinger, B. 2017. Fundamental wheat stripe rust research in the 21(st) century. New Phytol. 213:1625-1631. Crossref, Medline, ISI, Google ScholarSchwessinger, B., Chen, Y.-J., Tien, R., Vogt, J. K., Sperschneider, J., Nagar, R., McMullan, M., Sicheritz-Ponten, T., Sørensen, C. K., Hovmøller, M. S., Rathjen, J. P., and Justesen, A. F. 2020. Distinct life histories impact dikaryotic genome evolution in the rust fungus Puccinia striiformis causing stripe rust in wheat. Genome Biol. Evol. 12:597-617. https://doi.org/10.1093/gbe/evaa071 Crossref, Medline, ISI, Google ScholarSchwessinger, B., Sperschneider, J., Cuddy, W. S., Garnica, D. P., Miller, M. E., Taylor, J. M., Dodds, P. N., Figueroa, M., Park, R. F., and Rathjen, J. P. 2018. A near-complete haplotype-phased genome of the dikaryotic wheat stripe rust fungus Puccinia striiformis f. sp. tritici reveals high interhaplotype diversity. MBio 9:e02275-17. https://doi.org/10.1128/mBio.02275-17 Crossref, Medline, ISI, Google ScholarServant, N., Varoquaux, N., Lajoie, B. R., Viara, E., Chen, C.-J., Vert, J.-P., Heard, E., Dekker, J., and Barillot, E. 2015. HiC-Pro: An optimized and flexible pipeline for Hi-C data processing. Genome Biol. 16:259. https://doi.org/10.1186/s13059-015-0831-x Crossref, Medline, ISI, Google ScholarSimão, F. A., Waterhouse, R. M., Ioannidis, P., Kriventseva, E. V., and Zdobnov, E. M. 2015. BUSCO: Assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31:3210-3212. https://doi.org/10.1093/bioinformatics/btv351 Crossref, Medline, ISI, Google ScholarVaser, R., Sović, I., Nagarajan, N., and Šikić, M. 2017. Fast and accurate de novo genome assembly from long uncorrected reads. Genome Res. 27:737-746. https://doi.org/10.1101/gr.214270.116 Crossref, Medline, ISI, Google ScholarVasquez-Gross, H., Kaur, S., Epstein, L., and Dubcovsky, J. 2020. A haplotype-phased genome of wheat stripe rust pathogen Puccinia striiformis f. sp. tritici, race PST-130 from the Western USA. PLoS One 15:e0238611. https://doi.org/10.1371/journal.pone.0238611 Crossref, Medline, ISI, Google ScholarWalker, B. J., Abeel, T., Shea, T., Priest, M., Abouelliel, A., Sakthikumar, S., Cuomo, C. A., Zeng, Q., Wortman, J., Young, S. K., and Earl, A. M. 2014. Pilon: An integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS One 9:e112963. https://doi.org/10.1371/journal.pone.0112963 Crossref, Medline, ISI, Google ScholarWellings, C. R. 2011. Global status of stripe rust: A review of historical and current threats. Euphytica 179:129-141. Crossref, ISI, Google ScholarXia, C., Wang, M., Yin, C., Cornejo, O. E., Hulbert, S. H., and Chen, X. 2018. Genome sequence resources for the wheat stripe rust pathogen (Puccinia striiformis f. sp. tritici) and the barley stripe rust pathogen (Puccinia striiformis f. sp. hordei). Mol. Plant-Microbe Interact. 31:1117-1120. https://doi.org/10.1094/MPMI-04-18-0107-A Link, ISI, Google ScholarZheng, W., Huang, L., Huang, J., Wang, X., Chen, X., Zhao, J., Guo, J., Zhuang, H., Qiu, C., Liu, J., Liu, H., Huang, X., Pei, G., Zhan, G., Tang, C., Cheng, Y., Liu, M., Zhang, J., Zhao, Z., Zhang, S., Han, Q., Han, D., Zhang, H., Zhao, J., Gao, X., Wang, J., Ni, P., Dong, W., Yang, L., Yang, H., Xu, J.-R., Zhang, G., and Kang, Z. 2013. High genome heterozygosity and endemic genetic recombination in the wheat stripe rust fungus. Nat. Commun. 4:2673. https://doi.org/10.1038/ncomms3673 Crossref, Medline, ISI, Google ScholarFunding: Support was provided to S. Periyannan by the Australian Research Council (ARC) with funding through the Discovery Early Career Researcher Award (DE17010015) and to B. Schwessinger by the ARC Future Fellowship (FT180100024).The author(s) declare no conflict of interest. Copyright © 2022 The Author(s). This is an open access article distributed under the CC BY 4.0 International license.DetailsFiguresLiterature CitedRelated Vol. 35, No. 3 March 2022ISSN:0894-0282e-ISSN:1943-7706 Download Metrics Downloaded 1,372 times Article History Issue Date: 30 Mar 2022Published: 15 Feb 2022Accepted: 22 Nov 2021 Pages: 293-296 InformationCopyright © 2022 The Author(s).This is an open access article distributed under the CC BY 4.0 International license.FundingAustralian Research Council (ARC)Grant/Award Number: DE17010015Grant/Award Number: FT180100024Keywordsfungus–plant interactionsgenomicsmetabolomicsproteomicsThe author(s) declare no conflict of interest.PDF download

Referência(s)