Source: UNIVERSITY OF FLORIDA submitted to NRP
CITRUS GENOME SEQUENCE RESOURCES: DEVELOPMENT AND UTILIZATION TO CREATE HLB RESISTANT CULTIVARS
Sponsoring Institution
National Institute of Food and Agriculture
Project Status
COMPLETE
Funding Source
Reporting Frequency
Annual
Accession No.
1026184
Grant No.
(N/A)
Cumulative Award Amt.
(N/A)
Proposal No.
(N/A)
Multistate No.
(N/A)
Project Start Date
Apr 26, 2021
Project End Date
Apr 26, 2024
Grant Year
(N/A)
Program Code
[(N/A)]- (N/A)
Recipient Organization
UNIVERSITY OF FLORIDA
G022 MCCARTY HALL
GAINESVILLE,FL 32611
Performing Department
Citrus Research and Education Center
Non Technical Summary
Huanglongbing (HLB), also known as citrus greening, is the most destructive, devastating disease of citrus in the world. HLB affects all citrus species, and commercial citrus cultivars such as sweet oranges, grapefruit, and many mandarins, are most susceptible. The best long-term strategy, recognized and recommended by the National Academy of Sciences report on HLB, is to develop citrus scion and rootstock cultivars that can tolerate or resist HLB. Traditional breeding and the use of genome editing technologies, such as the CRISPR system, may be able to create highly tolerant or even resistant new citrus cultivars. However, successful implementation to achieve the goals will be substantially enhanced by having the highest quality genome sequences of both sensitive and tolerant types, to identify the most critical genes associated with sensitivity, tolerance, and resistance. This information can be used in traditional breeding to enable application of new breeding approaches such as genomic prediction and selection.It is important to understand that most commercially important citrus in Florida (sweet orange and grapefruit, for example) are derived through serial somatic mutations, and therefore traditional breeding approaches to incorporate HLB-tolerance cannot preserve their exact genotypes, nor exactly replicate their unique quality attributes. For this reason, gene editing is an attractive option because it can be used to change only the specific genes involved in susceptibility or resistance, while leaving all other attributes of the cultivar unaffected and intact.The research will focus first on producing genome sequence assemblies of 5 commercially significant types of citrus, three of which are sensitive (sweet orange, grapefruit, and Clementine mandarin) and two considered tolerant (Lisbon lemon and LB8-9 Sugar Belle® mandarin hybrid). As sequencing technology has made tremendous advances in recent years, it has become possible to produce chromosome scale assemblies of nearly complete sequence, thereby improving the effectiveness of gene editing experiments. Comparisons of these genomes to identify differences among them in terms of HLB responses will identify the important genes upon which editing ought to be focused. The project will also develop a catalog of these genes and define sites within the sequences to be targeted by CRISPR gene editing technology, to develop improved tolerance or perhaps even full resistance in these specific commercially important citrus types. As sequencing costs continue to decline and the quality of the output increases, these techniques will be used to develop genome sequence assemblies of other citrus species and their relatives to expand the genomic knowledge basis of citrus in general, to access additional sources of resistance to HLB and other limiting diseases, as well as providing the platform to explore genetic control of other important traits for citrus improvement, such as fruit quality. Such information can be utilized not only for gene editing for cultivar improvement, but also it will facilitate and greatly enhance the efficiency of developing valuable new cultivars through traditional breeding approaches. This is fertile ground for future and essential research, to generate and exploit new citrus genome sequence resources to maximize the likelihood of developing HLB resistant citrus trees.
Animal Health Component
25%
Research Effort Categories
Basic
60%
Applied
25%
Developmental
15%
Classification

Knowledge Area (KA)Subject of Investigation (SOI)Field of Science (FOS)Percent
20109991081100%
Goals / Objectives
The goal of this project is to develop the highest quality citrus genome sequence assembly resources for commercially and biologically important citrus cultivars and accessions using breakthrough sequencing and assembly technologies, and to utilize these resources to tools to support development of HLB-resistant cultivars by genome editing or other techniques. The specific objectives of this proposed project are:Sequence appropriate citrus genomes using the PacBio platform and produce chromosome scale assemblies through proximity ligation technology. Because of the commercial significance and great potential value of engineered HLB resistance, the work will focus on citrus cultivars that are HLB-susceptible but of great commercial value, such as sweet orange and grapefruit, as well as other cultivars or accessions that are HLB-tolerant, to produce contrasting genome assemblies.Sequence RNA transcript libraries from the same genomes using PacBio or other evolving state of the art technologies that can yield full-length RNA sequences, to aid annotation, assembly, etc.Resulting genome assemblies will be annotated, haplotypes phased, structural genomics among the genomes will be compared, to develop a set of genomics-based tools to effectively utilize sequence resources for gene discovery, gene sequence retrieval, genome editing, and ultimately the creation of HLB-tolerant or -resistant citrus cultivars.Identify full length, putative HLB-responsive gene sequences; compare sequences, transcripts and protein structures that may relate to host-pathogen interactions and disease development or its suppression, and predict sites for guide RNA designing and gene editing.
Project Methods
1) Sequence genomesPreparation of high molecular weight genomic DNA: High molecular weight genomic DNA will be extracted and the integrity of the extracted citrus genomic DNA will be determined by performing pulsed-field electrophoresis, and its purity by OD260/OD280 ratio. PacBio HiFi sequencing: Genomes will be sequenced using Pacific Biosciences HIFi technology. Sequencing will be performed in commercial labs offering the service to a target depth of at least 40-50x coverage.Assembling the genomes: State-of-the-art methods will be used to assemble the genomes. Based on our previous experiences the following aspects will be taken into consideration to produce high quality reference genomes. a. Heterozygosity. Intraspecific heterozygosity of most citrus species is ~0.2-0.3%, and between species divergence ~2%, providing sufficient inter-haplotype variation to allow the two haplotypes of a diploid cultivar to be assembled into separate contigs. Such haplotype-specific assemblies are possible when sequencing depth is sufficient; 85x sequencing redundancy allows for 40+x coverage per haplotype and haplotype separation. b. Assembly methodology. Genomes will be assembled using custom pipelines developed by our collaborators at UC Berkeley and the US Department of Energy Joint Genome Institute c. Long-range linking data. A limitation of shotgun sequencing approaches is that contiguity achieved by assembling overlapping reads is usually limited to megabase-scale. Citrus chromosomes are ~30 Mbp. To bridge this gap, chromatin conformation capture ("HiC") has been shown to provide useful intra-chromosomal contig linkages and has been successfully applied to several plant genomes.d. Chromosome-scale assembly, curation, and polishing. Chromosome-scale assemblies will be made, using appropriate state of the art assembly techniques. These assemblies will be further curated by manual inspection and editing using the JuiceBox toolkit, to ensure the robustness of the chromatin-guided assembly. After these steps, attempts will be made to close any remaining gaps, using PBJelly. Finally, Illumina short-read sequence will be combined with the high depth PacBio sequence to "polish" the assembly, identifying residual errors (based on disagreements with aligned sequences) and correcting them. The resulting accuracy will have fewer than one error per megabase. This polishing step will also flag problematic poorly assembled regions that cannot be corrected with existing methods, for the benefit of downstream users.2) Sequence transcriptomesa. Preparation of high-quality RNAs and transcriptome sequencing: Total RNA from leaf samples and other relevant tissue types will be extracted using Trizol reagent. mRNA will be purified using the polyA Spin mRNA Isolation Kit and quantified using a Qubit fluorometer with Qubit RNA HS Assay Kit. Reverse transcription will be performed using SuperScript II with Oligo dT primer. cDNA will be amplified using the standard Phusion PCR protocol. The library preparation for amplicon sequencing will be done using the manufacturer's protocol, to provide full-length reads for nearly all transcripts captured as mRNA.b. Transcriptome assembly: Raw data will be uploaded automatically by Metrichor software for cloud-based base-calling and quality filtering. Poretools will be used to extract fasta reads from Basecalled fast5 files. The majority of cDNAs should be sequenced as full length. Error correction will be carried out using both hybrid and de novo correction methods. Hybrid error correction can be carried out by proovread using short-read (2 x 100 bp paired-end reads) sequencing data previously generated on the Illumina HiSeq platform. Nanocorrect will be used for de novo error correction. Canu will be used to assemble nanopore reads, and racon will be used for polishing canu assemblies.3) Annotate genomes and comparative genome analysisGenome annotation: The citrus genomes will be annotated using a previously developed annotation pipeline that includes methods to identify transposable elements and other repetitive elements, using a de novo repeat finding algorithm RepeatModeler v1.0.5,to produce a comprehensive library of transposable elements. Manual review will ensure that genes from large families (e.g., R-genes) are not incorrectly identified as repetitive elements. Each genome will be masked, producing a GFF-formatted file output by RepeatMasker, that can be made publicly available through JGI's Phytozome. To predict protein-coding gene structures (i.e., exon-intron organization) and alternate splice forms for each gene, the long-read transcriptome resources generated (see above), as well as a range of short-read RNA-Seq data that captures a wide variety of tissues and includes samples with and without HLB infection, will be used. Using the methods outlined by Wu et al. (2014) the long- and short-read transcriptome-data are aligned, and protein sequences from nearby species are aligned to identify regions of putative gene homology. For citrus, most genes have homology evidence, and with the new transcriptome resources generated in this project ~75% of citrus genes will be supported by expression data.Standard functional analysis with KOG, KEGG, ENZYME, Pathway and the InterPro family of protein analysis tools, as well as gene family information, will be computed and made available through Phytozome.4) Extract full-length sequences for HLB-responsive gene sequences and predict candidate sites for genome editingPrevious meta-analyses of citrus genes reported to be responsive to CLas inoculation and HLB disease development in 22 citrus gene profiling experiments have identified nearly 3500 microarray probe sets that can detect differential gene expressions before and after CLas inoculation, between HLB-susceptible and -tolerant citrus genotypes, and/or among different stages of HLB development. These RNA-seq results may be reanalyzed against the new genome assemblies, to determine whether some previously identified potential causal genes are structural variants absent in the new assemblies. As new data emerge from genome-wide association studies, genetic linkage mapping, transcriptome profiling, gene coexpression network analysis, etc. the list will be narrowed to about 100 genes that may consistently respond to CLas inoculation and HLB disease development. These genes would be top candidates for CRISPR-based knocking down or knocking out for HLB resistance.To extract full-length sequences and understand their structures, DNA and protein sequences will be extracted from citrus genomes and compared. They will be annotated with PSI-blast and hmmsearch. The sequences will be aligned using Clustal Omega, MAFFT, and phylogentic trees will be built with MEGA, PHYLIP. Gene structure and transcript isoforms will also be compared. The expression of selected responsive genes will be confirmed using qPCR.The assembled chromosome-scale genome assemblies and the list of HLB-responsive genes can be used to assess specificity and off-target potential of potential gRNA designs, and the cutting efficiency of Cas9 enzyme directed by these gRNAs. Online computational tools can predict candidate cutting sites within the responsive genes; these candidate sites can be scored with the assistance of available web tools that are pre-loaded with citrus genome sequence. Selected cutting sites can be manually checked then to ensure high specificity and no or minimal off-target cleavage potential. Priorities will be given to those sites in the CDS region and close to an ATG start codon. When choices for a specific gene are limited, we can increase the on-target effects by introducing more than one cutting site. Primers for making CRISPR guide RNA for the cutting sites will be designed according to the CRISPR plasmids currently used.

Progress 04/26/21 to 09/30/21

Outputs
Target Audience:Extension and outreach to the citrus grower community, who will benefit from applications of the research information we are generating. Changes/Problems: Nothing Reported What opportunities for training and professional development has the project provided?Three post-doctoral research associates and one graduate student have had the opportunity to receive advanced training in new skill sets through mentoring on the project. How have the results been disseminated to communities of interest?As goals and objectives are achieved, we are making genomic resources available to the broad research community through various public portals, including The Citrus Genome Database, Phytozome, and NCBI. Researchers can access these resources and apply them to gene editing-based solutions to HLB. Further, citrus growers, the ultimate beneficiaries of the work conducted, have been kept abreast of progress and developments through the course of the work. What do you plan to do during the next reporting period to accomplish the goals?We will proceed to follow the approaches described in the project document.

Impacts
What was accomplished under these goals? HLB, also known as citrus greening, is the most destructive, devastating disease of citrus in the world. It was found in Florida in August 2005, and it poses an existential threat to the Florida citrus industry. Citrus is the most important agricultural industry in Florida, formerly generating a $9.3 billion annual economic impact on the state and providing as many as 80,000 jobs. In Florida, HLB is specifically associated with the bacterium 'Candidatus Liberibacter asiaticus' (CaLas) and is vectored by the Asian citrus psyllid (ACP, Diaphorina citri). HLB and ACP have already spread to Georgia, South Carolina, Louisiana, Texas, and California. Thus, HLB is truly posing a great threat to the entire U.S. citrus industry. HLB affects all citrus species, and commercial cultivars such as sweet oranges, grapefruit, and many mandarins, are most susceptible. The best long-term strategy, recommended by the National Academy of Sciences report on HLB, is to develop citrus cultivars that can tolerate or resist HLB. While there are no HLB-resistant cultivars, there are some selections and cultivars that are considered tolerant. Traditional breeding approaches to incorporate tolerance or resistance cannot preserve the unique quality attributes of most citrus fruit. Therefore, genetic engineering has been considered to develop tolerant or resistant cultivars, and any GMO projects have been pursued in different labs to achieve this goal. The unwillingness of consumers to accept GMO foods, however, has led to strong reluctance of the major beverage and food companies to embrace GMO citrus as a solution to HLB; in fact, all the major OJ products in the US carry the non-GMO label. An alternative approach, gene editing using CRISPR technology, has potential to address the concern over GMO citrus. If properly done no foreign DNA will remain in the new lines. The CRISPR-Cas9 system has been used successfully to modify citrus genes, and it is not currently regulated by USDA's Animal and Plant Health Inspection Service. To achieve effective and precise CRISPR gene editing, however, very high-quality genome sequence assemblies are required. The overarching goal of this project is to produce the highest quality genome assemblies possible of commercially important citrus cultivars, to enable efficient and effective gene editing, to produce HLB resistant plants that can allow the US industry to overcome the threat. Objective 1. Previously, we generated raw sequence data for Valencia orange (S, sensitive), Ruby Red grapefruit (S), Clementine mandarin (S), LB8-9 Sugar Belle® mandarin hybrid (T, tolerant), and Lisbon lemon (T) and preliminary assemblies and analyses were carried out. Because of reduced sequencing costs, we were able to enter additional important genomes into the pipeline beyond those originally proposed, including Carrizo citrange, sour orange, and Shekwasha (an important breeding parent for HLB tolerance); these also have now been sequenced and assembled. Now, we have completed Hi-C sequencing of two genomes, and by incorporating these data with PacBio sequence assembly from two of our target genomes we have produced improved chromosome scale assemblies. Minor assembly errors in repetitive DNA regions have been repaired, by genome alignment and comparison to the Poncirus assembly we recently published, resulting in polished assemblies of these two accessions. Initial preliminary characterizations of the presence/absence variations among the four phased haploid genomes have been made. Notable variations among these four involve transposable elements including MITES with characteristic sizes. A genome graph-based approach is being pursued, with the goal of building a genome graph containing the different types of structural variations that are now being identified. The availability of high-quality assemblies for the 3 basic species (C. medica, reticulata, and maxima) will allow thorough and complete characterization of large-scale structural variations (SVs: deletions, insertions, etc.) in our target genomes of commercial interest. SVs are a driving force for phenotypic diversity especially among somatic mutants (e.g., different oranges, grapefruits, etc.), and this type of information will become more important as we test different sweet orange mutants exhibiting enhanced tolerance of HLB and attempt to determine the underlying causes of such tolerance. In a related effort, our team recently compared 69 new east Asian genomes and other mainland Asian citrus to reveal a previously unrecognized wild sexual species native to Japan's Ryukyu Islands: C. ryukyuensis, which hybridized with an ancient east Asian mandarin to produce Shekwasha (shiikuwasha) mandarin, a powerful source of HLB tolerance in rootstock breeding. Further, by studying the genomes of C. ryukyuensis-derived hybrids and other citrus. we traced the origin and spread of apomixis (nucellar embryony, a trait that is required for seed propagation of citrus rootstocks) from Mangshanyeju wild mandarins in China a few million years ago through most of the commonly known contemporary citrus types (orange, grapefruit, lemon, etc.). This work resulted in deeper understanding and new genome-based tools that can be exploited for two critically important traits in citrus genetic improvement, nucellar embryony and most importantly HLB tolerance; the research was published in Nature Communications in July 2021 (see https://doi.org/10.1038/s41467-021-24653-0). Objective 2. We now have some transcriptome data for two of our target genomes, using the Illumina sequencing platform, and genome annotation (i.e., identify all the genes within the genome) is in progress. Samples have been prepared and collected, and plans are in place to proceed with Hi-C sequencing of the target genomes not yet completed and polished, as well as to generate the transcriptome data required for genome annotation, and further characterization of large-scale structural variations within and among the genomes upon which we are focused. We used the PacBio Sequel platform to sequence full-length gene transcripts in the leaf tissues of sweet orange and trifoliate orange and reconstructed their leaf transcriptomes. We identified novel full-length transcripts that were not present in the published reference transcriptomes. Objective 3. We have phased the two parental chromosomes of the target genomes mentioned above using Illumina short reads from citrons, pummelos and mandarins. Objective 4. We found in the sweet orange and trifoliate orange full-length leaf transcriptomes that some NBS-encoding genes (nucleotide binding site genes, one typical class of disease resistance genes in plants) underwent alternative splicing. One alternatively spliced NBS transcript expressed in HLB symptomatic leaf and fruit of sweet orange, and another alternatively spliced NBS transcript was differentially expressed in CLas-infected trifoliate orange samples, suggesting that isoforms of some NBS-encoding genes may play an important role in HLB tolerance of trifoliate orange, or alternatively HLB susceptibility in sweet orange. The new transcriptomes will be useful to identify candidate genes for disease resistance that have been missed in the published citrus genomes and transcriptomes. Finally, we have collaborated with the USDA Germplasm Repository for Citrus and prepared genomic DNA from more than 120 citrus species, varieties, and relatives for enrichment and sequencing of specifically targeted genes. Sequencing libraries have been prepared and are in que for processing. Results from this effort will provide important insights for the evolution and domestication of select genes that are important for citrus resistance or susceptibility to Huanglongbing and other diseases. The results will also provide template gene sequences for genome editing for Huanglongbing resistance.

Publications

  • Type: Journal Articles Status: Accepted Year Published: 2021 Citation: Wu, G.A., Sugimoto, C., Kinjo, H. Azama, C., Mitsube, F., Talon, M., Gmitter Jr.,F.G., and Rokhsar, D.S. Diversification of mandarin citrus by hybrid speciation and apomixis. Nat Commun 12, 4377 (2021). https://doi.org/10.1038/s41467-021-24653-0