Source: UNIVERSITY OF TENNESSEE submitted to NRP
USING GENOMIC TOOLS TO PROMOTE SUSTAINABLE CACAO AGROFORESTRY
Sponsoring Institution
National Institute of Food and Agriculture
Project Status
ACTIVE
Funding Source
Reporting Frequency
Annual
Accession No.
1032585
Grant No.
2024-67011-42911
Cumulative Award Amt.
$120,000.00
Proposal No.
2023-11570
Multistate No.
(N/A)
Project Start Date
Aug 15, 2024
Project End Date
Aug 14, 2026
Grant Year
2024
Program Code
[A7101]- AFRI Predoctoral Fellowships
Recipient Organization
UNIVERSITY OF TENNESSEE
2621 MORGAN CIR
KNOXVILLE,TN 37996-4540
Performing Department
(N/A)
Non Technical Summary
Chocolate, a favorite food loved by many people worldwide, is more than just an indulgence for the 50 million people worldwide who depend on it for income. Unfortunately, chocolate has a "dark side," with the average cacao farmer living well below the poverty line. Farmers' limited resources often lead to poor farming practices, which in turn lead to broader issues such as deteriorated local ecosystems and poorly managed genetic diversity in cultivated cacao trees. These problems also increase the likelihood that devastating crop diseases and climate change will negatively impact farmers' livelihoods. Fortunately, we don't need to choose between chocolate and the environment because cacao cultivation can be uniquely capable of supporting both resilient cacao production and biodiversity. My project will explore underutilized genetic diversity of cacao and shed light on cacao pollination to help farmers grow robust and valuable cacao with fewer costly inputs.I will sequence the genomes of wild cacao trees from a population in Belize to survey the genetic diversity found in a wild cacao population. These trees are highly valued for their fine flavor but are rare in cultivation. Analyses of the tree genomes will reveal which trees are pollinating each other, which will reveal how insects are moving pollen in a wild cacao forest. Lastly, I will compare insect abundance between wild and cultivated cacao trees, with a special focus on identifying potential cacao pollinators and their preferred habitats. A better understanding of the behavior of pollinators will guide farmers in accessing and improving ecosystem services to increase pollination, and thus pod production, while limiting negative effects on biodiversity.
Animal Health Component
(N/A)
Research Effort Categories
Basic
100%
Applied
(N/A)
Developmental
(N/A)
Classification

Knowledge Area (KA)Subject of Investigation (SOI)Field of Science (FOS)Percent
1252233108025%
1363095107025%
2022233108025%
2113110113025%
Goals / Objectives
My long-term, overarching goal is to preserve valuable genetic diversity within cacao and minimize the impacts of cacao production on biodiversity. To accomplish this goal, it is imperative to discover and characterize natural genetic diversity maintained in wild cacao populations and understand the impact of cacao agroforestry on biodiversity and crucial ecosystem services, such as pollination.Objectives:Population genetic study of a rare, wild Criollo cacao population in Belize.Estimate gene flow, pollen transport, and pollinator movement in a wild cacao population.Quantify differences in insect/pollinator diversity between agroforestry-grown cacao and forest cacao.Develop research methods and protocols.
Project Methods
Obj. 1:To study the evolutionary history of the Criollo cacao population at BFREE, the entire 1,153-acre property was searched for cacao trees and leaves of 280 out of 291 trees were sampled for DNA sequencing. We will sample from off-site nature reserves to place the BFREE trees into the greater context of cacao genetic variation and population structure in Belize. Although a large cacao SNP chip is available, a.k.a. the "chocolate chip", expected low genetic diversity in Criollo make it likely that this SNP chip will not produce enough useful, variable markers. To sample enough markers, a novel method of quantitative reduced representation DNA sequencing, OmeSeq-qRRS, will be used to generate a dataset of thousands of markers. Preliminary data collected using this method demonstrates it captures sufficient genetic variation to distinguish between closely related trees at BFREE. Leaf DNA extraction and sequencing will follow previously optimized protocols. SNPs will be used to determine population structure by PCAand STRUCTURE. Estimates of linkage disequilibrium (non-random association of alleles), Wright'sF-statistics (allele frequency distribution), and expected vs. observed heterozygositywill be used to determine demographic history, selection levels, and inbreeding levels. All data will be analyzed in R.Obj. 2:Seeds from 1,143 open-pollinated crosses of forest trees were germinated resulting in 471 collected seedlings. Leaves will be extracted and sequenced following the same method as Objective 1. SEQUOIA parentage analysis software in Rwill identify putative relationships and parent-offspring pairs. Distances between parents will be plotted to visualize the distribution of pollination distances. The preliminary finding that Criollo at BFREE is a single, panmictic population makes it ideal for the proposed parentage analyses, because the patterns of gene flow within this population will better represent how insect pollinators move through the forest and not be biased by biological barriers to reproduction. This study will be the first to use SNP markers for a cacao parentage study.Obj. 3:Traps that are advantageous for collecting different insect families will be placed in cacao agroforest and placed in undisturbed forest near wild cacao trees. These include SLAM (Sea Land and Air Malaise), yellow pan, and sticky traps. To identify insects associated with flowers, aspirators will be used to capture flying insects on/near cacao flowers, and sticky traps will be placed next to flower cushions. Whole flowers will be collected in DNA preservative buffer for sequencing environmental or eDNA left by floral visitors. Insects will be visually sorted to order/family level (depending on taxa), and bulk genomic DNA from each set of sorted insects will be extracted. Amplification of the COX1 gene will use arthropod-specific primers and each reaction will be uniquely barcoded for next-generation MiSeq sequencing (ca. 44-50 million paired-end reads, ~500,000 reads/sample). Insects aspirated from flowers or collected on sticky traps will be sequenced individually to create a database of COX1 sequences linked to specific floral visitors at the site. Before extraction, insects collected on sticky traps will be visually inspected under a microscope for pollen, and if pollen is detected, will be sequenced for cacao DNA sequences. If cacao DNA is detected, the insect will be identified as a potential pollinator. eDNA sequencing of flowers is a novel method that has never been used for detection of cacao pollinators, but it has potential to easily screen forests and orchards for the presence of pollinators. Several extraction methods will be compared for best amplification of arthropod eDNA. Diversity indices (Shannon diversity index and Beta diversity) will be calculated within and among habitats.

Progress 08/15/24 to 08/14/25

Outputs
Target Audience:To reach my target audiences, I am communicating research discoveries and methods by publishing my results and presenting at scientific conferences as well as communicating results to conservation managers who can distribute results to farmers in a way that helps them grow higher-value cacao. As a teaching assistant I am educating undergraduate students in the U.S. about where their food, especially chocolate, comes from and the value of farmers and the food supply chain. Additionally, I am communicating my results to the people who manage the cacao research station where the research was performed. They are working to create practical agroforestry techniques to improve cacao cultivation. BFREE is interested in Criollo cacao because it is a valuable andlocally importantcacao found in Belize that farmers could grow to increase their income, while providing a highly sought after cacao for the craft chocolate industry. Changes/Problems:Unexpected outcome: Parentage analysis has been complicated due to the population having potentially clonal population structure or high levels of inbreeding. These relationships are confounding kinship analyses, and steps are being taken to ensure the data is analyzed correctly given this unexpected finding. Once the population structure is determined, further parentage analyses will be performed. If the parents are clonal or highly related, it will be difficult to confidently identify pollinating trees, thus making the parentage analysis impossible. Although this would be unfortunate when viewed from the original objectives of the project, it would be an interesting discovery to report and would provide crucial information for breeding efforts that involve Criollo cacao and provide useful information on the evolutionary history of and past human utilization of cacao. Although these outcomes have been unexpected, they have not substantially impacted the current rate of expenditure to date. Although some seedlings have been sequenced and will be used for population genetic analysis, further sequencing of seedlings has been paused until we are confident that the sequencing will provide useful information. Travel to collect more samples has been delayed until the genetic structure of the population is finalized to ensure that funds are spent wisely to further our understanding of the subject. Any substantial changes will be communicated to USDA-NIFA. What opportunities for training and professional development has the project provided?I wrote and was awarded an internal proposal to hire and mentor a University of Tennessee Institute of Agriculture undergraduate student to assist with insect DNA extraction and sequencing. The undergraduate student is studying horticulture in the Plant Sciences Department and is considering pursuing a graduate degree in plant science. Her work in the lab is providing her an opportunity to discover if this area of research sparks her interest. She is also gaining experience that will be crucial for successfully obtaining future work opportunities. Additionally, I am gaining additional mentoring experience which will prepare me for future roles, all while receiving help to complete the project in a timely manner. I participated as a TA for an undergraduate class on chocolate, EPP123, Chocolate: Bean to Bar. This class educates students on where their food comes fromand the importance of agricultural supply chains. I also participated in teaching a lecture for an RNA-seq short course to undergraduate and graduate students. Additionally, I helped mentor a fellow University of Tennessee graduate student with research, writing, and preparing to defend his thesis. The topic of his research was on the fungal communities found in the same Criollo cacao trees that are being studied as part of my USDA-NIFA pre-doctoral project. He successfully defended his thesis and has acquired full-time employment in agriculture as a growing specialist in Monroe, Georgia. His research and connections have helped to disseminate information about both his and my research to farmers currently growing cacao in Belize. I participated as a peer reviewer for three journals. These included Ecology and Evolution, Open Life Sciences, and the Brazilian Archives of Biology and Technology. Two of the articles focused on cacao, and one of the articles was on plant genetics more broadly. I assisted with planning and coordinating the Tennessee Plant Research Center's monthly colloquia held at the University of Tennessee. This includes coordinating invited researchers to present their research and collaborate with University of Tennessee research groups. How have the results been disseminated to communities of interest?Current progress and results weredisseminated atconferences. I attended two conferences this summer to present my research and collaborate with other researchers. I first attended Evolution in Athens, GA. I then attended Botany in Palm Springs, CA. Discussions resulting from my presentations have provided valuable direction to my project from outside perspectives. Preliminary results have been communicated to the directors of the BFREE field station on several occasions. They are motivated to use information generated to implement any improvements in their efforts to study and promote cacao agroforestry. I am also communicating with the Ya'axché Conservation Trust. They can provide information about cacao research to local farmers. To increase public awareness about the research and educate the public on the issues surrounding cacao production, an article about the project was published in the University of Tennessee's magazine Life, Land, & Science. What do you plan to do during the next reporting period to accomplish the goals?Work to complete sequencing and data analysis for all cacao trees, seedlings, and insects is underway. Once analyses are complete, publications will be drafted and submitted to appropriate journals. Mentoring of U.S. undergraduate students in research and scientific presentation will continue, and further updates to BFREE directors and other invested groups will occur. Committee meetings will occur each year. I will be able to travel to one additional conference to present, and results will be presented locally in departmental seminars. Travel to Central America to collect more samples will be coordinated and scheduled.

Impacts
What was accomplished under these goals? Although many consumers in the U.S. may not know it, the majority of cacao we consume is grown internationally. This is because cacao is unable to be cultivated in the continental U.S. The craft chocolate industry is a fast-growing market for U.S. chocolate makers, and requires cacao beans from a variety of unique and valuable kinds of cacao. This project focuses on Criollo cacao, which is one type of chocolate prized in the craft industry. The major issues this project addresses are gaps in the understanding of Criollo cacao genetic variation, pollination, andthe impact of habitat on insects and pollinators. We are working to discover heritable variation found in wild cacao populations of fine cacao and see how agroforestry impacts pollinator abundance in the various habitats these cacao trees are found. This project is intended to provide information to farmers and resource managers about the extent of heritable variation in specific cacao types. It is also meant to provide information about which insects and pollinators are most common in different cacao habitats, which may help people optimize pollination in their local environment. Additionally, this project provides opportunities to reach students in a variety of settings and educate them on agriculture, scientific research, and the food supply chain. For each major goal in this project, the following activities, data, statistics, and discussion of results are as follows: 1. For the population genetic study of a rare, wild Criollo cacao population in Belize, leaves were collected, DNA extracted, and the whole genome sequenced. The resulting DNA sequencing data was generated and compiled with phenotypic data collected for each tree. Chloroplast DNA was extracted from the sequencing files and aligned. Preliminary population genetic analyses indicate that the population truly is Criollo cacao, which was expected. Within population variation appears to indicate two genetic clusters of trees with mixing between those two clusters. Although there is moderate genetic differentiation between those two clusters, all the trees at the station share the exact same chloroplast genome. Several possible reasons for all trees sharing the same chloroplast haplotype could be they were introduced from the same source or Criollo cacao lacks chloroplast variationmore broadly. Additionally, any historical selection for fruit traits would likely bottleneck chloroplast genomes, leading to potentially only one genotype. Further work will explore these possibilities. 2. For the study on pollinator mediated gene flow, seedlings were sprouted, collected, DNA extracted, and whole genome sequenced. Genomic DNA data was generated for 117 seedlings so far and seedlings were grouped with their "siblings" from the same pod. Additional seedlings are ready to sequence after results are analyzed for the first set of seedlings. In total,1,143 open pollinated seeds were planted, and 479 seedlings germinated and were collected for DNA extraction. This germination ratio is typical for forest trees. 3. To quantify insect variation across cacao habitats, insects were collected both manually with aspirators and passively with traps. All insects have been DNA extracted. COX1 Sanger sequences from 226 manually caught insects have been compared to the Barcode of Life Database (BOLD) for identification and have been prepared as a database to compare the passively caught bulk insects to. Trap caught insects are being prepared for COX1 metabarcoding. Results from Sanger sequencing include 226 COX1 sequences 620 bp long. Eight individuals' sequencing did not match a taxa in the BOLD Library. Successful identifications include insects from 8 orders and 38 families. The most common family was the Cecidomyiidae, or gall midges, which are a well-known presumed pollinator of cacao. Second most common was Ceratopogonidae, or biting midges, which are also a presumed pollinator. The third most common was Formicidae, or ants. Ants have recently been suggested to be significant pollinators of cacao. Discovery of three families of presumed pollinators in cacao suggests that local pollinators are common and that these taxa are likely the major pollinators in the region. Each of these pollinators have different life history strategies. Cecidomyiidae reproduce by creating galls on specific plant species. In contrast, Ceratopogonidae require a blood meal to reproduce. Knowing which pollinators may be more important in the region can help farmers facilitate appropriate habitat for pollinators. 4. Methods developed during the project so far include cacao leaf DNA extractions, which required careful troubleshooting to acquire clean DNA. Several kits and methods were investigated and both the QIAcube HT plant DNA extraction kit and Omega Bio-Tek E.Z.N.A. Plant DNA kit were successful for DNA extractions. Insect collection methods were developed by the Moulton Lab and were optimized for bulk extractions. Insect field collection methods are being compared to determine which tools reliably capture presumed pollinators. For individual insects collected near flowers, manual aspiration of insects resulted in more counts of presumed pollinators (from the families Ceratopogonidae, Cecidomyiidae, and Formicidae) when compared to sticky traps placed in flower cushions. Most insects captured with sticky traps were from Drosophilidae, which isn't known to pollinate cacao. This suggests that sticky traps placed in flower cushions are not an adequate method for studying pollinators, but manual aspiration is a successful method for capturing pollinators. Significant progress was made towards generating a finalized whole-genome sequencing data of the standing cacao at the site. Results are preliminary and will be finalized and published within the coming year. Additionally, all insects have been extracted for DNA and are ready for COX1 metabarcoding. Preliminary results provided by individually trapped insects provide insights into pollinator abundance at the research station.

Publications

  • Type: Conference Papers and Presentations Status: Accepted Year Published: 2025 Citation: H Brabazon, JA Marlin, L Ash, E Tzalam, E Ac, M Canti, M Staton, D Shoemaker. (2025). Genomic diversity of a subspontaneous population of cacao in Belize. Evolution Conference. Athens, Georgia.
  • Type: Conference Papers and Presentations Status: Accepted Year Published: 2025 Citation: H Brabazon, JA Marlin, L Ash, E Tzalam, E Ac, M Canti, M Staton, D Shoemaker. (2025). Genomic diversity of a subspontaneous Criollo cacao population in Belize. Botany Conference. Palm Springs, California.