Progress 01/01/23 to 12/31/23
Outputs Target Audience: During this reporting period, our project specifically targeted breeders, geneticists, researchers, and graduate students with an interest or active involvement in polyploid genetics and breeding. More specifically, we directly interacted with groups working on alfalfa and roses, which are both tetraploids, as well as Koronivia grass (Urochloa humidicola) and sweetpotatoes, which are hexaploids. Also, the open-source nature of our implementations potentially benefits a wide array of users, including those in educational settings, smaller breeding programs, or regions with limited resources. Our target audience includes both domestic and international personnel and institutions. Changes/Problems:
Nothing Reported
What opportunities for training and professional development has the project provided?During the reporting period, we engaged with graduate students who learned to utilize and interact with MAPpoly2 and QTLpoly. As mentioned in the previous report, NCSU Ph.D. candidate Simon Fraher continued to employ our tools within sweetpotato populations. With the aid of our resources, he initiated a project aimed at developing sweetpotato germplasm for the NCSU breeding program, focusing on enhancing skin robustness to minimize injury and reduce postharvest losses due to disease and desiccation. Additionally, another Ph.D. candidate at NCSU, Amelia Loeb, has worked with MAPpoly2, gaining an understanding of genetic mapping in multi-parental breeding populations. She is keen on contributing to projects at the intersection of breeding and bioinformatics, and we are committed to supporting her in these pursuits. Furthermore, postdoctoral researcher Gabriel Gesteira is significantly contributing to our team and assisting in developing GGSpoly. How have the results been disseminated to communities of interest?Our efforts have been shared through conferences, workshops, and scholarly articles. We were active participants at the 31st Plant and Animal Genome Conference, where the Principal Investigator (PD) and Co-Principal Investigator (Co-PD) delivered presentations related to our project. The PD also showcased MAPpoly2 at the Tools for Polyploids workshop, which attracted around 250 attendees, including breeders, geneticists, plant pathologists, researchers, and students from both academia and industry. Recordings of the presentations from the Tools for Polyploids workshop are accessible as videos on their website (MAPpoly2 presentation showing the alfalfa consensus map construction: https://youtu.be/eOT2mUXZJgc). Moreover, all the source codes for our project are openly available on our GitHub repository at https://github.com/mmollina. What do you plan to do during the next reporting period to accomplish the goals?Objectives 1 and 2: We are committed to further refining the algorithms crafted for objectives 1 and 2. Our goal is to enhance their efficiency and effectiveness, followed by submitting our advancements to peer-reviewed journals for validation and dissemination. Objective 3: Our focus will be on advancing GGSpoly, particularly in integrating the backend tools from objectives 1 and 2 with the user-friendly web interface. We plan to engage closely with our breeding partners to collect feedback and tailor the tool to meet their specific needs, ensuring it is both practical and valuable for their work.
Impacts What was accomplished under these goals?
In progressing toward Objective One, we have made advancements in haplotype reconstruction methodologies. We introduced MAPpoly2, a genetic mapping and haplotyping software for multi-population structures, which is now accessible on GitHub (refer to the "Other Products" section). This software streamlines the user experience by minimizing the number of decisions users need to make, thereby enhancing both speed and usability. Our theoretical advancements include the development of algorithms capable of integrating various ploidy levels across populations. This feature is particularly important when obtainingseedless varieties through odd-ploidy-level offspring. We conducted simulations (rpubs.com/mmollin/multi_family_simulation) with mixed ploidy levels and found that the algorithms perform robustly. In a collaborative effort with USDA/Cornell Breeding Insights, we succeeded in building interconnected tetraploid maps in alfalfa and reconstructing offspring's haplotypes (refer to "Products," full analysis available at https://rpubs.com/mmollin/tutorial_mappoly2). We also construct acomplex hexaploid map and derive offspring haplotypes in a modest-sized biparental population (60-50) using the DartTag mid-density target genotype platform. A paper detailing this work is being prepared. This achievement aligns with the challenging nature of this objective, as outlined in our proposal. Our current focus is on applying this novel algorithm to a tetraploid potato population, which includes a diverse range of full-sib sizes from 1 to 68 individuals, encompassing 16 parents and 399 offspring. While we anticipate a certain degree of information overlap within full-sib offspring, our findings so far suggest that the construction of consensus maps and haplotypes in breeding populations is feasible and promising. To share these significant theoretical and practical advancements, we started preparing a scientific publication, which we plan to submit in 2024. Regarding objective two (2), we have already implemented a multi-population QTL mapping method that will serve as a base to extend the genetic models between genotypes and phenotypes for all scenarios presented in objective 1. Regarding objective three (3), we have initiated the preliminary implementation of GGSpoly, accessible at https://gesteira.statgen.ncsu.edu/shiny/ggspoly/. This initial version lays the foundation for a comprehensive web application currently in the developmental phase. It serves as a framework upon which we are constructing a user-friendly computational tool in Shiny-R language designed specifically to assist breeders in making informed decisions about their breeding programs.
Publications
- Type:
Journal Articles
Status:
Published
Year Published:
2023
Citation:
da Costa Lima Moraes, A., Mollinari, M., Ferreira, R.C.U. et al. Advances in genomic characterization of Urochloa humidicola: exploring polyploid inheritance and apomixis. Theor Appl Genet 136, 238 (2023). https://doi.org/10.1007/s00122-023-04485-w
- Type:
Journal Articles
Status:
Published
Year Published:
2023
Citation:
Zhao, D., Mejia-Guerra, K. M., Mollinari, M., Samac, D., Irish, B., Heller-Uszynska, K., Beil, C. T. and Sheehan, M. J. (2023) A public mid-density genotyping platform for alfalfa (Medicago sativa L.), Genetic Resources, 4(8), pp. 5563. doi: 10.46265/genresj.EMOR6509.
- Type:
Journal Articles
Status:
Published
Year Published:
2023
Citation:
Cristiane Hayumi Taniguti, Lucas Mitsuo Taniguti, Rodrigo Rampazo Amadeu, Jeekin Lau, Gabriel de Siqueira Gesteira, Thiago de Paula Oliveira, Getulio Caixeta Ferreira, Guilherme da Silva Pereira, David Byrne, Marcelo Mollinari, Oscar Riera-Lizarazu, Antonio Augusto Franco Garcia, Developing best practices for genotyping-by-sequencing analysis in the construction of linkage maps, GigaScience, Volume 12, 2023, giad092, https://doi.org/10.1093/gigascience/giad092
- Type:
Conference Papers and Presentations
Status:
Published
Year Published:
2024
Citation:
Mollinari M., Gesteira G.S., Taniguti C.H., Pereira G.D.S., Zhao D., Wu S., Garcia A.A.F., Fei Z., Sheehan M., Byrne D., Riera-Lizarazu O., Yencho C., Zeng Z-B. Genomic Challenges in Polyploid Crops: An Overview of Progress so Far. In: International Plant & Animal Genome Conference 31. (Oral Presentation - Presenter) Available at https://tinyurl.com/2dp7ejtb
|
Progress 01/01/22 to 12/31/22
Outputs Target Audience:We aimed to engage breeders and graduate students with an interest or active involvement in polyploid breeding. The diverse plant species studied by these groups spanned blueberries, potatoes, sweetpotatoes, roses, yams, ornamental flowers, and blackberries, among others. Our target audience included participants from both domestic and international personnel and institutions. Changes/Problems: In our initial proposal, we identified the Mwanga Diversity Population (MDP) as the main resource for providing a representative sample of genetic data to be used in developing and testing our models and implementations. However, despite our collaborator's efforts to obtain good-quality DNA from the MDP materials and several adjustments to the genotyping protocol, we were unable to obtain the genotype information of the MDP progeny. As a result, testing the efficiency of the multi-parental model in that scenario has not been possible. This work is ongoing, and we expect to acquire the necessary data in the next reporting cycle. In the meantime, we have established collaborations with other breeding groups, including the International Potato Center (CIP), which provided us with a population that allowed us to construct a consensus genetic map and reconstruct the haplotypes in the progeny. Additionally, we are collaborating with the Texas A&M University Rose Breeding Genetic program, which supplied an interconnected rose population with varying ploidy levels, enabling us to test our procedures. We also decided to change the name of our user-friendly down-stream tool from DecisionPoly to GGSpoly (GGS for Genetic and Genomic Selection) - to be implemented in the next reporting cycles. What opportunities for training and professional development has the project provided?We are working in collaboration with several graduate students on projects directly involved with this project. For the graduate students, we could point out Simon Fraher, who successfully used our tools to identify a single major QTL that explained 70% of the variation in resistance to the nematode Meloidogyne enterolobii and currently developing markers to perform assisted breeding selection at the NCSU sweetpotato breeding program. We are collaborating closely with Gabriel Gesteira, a postdoctoral researcher at NCSU specializing in genotype-phenotype associations, and Cristiani Tanigiuti, a postdoctoral fellow at Texas A&M University, who is involved in multi-population haplotype construction and the development of user-friendly tools. How have the results been disseminated to communities of interest? Formal classroom instruction: We presented our work in guest lectures at NCSU, including Plant Cytogenetics in Plant Breeding, Breeding Asexually Propagated Crops, and Quantitative Genetics Theory and Methods. Workshops and conferences: We participated in the following events to share our knowledge and expertise: a. American Society for Horticultural Science Conference b. 30th Plant and Animal Genome Conference c. Tools for Polyploids workshop, attended by 238 participants d. Advancing Computing Skills in Plant Breeding workshop, organized by the NCSU plant breeding consortium, with an average of 30 attendees Extension and outreach: The talks from the Tools for Polyploids workshop were made available in video format on the Tools for Polyploids webpage at https://www.polyploids.org/2023recordedpresentations. In addition, all source codes for this project are freely available in the git repository https://github.com/mmollina. What do you plan to do during the next reporting period to accomplish the goals?Objective1 and 2. a) Continue to improve our phasing algorithm and explore possibilities to include pedigree with small family sizes and multiple generations. b) Finish the implementation of MAPpoly 2.0 and improve the implementation of MAPpoly-MP and QTLpoly-MP. Objective3. Leveraging our VIEWpoly visualization framework for polyploid genetic analysis (https://cran.r-project.org/package=viewpoly), we plan to commence the development and implementation of GGSpoly in the upcoming reporting cycle. GGSpoly is a computational tool intended to aid breeders in making informed short and long-term breeding decisions based on gathered and processed data about their breeding populations for diverse objectives. We anticipate collaborating closely with breeding groups such as the NCSU Sweetpotato and Potato Breeding Team led by Craig Yencho and the Texas A&M Rose Breeding Group led by Oscar Riera-Lizarazu and David Byrne in order to optimize GGSpoly's effectiveness in real-world breeding situations.
Impacts What was accomplished under these goals?
Objective 1: We successfully implemented mapping and haplotype reconstruction algorithms for polyploid partially inter-connected families, including diploid (2x), tetraploid (4x), and hexaploid (6x) families. Our current implementation allows for the integration of families with varying ploidy levels, enabling the simultaneous reconstruction of genetic maps and offspring haplotypes in single-generation populations given a sufficiently large number of individuals within full-sib populations. The performance of these algorithms was verified using real-world populations, such as a tetraploid potato in a partial diallel 3 x 9 parental configuration and three interconnected hexaploid sweetpotato populations, as well as several in silico simulations. Furthermore, we have optimized the algorithms employed in creating individual genetic maps for individual parents. Our preliminary evaluations demonstrated a substantial decrease in processing time, exhibiting a 10-fold improvement for tetraploid families and a 100-fold improvement for hexaploid families when tested with real data. This refinement is crucial for future implementation. The analysis of markers in single parents will act as an initiation algorithm for merging genetic maps in multiparental families in subsequent phases. Currently, we are implementing these functions in a new version of our mapping software MAPpoly which will incorporate several significant updates designed to enhance its capabilities and efficiency. Objective 2. We also made considerable advancements in connecting the haplotypes obtained in Objective 1 to phenotypes of interconnected populations. Our algorithm is an extension of the algorithm previously developed by Pereira et al. (2020). It relies on a random effect model which is applied in the context of the Multiple Interval Mapping (MIM) procedure. To evaluate and demonstrate the utility of our approach, we applied it to the same three interconnected hexaploid sweetpotato populations described in Objective 1, and also used several in silico simulations. With the posterior haplotype probabilities, we performed multiple QTL mapping and detected significant QTL for beta-carotene, dry matter, and starch content, with consistent allele effects across sub-populations. Objective 3: Nothing to report during this period.
Publications
- Type:
Conference Papers and Presentations
Status:
Published
Year Published:
2023
Citation:
Mollinari M. Computational Tools for Genomic Analysis in Polyploids in: 2022 ASHS Annual Conference (presenter) https://ashs.confex.com/ashs/2022/meetingapp.cgi/Paper/38640
- Type:
Conference Papers and Presentations
Status:
Published
Year Published:
2023
Citation:
Gesteira G.S., Mollinari M., Pereira G.da S., Olukolu B.A., Oloka B.M., Yencho C., Zeng Z-B. Genetic Mapping in Interconnected Hexaploid Sweetpotato Populations in: International Plant & Animal Genome 30 Conference (presenter) https://plan.core-apps.com/pag_2023/abstract/d8c2bdba-5e59-4f12-ab32-9e01d221a7a0
- Type:
Conference Papers and Presentations
Status:
Published
Year Published:
2023
Citation:
Wu S., Sun H., Kitavi M., Hamilton J.P., Gesteira G.S., Mollinari M., Zeng Z-B., Yencho C., Buell R., Fei Z. Advances in the Development of Chromosome-Scale Haplotype-Resolved Genome Assemblies of Hexaploid Sweetpotatoes in: International Plant & Animal Genome 30 Conference https://plan.core-apps.com/pag_2023/abstract/18b6ff6a-ca63-4074-ae87-e4e99fea4f11
|