Progress 02/03/21 to 09/30/21
Outputs Target Audience:The target audience is predominantly scientists, bioinformaticists, database developers, database curators, both national and international, as well as, U.S. industry stakeholders from the 25 crops covered in this project. Scientists have been engaged through peer-reviewed publications, workshops, presentations at scientific conferences and meetings, posts to mailing list, webinars, online surveys of users, emails about the website, and meetings with advisory groups. Bioinformaticists and database developers have been engaged through participation in monthly meetings on Tripal software, the release of new versions of Tripal, the tripal.info website, a two day hackathon, monthly Agricultural Biological Database (AgBioData) group meetings, peer-reviewed publications, and presentations at conferences and meetings. Industry stakeholders are being engaged through presentations and interactions at commodity group meetings and advisory committee meetings. Changes/Problems:
Nothing Reported
What opportunities for training and professional development has the project provided?Training opportunities for undergraduate, graduate, and postdoctoral researchers included participation and presentation at workshops, conferences, and meetings (mostly virtual), as well as manuscript writing and participation in short courses (includes DEI awareness training). How have the results been disseminated to communities of interest?Disseminating the results of this project has further increased the visibility and need for the resources provided by NRSP10. Between October 1, 2019 to January, 5 2022 this has been accomplished through 22 peer-reviewed publications; 32 presentations at 14 conferences: 2019 & 2021 North America Pulse Improvement Conferences, 2019 & 2021 Biennial Bean Improvement Cooperative Meetings, 2020 & 2021 Cotton Beltwide Conferences, 2020 & 2022 International Plant, and Animal Genome Conferences, 2020 International Rosaceae Genomics Conference, 2020 & 2021 American Society of Horticultural Science Conferences, 2021 Bioinformatics Open Source Conference, 2021 National Association of Plant Breeders Annual Meeting, 2021 International Vaccinium Symposium); Rosaceae and Vaccinium training workshops and Tripal codefests, several webinars, newsletters, and short "How to Videos". All of these presentations and webinars are available from the database websites and the NRSP10 project website. What do you plan to do during the next reporting period to accomplish the goals?In year 3 of this NRSP10 renewal project we will continue to provide user training materials, update the databases with data, make progress on our outstanding tool/module development goals, and use the information gathered from our sustainability analysis to develop sustainability plans for the databases which we plan to start implementingin year 4 of this 5 year project.
Impacts What was accomplished under these goals?
Progress inexpanding the online community databases for Rosaceae, citrus, cotton, pulse, and Vaccinium crops: We have continued to add genomic, genetic, and breeding data to our five databases and add/modify tools for increased functionality. For GDR, for example, thisincluded the addition of 120million genotypes, 52 genomes, 46 genetic maps, 800 K markers, 27K markers, 400 QTL, haplotypes,and conducting 1052 synteny and gene function analyses of the 85 published Rosaceae genomes.All the other databases had similar types of data added and analyses conducted. All our databases have work completed pages detailing data and tools added by date. Between10/01/19 to 02/15/22 usage of the databases was as follows:GDR- 78,719 users from 180 countries, with 2,723,689 pages viewed over 217,737 visits (22% U.S.);CottonGen- 54,879 users from 181 countries, 739,061 pages viewed over 112,781 visits (27%) U.S.);Pulse Crop Database- 17,877 users from 143 countries, 113,874 pages viewed over 23,984 visits (41% U.S.);Citrus Genome Database- 24,127 users from 167 countries, 286,005 pages viewed over 41,821 visits; andGDV- 12,350 users from 109 countries, 186,170 pages viewed over 21,990 vests (34% U.S.). For the first 5 years of the NRS10 project (10/01/14-9/30/19), 3,679,433 pages of these databases were accessed by users. In just the first 2 years of the NRSP10 renewal, over 4 Mdatabase pages have been accessed by the research community this support project serves and the databases have been cited in over 800 publications. Progress on Tripal module development: (a)Enhancing TripalMap: TripalMap,our genetic map comparison, and visualization viewer, now includes a genome comparison view with genomecorrespondence matrixand organism filtering for genomes (TripalMap v2.0). Chromosomes and linkage groups are linked by shared markers, allowing users to explore the genomic features around QTL, even when only the genetic position is available. The genes in chromosome view in TripalMap hyperlink to our genome viewer JBrowse. This has been implemented in the cotton, rosaceae, and vaccinium databases. (b)Enhancing the Breeding Information Management System:We continuedthe development of the Tripal Breeding Information Management System (BIMS), a program embedded in each of our five databases. New functionality addedincluded(1) Making BIMS compliant with BrAPI. This enables breeders to send data directly from any BrAPI compliant resource, such as the Field Book App, to BIMS, (2) Importingof public data from the community database into their private BIMS program, (3) Searching crosses, (4) Downloading stock site information (5) Changing the owner of the Breeding Program (e.g. when Breeders retire or leave a program), (5) Uploading haplotype data (6) Configuration page allows users to choose regular and custom properties to be used as phenotype search categories (7) Searching by phenotype and downloading genotype data (8) Loading and searching genotyping data (9) Created the BreedWithBIMS (https://www.breedwithbims.org)website so all the training manuals/webinars/presenations/publications for the BIMS programs are accessible from a central location for anyone using BIMS through the NRSP10 databases (or other databases using BIMS), (10) BIMS manual updated, (11) Webinar trainings held for five breeding programs (Rose at Texas A&M, strawberry/raspberry at Agriculture Canada, strawberry at U of New Hampshire, blueberry at U of Florida, soybean at U of Minnesota), and the Vaccinium CAP Project breeders,(12) BIMS presented at 11 conferences (13), BIMS 1.0.0 released to the Tripal community (https://gitlab.com/mainlabwsu/bims), (14) BIMS published in peer-reviewed journal, (15) Genotype, phenotype and location data for global performance predictions were gathered for peach, (15) BIMS templates and loaders updated to cover various source-dependent metadata, (16) Updated BIMS data loading system to shorten the loading time of big genotype data using a server-side object, (17) Developed a prototype Global prediction tool where users can load genotypic data of their germplasm of interest, and choose an environmental condition and trait name to view predicted trait values. (c)Develop a Tripal module for visualization of epigenomics data: Work is scheduled tobegin on this module in October 2022. Progress inProvidingAdditional Tools and Resources as Requestedby our Crop Communities: (a)At the request ofthe American Pomological Society and the ASHS we have developed an online tree fruit and nut register of cultivars database (https://www.fruitandnutlist.org/) which collates and standardizes information fromthe last 51 tree fruit and nut registers into searchable data for use by breeders and growers.(b) At the request of the US Plant Breeding Coordinating Committee (PBCC SCC-80) we have developed web pages (https://www.nrsp10.org/PBCC_about_us)within the NRSP10 websitefor PBCC information management and resource dissemination to the plant breeding community. (c) At the request of the National Association of Plant Breedingleadership team, we are designing and developinga new website for the NAPB using Drupal 9. Once this is complete to their satisfaction the PBCC site will be added and the entire site provided to the NAPB web hosting contractors. The old site was not easy to edit or manage. (d) Collection of data and access to information about US Plant Breeding Programs continues to be updated using tools available on the NRSP10 site (https://www.nrsp10.org/us-breeding-program). Progress onIdentifying Database Sustainability Options: Phoenix Bioinformatics has developed a database user survey to gauge the willingness of users/institutions/ to financially support their community database which is currently undergoing IRB compliance checking at WSU before being sent to the Rosaceae, Cotton and Vaccinium research community. They have contacted key industry representatives to discuss possible sustainability models and their willingnessto provide financial support toward sustainability. One non-federal funding organization has indicated its willingness to allow principal investigators to request funds in their research proposals to support their crop database. USDA ARS has submitted a funding request for $500,000 per year to be appropriated to support CottonGen. New federal grant proposals included funds to support data collection and curation of genomic, genetic, and breeding data for the citrus and vaccinium databases. New proposals are pending that will support the NRSP10 tree fruit, berry, nut, and pulse crops databases.On September 1, 2021, NSF funded ($600,000) a three-year Research Coordination Network project "RCN: Reimagining a Sustainable Data Network to Accelerate Agricultural Research and Discovery". The project team of NRSP10 helped lead this proposal and the AgBioData website (https://www.agbiodata.org) is developed and hosted through the NRSP10 project. A sustainability review and report will provide a roadmap for the future of GGB databases.
Publications
- Type:
Journal Articles
Status:
Published
Year Published:
2021
Citation:
Jung, S., Lee, T., Gasic, K., Campbell, B.T., Yu, J., Humann, J., Ru, S., Edge-Garza, D., Hough, H. and Main, D., 2021. The Breeding Information Management System (BIMS): an online resource for crop breeding. Database, baab054.
- Type:
Journal Articles
Status:
Published
Year Published:
2021
Citation:
Jung, S., Cheng, C.H., Buble, K., Lee, T., Humann, J., Yu, J., Crabb, J., Hough, H. and Main, D., 2021. Tripal MegaSearch: a tool for interactive and customizable query and download of big data. Database, baab023.
- Type:
Journal Articles
Status:
Published
Year Published:
2021
Citation:
Ru, S., Hardner, C., Evans, K., Main, D., Carter, P.A., Harshman, J., Sandefur, P., Edge-Garza, D. and Peace, C., 2021. Empirical evaluation of multi-trait DNA testing in an apple seedling population. Tree Genetics & Genomes, 17(1), pp.1-10.
- Type:
Journal Articles
Status:
Published
Year Published:
2021
Citation:
Yu, J., Jung, S., Cheng, C.H., Lee, T., Zheng, P., Buble, K., Crabb, J., Humann, J., Hough, H., Jones, D. and Campbell, J.T., 2021. CottonGen: The Community Database for Cotton Genomics, Genetics, and Breeding Research. Plants, 10(12), p.2805.
- Type:
Journal Articles
Status:
Published
Year Published:
2021
Citation:
Bari, M., Zheng, P., Viera, I., Worral, H., Szwiec, S., Ma, Y., Main, D., Coyne, C. J., McGee, R. J., & Bandillo, N. (2021). Harnessing Genetic Diversity in the USDA Pea Germplasm Collection Through Genomic Prediction. Frontiers in genetics, 12, 707754.
- Type:
Journal Articles
Status:
Published
Year Published:
2021
Citation:
Staton, M., Cannon, E., Sanderson, L.A., Wegrzyn, J., Anderson, T., Buehler, S., Cobo-Sim�n, I., Faaberg, K., Grau, E., Guignon, V. and Gunoskey, J., 2021. Tripal, a community update after 10 years of supporting open source, standards-based genetic, genomic and breeding databases. Briefings in bioinformatics, 22(6), p.bbab238.
- Type:
Journal Articles
Status:
Published
Year Published:
2021
Citation:
Rajendran, K., Coyne, C.J., Zheng, P., Saha, G., Main, D., Amin, N., Ma, Y., Kisha, T., Bett, K.E., McGee, R.J. and Kumar, S., 2021. Genetic diversity and GWAS of agronomic traits using an ICARDA lentil (Lens culinaris Medik.) Reference Plus collection. Plant Genetic Resources, 19(4), pp.279-288.
- Type:
Journal Articles
Status:
Published
Year Published:
2021
Citation:
Edge-Garza, D.A., Villamil-Castro, M., Main, D. and Hardner, C., 2021. Correction of database SNP accession numbers misassigned to SNPs of the 8K IRSC apple SNP array. Tree Genetics & Genomes, 17(2), pp.1-4.
- Type:
Journal Articles
Status:
Published
Year Published:
2021
Citation:
Hardner, C., Kumar, S., Main, D. and Peace, C., 2021. Global Genomic Prediction in Horticultural Crops: Promises, Progress, Challenges and Outlook. Frontiers of Agricultural Science and Engineering, 2021, 8(2): 353355.
- Type:
Journal Articles
Status:
Published
Year Published:
2021
Citation:
Callahan, A.M., Zhebentyayeva, T.N., Humann, J.L., Saski, C.A., Galimba, K.D., Georgi, L.L., Scorza, R., Main, D. and Dardick, C.D., 2021. Defining the HoneySweetinsertion event utilizing NextGen sequencing and a de novo genome assembly of plum (Prunus domestica). Horticulture research, 8(1), pp.1-13.
- Type:
Journal Articles
Status:
Submitted
Year Published:
2021
Citation:
Bett et al.,Genomic rearrangements have consequences for introgression breeding as revealed by genome assemblies of wild and cultivated lentil species. Nature Genetics (submitted).
- Type:
Journal Articles
Status:
Submitted
Year Published:
2021
Citation:
Edger et al. There and back again; historical perspective and future directions for Vaccinium breeding and research studies. Horticulture Research (submitted).
- Type:
Conference Papers and Presentations
Status:
Published
Year Published:
2021
Citation:
Gasic, K., Jung, S., Cheng, C.H., Lee, T., Zheng, P., Yu, J., Humann, J., Evans, K., Peace, C., DeVetter, L., Mcferson, J., Coe, M.L. and Main, D. (2021). Resources in the Genome Database for Rosaceae for peach research. Acta Hortic. 1304, 99-106
- Type:
Conference Papers and Presentations
Status:
Published
Year Published:
2021
Citation:
Yu, J., Jung, S., Cheng, C.H., Lee, T., Humann, J. Zheng, P., Crabb, J., Hough, H., Campbell, B.T., Udall, J., Jones, D., Main, D. 2021. CottonGen: A Central Data Repository and Analysis Resource for the Cotton Community. Proceedings of the National Cotton Council Beltwide Cotton Conference, January 5-7, 2021 (virtual).
- Type:
Conference Papers and Presentations
Status:
Published
Year Published:
2021
Citation:
Ramnath, R., Sanderson, L-A., Buehler, S., Burns, J., Buble, K., Senalik, D., Richter, P., Caron, C., Almsaeed, A., Condon, B., Staton, M., Main, D., Bett, K., Wegrzyn J.L., and Ficklin, S.P. 2021. Tripal Creates Online Biological, Community Based Web Portals for Research and Analysis. Proceedings of the 2021 Bioinformatics Open Source Conference, July 29-30, 2021 (virtual).
- Type:
Conference Papers and Presentations
Status:
Published
Year Published:
2021
Citation:
Jung, S., Lee, T., Cheng, C.H., Gasic, K., Humann, J.L., Yu, J., Hough, H., Main, D. 2021. Update on the Breeding Information Management System (BIMS). Proceedings of the American Society of Horticultural Science Annual Meeting, August 5-9, 2021, Denver, CO, USA.
- Type:
Conference Papers and Presentations
Status:
Published
Year Published:
2021
Citation:
Gasic, K., Jung, S., Lee, T., Cheng, C.H., Humann, J.L., Yu, J., Hough, H., Main, D. Specialty Crop Breeding Data Management. Proceedings of the National Association of Plant Breeders Annual Meeting Conference, August 15-19, 2021 (virtual).
- Type:
Conference Papers and Presentations
Status:
Published
Year Published:
2021
Citation:
Humann, J.L., Cheng, C-H., Lee, T., Zheng, P., Crabb, J., Hough, H., Jung, S., Iorizzo, M., Main, D. 2021. GDV Workshop - Using the GDV for Genetics, Genomics, and Breeding Research. 2021. Proceedings of the XII International Vaccinium Symposium - August. 30- September 1, 2021 (virtual).
- Type:
Conference Papers and Presentations
Status:
Published
Year Published:
2021
Citation:
Main, D., Jung, S., Lee, T., Humann, J.L., Cheng, C-H., Zheng, P., Crabb, J., Hough, H., Jung, S., Iorizzo, M., 2021. GDV Workshop - Using GDV for Genetics, Genomics, and Breeding Research. 2021. Proceedings of the XII International Vaccinium Symposium - August 30 - September 1, 2021 (virtual).
- Type:
Conference Papers and Presentations
Status:
Published
Year Published:
2021
Citation:
Humann, J., Crabb. J., Cheng. C-H., Lee. T., Zheng, P., Buble, K., Jung, S., Yu J, Hough H. Coyne C, McGee R, Main D. 2021. The Pulse Crop Database: A Resource for Pulse Crop Research and Improvement. Proceedings of the 2021 Biennial NAPIA and BIC Bean Meeting, November 4-6, 2021 (virtual).
- Type:
Conference Papers and Presentations
Status:
Published
Year Published:
2021
Citation:
Main, D., Baum, D. 2021. The Polyploid Tools Community Website Resource. Proceedings of the Tools for Genomics-Assisted Breeding in Polyploids Workshop, January 13-15, 2021 (virtual).
- Type:
Websites
Status:
Published
Year Published:
2021
Citation:
NRSP10 project website (https://www.nrsp10.org).
- Type:
Websites
Status:
Published
Year Published:
2021
Citation:
Tripal informational and support website (https://tripal.info).
- Type:
Websites
Status:
Published
Year Published:
2021
Citation:
RosBREED project website (https://www.rosbreed.org).
- Type:
Websites
Status:
Published
Year Published:
2021
Citation:
Agricultural Biological Databases Consortium Website, AgBioData (https://www.agbiodata.org).
- Type:
Websites
Status:
Published
Year Published:
2021
Citation:
Fruit and Nut Cultivars Database (https://www.fruitandnutlist.org/).
- Type:
Websites
Status:
Published
Year Published:
2021
Citation:
Tools for polyploids community resource website (https://www.polyploids.org)
- Type:
Other
Status:
Published
Year Published:
2021
Citation:
Jung, S., Humann, J.L., Lee, T., Hough, H., Gasic, K., Campbell, B.T., Main, D. 2021. GDV Webinar - Breeding Data and Tools. August 19, 2021 (virtual).
- Type:
Other
Status:
Published
Year Published:
2021
Citation:
Humann, J.L., Cheng, C-H., Lee, T., Zheng, P., Crabb, J., Hough, H., Jung, S., Iorizzo, M., Main, D. 2021. GDV Webinar - Genetics Data and Tools. August 19, 2021 (virtual).
- Type:
Other
Status:
Published
Year Published:
2021
Citation:
Main, D., Humann, J.L., Cheng, C-H., Lee, T., Zheng, P., Crabb, J., Hough, H., Jung, S., Iorizzo, M. 2021. GDV Webinar - Introduction and Overview. August 19, 2021 (virtual).
- Type:
Other
Status:
Published
Year Published:
2021
Citation:
Humann, J.L., Cheng, C-H., Lee, T., Zheng, P., Crabb, J., Hough, H., Jung, S., Iorizzo, M., Main, D. 2021. GDV Webinar - Genomics Data and Tools. August 19, 2021 (virtual).
|