Progress 09/01/24 to 08/31/25
Outputs Target Audience:The target audience during this reporting period was high-dimensional plant phenotype domain experts, mainly university faculty working adjacent to the plant breeding space. As the project continues, the target audience will broaden to include plant breeders working with a wide range of crops. Changes/Problems:
Nothing Reported
What opportunities for training and professional development has the project provided?Project personnel participated in several training and professional development activities during the reporting period. Internal training occurred through expert consultations on metabolomics and microbiome data models, which provided feedback that directly shaped project outputs while building cross-disciplinary knowledge between plant and computer scientists. Professional development opportunities supported technical skill development in high-dimensional phenotype (HDP) data management and integration, as well as broader growth in plant breeding and informatics. Co-PI Lukas Mueller, collaborator Peter Selby, and team members Chaney Courtney and Ben Maza attended the 2025 BrAPI Hackathon in Los Baños, Philippines, where Selby introduced the project and Maza presented on BrAPI use with HDPs. The hackathon fostered collaborative software development and offered professional visibility within the global BrAPI community. Additional professional development occurred through conference participation. PD Jenna Hershberger delivered an invited oral presentation at the 2025 National Association of Plant Breeders (NAPB) Annual Meeting. Postdoc McKena Wilson presented posters on phenotypic data infrastructure at the Clemson University Postdoctoral Research Symposium and the NAPB Annual Meeting. Maza also gave an invited seminar at the Boyce Thompson Institute, strengthening communication skills with diverse audiences. Collectively, these activities advanced technical expertise, communication, and professional networks for the early-career scientists on the project. Hershberger, J. (2025 May 21). Building phenomics capacity in a new public vegetable breeding program. [Oral Presentation]. 2025 National Association of Plant Breeding Annual Meeting, Kona, HI, USA. Maza, B. (2025, June, 2-6). Using BrAPI with High Dimensional Phenotypes [Oral Presentation]. BrAPI Hackathon, Los Baños, Philippines. Maza, B. (2025, July, 7). Helping Plant Breeders Manage Complex Data [Oral Presentation]. Boyce Thompson Institute, Ithaca, NY, USA. Wilson, M., Courtney, C., Jannink, JL., Maza, B., Mueller, L., Powell, A., Rife T., Selby P., Hershberger, J. (2025, April, 22). Managing the Complexity: Phenotypic Data Infrastructure for Next-Gen Plant Breeding [Poster]. Clemson UniversityPostdoctoral ResearchSymposium, Clemson, SC, USA. Wilson, M., Courtney, C., Jannink, JL., Maza, B., Mueller, L., Powell, A., Rife T., Selby P., Hershberger, J. (2025, May, 19-23). Managing the Complexity: Phenotypic Data Infrastructure for Next-Gen Plant Breeding [Poster]. 2025 National Association of Plant Breeders Annual Meeting, Kona, HI, USA. How have the results been disseminated to communities of interest?Results and project activities were disseminated through Zoom calls with HDP domain experts and invited and contributed presentations and posters (see professional development section above). At the 2025 NAPB Annual Meeting, PD Hershberger's invited talk highlighted project goals and early outcomes for the broader plant breeding community. Wilson's poster presentations at NAPB and Clemson's Postdoctoral Research Symposium further shared project progress with meeting attendees. International dissemination occurred through project participation in the 2025 BrAPI Hackathon in the Philippines. By contributing project use cases and demonstrations, the team engaged directly with a global network of plant breeding software developers. Maza's invited seminar at the Boyce Thompson Institute also provided an opportunity to communicate project goals and progress to potential software users and researchers outside the immediate plant breeding community. What do you plan to do during the next reporting period to accomplish the goals?Objective 1: Generate appropriate data models for HDPs We will develop a draft data model for proteomic data and review it with proteomics domain expert Dr. Aleksandra Skirycz (Michigan State University) to ensure it accurately represents data structure and metadata. Previously developed data models for metabolomic, transcriptomic, and microbiome metagenomic data will be reviewed with domain experts to incorporate feedback and ensure alignment with research workflows. Objective 2: Develop BrAPI standards for HDPs BrAPI calls for storing and retrieving HDP data within Breedbase will be refined, incorporating feedback from domain experts. This includes finalizing and testing the user interface for uploading and downloading metabolomic, transcriptomic, and microbiome data. Objective 3: Design and implement HDP storage structures in BrAPI-enabled breeding databases We will continue improving data management within Breedbase, enhancing user interface features for organizing and monitoring uploaded HDP datasets to ensure efficient storage and accessibility. Objective 4: Integrate HDP BrAPI calls into widely used data collection tools We will release an updated Field Book app with spectrometer integration features developed through this project and will update Field Book documentation to support adoption by users. We will begin work to support spectral data transfer from Field Book to Breedbase via BrAPI. ?Objective 5: Develop HDP BrAPI-enabled analysis applications (BrAPPs) We will continue development of BrAPPs for HDP data analysis and visualization, including spectral prediction, filtering, clustering, principal component analysis, heatmaps, dataset creation, and plotting based on user-selected parameters. These tools will be informed by features in the existing (but non-BrAPI-enabled) Sol Genomics Expression Atlas and Breedbase spectral data analysis tools and ongoing consultation with domain experts. Implementation of HDP support in the QBMS R package will also continue in collaboration with Khaled Al-Sham'aa (ICARDA). In addition to addressing these specific objectives, the project team will continue to participate in professional development, training, and dissemination activities to support skill growth and community engagement.
Impacts What was accomplished under these goals?
The overarching goal of this project is to increase the accessibility and usability of high-dimensional phenotypes (HDPs), including spectral, transcriptomic, metabolomic, proteomic, and microbiome data, for plant breeding programs by developing tools for their collection, transfer, storage, and analysis. Significant progress has been made toward this aim across several objectives during the reporting period. To support Objective 1, we first developed introductory materials to share with domain experts in order to guide discussions and gather feedback. We then met with experts in plant metabolomics (Dr. Lauren Brzozowski, University of Kentucky, and Dr. Gaurav Moghe, Cornell University) and microbiome and metagenomics (Dr. Jason Wallace, University of Georgia) to review proposed metadata fields and formats. These consultations informed the development of draft data models for metabolomics and microbiome metagenomic data that capture both data structure and metadata required for plant breeding and genetics applications. In addition, we finalized the spectral and transcriptomic data models that had been drafted prior to the project start, ensuring that they were well aligned with domain-specific needs and workflows. For Objective 2, we worked toward extending Plant Breeding API (BrAPI) standards to accommodate HDPs by translating the data models developed in Objective 1 into draft BrAPI standards. We have posted these draft standards online for use by other BrAPI developers at https://brapinewconceptpreview.docs.apiary.io/#/reference/high-dimensional-phenotypes. In alignment with Objective 3, we implemented BrAPI calls for spectral, transcriptomic, and metabolomic data within Breedbase, enabling the upload, download, and retrieval of these data types. This functionality was extended through several contributions to the Sol Genomics Network GitHub repository, with 8 pull requests referencing "NIRS" and 10 referencing "BrAPI" during the reporting period. For example, pull request 5491 (https://github.com/solgenomics/sgn/pull/5491) added a new spectral data section to the trial detail page, while pull request 5381 (https://github.com/solgenomics/sgn/pull/5381) created a landing page for the management of transcriptomic data. Progress toward Objective 4 was achieved by integrating support for mobile spectrometers and colorimeters into the Field Book Android application, including implementation of a snowflake database structure to facilitate current and future device integration. Testing is underway to finalize these updates prior to release. Finally, to advance Objective 5, a BrAPI-enabled application was developed in Breedbase to visualize and plot spectral data for individual samples within field trials, providing a proof of concept for analysis tools that integrate HDPs with other breeding data types. To support further BrAPI application (BrAPP) development, we are working to integrate HDP BrAPI calls into QBMS, an R package for BrAPI, with contributions planned in coordination with package developer Khaled Al-Sham'aa (ICARDA). We also consulted with Dr. Trupti Joshi (University of Missouri) to explore opportunities for incorporating BrAPI into her existing web-based HDP analysis tools, such as G2PDeep, SoyKB, and KBCommons. Collectively, these accomplishments demonstrate strong progress toward developing robust, standardized frameworks for managing HDPs within plant breeding software ecosystems, while building broad community engagement to ensure that the tools are well aligned with domain-specific workflows and priorities. Progress to date is in line with the proposed project timeline.
Publications
|