Source: UNIVERSITY OF PUERTO RICO submitted to NRP
GENOMICS AND BIOINFORMATICS FOR INSULAR AGRICULTURE AND FOOD SCIENCE SKILLS TO MEET THE CHALLENGES OF THE FUTURE
Sponsoring Institution
National Institute of Food and Agriculture
Project Status
ACTIVE
Funding Source
Reporting Frequency
Annual
Accession No.
1031269
Grant No.
2023-70008-41027
Cumulative Award Amt.
$200,000.00
Proposal No.
2023-05148
Multistate No.
(N/A)
Project Start Date
Sep 1, 2023
Project End Date
Aug 31, 2026
Grant Year
2023
Program Code
[DEG]- Distance Education Grants for Insular Areas
Recipient Organization
UNIVERSITY OF PUERTO RICO
AVE PONCE DE LEON
SAN JUAN,PR 00918-1000
Performing Department
(N/A)
Non Technical Summary
Genomics and Bioinformatics are rapidly advancing fields with the potential to transform agriculture and food production by providing tools and techniques to address some of the most pressing challenges facing the industry today. The agriculture and food industry require a well-trained workforce in order to address these challenges. However, there is a paucity of bioinformatics instruction at the undergraduate and graduate level and this, along with computing infrastructure, is especially lacking at minority and insular institutions. This proposal aims to address this deficiency by conducting a training program in genomics and bioinformatics in Puerto Rico, aimed at undergraduates and graduates, utilizing a framework of remote one-on-one instruction. In doing so, this project will (i) build instructional resources and conduct genomics and bioinformatics training applied to agricultural crops, insect pests and livestock; (ii) convert instructional material for use as stand-alone on-demand remote instruction; (iii) provide support for analysis of students' independent research projects; and (iv) provide professional development opportunities in agriculture.The proposed project will impact the research activities of students and faculty at the University of Puerto Rico as well as the professional development of students and agronomists in Puerto Rico, and in the long-term, the agricultural practices across the island, including improved crops, more efficient animal breeding, and better food safety. The extent of these impacts will be measured by monitoring the number of students in different categories (academic research, agriculture, food science, etc.) through follow-up surveys designed to evaluate their professional development. Overall, this program has the potential to impact over 100 graduate and 900 undergraduate students in biology and computer sciences at the University of Puerto Rico, of whom more than 95% are Hispanic and other minorities and close to 600 members of the College of Agronomists of Puerto Rico.
Animal Health Component
(N/A)
Research Effort Categories
Basic
(N/A)
Applied
(N/A)
Developmental
(N/A)
Classification

Knowledge Area (KA)Subject of Investigation (SOI)Field of Science (FOS)Percent
90373101080100%
Goals / Objectives
The overall goal of this proposal is to increase the bioinformatics literacy of students in agriculture and food sciences from underrepresented communities in Puerto Rico and beyond. To achieve our goal, we propose the following target objectives:Provide hands-on training on the most current bioinformatic pipelines for genomic analyses. We propose a series of short courses on (i) Experimental design, (ii) Introduction to Unix and R, (iii) RNA-seq and differential gene expression analyses, (iv) Genome sequencing and Variant calling with the aim to train the next generation of bioinformatics researchers in agriculture.Offer one-on-one training to students. Students with ongoing research projects will benefit by receiving individually tailored support--from experimental design to completion of a research project and publication--from participating instructors and the in-house bioinformatician who will guide student growth, help troubleshoot, and provide other advice.Foster collaborative interactions and a supportive system between instructors, faculty, and students.To promote collaboration and communication among students and faculty in different laboratories, activities that facilitate interactions among course participants will be implemented.Facilitate genomic research.By training the participating students, who will become the next generation of scientists, agronomists, and life scientists in Puerto Rico, this program will also facilitate the development of research programs of multiple research faculty with ongoing genomics and genomic-related projects as well as the research and development practices of multiple agronomical enterprises in Puerto Rico.
Project Methods
The main objective of the project is to train and facilitate students' acquisition of bioinformatics and research skills that can best serve them in confronting problems in agriculture and food science. This will be accomplished by developing resources to teach the most important genomic techniques used in agricultural sciences.Instructional Program: The yearly instructional program will comprise five different modules:Module 1: Experimental Design: A mix of lectures and interactive workshops to introduce the different types of genomic experiments in agriculture and life sciences, the principles of experimental design, and developing a genomics research proposal.Module 2: Introduction to UPR BoquerĂ³n computing system, Linux, Bash scripting, and R studio. A mix of lectures and interactive workshops to introduce the computing environment and programming languages, including the UPR high-performance computing server (Boqueron) server, Linux, Bash scripting, and R studio.Module 3: Gene Expression (RNA-seq):A mix of lectures and interactive workshops on differential gene expression analysis, including: how to Identify a data set from NCBI from a paper that they are interested in; a complete overview of the RNA-seq analysis pipeline; running of analysis pipelines; and interpretation of data.Module 4: Genome sequencing and variant calling:Students will be shown how to analyze data sets from various representative organisms relevant to agriculture: such as plants (corn, rice, wheat, soy), insects (honey bee, soybean aphid, white fly), ruminants (cow, sheep), and bacteria associated with food contamination. Students can elect to use their own data set for the training and analysis in addition to the data that will be included in the training.Module 5: Writing and presentation of work:A mix of workshops and lectures designed to teach students how to write a journal article using their own data (which will facilitate the publication of their research), preparation, and presentation of results obtained.The project will be managed across three different consecutive years in the following manner:Year 1will consist of a two-semester program that covers the five different modules of experimental design, bioinformatics, and genomic analysis. Students can receive a total of 6 credits after their participation and completion of the training.After the completion of each module and at the end of the instructional program, a student instructional evaluation will be conducted. A total of 30 students will be accepted into the course for this year.Year 2:Re-evaluate and modify as needed the instructional program by utilizing observations and experience of instructors and student evaluations and re-run the program for the second year. Similarly, re-evaluate the instructional program after the completion of each module. At the end of the instructional program, a student instructional evaluation will be conducted. A total of 30 students will be accepted into the course for this year.Year 3:Conduct final edits and convert instructional materials to an online format suitable for uploading to the distance learning platforms such as the UPR Division of Continuing Education and Professional Studies (DECEP) and the College of Agronomists of Puerto Rico continuing education platform. During this period, all modules will be open for self-guided study. A maximum of 30 students per module would be allowed to enroll.One-on-one instruction and consulting:Students with ongoing research projects will benefit by receiving individually tailored support--from experimental design to completion of a research project and publication--from participating instructors and the in-house bioinformatician who will guide student growth, help troubleshoot, and provide other advice.Other activities:During years one and two, activities that facilitate interactions among course participants will be implemented. These include, (i) a monthly journal club, where papers on cutting-edge genomic work applied to agriculture will be discussed, (ii) a seminar series, where invited experts that apply genomic methods to agricultural problems will share their successes and failures; and (iii) an online communication hub for posting questions and exchanging information between students and instructors. Speakers will be chosen from industry, government agencies, and academia so that students can also gain an appreciation of the different approaches used at different places of work. This exposure will also serve to further prepare them for making their own professional choices.Student selection:For the first two years of the program, participants will be selected through a competitive application process. Call for applications will be advertised on all campuses of the UPR system through the UPR Division of Continuing Education and Professional Studies (DECEP), which has a significant social media presence, and through the College of Agronomists of Puerto Rico continuing education platform. Applicants will be ranked based on academic performance and responses to questions in the application. These questions will include student status (juniors and seniors will be given preference), the reason for taking the course, a plan for the incorporation of genomics and bioinformatics in their research or practice, and future professional goals.Evaluation Plan:Evaluation of the impact of the Genomics and Bioinformatics Course on the development of students will be done at different levels: (1) Pre-and Post-course Assessments: These assessments will help evaluate the knowledge gained by students throughout the course. We will use the same assessment tool before and after the course to compare the results and determine how much the students have learned. (2) Projects and Assignments: implemented to test the knowledge and skills gained by students during the course. The quality of their work will be evaluated to determine how well they have understood and applied the concepts taught in the course. (3) Course Evaluation: Students will provide feedback through course evaluations at the end of the course. These evaluations will provide insights into the strengths and weaknesses of the course and help to improve it for future instructional efforts. (4) Follow-up Surveys: Follow-up surveys will be conducted after the course to determine the overall impact of the course on the student's development. These surveys will include questions about how the course has helped them in their studies, research, and career development. The evaluation methods will be applied repeatedly throughout the three years of the course to assess and make yearly improvements to ensure that the instructional material meets the needs of the students in agricultural studies.

Progress 09/01/23 to 08/31/24

Outputs
Target Audience:This project aimed to provide genomics and bioinformatics training to students in agricultural and biological sciences. The target audience was graduate and undergraduate students at the University of Puerto Rico and other professionals in agricultural sciences on the island of Puerto Rico. During this reporting period, we conducted the first pilot workshop, which was attended by a total of 27 students. Of these, 4 % were undergraduates, 22 % were master's students, 56% were PhD students, and 7% were postdoctoral fellows. In addition, the workshop was also attended by faculty members (4%) and professionals who work for the USDA-ARS Tropical Agriculture Research Station in Puerto Rico (7%). In addition, 4% percent of the attendees were affiliated with the University of Puerto Rico, Ponce, a primarily undergraduate institution. All students enrolled in the workshop were from underrepresented minorities: 96.3% Hispanic and 3.7% African descent. Changes/Problems:The workshop was implemented as planned and proceeded smoothly with no problems that could not be readily addressed at the time of their occurrence. One of the problems we encountered was the ability of some students to have viable internet connections at all times. This shortcoming was resolved by connecting students via working groups, providing instructional information on the website, and facilitating communication between students and instructors via email. On the other hand, there were some delays that impacted the rate of expenditure for year 1. Even though the grant was activated in September 2024, the official project start date was November 1, 2024, as originally specified in the proposal submitted. This caused a slight decrease in the expenditure for year 1. Due to an administrative delay, the contract for Dr. Zimin (bioinformatics consultant) was not finalized until the end of year 1. Thus, his role inone-on-one training with studentshas shifted from years 1 and 2 to years 2 and 3 of the project. Due to ongoing renovations in the computing framework at the UPR High-Performance Computing facility, the purchasing of computing nodes to enhance the HPCf cluster was postponed to year 2. The HPCf administrators recommended that we wait until this upgrade is complete before purchasing new computing nodes. It is important to note that these expenditure delays did not affect the project schedule or goals. All expenditure delays will be remedied in years two and three of the project. What opportunities for training and professional development has the project provided?For students, this workshop served as a means to acquire a new skill set that facilitated the completion of their research projects and, therefore, facilitated the attainment of their higher educational degree. For professionals (post-docs, research faculty, and USDA investigators), the workshop provided new analysis skills that have broadened their research fields. In addition, experts in genomics sequencing, bioinformatics, and big data management were invited to present and interact with participants, providing new knowledge and networking opportunities. We organized three seminars/workshops with invited speakers open to the university community. This seminar included talks by: Sara Goodwin, PhD.Research Assistant Professor and Genomics Core Head at Cold Spring Harbor Laboratory: "Third Generation Sequencing Technologies" Jianguo (Jeff) Xia, PhD.Associate Professor and the Canada Research Chair in Bioinformatics and Big Data Analytics from McGill University, Macdonald Campus. "Networkanalyst a platform for comprehensive gene expression profiling via network visual analytics" Peter Cooper, PhD.Staff Scientist at the National Library of Medicine, National Institute of Health: "Exploring NCBI Gene Expression and Other Biomolecular Resources" How have the results been disseminated to communities of interest?All workshop material (videos, assignments, reading material, and student presentations) has been posted on the official course website, which is freely accessibleand available to everyone. What do you plan to do during the next reporting period to accomplish the goals? Provide hands-on training on bioinformatic pipelines for genomic analyses: Evaluate participants learning outcomes during the first round of the workshop using the pre- and post-course student surveys obtained as part of the workshop. Curate the videos created during the first reporting period to streamline and annotate content to facilitate future navigation. Develop the second workshop focused on genome assembly and annotation. One-on-one training with students: We will continue to provide one-on-one training to students enrolled in the next reporting period, focusing on a new bioinformatic topic: Genome assembly and annotation. Fostering collaborative interactions: We will continue to promote collaborations between students and faculty as in the previous reporting period. We will also organize another series of speakers to complement the workshop topic. Facilitate genomic research: For the next reporting period, we expect to train a new cohort of students and continue to enhance their research projects by developing new genomic and bioinformatic skills.

Impacts
What was accomplished under these goals? Hands-on training on bioinformatic pipelines for genomic analyses: During the reporting period, we created and implementeda workshop on genomics and bioinformatics, focused on transcriptomic analysis. The workshop consisted of 15 lectures of 3 hours each and was led by six instructors with different expertise. The workshop covered three instructional modules: (1) Experimental Design for Agricultural Genomics, (2) Introduction to Computational Platforms and Bioinformatic Tools, and (3) Hands-on Analysis of RNA-seq data sets.Each lecture consisted of a theoretical introduction followed by a hands-on learning experience, where students implemented the learned knowledge on real-life biological problems. All lectures and activities were recorded and made available to students for further on-demand distance learning. A total of 45 hours of instructional videos were created and posted, along with supporting scientific literature, to serve as additional instructional material. A course webpage was created to administer the course, share supporting material (scientific articles, class videos, and take-home assignments), and as a means of communication between students and instructors/speakers for one-on-one assistance. This website will host the open-source distance-learning platform that will be the main product outcome of this proposal. One-on-one training with students: During this period, 27 students enrolled in the course were engaged in individual research projects. Students received individual support from course instructors. Students also received advice regarding the experimental design of their ongoing and future projects. Students were also trained in approaches to reporting their research findings and preparing for future publications. Fostering collaborative interactions: During the workshop, students were divided into teams to encourage interactions and individual contributions toward solving problems addressed in class. The student teams were encouraged to meet outside the workshop time to further practice skills acquired during the workshop. Members of the teams could also directly contact the instructors for advice and feedback. Facilitate genomic research: The workshop thus far has impacted the research of 14 different research laboratories at the University of Puerto Rico, Rio Piedras (11 labs), and the USDA Agricultural Station in Puerto Rico (3 labs). The research projects include an extensive array of topics, including but not limited to: Bacterial diversity of Puerto Rican coffee leaves. Enhancing agricultural practices for coffee production Coffee bean pathogens and biological control Fungal diversity and its effects on coffee bean health Integrated management of cover crops, biochar, andBeauveria bassianafor soil and plant health in coffee production. Effects of urban shade on seagrass health. Effects of soil microbiome on banana cultivation. Characterization of insulin-like growth factors in sea cucumberHolothuria glaberrima. Cellular and Molecular Characterization of Muscle Regeneration inHolothuria glaberrima. Characterization of astroglia markers in sea cucumbersHolothuria glaberrima. Molecular basis of holothurian radial nerve cord regeneration. Epigenetics of alcohol neuroadaptation inDrosophila. Insecticide resistance in mosquitoes. Alcohol neuroadaptation in insects. Environmental neuroadaptation in insects. Conservation of highland coqui species in Carite and El Yunque . Patterning and coloring of butterfly wings. Genomic characterization of land snails. Role of Septin9 in inflammatory breast cancer. Microbiome dynamics in inflammatory breast cancer. Role of Basonuclin-1 in inflammatory breast cancer. Genetic diversity and systematics of three species of Ilex from the Greater Antilles. Integrated pest management strategies on mango inflorescence in Puerto Rico. Molecular mechanisms for understanding interactions between tropical plants and pathogens. Enhancing cacao (Theobroma cacao) rootstock resilience: A comprehensive germplasm evaluation for drought tolerance. How does the microbiome on the host and parasitoid change at different locations?

Publications

  • Type: Journal Articles Status: Submitted Year Published: 2024 Citation: Avalos A., Scannapieco A, Monmany-Garzia AA, Marcelino J, Donthu R, Giordano R, Giray T, Galindo-Cardona (2024). Scientific Note: Relative representation of Africanization in Apis mellifera from Argentina. Apidologie (submitted).