Source: UNIV OF IDAHO submitted to NRP
SUPERMINDS: SUSTAINABLE UNDERGRADUATE PROGRAM FOR EXTENSION/RESEARCH/MANAGEMENT IN DATA SCIENCE
Sponsoring Institution
National Institute of Food and Agriculture
Project Status
ACTIVE
Funding Source
Reporting Frequency
Annual
Accession No.
1030134
Grant No.
2023-68018-40323
Cumulative Award Amt.
$749,999.00
Proposal No.
2022-09085
Multistate No.
(N/A)
Project Start Date
Jun 1, 2023
Project End Date
Aug 31, 2028
Grant Year
2023
Program Code
[A7401]- Research and Extension Experiences for Undergraduates
Recipient Organization
UNIV OF IDAHO
875 PERIMETER DRIVE
MOSCOW,ID 83844-9803
Performing Department
(N/A)
Non Technical Summary
Data science is an integrated part of 21st-century agriculture, yet the educational needs have not been adequately met in schools and universities. The SUPERMinDS project goals are to train the next generation of professionals with data science skills and fill key gaps in agricultural data and information.To achieve these goals, three diverse student groups are met with three sets of immersive research and extension activities. Out of the 5-year total of 72 SUPERMinDS student participants, 20 students will be recruited from College of Southern Idaho, Idaho's first Hispanic-Serving Institution; 24 from Randolph College, a former women's college; and 28 from the University of Idaho main campus.The three project objectives dictate that SUPERMinDS activities should encompass training in all aspects of data science with directly applicable results for Idaho agriculture: students involved in data-gathering activities will engage with local stakeholders and produce up-to-date enterprise budgets for major Idaho crops; students involved in data analytics activities will utilize big data analytics techniques to produce real-time forecasts, what-if calculators, and risk management strategies for Idaho farmers and ranchers; and students involved in data management activities will use a variety of coding software to manage the gathered data, visualize results from data analytics, and design user-friendly website.The diverse group of mentors will leverage their past success to gauge interest from target students and work closely with the project evaluator to measure short-term learning and long-term metrics, such as student placement and where the experiential learning occurs. The long-term sustainability of the program is maintained through industry partners, directed studies, and capstone projects.
Animal Health Component
80%
Research Effort Categories
Basic
20%
Applied
80%
Developmental
0%
Classification

Knowledge Area (KA)Subject of Investigation (SOI)Field of Science (FOS)Percent
6012010301020%
6011310301020%
6011550301020%
6011640301020%
6011543301020%
Goals / Objectives
The goals of this Research and Extension Experiences for Undergraduates (REEU) project are to train the next generation of professionals with data science skills (including data gathering, data analytics, and data management), and fill key gaps in agricultural data and information. Furthermore, such training shall be sustained in long-term agricultural education.To achieve these goals, we developed three objectives for SUPERMinDS student participants:Objective 1. Faculty at the University of Idaho (UI) Twin Falls Research and Extension Center (TFREC) will instruct undergraduate students on data gathering and database-building activities for generating enterprise budgets for Idaho's main crops. Students will also learn how to disseminate the gathered information via Extension research reports, fact sheets, and associated presentations and videos.Objective 2. Faculty on the UI main campus in Moscow, Idaho, will instruct undergraduates on data analytics activities such as empirical estimation and forecasting and building web-based decision aid tools.Objective 3. Our collaborator at Randolph College will lead training on digital data management for undergraduates. Specifically, mathematics/computer science students will develop websites and database management systems that use algorithms to automatically gather and store publicly available agriculture-related data (e.g., prices, production). The final product that encompasses all these components will be a website we call RAPIDS (Real-time Analytics Programming Information Dissemination System), on which we will disseminate outputs, host the decision aid tools, and link to an REEU student recruitment and project outcome website. The novelty of this project lies not only in the immersive research and Extension experiences for undergraduate students but also in a new form of collaboration: we will both train agricultural students with new technological tools from STEM disciplines and STEM students with new applications in agriculture.
Project Methods
Efforts:Activity 1: Data Gathering (for Objective 1).A1.1 Enterprise budget basics: include introducing students to the basic concept of enterprise budgets and their main cost and return components in guest lectures in relevant CSI Agriculture courses. After these guest lectures, students will be invited to apply to officially join SUPERMinDS as an Agricultural Data Associate. After application review by Dr. Hatzenbuehler and Dr. Quesnell of CSI, students will be selected to form an Agricultural Data Associate team.A1.2 Data gathering strategies and methods: will include implementing more formal training in data-gathering strategies and methods based on the existing UI Extension curriculum on enterprise budget development that is currently delivered to agricultural producers throughout the state in Farm Management courses. This curriculum includes a detailed discussion of the budget cost and returns components, identification of data sources for each cost and return variable, strategies for sampling among multiple sources, and the discussion of the usefulness of indexes from secondary sources for obtaining variable estimates.A1.3 Data gathering implementation: involves actual data gathering by engaging with local stakeholders to obtain agricultural data via email, phone calls, and/or in-person meetings. To implement this aspect, each Agricultural Data Associate will develop a Data Gathering Plan, in which they identify whether primary or secondary data are needed, potential sources of data, and strategies for obtaining the data. Data Gathering Plans will be reviewed by Dr. Hatzenbuehler and feedback will be provided after review. Students will then implement the data-gathering activities as consistent with the approved plan. The gathered data will be encompassed in a database that is shared among all Agricultural Data Associates and project partners.A1.4 Information dissemination:this is disseminating the data and information through publishing reports, filming associated videos--in which the students describe how they developed the enterprise budgets and how they are different from older budgets--and delivering presentations at relevant UI Extension programs. Dr. Hatzenbuehler will facilitate the travel for students to UI Extension programs.Activity2: Data Analytics (for Objective 2).A2.1 Forecasting and big data analytics:The training will provide web-based analytical results using state-of-art tools such as machine learning-based forecasting for SUPERMinDS project participants. The algorithm will automatically update results as new data become available.The analysis will use big data in all forms: students will utilize publicly available data about the weather, demographics, economics, geography, production, and public opinion from multiple sources including data gathered from data-gathering activities.A2.2 Risk and risk management analysis:Utilizing enterprise budget information gathered in Objective 1, students are able to conduct research on comparing costs of production in Idaho relative to nationally representative benchmark data and help farmers be more confident in making marketing and production decisions about when to cut losses or expand a certain product. This study is especially useful in labor-intensive operations in Idaho where, due to fluctuation in labor availability, uncertain financing costs, and variation in other input costs, there is a growing trend of automation, which introduces new uncertainties in cost structure for farms of all sizes. From these baseline results, Du will alsocollaborate with SUPERMinDS students to do research on developing: (i) optimal risk management strategies for farms of varied sizes and (ii) indicators of the financial performance of various enterprises across farms of varied sizes. Examples of the latter include analysis by crop acreage, total farm acreage, the magnitude of sales, and livestock operations, by herd size. Simulation analyses will use these indicators as a base level to evaluate the likelihood of triggering policy responses from crop insurance or government support programs.A2.3 Information dissemination:this is co-building theRAPIDSwebsite in conjunction with students' activities in Data Management. Leveraging the UI AgBiz website, the project team will build the dashboard-style webpageRAPIDShosted under the UI AgBiz website.RAPIDSincludes a web-based interactive information dashboard where data visualization will be provided and can be customized according to users' needs. TheRAPIDSwebpage is expected to be deployed in January 2024. Example tools that will be hosted withinRAPIDSinclude data visualization for important production and market information for main Idaho crops, real-time forecasts of market conditions, and a "what-if" scenario simulator for issues like yield losses and insurance coverage.Activity3: Data Management (for Objective 3).Dr. Wan will lead data management activities.Students involved in this activity will focus on:A3.1 Conceptual Training and Coding Camp with USDA database:We will use it as the training for python script for data scraping, USDA database introduction, and time-series data basics.A3.2 USDA API time-series data training:a) work with Python packages such as Pandas, NumPy, and Plotly and b) data filtering and virtualization with Python and SQLite.A3.3 Automation with Cloud Services: a) researching cloud service solutions; b) getting data plotted automatically; c) redesigning and implementing data visualization with tools such as D3 and RShiny; d) researching the efficiency of data plotting for the above methods.A3.4 Frontend Redesign and Implementation: a) learning web page design basics: HTML, CSS, js, and D3; and b) redesigning frontend features such as smooth pop-out windows and intuitive navigation.A3.5 Big Data Algorithm Designand Implementation: a) learning introductory coding examples and linear interpolation methodsto manage missing data in R; b) being introduced to coding examples, such as the Holt-Winter exponential smoothing model in R for time-series data forecasting; c) being introduced to machine learning algorithms such as ARIMA for time series data smoothing; d) developing new features and/or adjustments as the other SUPERMinDS team members develop their study based on interactions with farmers and ranchers at Extension programs.Project EvaluationNav Ghimire will serve as the project evaluator. He will conduct a formative evaluation annually to assess the program outcomes. The data from the formative assessment will serve two purposes: to report project progress and to improve the training curriculum and programs throughout SUPERMinDS project implementation. Ghimire will conduct a summativeevaluation after the completion of the project to develop a final report required to submit to the funder. The objectives of the project and the following evaluative questions will serve as aframework to inform the design and implementation of the evaluation plan: a) To what extent does the program improves students' knowledge and skills about data science and itsapplications in the agricultural field, 2) To what extent does the program fill gaps in the agricultural data. 3) To what extent does the program attract STEM students pursuing careers inagriculture. We will use qualitative, quantitative, and mixed methods to collect evaluation data on the project's outcomes, success, and challenges. The evaluation audience includes students, mentors, stakeholders, and others interested in such student activities.SUPERMinDS student participants will be tracked using the following strategy: 1) before the exit, students will fill out a form that includes their mailing address, personal email, and phone number, their employment industry and/or graduate school, and 2) they will join a social media group named "SUPERMinDS Alumni."

Progress 09/01/24 to 08/31/25

Outputs
Target Audience:Undergraduate students,economists, and researchers at professional meetings. Changes/Problems: Nothing Reported What opportunities for training and professional development has the project provided?The project has offered a range of valuable training and professional development opportunities designed to support students' academic growth and career readiness. These include: One-on-One Mentorship: Each student worked closely with a faculty mentor, receiving individualized guidance on research design, data analysis, project management, and technical skills relevant to their field. This personalized support helped students build confidence, refine their problem-solving abilities, and deepen their understanding of research methodologies. Participation in Conferences, Workshops, and Seminars: Students were encouraged and supported to attend local, regional, and national conferences where they presented their research findings. These experiences helped them develop public speaking skills, receive feedback from experts, and build professional networks. They also participated in discipline-specific workshops and seminars that exposed them to current topics, tools, and trends in the field. Peer-to-Peer Training and Collaboration: Students were given opportunities to mentor newer team members, reinforcing their own knowledge while developing leadership, communication, and teamwork skills. This collaborative environment encouraged knowledge sharing and fostered a sense of community within the research group. Skill Development in Research and Technology: Students were trained in essential research skills such as literature review, experimental design, coding, data collection and analysis, and use of specialized software and tools (e.g., machine learning platforms, image recognition software, or statistical analysis tools). Career Preparation and Professionalism: In addition to technical training, students received guidance on applying to graduate programs, and exploring career paths in research and industry. They also gained experience adhering to project deadlines, managing responsibilities, and presenting themselves professionally in academic and professional settings. How have the results been disseminated to communities of interest? Developed a flyer to promote internship opportunities, which was distributed at the CSI Ag Career Fair. Promoted the project in undergraduate classes and successfully recruited student participants. Students received training not only in research tasks but also in mentoring future student researchers, ensuring a smooth transition and continuity within the team. Their experience and insights will be invaluable for onboarding new members. Several students have completed their roles as research interns and have either graduated or are in the process of graduating. Their contributions and the skills they acquired through the project provide a strong foundation for their future academic and professional endeavors. We offer students the opportunity to present their work at professional conferences, allowing them to engage with targeted audiences and gain valuable experience in academic and industry settings. What do you plan to do during the next reporting period to accomplish the goals?We plan to recruit additional students to join the program and provide them with targeted training to enhance their skills in their respective fields of study. Through hands-on research experience, mentorship, and skill-building activities, students will gain practical knowledge and competencies that will prepare them for future academic and career opportunities.

Impacts
What was accomplished under these goals? Our faculty at theUniversity of Idaho (UI) Twin Falls Research and Extension Center (TFREC)trained threeCSI student Agricultural Data Associates on agricultural data gathering principles and methods, and enterprise budget basics.Student participants offered positive feedback in theirinternship reports. Our faculty at theUniversity of Idaho (UI) Main Campus trained one undergraduate student on using LLM for automating enterprise budgets. The student made a presentation of his findings within the department. Moreover, the faculty at UI main campus has developed a new curriculum for training future year students on data analytics. Our research team at Randolph College has made progress in advancing our initiatives, expanding our team, and integrating new technologies to further our research capabilities. 1. Team Expansion.We have successfully expanded our research team by recruiting new undergraduate researchers. This year, we welcomed two new student researchers, David Kline and Saniyha Atchison. With Jennifer from the previous year having graduated, our team now comprises four undergraduate students majoring in mathematics and computer science dedicated to advancing our research projects. 2. Data Archiving and Analysis A major focus this year was revisiting and updating our data archives. We incorporated new data on crop harvests, including potato and barley. In addition, we provided training to new members on data analysis techniques using machine learning algorithms for processing time sequence data. We also explored new categories of data, notably including weather data from NOAA, and conducted statistical studies on crop performance in relation to weather conditions. 3. Development of AI Tools. In alignment with technological advancements, our senior student researcher, Saniyha Atchison, developed a potato blight image recognition bot under my supervision. This tool currently achieves over 70% accuracy in identifying early and late blight in potatoes. We are continuing to improve this tool's accuracy and plan to expand its functionality to develop a more comprehensive "PlantPal" bot for pre-screening various crop diseases. 4. SUPERMinDS Webpage Refactoring. Our team began refactoring the SUPERMinDS webpage, a key component of our digital outreach. Student researchers created a new landing page and are actively reorganizing content and refining the underlying code. This includes the integration of a page featuring the PlantPal bot. 5. Enhancing Website Performance. To enhance the performance of our website, we are developing a REACT tutorial and demo package. Our goal is to transition our website to a server-side executed platform, which will result in a faster, more efficient, and modern online presence. 6. Presentations and Outreach.This year, we had the opportunity to share our research at the Randolph College Annual Research Conference. The presentation, titled "An Artificial Intelligence Development and Research: Potato Blight Image Recognition Bot," was authored by Saniyha A. Atchison and supervised by Jia Wan. Presenting our work allowed us to highlight our progress, engage with fellow researchers, and receive valuable feedback to guide our future efforts.

Publications


    Progress 09/01/23 to 08/31/24

    Outputs
    Target Audience:Undergraduate students;economistsand researchers at professional meetings. Changes/Problems:We have a Co-PI Nav Ghimire left the University of Idaho. We will find another person to fill Nav's role as an evaluator. What opportunities for training and professional development has the project provided?1. one-on-one work with a mentor. Students learned how to usedifferent Large Language Models to see which model fits better for enterprise budgets for different crops. 2.Developdraft Enterprise Budgets for provision to the Moscow-based undergraduate intern for use in their research investigation of using Large Language Models to create enterprise budgets. 3. Training on the following topics was processed via engaging students in the process from data collecting to website development. There are introductions and hands-on activities for each of them: a.data collection with Python programs; b. working with API;c. data frame management with Python and JavaScript;d.web development basics: introduction to HTML, CSS, and javascript;e.basic data visualization with D3. How have the results been disseminated to communities of interest?1.Met with the College of Southern Idaho (CSI) Agriculture Department Chair to discuss student recruitment plans twice. Developed a flyer for promoting internship opportunities and disseminated the flyer at the CSI Ag Career Fair. 2. Advertise our project inundergraduate classes and recruit students. 3. Students were trained not only inresearch tasks but also were trained to assist future student researchers in transitioning smoothly into their roles. Their valuable experience and knowledge will be instrumental in onboarding future team members effectively. 4. Students pass good words. We had/having students graduated/graduating, and completed their tasks as research interns. Theircontributions and learnings from the project will serve as a valuable foundation for their future endeavors. What do you plan to do during the next reporting period to accomplish the goals?We will engage new students in the program and have the current student researchers participate trainingnew members. We will keep developingthe website.

    Impacts
    What was accomplished under these goals? 1. Have one paper presented at conferences with a full manuscript ready for publication. The audience asked questions and gave goodcomments which showed their interest in this study. 2. Instruct undergraduate students to do research. After gaining research experience, some students are interested in applying to graduate school. 3. Data Analysis and Visualization Projects: This area encompassed various sub-projects utilizing specific technologies: a. Data Mining using USDA API;b. Fundamentals of Webpage Development;c. Interactive Data Visualization Techniques: Scales, Axes, Motion, and Transitions;d. Geomapping. Students got increased knowledge andskills in using the abovetechnologies. 4.We achieveda new form of collaboration: we both train agricultural students with new technological tools (e.g. Chat GPT) from STEM disciplines and STEM students with new applications in agriculture (automatic data collection/cleaning for major crops' prices and production).

    Publications

    • Type: Conference Papers and Presentations Status: Published Year Published: 2024 Citation: Wells, Connor (speaker); Devin Schafer; Xiaoxue Du; Liang Lu; Patrick Hatzenbuehler; Brett Wilder. "Using GPTs for enterprise budgets." presentation at 2024 Agricultural & Applied Economics Association (AAEA) Annual Meeting.
    • Type: Journal Articles Status: Other Year Published: 2024 Citation: Devin Schafer; Wells, Connor; Xiaoxue Du; Liang Lu; Patrick Hatzenbuehler; Brett Wilder."Using GPTs for enterprise budgets." for publication at Applied Economics Teaching Resources. Status: abstract submitted and approved for final manuscript review.