Risk Management Tool to Increase Farm Safety

RISK MANAGEMENT TOOL TO INCREASE FARM SAFETY

Sponsoring Institution

National Institute of Food and Agriculture

Project Status

ACTIVE

Funding Source

SMALL BUSINESS GRANT

Reporting Frequency

Annual

Accession No.

1031128

Grant No.

2023-39410-40700

Cumulative Award Amt.

$644,705.00

Proposal No.

2023-03924

Multistate No.

(N/A)

Project Start Date

Sep 1, 2023

Project End Date

Aug 31, 2025

Grant Year

2023

Program Code

[8.12]- Small and Mid-Size Farms

Recipient Organization
VISIMO LLC
400 MAIN ST
CORAOPOLIS,PA 151081629

Performing Department
(N/A)

Non Technical Summary
VISIMO proposes to develop a risk management platform, referred to as Demeter, that leverages a custom Machine Learning (ML) model to enhance the efficiency, profitability, and safety of small and mid-sized farms. Demeter will advance the state of farm safety by identifying risks, providing mitigating actions, and offering economic cost-benefit analyses for suggested mitigations. The proposed application connects to agricultural-related manufacturing technology by improving the safety of agricultural practices, increasing productivity, improving operational management, and training farmers and producers.Farming is among the most dangerous occupations in the United States with an annual death rate of 20/100,000 persons (U.S. Bureau of Labor Statistics, 2021). Nationally, workers in Agriculture, Forestry, and Fishing (AFF) industries are up to 33 times more likely to die on the job than workers in other industries (U.S. Bureau of Labor Statistics, 2021). The average annual cost of occupational injuries in agriculture is $8.3B, including both medical costs and lost productivity, and an estimated 167 agricultural workers per day lose productivity due to occupational injuries (Agricultural Safety & Health Council of America, 2015). Farming injuries represent 30% higher cost rates for personal injuries per year than the national average across all industries (Leigh et al., 2001).People under the age of 16 years old are especially impacted by agricultural hazards. Farms, ranches, and other agricultural operations are often both worksites and homes (Reames, L., Jan 31, 2023 VISIMO interview). Of the 731,000 youths working in agriculture, 65% of them work on a family farm (ASHCA, 2015), while a total of 893,000 total youths lived on farms in the U.S. (NIOSH, 2016). In agriculture, one child fatality occurs approximately every three days (NCCRAHS, 2020; Goldcamp et al., 2004; Hendricks & Hard, 2014), and 38 children are injured on farms daily (ASHCA, 2015). Young workers are 7.8 times more likely to be fatally injured in agriculture compared to the average injury rates of all other industries combined (14.57/100,000 Full Time Employee (FTE) vs. 1.87/100,000 FTE) (NIOSH, 2019).Despite high injury rates in agriculture, there are no fully comprehensive, pre-existing datasets for agricultural hazard analysis and existing datasets are limited in their composition. AgInjuryNews (Weichlet & Gorucu, 2018), which tracks agricultural injuries by compiling news articles, is one of the most comprehensive existing data sources, but it is limited by the small set of binary variables that it tracks, which do not fully encompass all agricultural risk.VISIMO proposes the development of a customizable decision-support tool, Demeter, which will allow producers to assess and mitigate risk in real time, increasing the safety, efficiency, and productivity of small and mid-size farms.The primary outcome of the Phase II effort will be a prototype of the Demeter application. Demeter will accurately assess risk and provide mitigation suggestions, including cost-benefit analyses, for each suggestion. As development progresses, improvements will be made to the GNN and to the arithmetic formula to improve risk mitigation. Over the period of performance, VISIMO will collect between 5,000 and 7,000 data entries on small and mid-size beef and dairy farms.In addition to the creation of a prototype, Phase II will also result in a dataset for agricultural risks far more detailed and voluminous than any existing dataset. Current datasets rely heavily on limited information identified after a safety incident. In many cases, not all the variables are known because the data source (e.g., a news article) does not include this information. Through the creation of a new dataset, VISIMO will enable new research and education.VISIMO's primary customers include extension agencies, cooperatives, insurers, banks and crediting agencies, professional associations, and educational organizations that seek to address occupational safety issues affecting U.S. agriculture.

Animal Health Component

60%

Research Effort Categories

Basic

20%

Applied

60%

Developmental

20%

Classification

Knowledge Area (KA)	Subject of Investigation (SOI)	Field of Science (FOS)	Percent
723	7410	3100	50%
903	7410	3100	50%

Knowledge Area
903 - Communication, Education, and Information Delivery; 723 - Hazards to Human Health and Safety;

Subject Of Investigation
7410 - General technology;

Field Of Science
3100 - Management;

Keywords

agriculture

artificial intelligence (ai)

farm

machine learning (ml)

prediction

risk

Goals / Objectives
VISIMO proposes the development of a customizable decision-support tool, Demeter, which will allow producers to assess and mitigate risk in real time, increasing the safety, efficiency, and productivity of small and mid-size farms.The primary outcome of the Phase II effort will be a prototype of the Demeter application. Demeter will accurately assess risk and provide mitigation suggestions, including cost-benefit analyses, for each suggestion. As development progresses, improvements will be made to the GNN and to the arithmetic formula to improve risk mitigation. Over the period of performance, VISIMO will collect between 5,000 and 7,000 data entries on small and mid-size beef and dairy farms.Phase II Technical Objectives:Gather Requirements:Formally document R&D specifications for the work identifying necessary features. Establish metrics and benchmarks for future validation. Recruit testers an d end users.Enhance PWA and Backend:Gather preferences from potential end users and design a user-friendly User Interface (UI). Test functionality. Iterate the online backend, using Django with GraphQL; use React JS for the frontend.Develop NLP and Object Detection Tools:Add features that incorporate data uploaded by users and that uses NLP to facilitate data. These will be established early for ongoing data collection. Models will be trained on real user data to support NLP and CV.Conduct Alpha Testing:Identify end users and deliver an Alpha version of the PWA to testers; assist with install and use. Collect user feedback, focusing on functionality issues and support, continually gathering data and feedback to iteratively improve components of Demeter. Compile data points collected in Alpha testing for use in updating synthetic data and train the GNN.Conduct Beta Testing:Deliver Beta version of PWA to testers and assist with install, use, and support. Collect feedback from users, focusing on usability concerns. Analyze usability concerns and enhancement prioritization, gathering feedback to iteratively improve components of the application. Continue compiling data points collected during Alpha testing, updating synthetic data for GNN training.Perform Iterative Improvements:Release a 1.0 version of the PWA available to a general audience, allowing users to provide feedback. Review Phase I assumptions in synthetic data generation against incoming real data and make corrections. Use data to continue improving the existing ML components of the application.Proposed Phase II Success Metrics:Process Based:Triplet Loss: Measures how well embedding space translates similarities and differences between hazards.Data Quantity: Number of data entries collected.Application Performance: Metrics tracking speed, bug frequency, and prominence.Usage Statistics: Quantity and demographics of users.Outcome Based:Precision: Fraction of actual accidents among predicted accidents.Recall: Fraction of actual accidents among predicted non-accidents.User Satisfaction: Survey results about user experience.MAE: The amount of agreement between the arithmetic formula and SMEs.

Project Methods
The R&Dphase will be divided into three steps, with Step I as the creation of a mobile Progressive Web Application (PWA).The PWA will be constructed using ReactJS for the frontend and Django, a Python web development framework, for the backend.The goal of Step II is to gather as much ground-truth data as possible, and VISIMO will encourage end users to submit records daily. A single data record is a hierarchy consisting of: (1) an activity, (2) one or more categories, and (3) one or more hazard observations for each category. A hazard refers to an object, structure, or condition that might negatively influence safety. As an example of a full observation, a common activity on dairy farms is scraping and cleaning manure. Some categories underneath this activity include manure storage pond, and above-ground manure storage. Specific hazards within the category of a manure storage pond include the quality of the fence that surrounds the pond, and whether appropriate warning signs have been placed around the pond. Based on user feedback from VISIMO's construction JSA tool,producers will have to spend less than three minutes inputting variables prior to a task to receive mitigation suggestions, with an accompanying cost-benefit analysis. After a period of work, users will update which mitigations they chose to act on, and whether a safety incident occurred.VISIMO expects Demeter's ML components will require around 5,000 to 7,000 individual data entries on which to train. Because the scope of Demeter is still limited to beef and dairy farms, it uses only 100 variables. Therefore, the number of data entries required for model training decreases significantly. With an estimated safety incident rate below 1%, collecting at least 5,000 data entries also helps to ensure that records with injuries will be present in the data. VISIMO expects to collect this amount of data within 18 months through our committed collection partners Virginia Future Farmers of America (FFA) and VA Farm Bureau. This collection window will span all four seasons, which will impact the collected data. For example, during the winter, pasture management activities are less frequent, and there is a 17% increase in the number of total injuries in agriculture (Farm Injury Resource Center, 2018). In spring and fall, there will be an increased frequency of hazards related to milking cows, as some farms milk seasonally instead of year-round (NIWA, 2023). Understanding and accounting for these seasonal impacts will help eliminate bias and improve Demeter's performance. While collecting data, VISIMO will establish both a means of uploading image data to the database as well as accepting written descriptions of hazards on the scene, in addition to Demeter's standard drop-down menu options for risk analysis. These alternative forms of data will set the foundation for the Computer Vision (CV) and NLP components discussed in Step III.In Step III, VISIMO will incorporate ML into Demeter's risk estimation capabilities, using the data gathered in Step II to train its ML models. Performing risk analysis using ML will enable the tool to model the intricate and hidden relationships between hazard variables and to create more accurate risk estimates. For ML modeling, a GNN will be implemented as GNNs aredesigned to handle complex, nested data structures (i.e., graphs), such as the hierarchical tree structure described above and used to define the observations. When reviewing an individual node on the observation structure, a traditional neural network is unable to reference non-local information to make decisions (Zhou et al., 2020), meaning network types are unable to look at the rest of the tree while evaluating a specific item. GNNs, on the other hand, make continual use of non-local information during both training and prediction, as they are focused on learning the relationships between different nodes on the tree, and can identify complex, non-linear relationships between these different variables.Phase II will focus on obtaining real-world data to train and test the model built in Phase I. Phase II will also focus on end user and purchaser testing to ensure the developed prototype meets the needs of the market. Demeter's Phase II use will create an unprecedented dataset within agriculture, providing researchers and practitioners with access to hundreds of variables and thousands of case studies. VISIMO has secured commitments from individual producers to allow data collection on their farms during Phase II, as well as a commitment from VA FFA and VA Farm Bureau to collect data for Demeter. VA FFA will utilize their 300 teachers who work with the 520 dairy and 23,000 beef farms across the state, and VA Farm Bureau will collect data during their 150 annual farm visits for safety auditing services.Precision measures how well the model avoids false positives; i.e., how likely a scenario identified as high-risk by the model actually is high-risk; while recall measures how well the model avoids false negatives; i.e., the probability that a truly high-risk scenario will be flagged as high-risk by the model. Evaluating the performance of the GNN on the synthetic data resulted in a precision of 0.3521 and a recall of 0.8333. Since the GNN can learn on the synthetic data which models the complex relationships found in real data, its performance when trained on real data is likely to achieve similar results. Continually training the model with a combination of the SME-created synthetic data and a larger volume of real data from VISIMO's committed partners for Phase II increases the likelihood of higher performance.The second method for evaluating the GNN compares its performance to that of the arithmetic formula, using the synthetic data as an evaluation set. VISIMO developed this arithmetic formula (the third result of Phase I) as a way to provide useful risk estimates to users even before sufficient data has been collected to train the ML models. Since this formula is based on SME estimates and proceeds via straightforward calculations, it can provide risk estimates in the early stages of Demeter's deployment, when comparatively little data has been collected.While agriculture is a riskier industry than others, injuries are relatively rare, with only 25 to 50 injuries expected in a dataset with 100,000 records. Therefore, the use of SMEs enables risk estimation in the initial phases of data collection prior to recording incidents. The SMEs assigned a risk value on the same scale as the arithmetic formula and VISIMO computed the MAE between the arithmetic risk scores and the SME risk scores. The MAE score indicates how well the formula agrees with SME scoring. The arithmetic formula had an MAE of 0.2110 on a scale of 0 to 1, suggesting it was effective in estimating SME scoring. After validating that the arithmetic formula was capable of approximating SME scoring, VISIMO used it as an additional means of evaluating the GNN. Because the formula outputs a value between 0 and 1, and the GNN outputs a binary prediction, the formula scores were converted to binary predictions. To do this, any observation with a score greater than 0.39 was considered a high-risk observation, while any with a score below 0.39 were considered not high-risk (the threshold value of 0.39 was determined to be optimal via simple calculations.) The formula was then applied to the synthetic data and was able to achieve a recall of 0.7666 and a precision of 0.0653. The GNN was able to far surpass these scores with a recall of 0.8333 and a precision of 0.3521, indicating that the GNN model will perform more accurately than the SME-validated formula on real-world data.

Progress 09/01/23 to 08/31/24

Outputs
Target Audience:The following market segments (i.e., customers and end users) were explored and actively considered through the first year of the period of performance: Primary Customer Extension Agencies: Bringing evidence-based science and modern technologies to farmers, consumers, and families to improve lives. Cooperatives: Ensuring safety and economic viability and serve as advocates for their members and have expressed interest in the tool's potential use as an educational resource. Insurers: Helping producers reduce accidents and lower insurance rates through proactive safety auditing. Professional Associations: Their reputation in the agriculture industry, as they support producers through the provision of guidance on best practices, training and operational resources, and connect producers with others in their sector. Educational Organizations: Supporting the future of agriculture through public school curricula focused on farming (not solely for future producers, but future agricultural insurers, businesspeople, veterinarians, etc.). Secondary Customer Small-to-Mid Sized Farms: Care about safety, sustainability and profitability Primary End User Agriculture Extension Agents: Need tool that assist in best-practice safety training and audits with producers. Insurance Underwriters: Need tools that assist them with site visits and voluntary safety audits to appropriately assess risks for small and mid-size producers. Underwriters have already sought such a tool, but nothing was available on the market. Agriculture Educators: Need tools that give young students firsthand experience with safety, risk, and the benefits of mitigation. Secondary End User Producers: Care about Sustainability; Profitability; Keeping family safe. Changes/Problems:Technical work for the first year of the period of performance went mostly as planned with a few minor adjustments to the planned workflow. For example, minor modifications were made to the alpha testing procedure. Alpha testing was broken into three stages, with time between each stage for iterative refinement and development to occur. The first stage was VISIMO internal testing. During this period, VISIMO employees not working on this project were tasked with following the user guide and testing a series of components to ensure functionality and identify bugs. In the next stage, VISIMO's educational and industry partners were provided with the tool. Using their subject matter expertise, they provided suggestions for changes to make Demeter easier to use. Finally, VISIMO provided key stakeholders with the application to solicit additional feedback. Between each of these stages and after the final stage, the application was updated to integrate the changes recommended by alpha testers and to ensure bugs were fixed. The primary modification to the planned alpha testing phase was delaying robust data collection. VISIMO decided to delay data collection until the beta testing stage to ensure data collectors had the most functional and usable application possible. The purpose of this is to guarantee VISIMO collects as much usable data as possible, and providing an unpolished application could be detrimental to users' trust and willingness to use the app. Because of this, training the GNN on real data must be delayed until the beta testing phase. The primary challenge that has been encountered has been the data requirements for training the deep learning models. VISIMO will conduct data collection during the beta testing stage because a more refined front end of the application will the improve quantity and quality of data that the team will be able to collect. This presents a challenge in the interim, because the models which are being developed must be trained and tested using synthetic data. Despite this challenge, VISIMO has developed training pipelines which will automate and speed up hyperparameter tuning once the data is available. This will mitigate the risk of beginning the training at a later stage of the project life cycle and will help to ensure the milestones of the project can be completed on time. What opportunities for training and professional development has the project provided?During the first year of working on this award, the VISIMO staff augmented their skills across a variety of topics. First, the data science team conducted robust investigation into Large Language Models (LLMs) and assessed the feasibility of integrating them into Demeter. Capitalizing on the newest trends and state-of-the-art advancements in LLMs has been a priority during the development of this project. Additionally, the software team has cross-trained on a variety of software best practices, to include robust unit testing and greatly enhancing the User Interface (UI) (i.e., intuitiveness) of the application. A fundamental goal on this project is establishing a user-friendly application that facilitates significant amounts of data collection in the field. Lastly, new project managers have assisted with this award by investigating customer and end-user perspectives and understand the USDA contracting process, which has allowed success on subsequent USDA-funded Phase I awards. How have the results been disseminated to communities of interest?Throughout the first year of the period of performance, the VISIMO team conducted customer and end-user discovery with a variety of stakeholders, to include those mentioned in the "Target Audience Section" (e.g., Extension Agents, Cooperatives, Insurers, and Educational Organizations). Specifically, members of participating organizations were provided Demeter credentials and access to help with external alpha testing. Their experience and candid feedback accelerated the development and allowed a robust improvement to the application prior to fielding to a broader audience. What do you plan to do during the next reporting period to accomplish the goals?VISIMO intends to begin the process of beta testing and robust data collection which will last for the remainder of the period of performance. Additionally, VISIMO intends to begin beta testing in September of 2024. This process will include data collectors from key stakeholders, and other supporting cooperatives and partners. The goals of beta testing will be to (1) collect data through observations of real activities on farms, and (2) solicit additional feedback and recommendations from a broader audience on the utility of Demeter. VISIMO will continue to investigate the integration of LLMs as an alternative method for users to enter data into Demeter. This will streamline the data entry process and will provide additional insights using historical data and general insights into safety and efficiency on a farm. Integrating an LLM as an alternative form of data entry could provide some users with a more intuitive user experience by allowing them to write out their daily activities using natural language, or even to speak it into their device using a voice-to-text feature. VISIMO plans to fully integrate state-of-the-art technology for voice-to-text capabilities in Demeter during the next year in the period of performance. This feature will enhance the productivity and overall experience of end users, particularly given diverse conditions.

Impacts
What was accomplished under these goals? VISIMO's technical work during the first half of the Phase II period of performance can be broken into three categories: (1) iteration on the PWA and alpha testing; (2) improvements to the Graph Neural Network (GNN); and (3) improvements to the formula-based risk assessment model. There were several major modifications made to the PWA, informed by feedback from external alpha testers, designed to increase flexibility and ease of use. One of these new features is the introduction of a new organizational structure to simplify the process of data collection and creation. One such structure consists of named collections of categories and hazards. Another feature designed to ease data entry is the "Template" feature, which allows input of tasks that are repeated frequently (e.g., a series of actions completed every day). Both allow for work to be planned and decrease the amount of time it takes in the field to fill out an observation. Secondly, there were two main focuses for the modification of the GNN. The first focus was on improving the GNN architecture used as the primary predictive model in the application. This additional structural information will help the network exploit the relative relationships between variables in the data collected. Furthermore, VISIMO modified the inputs to the model, so they are not limited to a specific set of categories. To do this, the team integrated a model to embed text descriptions of the variables rather than a pre-determined set of inputs. This allows the model to operate on natural language descriptions uploaded by users which will increase both the amount and quality of data that can be collected. Lastly, the mathematical risk estimation formula provides the ability to produce risk scores without the need for massive amounts of collected data. VISIMO investigated a method by which the AgInjuryNews dataset can be used to correlate observations in Demeter with the incident rate corresponding to a risk score. The key principle behind this method is that while the total amount of incidents in both AgInjuryNews and Demeter are unable to be matched, the fraction of incidents corresponding to an activity in Demeter should be related to the fraction of incidents seen in the AgInjuryNews dataset that involve that activity by a factor corresponding to the risk score of an observation and the inherent risk of an activity. The result of this research was that the expected calibration error will be less than 10% even if fewer than five actual incidents are seen in the Demeter data.

Publications