Source: BIG DATA IN A BOX, LLC submitted to NRP
SPIDER: A SMART PORTABLE INTERACTIVE DATA EXTRACTION AND REPORTING TOOL FOR ORGANIC FARMERS
Sponsoring Institution
National Institute of Food and Agriculture
Project Status
COMPLETE
Funding Source
Reporting Frequency
Annual
Accession No.
1022751
Grant No.
2020-33610-32057
Cumulative Award Amt.
$100,000.00
Proposal No.
2020-00426
Multistate No.
(N/A)
Project Start Date
Sep 1, 2020
Project End Date
Jan 31, 2023
Grant Year
2020
Program Code
[8.12]- Small and Mid-Size Farms
Recipient Organization
BIG DATA IN A BOX, LLC
2700 S LOOP DR
AMES,IA 50010
Performing Department
(N/A)
Non Technical Summary
Keeping accurate and timely records is a challenge for all farmers. Organic farmers, in particular, struggle to manage required records for maintaining certification, which is one of the main reasons organic farmers are not able to bridge the gap between supply and demand for organic products. Owners of small- and mid-sized farms face this issue more acutely as they transition to or work to maintain their organic certification. A recent study of 1,800 farmers has shown that ~40% report record-keeping as a key obstacle to staying in business. Record-keeping and filing requirements vary by the type/variety of crops and produce. A typical owner of a small or mid-sized farm grows multiple types of crops, which makes record-keeping and filing requirements more onerous and often dissuades farmers from seeking organic certification. Big Data in a Box LLC (BDiB) proposes to develop and validate a portable device to capture and digitize requisite data, making it efficient and affordable for these farmers to transition to organic or to maintain certification. Phase I goals are to devel-op 1) a cost-effective and easy-to-use system that supports farmer-owner facility/certification compliance, and 2) an innovative, robust, cost-effective portable edge-computing device that is network-independent. The commercial BDiB software product resulting from this SBIR project will be provided via subscription, along with a cost-effective data capture device.The retail market for organic products was valued at $50 billion by the Organic Trade association in 2018 and continues to grow at 20% per year. Reducing the barriers to record-keeping will support farmers' efforts to transition to or maintain organic certification, thus meeting the growing demand. Through its edge computing features, BDiB's Smart Portable Interactive Data Extraction and Reporting (SPIDER) tool is expected to work with or without an Internet connection, and will ease the documentation burden of organic certification. Once the technology tool is tested and validated with farmers, it will be marketed via a subscription service to owners of small- and mid-sized farms, initially to the organic sector and then to conventional farms. Our market potential spans small- and mid-sized farms, which constitute 94% of the 2.1 million farms nationwide. With an anticipated subscription price of $60 per year for the basic tool, we hope to reach a significant portion of the market within five years or less. Because our tool is based on an edge-computing model, in which the data can be manipulated at or near the source of the data, so users will have more autonomy with data collection, storage, and use.We plan to use a Software as a Service (SaS) business model. We will price a basic record-keeping system of a standalone product with software, with an option to update the software libraries, and offer more capabilities with a premium subscription model.
Animal Health Component
40%
Research Effort Categories
Basic
20%
Applied
40%
Developmental
40%
Classification

Knowledge Area (KA)Subject of Investigation (SOI)Field of Science (FOS)Percent
20501991060100%
Knowledge Area
205 - Plant Management Systems;

Subject Of Investigation
0199 - Soil and land, general;

Field Of Science
1060 - Biology (whole systems);
Goals / Objectives
Develop a cost-effective and easy-to-use system that supports farmer-owner facility/certification complianceDevelop an innovative, robust, cost-effective, and portable edge-computing device that is network independent.Testing will be carried out on regional farms for Phase I proof of concept.
Project Methods
EffortsMeet with participating farmers and establish data collection process and testing goalsPrepare a "gap analysis" of compliance requirements for each participating farmDesign and implement the data collection processDevelop static web pages of the information presented in several different designsBuild and test data collection forms on the deviceSet up scenarios for data collection in the fieldDocuemnt the best combinations of record-keeping methods for farmers, such as online references (e.g., extension documents, photos) and farmer-developed notesDevelop a software library of data collection and analysesCreat the software library using open source toolsEvaluationDevelop and administer surveys to the farm participants before the end of the project to determine their satisfaction with the data collections forms, the device interface and usability, and the support for the technology use and trainingCollect data on system performance and develop additional tests as neededPerform economic feasibilty of the current toolDevelop a pre-test to identify data requirements for certification of specific crop or produce that will be farmed as organicMeasure the ability of our farmer participants to identify data requirementsMeasure data collection success when using our tool for certification

Progress 09/01/20 to 01/29/23

Outputs
Target Audience:We have engaged three Iowa fams: Mustard Seed Farms, Root to Rise farms, and SimpleLife Farms. We have prepared surveys to query about 500 organic farmers in Iowa. Before sending out the survey, we plan to conduct a focus group study of six to nineIowa small produce farmers. Changes/Problems:During our project, we realized from field experience and feedback from the farmers that they preferred a software solution instead of the proposed hardware-based solution. Based on this feedback, we repurposed our goals to build a software-based solution and have it available via an easy-to-access web portal. We continue to refer to this portal-based application as SPIDER, as originally named. We also moved away from hardware-based data input (e.g., using a finger, stylus, or keyboard) to voice input. Further, we used artificial intelligence tools to convert voice input to actionable data. What opportunities for training and professional development has the project provided?We have employed several interns, both graduate, and undergraduate. Our interns represent minority females in the STEM field. These interns helped BDiB in our research efforts and SPIDER application development. Also, we engaged with three freshmen honors students from the computer science program of Iowa State University to work on the SPIDER technology. How have the results been disseminated to communities of interest?We have demonstrated the capabilities of our SPDER application to participating farms, namely Mustard Seed Farms and Root to Rise Organic farm. We now participate in the NSF-NIFA multi-university consortium research project led by Iowa State University (AIIRA). We also are a principal participant with Translational AI Center at Iowa State (TrAC). What do you plan to do during the next reporting period to accomplish the goals? Nothing Reported

Impacts
What was accomplished under these goals? We developed a web-based application called SPIDER. SPIDER assists the farmer in gathering all of the daily farm activitiesthrough voice input. With open AI, the audio input is transformed intostructured data and saved in a protected and secure database. Only authorized and authenticated users (farmers and BDiB developers) will have access to the reports and information in the database. The farmer will eventually use this information to submit an application for organic certification.

Publications


    Progress 09/01/21 to 08/31/22

    Outputs
    Target Audience:We have engaged two Iowa farms namely Mustard Seed Farms and Root to Rise Farm in our research efforts. We have prepared surveys to query about 500 organic farmers of Iowa. Changes/Problems:Problems Encountered During the interviews with farmers regarding their pain points in terms of meeting the USDA NOP compliance requirements, the issue about not having to type too many times to enter the data was a challenging one. We were able to resolve it by planning to use voice recording. We are currently able to record voice when the device is connected to wifi but we are trying to figure out a solution so that voice input can be recorded without wifi. The other challenges we are facing are converting the voice data to automatically get transferred to the intermediary tables and conversion of digital images into data. We hope to figure out a way for all or most of these problems. We are fine tuning conversion of the voice input into digital data. We are currently focused on addressing the problem of porting our App to other mobile platforms as well as web platforms. What opportunities for training and professional development has the project provided?We have employed two interns from Iowa State University who represent minority female. These interns helped us in our research efforts and in early stages of SPIDER too; App development. In addition we also engaged with three freshman honors students from the computer science program of Iowa State University to work on the App. How have the results been disseminated to communities of interest?We have engaged two Iowa farms namely Mustard Seed Farms and Root to Rise Farm in our research efforts. We have prepared surveys to query about 500 organic farmers of Iowa. What do you plan to do during the next reporting period to accomplish the goals?Possibility of IP: We request the farmers to use key phrases to record their daily activities. Because of the context sensitive key phrases in transcription, we are in a position to use them to convert them into our data fields. Since converting the transcribed data into the data fields is a time-consuming process. we are adapting the AI technology to the transcripts to extract the relevant information using our proprietary algorithm and feed it to the application that we developed. This is unique to us and we intend to claim IP on it. Once we are able to test and validate the effectiveness of this approach, we would be in a position to claim IP. We are initiating the conversations with the Iowa State University Office of the Intellectual Property to file copyright protection of the App and its algorithms. We would be demonstrating our App to a national consortium of universities involved in the multi-year USDA-NSF project (AIIRA) where we are a stakeholder.

    Impacts
    What was accomplished under these goals? We have developed a standalone app to aid in the collection and storing of the farm operational data. This is accomplished through a voice input mechanism and AI tools.We developed theApp of the SPIDER tool using open source resources and proprietary algorithms. Surveys have been developed by Field Research Specialist Dr. Priyanka Jayashankar and we are planning to send the surveys to 500 organic farmers from all over Iowa and hope to get the response from around 250 farmers.

    Publications


      Progress 09/01/20 to 08/31/21

      Outputs
      Target Audience:Project Director Dr. Reddy, Co-Project Director Dr. Nilakanta from Big Data in a Box, LLC (henceforth BDiB) and BDiB Ag Consultant Kenneth Beamer visited multiple farms to discuss with the farmers dealing with vegetable and fruit crops as well as row crops. The discussion revolved around the crops they grow, challenges faced by them in decision making process every year, problems related to data collection and to know what they would need to make the ideal decisions. BDiB also checked with the farmers regarding their interest in obtaining organic certification and challenges related to obtaining and maintaining the organic certification. BDiB decided to work with two small vegetable and fruit crop farms: one natural food growing famer (Mustard Seed Farm) that is interested in obtaining the organic certification for part of the farm and one farmer (Root to Rise Farm) who obtained organic certification in spring of 2020. The idea is to see how BDiB device can help the farmers in storing data towards obtaining organic certification or to collect data to renew the organic certification respectively. We also will be working with one mid-size row crop farmer (Story County Farm) to see how our device can help in collecting the data and to see if they would be interested in future organic certification. Changes/Problems:Problems Encountered During the interviews with farmers regarding their pain points in terms of meeting the USDA NOP compliance requirements, the issue about not having to type too many times to enter the data was a challenging one. We were able to resolve it by planning to use voice recording. We are currently able to record voice when the device is connected to wifi but we are trying to figure out a solution so that voice input can be recorded without wifi. The other challenges we are facing are converting the voice data to automatically get transferred to the intermediary tables and conversion of digital images into data. We hope to figure out a way for all or most of these problems. We are fine tuning conversion of the voice input into digital data. What opportunities for training and professional development has the project provided?Female under-represented minority undergarduate and graduate STEM students from Iowa State University participated in our research projects as paid interns. How have the results been disseminated to communities of interest? Nothing Reported What do you plan to do during the next reporting period to accomplish the goals?Successes To Date The first success was for Dr. Reddy, Dr. Nilakanta and Ag Consultant Mr Beamer be able to collect the requirements of the farmers and classify them into major and minor pain points. Major pain points are 1) inability to type too many characters while collecting the data in the field as hands get wet and/or dirty very fast, and problem in storing and 2) finding the sales receipts, input expense receipts etc. We were able to provide voice input recording and digital image capture as solutions to the above two problems was our first success. However we are still trying to fine tune the voice data input and digital image capture. We are able to record the voice without the wifi. We are now recommending the farmers to do voice recordings in the room or barn to avoid the interference by the wind. The other success we had was realization that the voice recordings done using key phrases has improved the ability to transcribe the digital voices. Tasks in Progress Possibility of IP: We request the farmers to use key phrases to record their daily activities. Because of the context sensitive key phrases in transcription, we are in a position to use them to convert them into our data fields. Since converting the transcribed data into the data fields is a time-consuming process. we are adapting the AI technology to the transcripts to extract the relevant information using our proprietary algorithm and feed it to the application that we developed. This is unique to us and we intend to claim IP on it. Once we are able to test and validate the effectiveness of this approach, we would be in a position to claim IP. The development of exit surveys will be done after we validate our technology and analysis of data from all the tasks will be analyzed towards the end of the project which would be towards the end of the year as we have received the extension of the project.

      Impacts
      What was accomplished under these goals? Task1.1: Kickoff meeting, farmer input and goal setting for data collection process and testing The various steps involved in obtaining the seeds and or cuttings, field preparation, soil testing, water quality testing, seeding/planting, fertilizer/pesticide/herbicide application, harvest, produce cleanup, produce shipping/delivery and/or sale of produce were discussed for each farm. Mustard Seed Farm delivers the produce directly to their customers through volunteers, Root to Rise Farm arranges for pickup and direct sale in local farmers market while Story County Farm sells it to the local coop or other entity based on demand. Task 1.2: Collect data requirements for crop types: Two major requirements suggested by all farmers for any ideal electronic device for their needs included the following i) an alternative to entering data by reducing the necessity of using touch pad as the fingers of the farmers can be big or rough or dusty and hence hinder the typing using touch pad ii) having picture capture capability. Based on the discussion with farmers, we also identified the following main gaps for three farms. Mustard Seed Farm (Interested in organic certification): Currently have data mostly through hand written notes, need help in converting them into electronic format. Need easier way for direct electronic data entry, can use BDiB device. No formal field map and data driven crop rotation decision making process exists. Don't have information as to what exactly is needed for organic certification even though they do not apply any product that is not in the National Organic Program (NOP) substance list. Need help in identifying the best yielding fruit trees. Root to Rise Farm (Interested in renewing organic certification): Currently have data mostly through hand written notes, need help in converting them into electronic format. Google Earth based field map, can number the fields in the farm so as to make crop rotation decisions easily. May need help with electronic data entry. Need easier way for direct electronic data entry, can use BDiB device. Story County Farm (currently not interested in organic certification): Have a different device for electronic data entry. May need information about help with electronic data entry. Need to be informed about NOP. Compare data entry process in the currently used expensive device vs BDiB device Task 1:3: Design/implementation of data collection process at participating farms We have outlined the steps needed for the different farms based on their requirements and made a plan of action towards the compliance with certification needs based on USDA NOP guidelines. PI Dr. Reddy discussed with Ag Consultant Kenneth P. Beamer to develop intermediary data collection tables shown later in this section. The idea is to test the voice input being automatically converted into digital data that can automatically get transferred into these tables. Table 1. Inputs including purchase receipts, product labels of inputs used for fertility, pesticide, Irrigation management Table 2. Seed inputs including purchase receipts, enquiry from atleast three sources, planting, transplanting Table 3. Test ouputs: including lab analyses reports of water and soil Table 4. Harvest output including storage, sales and left over harvested produce Task 1.4: Farmer interview/surveys and development of support documents Summer intern and Iowa State University student Adriana Gerardo worked with the farmers at Mustard Seed Farms and Root to Rise Farm. Adriana interviewed the farmers and found about the improvements that can be brought to use the voice recordings and digital image capture. Farmers are happy with using the SPIDER tool for the possible outcome they derive but practical problems like the device going into sleep mode and unclear recording when wind is high, sweat running onto the devices etc. were pointed out. Task 2.1: Pre-test and develop data collection procedures to meet certification compliance We plan to develop data collection procedures using the Operating System that can integrate the communication and software development all on one platform. Project Director, Co-Project Director, Ag Consultant and Programmer Indira Reddy discussed and decided that the data collection will be done in different folders that are directly related to the NOP requirements. This provides an ability to easily take pictures of documents related to a given folder and store it there for later use. This alleviates the problem of losing documents by capturing the images soon so that the time stamp also can be used as the date of a given activity. We decided to develop and fine tune these data collection procedures to address the organic certification process of Iowa Department of Agriculture and Land Stewardship (IDALS) as the farms we are targeting for this work are all in Iowa. The intention is to focus on research leading to appropriate data capture for the given farm for alignment with NOP compliance. Task 2.2.: Choose certification guidelines for specific agencies The certification guidelines referred on IDALS site https://www.iowaagriculture.gov/AgDiversification /producersPage.asp will be used for developing our device. The idea is to get the device tested by farmers based in Iowa. Since each agency has their own guidelines, we eventually want to develop tools to meet the requirements for the agency chosen by the individual farm. However, we want to demonstrate the efficacy of our research using the IDALS NOP requirements as the benchmark for a transitioning farm (Mustard Seed Farm) or a continuing farm (Roots to Rise Farm). Task 2.3: Test, analyze, and validate data collection procedures The data collections procedures are still being standardized and once that is completed, we plan to develop a library of procedures for accurate record-keeping. Currently we are focused on recording and converting voice data to automatically fill in the intermediary tables that we created. Task 2.4: Identify scope for system testing and desired additional tests Since the season began in the end of April, we took extension and tested the edge computing, to determine the applicability of the SPIDER tool and software for different organic crops and produce. Task 2.5: BDiB tool and software specifications and development and economic feasibility Voice input using key phrases is helping in transcribing the digital voices. As described in the section Task 1.3, different intermediary data collection tables have been designed for the standardization of this method.

      Publications