DSFAS-AI: Development of Convolutional Neural Networks that Connect Molecular Signatures to Rapid Optical Readouts on the Health of Chickens

DSFAS-AI: DEVELOPMENT OF CONVOLUTIONAL NEURAL NETWORKS THAT CONNECT MOLECULAR SIGNATURES TO RAPID OPTICAL READOUTS ON THE HEALTH OF CHICKENS

Sponsoring Institution

National Institute of Food and Agriculture

Project Status

ACTIVE

Funding Source

AFRI COMPETITIVE GRANT

Reporting Frequency

Annual

Accession No.

1028280

Grant No.

2022-67021-36644

Cumulative Award Amt.

$299,944.00

Proposal No.

2021-11527

Multistate No.

(N/A)

Project Start Date

Jan 15, 2022

Project End Date

Sep 14, 2025

Grant Year

2022

Program Code

[A1541]- Food and Agriculture Cyberinformatics and Tools

Recipient Organization
UNIVERSITY OF ARKANSAS
(N/A)
FAYETTEVILLE,AR 72703

Performing Department
Chemical Engineering

Non Technical Summary
With the world population projected to reach 9.8 billion by 2050, there is immense pressure on animal agriculture to increase production to meet the growing demand for high-quality protein. Broiler chickens play an important role in worldwide meat production, as they are one of the most efficient food sources, and support the livelihoods and food security of billions of people. Genetic selection for high growth rate and enhanced muscle development has made impressive progress in terms of breast yield, feed efficiency, and reduction of market age; however, this increase has not come without serious negative metabolic consequences. Additionally, the warming planet introduces several key challenges including long-term issues related to redistribution of infectious disease and immediate concerns related to rising maximum temperatures.In the context of poultry and livestock, heat stress (HS) is a key challenge exacerbated by climate change. For this reason, we chose HS as a model stressor for testing out the hypothesis laid out in the proposal due to its relevance in the context of production and welfare of poultry. HS in chickens can occur in the summer months, when temperatures often reach more than 95°F (35°C) in the regions of the United States where most broiler chickens are raised for production. Despite the use of combination of ventilation and evaporative cooling pads in broiler houses, HS still alters bird's welfare by inducing stress depressing feed intake,inducing thirst , causing immunosuppression , reducing growth, and in extreme cases increasing mortality rate by spiraling hyperthermia. HS results in an estimated annual economic loss of more than $128 million in the USA and has detrimental effects on feed intake, growth, meat yield, welfare, and mortality in modern broilers.Although we consider HS as a model environmental stressor to build our rapid reporting technique, we emphasize that both the experimental and ML-based algorithms we develop are general and have broad applicability in a wide-range of diseases and welfare-issues relevant to poultry and livestock.We will carry out this work with two key objectives: 1) Microscopy techniques will be employed to generate large data sets of chicken RBCs in LC containing both healthy and HS-inflicted cells 2) Algorithms will be developed classify the data sets in a rapid and automated fashion between the two sets 3) Validation of the algorithm will be carried out via testing on untrained data sets.

Animal Health Component

30%

Research Effort Categories

Basic

40%

Applied

30%

Developmental

30%

Classification

Knowledge Area (KA)	Subject of Investigation (SOI)	Field of Science (FOS)	Percent
315	3299	2020	100%

Knowledge Area
315 - Animal Welfare/Well-Being and Protection;

Subject Of Investigation
3299 - Poultry, general/other;

Field Of Science
2020 - Engineering;

Keywords

animal welfare

convolutional neural networks

Goals / Objectives
Overview of Project Goals: Heat stress (HS) is an important stressor that is determinental to the welfare, production and the health status of poulty. Our research efforts in this seed grant seek to build experimental datasets (optical micrographs) that will processed through machine learning (ML)-based algorithms, to eventuate the development of rapid reporting techniques that inform on the presence of HS in poultry. The fundamental hypothesis that drives our research is that the cells overexpressing well-established intracellular stress chaperones such as heat shock proteins (HSP70) also undergo changes to their mechanical properties. We introduce the concept of rapidly characterizing mechanical properties of red blood cells (RBCs) of chickens using fluids called liquid crystals (LCs). A key aspect of the development of our LC-based platform involves building ML-based convolutional neural networks (CNNs) that can generate classifiers to separate image sets of RBCs dispersed within LCs into healthy ones those of chickens experiencing HS. The RBCs dispersed in LCs provide with value image features such as cell extension, cell orientation and LC texture which are all connected to our fundamental hypothesis and enable classification via CNN. The expression levels of HSP70 will be used to define our classes for the CNN framework which would then be used to classify the experimental micrographs. Through this seed grant, we seek to establish a strong preliminary dataset (both experimental and CNN) to test this fundamental hypothesis and enable us to apply for a standard grant in this program.Goals expanded:Current methods for monitoring stress rely on the identification of molecular and protein markers such as corticosterone and HSPs. Although methods that report on molecular and protein markers have increased our understanding of HS, these methods are usually time-intensive and are not immediately accessible to the end user (farmer, technician on a production line) seeking to make informed decisions on the health and stress levels of chicken. Therefore, there is a critical need to identify reliable, and rapid ways to monitor HS in poultry.A key innovation in the proposed work in our methodology is to connect the expression of HSP70 to rapid optical readouts which characterize the health of the blood cells of chickens. We seek to build a unified picture connecting protein markers of HS and rapid readouts of their mechanical properties. The PI has previously developed an LC-based technique and it was deployed to rapidly report on the health of human RBCs. Molecules of LC are perturbed from the preferred parallel orientation when an inclusion, for instance a colloidal particle is present withing the LC fluid. This creates an orientational strain within the LC. However, if the inclusion is soft, such as an RBC, the LC can stretch out the cell and release some of the strain contained within the fluid. This sharing of strain is intimately coupled to the mechanical properties of the RBCs which we expect to change as they experience HS. LCs enable rapid readouts of the mechanical properties of cells, for instance, a simple experiment of dispersing a few µl of blood in LCs can be used to understand the health status of over a thousand cells within a few minutes. Another key innovation of our approach is the development of CNNs algorithms that connect the micrographs of chicken RBCs dispersed in LCs to molecular expression of HSP70. Our preliminary data reveals that chicken RBCs dispersed in LCs exhibit several image features such as cell orientation, cell extension, texture of LC around the cell etc, which are all a function of the HS status of the cell. These feature-rich datasets are good candidates for developing our CNN algorithms and for classifying these features to the expression of HSP70. This will enable the development of a comprehensive picture from the molecular level to the cellular level of the state of chickens experiencing HS and enable rapid readouts with the LC-based technique.

Project Methods
Methods:Polarized Optical Microscopy: The RBC-in-LC samples will be imaged through an Olympus BX51 microscope connected to a Canon DSLR camera. Imaging will be performed in a cross-polarized mode. Vapor Pressure Osmometry: Comprehensive mapping of the osmotic pressures of the LC phases formed by DSCG will be performed by Wescor 5550 osmometer at temperatures relevant to HS. Building datasets for ML methods: We emphasize that all experimental conditions related to imaging the samples (light intensity, sample orientation, polarizer orientation) will be tightly controlled to make sure that experimental artifacts like cell density or cell aggregation do not influence the CNN method. We will develop ImageJ based thresholding algorithms that screen for cell density and cell cutoff sizes in the datasets. Images not meeting the cutoff will be excluded from the CNN analysis. RT-qPCR: Chicks from the south-east Asian jungle fowl the wild-type ancestor of the modern broiler and commercial broiler strains will be sexed and wing tagged for identification. Blood will be collected via heart puncture from 7d old males and placed into K2 EDTA blood collection tubes (Becton Dickinson, Franklin Lakes, NJ) to prevent coagulation. To isolate RBCs, blood collection tubes will be centrifuged at 3000 rpm for 3 min at 4°C. Isolated cells will be diluted in RPMI1640 (Life Technologies, Carlsbad, CA) supplemented with 10% FBS and 1% penicillin/streptomycin and seeded at a density of 2 x 106 cells per well in 6-well culture dishes. All cells will be maintained at 37°C in a humidified atmosphere of 5% CO2 and 95% air. Heat stress (HS) will be induced by incubating cells at 45°C for 2h. Cells will then be collected for mRNA extraction and analysis. Total RNA will be extracted from RBC using Trizol reagent (Life Technologies, Carlsbad, CA) according to the manufacturer's instructions, and concentration and quality will be determined by using the Take3 micro-volume plate and the Synergy HT multimode microplate reader (BioTek, Winooski, VT). The ratio of absorbance (A260/280) as well as agarose gel electrophoresis will be used to assess RNA integrity and quality. RNA will be reverse transcribed using qScript cDNA Sythesis Supermix (Quanta Biosciences, Gaithersburg, MD), and amplified by qPCR (Applied Biosystems 7500 Real Time System) with SYBR green master mix (Life Technologies, Carlsbad, CA). Relative expression of the target genes will be determined using the 2-ΔΔCT method, with normalization to 18s expression.Efforts: Dissemination of the new knowledge gained through our experiments will be carried out through publications in journals relevant to poultry science and machine learning. The PI will also include modules on using liquid crystals to strain cells as a part of outreach activities carried out by University of ArkansasEvaluation: As indicated in the timeline of the grant, we expect the identification of an LC that is isoosmotic with the interiors of chicken RBCs and able to strain them in the first 9 months. We then expect a feedback loop between our ML algorigthms and further experiments which will lead to a specific and rapid identification of HS in chickens within the 2 year timeframe.

Progress 01/15/23 to 01/14/24

Outputs
Target Audience:Poultry farms and processing plants. Changes/Problems:NA What opportunities for training and professional development has the project provided?My students presented the work at the AICHE meeting. How have the results been disseminated to communities of interest?Two conference presentations and one journal publication (impact factor: 3.5) What do you plan to do during the next reporting period to accomplish the goals?We plan to study the specificity of the sensor

Impacts
What was accomplished under these goals? ?A simple, lightweight, convolutional neural network ML model, 3 convolution layers deep, was found to be capable of distinguishing between minute differences in the shapes or the aspect ratios of RBCs. The model was trained on microscope images of cells immersed in an isotonic solution of a nematic liquid crystal (DSCG). We demonstrate that LCs biomaterial would 'sense' small differences in the mechanical properties of cells, as evident from the changed shape of the cell or the liquid crystalline color pattern around the cells (in the polarized images) and could be used to detect the presence/onset of diseases, or microbes in air or water in tandem with trained ML models. An underlying and important hypothesis - that biochemical changes in an organism could affect mechanical changes in its cells - makes for a rich and interesting endeavor for future researchers; similar to how the expression of HSP70 protein was shown to make the heat-stressed RBCs stiffer in this study. The 250K parameter model is easily trainable on GPU equipped personal computers using just a few hundred study-specific micrographs. The trained model is lightweight enough to fit in a sensor computer and fast enough to virtually instantly perform the classification on every new 'photograph'. In the context of the current rise of foundation vision models, whose fine-tuning and training are prohibitively expensive and possible only at very large research laboratories, our approach is geared towards empowering labs and individuals all round the world to construct and train their own ML models on general or niche tasks related to imaging and/or sensing. In experiments, it was found that crosslinking the chicken and human RBCs using glutaraldehyde in order to simulate a diseased cell was an adequate strategy for planning, building, training, and evaluating valid ML models ahead of collecting of actual training data. In our case, no model-tweaking was found to be necessary while going from the simulated to the real heat-stressed cells. Because biological data could often be available in less quantities and later in the study, we believe that our simulation example might come in handy for researchers looking to work on their ML models while waiting, or those looking to simply augment their data in an appropriate manner. The successful implementation of the CNN algorithm to rapidly identify HS also opens up new questions related to our line of investigation including: i) Are final static response (micrographs) enough to develop a ML classifier to discriminate (specify) different RBC strains? ii) How do the dynamic conditions (response of cells as a function of time under HS) contribute to developing efficient and specific classifiers? iii) Can static images at TN conditions predict the expression levels of HSP70 at HS conditions. Future work to answer these questions we will focus on collecting and maintaining a large and diverse image data base of strained and unstrained samples, and by developing dedicated deeper CNNs that are computationally efficient through use of ResNets. Additionally, CNN frameworks that process multi-channel image datasets, that include spectral channels, and dynamic images, beyond conventional 3 channel RGB images.

Publications

Type: Other Journal Articles Status: Published Year Published: 2024 Citation: A liquid crystal-based biomaterial platform for rapid sensing of heat stress using machine learning, P. Verma, E. Adeogun, E. S. Greene, S. Dridi, U. Nakarmi, K. Nayani*. Sensors and Diagnostics. 3 (11), 1843 (2024)

Progress 01/15/22 to 01/14/23

Outputs
Target Audience:In this project period our work was mainly disemminated via publications ( one published and one submitted). Therefore, the target audience was the populations engaging in scholary publications including scientists, farmers and agricultural researchers. Changes/Problems: Nothing Reported What opportunities for training and professional development has the project provided?Graudate student was trained in obtaining optical microgaphs and then in coding the CNN layers. How have the results been disseminated to communities of interest?We have published one manuscript in a scientific journal and currently have the second manuscript under review. What do you plan to do during the next reporting period to accomplish the goals?We aim to reach a 99% accuracy for the model in identifying heat stress inpreviously unseen optical data. We expect to work on tuning the hyper parameters to acheive this goal. Further, we aim to conduct new experiments on the specificity for the test and train a new model wherein there are several other stressors affecting the chicken

Impacts
What was accomplished under these goals? This year, we were able to successfully accomplished the following sub-goals: Sample preparation and optical microscopy As-received RBCs were dispersed in a 154 mM NaCl isotonic solution; an isotonic solution ensured that cells maintained their natural elliptical shape. Typically, about 5 µL RBCs were mixed in about 60 µL isotonic NaCl solution to obtain the RBC dispersion. A 17.3% (w/v) DSCG aqueous solution was prepared by mixing DSCG in water in a vortex mixer for 4 hours. Previous works have showed that 17.3% (w/v) aqueous DSCG is isotonic with the interiors of an RBC [ref], which ensured that the RBC shape change was solely due to the mechanical interaction between the LC field and the RBC. Since RBCs naturally strain when put in a DSCG solution, strained cell samples were obtained by adding 2 μL of dispersed RBCs to 60 μL of DSCG solution and gently swirled. To prevent straining in DSCG and obtain unstrained cell samples, glutaraldehyde was used to crosslink and stiffen the RBCs [ref]. A stock solution of 5% v/v of glutaraldehyde in water was used; 5 µL RBCs were slowly pipetted into 0.2 µL of this stock to effect crosslinking. The final glutaraldehyde concentration in the cells was chosen to be around 0.2 % to make sure the individual cells were fixated and do not form aggregates. The solution was slowly mixed on a shaker for an hour to allow the glutaraldehyde to completely crosslink. About 2 µL of crosslinked RBC was then added to 60 µL of DSCG solution and gently swirled.HSP70 RBCs were collected from 21-day old broiler chickens that had been exposed to acute heat stress (35°C for 2 hours). Whole blood was collected into EDTA coated tubes and the RBCs were isolated from the whole blood by centrifugation and washing with PBS three times. For imaging, RBC samples were transferred (post swirling) to microscope slides. Micrographs were obtained using an Olympus BX41 optical microscope fitted with a 40x objective lens. Polarized and brightview images were captured in the presence and absence of a polarizer respectively. Building datasets Images from the microscope(s) were obtained in a variety of sizes, aspect ratios and formats. Images with differences in tint, brightness, contrast, lighting were included. Images with scratch marks on the microscope slides or of samples containing foreign objects (like dirt or lint) or containing things other than the RBCs were also included. This was done to increase the diversity in the dataset, keeping in mind the plausible diversity that the trained model may encounter during testing and after deployment. Collected images were at least 1800 px in height, either 3:2 or 4:3 in aspect ratio (width to heigh ratio), and saved in one of jpeg, bmp, tiff, or raw format. All images were RGB (containing information in red, green, and blue channels). Before building the dataset, all images were cropped and resized to the same size and converted to jpeg format. A square section from the center of the image was selected. This was done to (1) discard the sides which sometimes contained portion of the microscope slide or stage outside of the actual sample and (2) standardize the aspect ratio to 1:1 from 3:2 or 4:3. The resulting square image was scaled down in size to exactly 1000 px wide and 1000 px tall using bicubic resizing algorithm in PIL. Each image was assigned labels (such as species, chemicals used, and magnification etc.) that were stored in a tabular form within excel files. These labels were used to programmatically find images matching a certain criterion. For instance, polarized images of unstrained chicken RBCs were found by logical querying of these labels: polarized is "True", chemicals used contains "glutaraldehyde", species is "chicken", and cell type is "RBC". Ten distinct subsets of the dataset (called classes) were built using this process and are summarized in Table 1. Each class was shuffled and split into training (train), validation (val), and test (test) sets in 70:15:15 ratio, respectively. Table 1. Split of images between training, validation and test set in each dataset class or dataset name train val test brightview unstrained chicken 112 24 24 strained chicken 585 125 125 unstrained human 350 75 75 strained human 350 75 75 strained HS chicken 489 105 105 polarized unstrained chicken 798 171 171 strained chicken 1264 271 271 unstrained human 390 83 83 strained human 350 75 75 strained HS chicken 423 90 90 Building the CNN Convolutional neural networks (CNNs) were built using the TensorFlow library (that utilizes Keras API) in Python. Images from the datasets were imported using flow_from_directory function of Keras's preprocessing module, which yields batches of images indefinitely during training. The preprocessing module was also set to randomly flip the images horizontally or vertically during training, to add to the richness of the dataset as part of the data augmentation process. The batch size was set to 32 and the imported image size was set to 500 x 500 pixels. A simple binary classifier CNN, meaning it classified images into one of two input classes, was built. The model is schematically shown in Figure 2b. It consisted of three convolutional layers, each followed by a max-pooling layer. A ReLU activation function was used with each convolutional layer. Image tensors were flattened to a single dimension before being passed to the final dense layer consisting of a single neuron; this neuron yielded the probability that a particular image belonged to one of the two classes. This probability was yielded by a sigmoid activation function, that maps the input to a value between 0 and 1. The model's parameters are shown in Table 2. Model was compiled using the optimizer Adam and the loss function sparse_categorical_crossentropy. Performance of the model was measured using accuracy as the metric. Table 2. Summary of the CNN architecture obtained using Keras's summary method after defining the model. The type, shape of the output tensor, and the number of parameters (params) for each Keras layer are shown. layer name (type) output shape params conv2d_1 (Conv2D) (None, 498, 498, 16) 448 maxPooling-1 (MaxPooling2D) (None, 249, 249, 16) 0 conv2d_2 (Conv2D) (None, 247, 247, 32) 4640 maxPooling-2 (MaxPooling2D) (None, 123, 123, 32) 0 conv2d-3 (Conv2D) (None, 121, 121, 64) 18496 maxPooling-3 (MaxPooling2D) (None, 60, 60, 64) 0 flatten_1 (Flatten) (None, 230400) 0 dense_1 (Dense) (None, 1) 230401 Total params: 253,985 Trainable params: 253,985 Training the CNN In a given machine-learning experiment, a model could be trained to distinguish between two of any of the classes shown in Table 1. Specifically, models were trained to distinguish between unstrained and strained cells ('A and B', 'C and D', 'F and G', 'H and I') and between healthy and HS chicken cells ('B and E', 'G and J'). Experiments are depicted as first initials of class names (such as 'AB', 'CD' and so on) used in it, for convenience. Training was performed using the fit function on training image set, and accuracy was calculated and recorded during training for both training and validation sets. A receiver operating characteristic (ROC) curve was plotted for each experiment for a range of decision thresholds (101 linearly spaced threshold values from 0 to 1) for predictions made on validation sets. The best threshold was selected to be the one that was closest to the point (0, 1) on the ROC plot. The confusion matrix corresponding to the best threshold has been reported. A confusion matrix was also calculated for the predictions made on the test set using the best threshold (that was evaluated on the validation set) and has been reported.

Publications

Type: Journal Articles Status: Published Year Published: 2022 Citation: Effects of heat stress on cyto(chemo)kine and inflammasome gene expression and mechanical properties in isolated red and white blood cells from 4 commercial broiler lines and their ancestor jungle fowl. Elizabeth S. Greene, Elizabeth Adeogun, Sara K. Orlowski, Karthik Nayani, Sami Dridi. Poultry Science, Volume 101, Issue 6, June 2022, 101827