Progress 08/01/23 to 07/31/24
Outputs Target Audience: Scientists, epidemiologists, and swine veterinarians Government and policymaking agencies involved in animal disease control and prevention Swine industry stakeholders (e.g., swine production companies, producers, pharmaceutical companies, biosecurity companies) Changes/Problems:Due to the interest of our post-doctoral assocaite at the University of Minnesota in structural modeling, the primary work for WP2 is now being done at Minnesota rather than at Roslin Institute. What opportunities for training and professional development has the project provided?The project has provided two postdoctoral researchers with opportunities to learn and develop skills in machine learning, phylogenetics and structural bioinformatics. This student also supports one graduate student work on PRRSV bioinformatics. How have the results been disseminated to communities of interest?The results have been disseminated to communities of interest through various channels. The veterinary diagnostic laboratory at the University of Minnesota, which processes a significant volume of swine disease samples in the U.S. Midwest, has integrated the new machine learning algorithm for PRRSV-2 variant classification into its services, replacing the traditional RFLP typing with our classification system in the ORF5 sequencing report provided to clients. Additionally, we presented the development and application of this classification system to different stakeholders in the swine industry. An infographic detailing the variant classification system was distributed as a leaflet at the 2024 AASV meeting, where swine practitioners convened, and we delivered oral presentations on this topic at the 2024 Allen D. Leman Swine Conference, attended by scientists, practitioners, and producers. Ultimately, the findings from the WP1 study have been published in a peer-reviewed journal, along with a publicly accessible variant classification platform available on a dedicated website. Academic conference attended by industry: • Kimberly VanderWaal, Nakarin Pamornchainavakul: PRRSV-2 genetic classification (Infographic). AASV Annual Meeting. Nashville, Tennessee, February 24-27, 2024. • Kimberly VanderWaal, Paul Yeske: PRRSV-2 genetic variant classification: What is it and why we need it? Allen D. Leman Swine Conference, St. Paul, MN September 21-24, 2024. Presentations to industry groups: - We have done 3different orientaitons thus far, where we have met with veterinary practices/swine production companies to oreint and educate them to the new clasificaiton system. This includes Fairmont Veterinary Clinic, Swine Vet Center, Vaxxinova. Media: • 2024 Article in National Hog Farmer: "Surprising but true: PRRSV one of the most sequenced viruses in the world" By: Kimberly VanderWaal. https://www.nationalhogfarmer.com/livestock-management/surprising-but-true-prrsv-one-of-the-most-sequenced-viruses-in-the-world • 2024 Morison Swine Health Monitoring Program: "Surprising but true: PRRSV one of the most sequenced viruses in the world" By: Kimberly VanderWaal. https://umnswinenews.com/2024/08/23/surprising-but-true-prrsv-is-one-of-the-most-sequenced-viruses-in-the-world/#:~:text=For%20those%20outside%20the%20pig,is%20unparalleled%20by%20human%20medicine. What do you plan to do during the next reporting period to accomplish the goals?As we conclude the first year of the project, we have successfully completed Activity 1 (Variant Clustering) of Aim WP1 ahead of schedule, and we anticipate finalizing Activity 2 (Automated Variant Classifier) by the end of this year. In line with our project plan, we have also made at least 25% progress on Aim WP2a. Furthermore, we have initiated Aim WP3 earlier than planned (initially scheduled for year three) to address the active use of the application developed in Aim WP1. Moving forward, we aim to adhere to the proposed timeline or even accelerate some aims in anticipation of Aim WP1 results, which are foundational to the subsequent aims and will soon be ready for implementation. Our detailed plans for the next reporting period include the following: WP1 - The manuscript on Activity 2 (Automated Variant Classifier) has been submitted to a journal and is currently under peer review, with expected publication by the end of 2024. During this period, we will continue to refine our machine learning algorithm based on feedback from ongoing and new users, including veterinary diagnostic laboratories and their clients. Any adjustments, along with updates to the variant classification model trained on new sequencing data from the Morrison Swine Health Monitoring Project (MSHMP), will be uploaded to our GitHub page and RShiny app website on a quarterly basis. WP2a and WP2b - For the AlphaFold analysis, we plan to complete the structural prediction of 158 GP5/M heterodimers, along with separate GP5 and M proteins using various models and cleavage site predictions. We will compare the structural information, including protein conformations and confidence parameters, across all models to identify the best prediction. This optimized structural configuration will then be applied to GP5 and M proteins from newly generated PRRSV-2 whole genome sequences in cross-neutralization experiments. We aim to measure the association between structural divergence, amino acid variation, and antigenic distances and develop a machine learning model for antigenic and phenotypic prediction based on these features. WP3 - The primary challenge we currently face with this aim is improving the accuracy of the virus fitness predictive model. We will explore strategies such as downsampling data from overrepresenting classes to address dataset imbalance or modifying the predictor variables to enhance the model's performance. Our goal is to achieve at least 70% accuracy by the end of the first quarter of 2025. WP4 - We will continue to add trained modes from WP2 and WP3 into the PRRSLoom webtool as they become available. Additionally, we plan to present our research findings at upcoming conferences, including NAPRRS 2024 and the AASV meeting in 2025.
Impacts What was accomplished under these goals?
WP1- Completion status = 95% We utilized a database of over 28,730 sequences from 2010 to 2021 to develop a fine-scale classification system for PRRSV-2 variants. We systematically compared 140 approaches, assessing various tree-building methods and criteria for defining variants. The three most effective approaches yielded reproducible classifications, with the average genetic distance among sequences within the same variant ranging from 2.1% to 2.5%, while the divergence between variants was 2.5% to 2.7%. We trained machine learning algorithms that accurately assigned new sequences to existing variants with over 95% accuracy. This work is supported by a published paper and another under review. The classification tool is publicly available at https://stemma.shinyapps.io/PRRSLoom-variants/ with code accessible on https://github.com/kvanderwaal/prrsv2_classification. WP2a- Completion status = 25% We incorporated 153 nucleotide sequences of ORF5 and ORF6 genes from NCBI GenBank, representing various PRRSV-2 lineages and sub-lineages in the U.S. from 1992 to 2021, along with five common commercial live attenuated vaccine strains. These sequences were translated into GP5 and M proteins for structural prediction analysis. Using AlphaFold2 on the University of Minnesota's supercomputer, we predicted the structures of PRRSV-2 GP5 and M proteins in several configurations, achieving predictions for over 75% of the data. Preliminary findings suggest a disconnect between structural distances and amino acid distances for GP5/M proteins, indicating the need for further exploration, particularly focusing on epitopic sites in subsequent analyses. WP2b- Completion status = 50% We have collaborating with a company, Phibro Animal Health, who has shared with our team data on cross-neutraliztiaon between a panel of PRRSV-2 variants. We used these data to train a randomForest model in R to predict the potential of anti-sera generated against one PRRSV-2 variant to provide protection against a different variant. The model is reasonably accurate, with accuracy >70%. We also have obtained data from this group on lung lesions in immunized and challenged aniamls, to allow us to assess wheether the ML algoirthm predictions (trained on in vitro data) provided an approxiation of in vivo protection. Results look promising thus far, with general agreement between the in vivo and in vitro studies. A manuscript is being drafted. WP3- Completion status = 20% We utilized the same dataset from WP1 to apply machine learning for predicting the epidemiologic fitness of new PRRSV-2 variants. We identified 20 candidate features from ORF5 sequence alignments, variant classification data, and the phylogenetic tree to forecast variants with potential population expansion exceeding 200% in 12, 24, or 36 months. Fourteen machine learning classifiers were trained using 70% of the data with 10-fold cross-validation, achieving accuracy rates between 50% and 65% depending on the model and prediction timeframe. Notably, the random forest and extra tree classifiers performed the best, with local branching index and within-variant genetic distance emerging as key features. Further refinement of features and methodologies is underway to enhance prediction accuracy and reduce processing time. WP4- Completion status = 50% The classification webtool from WP1 allows end-users to classify their own sequences into variants. This tool also includes additional contextual information that a user can look-up once they have their variant ID. Notably, they can view the current prevalence of their variant in the entire dataset, and also get a report on the "occurrence" trend, i.e., if the occurrence of the variant is accerlated (doubled over 12 months), decellerating (decreased by 1/2 over the past 12 months), or is stable.
Publications
- Type:
Journal Articles
Status:
Published
Year Published:
2024
Citation:
VanderWaal K, Pamornchainavakul N, Kikuti M, Linhares DCL, Trevisan G, Zhang J, Anderson TK, Zeller M, Rossow S, Holtkamp DJ, Makau DN, Corzo CA and Paploski IAD (2024) Phylogenetic-based methods for fine-scale classification of PRRSV-2 ORF5 sequences: a comparison of their robustness and reproducibility. Front. Virol. 4:1433931. doi: 10.3389/fviro.2024.1433931
- Type:
Journal Articles
Status:
Under Review
Year Published:
2024
Citation:
Kimberly VanderWaal, Nakarin Pamornchainavakul, Mariana Kikuti, Jianqiang Zhang, Michael Zeller, Giovani Trevisan, Stephanie Rossow, Mark Schwartz, Daniel C.L. Linhares, Derald J. Holtkamp, Jo�o Paulo Herrera da Silva, Cesar A. Corzo, Julia P. Baker, Tavis K. Anderson, Dennis N. Makau, Igor A.D. Paploski (2024) PRRSV-2 variant classification: a dynamic nomenclature for enhanced monitoring and surveillance. bioRxiv 2024.08.20.608841; doi: https://doi.org/10.1101/2024.08.20.608841
- Type:
Conference Papers and Presentations
Status:
Published
Year Published:
2024
Citation:
K. VanderWaal. Rapid evolution of PRRSV: Is it possible to predict the emergence of new PRRSV variants? 27th International Pig Veterinary Society Congress. Leipzig, Germany. June 4-7, 2024.
- Type:
Conference Papers and Presentations
Status:
Published
Year Published:
2024
Citation:
K. VanderWaal & N. Pamornchainavakul. Predicting PRRSV-2 Variant Emergence: Insights from a Decade of Genomic Analysis. 27th International Pig Veterinary Society Congress. Leipzig, Germany. June 4-7, 2024.
- Type:
Conference Papers and Presentations
Status:
Published
Year Published:
2024
Citation:
Kimberly VanderWaal and Paul Yeske. "PRRSV-2 genetic variant classification: What is it and why we need it?" Oral presentation at Allen D. Leman Swine Conference. September 23, 2024, St. Paul, MN.
- Type:
Conference Papers and Presentations
Status:
Published
Year Published:
2024
Citation:
Kimberly VanderWaal. "Chasing a moving target: The emergence of new PRRSV genetic variants in U.S. swine." The University of Minnesota College of Veterinary Medicine Research, Innovation, Discovery, and Education (RIDE) Seminar Series. September 11, 2024, St. Paul, MN.
|