Progress 01/15/16 to 01/14/17
Outputs Target Audience:The following target audiences were reached during this project: Other scientists, particularly those involved in: antibiotic resistance research livestock production use of shotgun metagenomic sequencing for microbiome-resistome research development of computer and statistical algorithms for analysis of shotgun metagenomic data Livestock producers, including: feedlot operators slaughterhouse owners cow-calf producers Students, including: undergraduate and graduate computer science students who are helping to implement computer algorithms for purposes of microbiome-resistome analysis graduate students involved in microbiome-resistome research in livestock production veterinary students involved in livestock production and population health Other USDA NIFA pre- and post-doctoral fellows Changes/Problems:
Nothing Reported
What opportunities for training and professional development has the project provided?Thanks to this project, I was able to undertake the following training and professional development opportunities: Data Visualization for High-Dimensional Data, led by data visualization experts from the University of Denver Center for Data Visualization and Statistics. July 11th, 2016 in Fort Collins, Colorado, ½-day of theory and hands-on training. I organized and funded this workshop as part of my post-doctoral fellowship, and offered it to colleagues and collaborators. Mining Microbial Genomes and Metagenomes for Biotechnological Applications, led by staff from the Joint Genome Institute (Department of Energy) during ASM/Microbe Conference, June 18th, 2016, Boston. Hands-on instruction on use of the JGI IGM suite of online tools and databases. Genome-wide Association Studies and Comparative Genomics for Tracking Multi-resistant and Hypervirulent Bacterial Clones, taught by multiple academic experts in bacterial population genetics during the ASM/Microbe Conference, June 19th, 2016, Boston. Comprehensive theoretical overview and hands-on practice with population genetic analysis of microbial populations. Introductory Network Analysis, led by Drs. Zvonimir Poljak and Olaf Berke at the Department of Population Medicine at Ontario Veterinary College, May 18th - 20th, 2016. Intensive 5-day instruction on use of network analysis in epidemiological investigations.'' In addition, I was able to improve upon my science communication skills through the following opportunities: Podcast interview, eLife. Topic: Publication of article "Resistome diversity in cattle and the environment decreases during cattle production." Available at: https://elifesciences.org/podcast/episode28 Written interview, MedicalResearch.com Topic: Publication of article "Resistome diversity in cattle and the environment decreases during cattle production." Available at: http://medicalresearch.com/author-interviews/study-finds-no-antibiotic-resistant-genes-in-meat-products-shipped-to-groceries/22592/ Blog interview, Dr. Richard Raymond's Feedstuffs.com blog. Topic: "Shotgun metagenomics: Not just a pretty face." Available at: http://feedstuffsfoodlink.com/blogs-shotgun-metagenomics-pretty-face-commentary-10799. Radio interview, KNEB radio. Topic: Antimicrobial resistance in livestock production. Two, 3-minute segments produced. Available as mp3 files upon request. Finally, I was able to complete a Graduate Teaching Certificate program, which included 20 hours of hands-on teaching experience, 12 credits in teaching and pedagogy, and completion of a Teaching Portfolio (available on request). Short courses/seminars completed: Best Practices for Online Course Design (3-week online course) Threading Information Literacy Throughout Course Curricula (4-week hybrid course) An Introduction to Audio/Visual Methods for Student Learning (4 hours of in-class lecture) More than Passive Listeners: Peer Instruction in the Lecture Setting (2-hour seminar) Overcoming barriers to learning in large STEM classrooms (2-hour seminar) Humanizing the Classroom (2-hour seminar) Using Canvas Quiz Statistics to Create Stronger Exams (1-hour seminar) What is Learning Analytics: An Introduction for Everyone (1-hour seminar) Survey Feedback in Online Courses: How do we get it? How do we use it? (1-hour seminar) Reflections on an instrument to provide quantitative feedback on teaching (1-hour seminar) How have the results been disseminated to communities of interest?Results have been disseminated through the following mediums: Journal articles Conference presentations Invited speaker presentations (including extension talks, conference keynotes, and expert panels) Media interviews (podcasts, blog posts) Press coverage Seminar presentations What do you plan to do during the next reporting period to accomplish the goals?
Nothing Reported
Impacts What was accomplished under these goals?
Objective #1--We identified several open-source tools that can be used for hierarchical Bayesian modeling: jags Stan RStan rjags R2jags runjags mcmc OpenBugs We identified the following packages for Bayesian Network Analysis (including constructing Bayesian Networks, analyzing the networks and visualizing results): bnlearn gRain deal snow/parallel igraph sna gephi Objective #2--After installing and testing all of the tools for hierarchical Bayesian modeling, we found that Stan was the most-suited to our needs, as it is written in C++ (and therefore is incredibly fast for large datasets), and it can accomodate mixed models. However, jags is more widely used at this point (as Stan is newer), and therefore we implemented models in both jags and Stan. rjags is the most comprehensive and user-friendly R package for interfacing between jags and R, while RStan is the only currently-available package to interface with Stan. For Bayesian Network Analysis, we found bnlearn to be the most flexible (and the only such software that can accomodate both categorical and continuous data). In order to speed computation, we also implemented parallelization using the snow package. Finally, we were able to transfer the network characteristics into igraph and sna to perform network analysis, and then from igraph into Gephi for visualization. Objective #3-- Using the tools identified and valdiated above, we were able to develop a new hierarchical Bayesian model using Dirichlet priors for proportional datato model shotgun metagenomic resistome counts that had previously been modeled using a non-Bayesian approach. We found that the Bayesian approach was more conservative and likely more accurate, as we were able to incorporate hyperpriors to account for increased variance in shogun metagenomic data. This model will help researchers to focus on associations with management factors andantimicrobial resistance that are more likely to be "true positives" (as opposed to false positives, of which there are many using non-Bayesian methods). This cangreatly increase the efficiency of shotgun metagenomic resarch, particularly as a hypothesis generation tool. Using a Bayesian Network Analysis approach, we also uncovered novel associations in a dataset of microbiome, demographic, diagnostic and behavioral factors for human bacterial vaginosis. This dataset was used to validate the Bayesian Network approach because it is a well-studied, highly-validated dataset with many disparate data types. We found that using the Bayesian Network approach, we can both re-affirm already-known associations between microbiome, host and environment, while also uncovering novel associations. This approach translates seamlessly to other datasets, and we are currently generating datasets with much more robust metadata in order to utilize this method. Finally, we designed and validated a pre-sequencing enrichment system for antimicrobial resistance and virulence genes in shotgun metagenomic data. This system includes >23,000 baits that were custom-designed to capture >5,500 resistance and virulence genes; as well as use of randomly-generated 22-mer unique molecular indicators (UMIs) within sequencing adapters. Using the baits and UMIs, we are bow able to count individual resistance genes within shotgun metagenomic data (including PCR duplicates), while simultaneously increasing on-target sequencing of resistance/virulence genes from an average of <0.1% up to nearly 50% of sequence data. This system will be made publicly available upon publication, and will allow shotgun metagenomic resistance researchers to greatly decrease sequencing costs, while also improving sensitivity of the approach. This has already allowed us to uncover novel associations between environment and management practices that influence the presence of rare resistance elements -- associations which we were not able to identify without this pre-sequencing enrichment system. We also discovered that the resistome (i.e., all of the resistance elements in the microbiome) is much more diverse than previously known.
Publications
- Type:
Journal Articles
Status:
Accepted
Year Published:
2016
Citation:
1. Lakin SM*, Dean C*, Noyes NR*, Dettenwanger A, Ross A, Doster E, Rovira P, Abdo Z, Jones KL, Ruiz J, Belk KE, Morley PS, Boucher CA. MEGARes: an antimicrobial resistance database for high throughput sequencing. Nucleic Acids Research 2016; 45 (D1): D574-D580.
*Co-first authors these authors contributed equally to this work.
- Type:
Journal Articles
Status:
Under Review
Year Published:
2016
Citation:
Noelle R. Noyes, Maggie E. Weinroth, Jennifer Parker, Chris A. Dean, Steven E. Lakin, Robert A. Raymond, Pablo Rovira Sanz, Enrique Doster, Zaid Abdo, Jennifer Marti2, Kenneth L. Jones, Jaime Ruiz, Christina A. Boucher, Keith E. Belk, Paul S. Morley. MEGaRICH: A Pre-Sequencing Capture System for Enriching and Counting Resistance Genes within Metagenomic Samples.
- Type:
Journal Articles
Status:
Awaiting Publication
Year Published:
2016
Citation:
Muggli M, Bowe A, Gagie T, Raymond R, Noyes NR, Morley PS, Belk KE, Puglisi S, Boucher CA. Succinct Colored de Bruijn Graphs.
- Type:
Journal Articles
Status:
Under Review
Year Published:
2016
Citation:
Joseph R. Owen, Noelle Noyes, Daniel J. Prince, Amy E. Young, Beate M. Crossley, Patricia C. Blanchard, Terry W. Lehenbauer, Sharif S. Aly, Jessica H. Davis, William J. Love, Sean M. ORourke, Zaid Abdo, Keith Belk, Michael R. Miller, Paul Morley, Alison L. Van Eenennaam. Whole-Genome Sequencing of Bacterial Isolates Associated With Bovine Respiratory Disease.
- Type:
Conference Papers and Presentations
Status:
Published
Year Published:
2016
Citation:
1. Noyes NR, Weinroth M, Doster E, Rovira Sanz P, Yang X, Dean C, Boucher CA, Jones KL, Abdo Z, Morley PS, Belk KE. The beef fecal resistome differs from other commodities. Beef Industry Safety Summit, Austin, TX, March 2016.
- Type:
Conference Papers and Presentations
Status:
Published
Year Published:
2016
Citation:
1. Noyes NR, Abdo Z, Boucher CA, Belk KE, Morley PS. Becoming Bayesian: research and other during my USDA NIFA postdoctoral fellowship. USDA Fellowship Meeting, Washington DC, August 2016.
- Type:
Conference Papers and Presentations
Status:
Published
Year Published:
2016
Citation:
Muggli M, Bowe A, Gagie T, Raymond R, Noyes NR, Morley PS, Belk KE, Puglisi S, Boucher CA. Succinct Colored de Bruijn Graphs. ISMB, Orlando, FL, July 2016.
|