Source: UNIVERSITY OF ILLINOIS submitted to
FUNCTIONAL GENOMICS OF RUMINOCOCCUS FLAVEFACIENS FD-1
Sponsoring Institution
National Institute of Food and Agriculture
Project Status
TERMINATED
Funding Source
Reporting Frequency
Annual
Accession No.
0190218
Grant No.
2002-35206-11634
Project No.
ILLU-538-557
Proposal No.
2001-03339
Multistate No.
(N/A)
Program Code
42.0
Project Start Date
Dec 15, 2001
Project End Date
Dec 31, 2005
Grant Year
2002
Project Director
White, B.
Recipient Organization
UNIVERSITY OF ILLINOIS
2001 S. Lincoln Ave.
URBANA,IL 61801
Performing Department
ANIMAL SCIENCES
Non Technical Summary
This research is expected to result in a "genetic blueprint" for ruminal fiber degredation by R. flavefaciens FD-1. These studies will result in microarrays which are essential for fully enabling the use of "post-genomics" technologies and bioinformatics, to elucidate which enzyme(s) or other ecological or physiological process(es) are rate-limiting to fiber degradation, and how this might change relative to dietary composition. The enabling technology will be beneficial to all of us interested in animal growth and (or) biomass conversion.
Animal Health Component
(N/A)
Research Effort Categories
Basic
100%
Applied
(N/A)
Developmental
(N/A)
Classification

Knowledge Area (KA)Subject of Investigation (SOI)Field of Science (FOS)Percent
3034010110050%
3053910102050%
Goals / Objectives
The objectives of this proposal are to apply these techniques to the cellulolytic ruminal bacterium Ruminococcus flavefaciens FD-1 to detect and identify candidate genes expressed in response to different dietary sources. These studies will result in microassays which are essential for fully enabling the use of "post genomics" technologies and bioinformatics, to elucidate which enzyme(s) or other ecological or physiological process(es) are rate-limiting to fiber degradation, and how this might change relative to dietary composition.
Project Methods
Our functional genomics approach combines the utility of draft genome sequence (95% representation of the genome) and microarray analysis to screen for dietary response genes. This research is expected to result in a "genetic blueprint" for ruminal fiber degradation by R. flavefaciens FD-1. We will sequence the genome of R. flavefaciens FD-1 using a small insert library (1-3 kb) "shotgun" sequencing approach to 3X coverage. We will then use microarrays of approximately 2000 ORFs including unannotated ORFs; ORFs representing proteins involved in plant cell wall hydrolisis and ORFs representing regulatory proteins to identify the candidate genes involved in cellulolysis. Functional linking genes identified via the microarray analysis will serve as a platform for future post-genomics studies.

Progress 12/15/01 to 12/31/05

Outputs
The Ruminococcus flavefaciens FD-1 sequencing project is at 4x coverage (4.4 Mb genome). The 1,143 assembled contigs have an average size of 3,686 bp and range from 205 to 31,132 bp. Cellulosome organization in strain FD-1 appears similar to that of strain 17. Analysis of the assembly revealed a cluster of genes encoding for the putative scaffoldin proteins, ScaA, ScaB and ScaC. Like in strain 17, the FD1 scaB is situated downstream of scaA and the scaC equivalent is located upstream of ScaA. Compared to R. flavefaciens 17, the architecture of the FD1 proteins is very similar, except for the number of cohesins in ScaA and ScaB. In silico analysis for identification of putative dockerin modules, novel glycoside hydrolases and putative scaffoldins was conducted. Dockerin motifs were used to search for dockerin containing ORFs, revealing an unprecedented number (171) of putative dockerin modules. This is more than double the number so far detected in C. thermocellum-the bacterial standard for comparison of cellulosome systems. Phylogenetic analysis indicated five major clusters of dockerin groups; EndB dockerins in 52 ORFs, CesA (or type-3) dockerins in 25 ORFs, XynE dockerins in 54 ORFs, a ScaA dockerin in one ORF, and ScaB dockerins in 3 ORFs. These dockerin-containing ORFs have associated protein families including xylanases, cellulases, proteinases, serpins, surface-anchoring proteins and unknown activities. Catalytic domains and CBMs implicated in plant cell wall breakdown are found associated with EndB, XynE, type-3, and ScaB groups. Proteinases are found in the XynE and type-3 groups. A ScaB X-module-dockerin (XDoc) pair has been discovered in an orf that is situated between scaB and scaE in the sca gene cluster of both strains 17 and FD-1. The expressed ORF protein binds to cellulose and its C-terminal XDoc binds selectively to the ScaE cohesin. This surface-associated ORF protein and ScaB could thus account for the elusive cellulose-binding function in this bacterium. The genome was used to develop a microarray (~6,000 sequences). Expression profiling with mRNA from bacteria grown with either cellobiose or as carbon source was performed. Of the up-regulated contigs, those containing dockerin sequences were further analyzed. The EndB type dockerins seem to be the most common in the up-regulated contigs. The multi-domain gene families of the dockerin containing contigs with the highest up-regulation were GH_11/CBM_4_9/GH_10/GH_11/polysaccharide deacetylase, CE_3/CBM 22/CesA, Ricin B-like lectin CBM/CBM 6/lipase GDSL, alpha-glucosidase, GH 30/CBM 22/CE_3/Ricin-like CBM/PL_11, and ScaC/ScaA/ScaB. These gene families range from 23-fold to 3-fold up-regulation respectively. A gene similar to a bifunctional xylanase found in R. flavefaciens 17 was the most highly up-regulated gene (10 to 23-fold). The putative scaffoldin proteins, ScaA, ScaB, and ScaC were up-regulated approximately 4-fold. The ScaA dockerin was the most highly up-regulated. Multi-domain fiber degrading proteins appear to be highly up-regulated and it seems that the presence of a dockerin sequence is highly correlated with up-regulation.

Impacts
The Ruminococci are the predominant cellulolytic Gram-positive cocci inhabiting the rumen community. They are responsible for releasing simpler substrates from cellulolytic materials that other Eubacterial and Archaeal species can utilize. Applying genomic technologies to understanding this rate limiting step offers a new method to underscore not only differences between organisms in the rumen environment but also the sequence imprint that occurs from living in such a unique environment. The fact that R. flavefaciens strain FD-1 has a cellulosome complex which is orthologous to that of strain 17 suggests that the R. flavefaciens cellulosome system is the most complex, diverse and heterogeneous known in cellulolytic bacteria. Microarray analysis has enabled us to identify a complement of genes responsible for cellulose attachment and degradation and how expression of these genes changes cellulosome composition. This database of expressed sequences will allow us to determine genes that are differentially expressed in response to plant cell wall components. The expressed genes will be the basis for studying the biochemistry of cohesin-dockerin interactions and cellulosome orthologues and allow us to unravel the complex molecular events in plant cell wall hydrolysis by R. flavefaciens. In the future, a combined genomic and proteomic approach is dictated to understand the regulation and assembly of this remarkable cadre of cellulosome components. Multi-domain fiber degrading proteins with a dockerin sequence are prime candidates for future systematic studies.

Publications

  • No publications reported this period


Progress 01/01/04 to 12/31/04

Outputs
The current Ruminococcus flavefaciens FD-1 genome sequencing project at the University of Illinois at Urbana-Champaign is currently at 3.36x-coverage. We are manually curating the Phred/Phrap-mediated assembly in order to reduce the number of contigs (1,292) made up of 4,047,136 bases. There are several sequences with relevance to cellulose degradation, which are indicative of a Ruminococcus-like cellulosome. Homologues of the scaffoldin proteins ScaA (only 2 cohesins) and ScaB (9 cohesins and with 2 unusually short linkers, just a PT) and ScaC (85% including dockerin and cohesin missing the N-terminal part) have been found which have similar motifs and architecture to those described from R. flavefaciens 17. Additionally, further searches utilizing other cellulase components uncovered two putative ORFs with repeated X domains. Analysis of these domains suggests that they represent a new class of X domains from glyosyl hydrolases. These approaches will facilitate the identification of relevant ORFs for further study using microarrays and proteomic-assisted technologies.

Impacts
The Ruminococci are the predominant cellulolytic Gram-positive cocci inhabiting the rumen community. They are responsible for releasing simpler substrates from cellulolytic materials that other Eubacterial and Archaeal species can utilize. Applying genomic technologies to understanding this rate limiting step offers a new method to underscore not only differences between organisms in the rumen environment but also the sequence imprint that occurs from living in such a unique environment.

Publications

  • Antonopoulos, D.A., Bayer, E.A. and White, B.A. 2004. Identification of cellulase components from a random-phase genome sequence of Ruminococcus flavefaciens FD-1. In: Biotechnology of Lignocellulose Degradation and Biomass Utilization, (eds. K. Shimada, S. Hoshino, K. Ohmiya, K. Sakka, Y. Kobayashi and S. Karita) Uni Publishers Co., LTD., Tokyo, Japan. p. 309-316.
  • Antonopoulos, D.A., Nelson, K.E., Morrison, M. and White, B.A. 2004. Strain specific genomic regions of Ruminococcus flavefaciens FD-1 as revealed by combinatorial random-phase genome sequencing and suppressive subtractive hybridization. Environ. Microbiol. 6:335-346.


Progress 01/01/03 to 12/31/03

Outputs
The current Ruminococcus flavefaciens FD-1 genome sequencing project at the University of Illinois at Urbana-Champaign is currently at 2x-coverage. We are manually curating the Phred/Phrap-mediated assembly in order to reduce the number of contigs (1,500) made up of 14,437 reads. Automated BlastX searches of the non-redundant amino acid GenBank database using the assembled contigs as queries yields 151 sequences with similarity to fiber-degrading encoding sequences and/or sequences that may be important in substrate adhesion. There are also several sequences with relevance to cellulose degradation, which are indicative of a Ruminococcus like cellulosome. Homologues of the scaffoldin proteins ScaA and ScaB have been found which have similar motifs and architecture to those described from R. flavefaciens 17. Additionally, further searches utilizing other cellulase components uncovered two putative ORFs with repeated X domains. Analysis of these domains suggests that they represent a new class of X domains from glyosyl hydrolases. These approaches will facilitate the identification of relevant ORFs for further study using microarrays and proteomic-assisted technologies.

Impacts
The Ruminococci are the predominant cellulolytic Gram-positive cocci inhabiting the rumen community. They are responsible for releasing simpler substrates from cellulolytic materials that other Eubacterial and Archaeal species can utilize. Applying genomic technologies to understanding this rate limiting step offers a new method to underscore not only differences between organisms in the rumen environment but also the sequence imprint that occurs from living in such a unique environment.

Publications

  • Antonopoulos, D.A., Nelson, K.E., Morrisom, M. and White, B.A. 2003. Strain specific genomic regions of Ruminococcus flavefaciens FD-1 as revealed by combinatorial random-phase genome sequencing and suppressive subtractive hybridization. Environ. Microbiol. (In Press).
  • Antonopoulos, D.A., Bayer, E.A. and White, B.A. 2004. Identification of cellulase components from a random-phase genome sequence of Ruminococcus flavefaciens FD-1. In: Biotechnology of Lignocellulose Degradation and Biomass Utilization, (eds. K. Shimada, S. Hoshino, K. Ohmiya, K. Sakka, Y. Kobayashi and S. Karita) Uni Publishers Co., LTD., Tokyo, Japan (In Press).
  • Antonopoulos, D.A., Russell, W.M. and White, B.A. 2003. Phylogenetic reconstruction of Gram positive organisms based on comparative sequence analysis of molecular chaperones from the ruminal microorganism Ruminococcus flavefaciens FD-1 FEMS Microbiol. Lett. 227:1-7.


Progress 01/01/02 to 12/31/02

Outputs
We are currently sequencing the genome of the cellulolytic bacterium Ruminococcus flavefaciens FD-1. A shotgun library was constructed from nebulized genomic DNA that was size-fractionated (1.5-3 kb), polished and cloned into pCR"4Blunt-TOPO". The cumulative number of bases sequenced so far with a Phred quality of 20 or higher is 4,117,421 from 10,272 reads. These have been assembled into 4,302 contigs which yields just under 1x coverage since the estimated size of this genome is 4.4 Mb based on PFGE of R. flavefaciens 17. Library quality was assessed periodically by checking the redundancy of sequences in the early phase of sequencing and efforts are now directed towards the next round of sequence coverage.

Impacts
Genomic information will accelerate the data gathering steps needed to address the important questions related to the regulation of cellulose hydrolysis. These approaches will revolutionize our understanding of the biochemistry, physiology and genetics of R. flavefaciens FD-1. These experiments will also allow us to unravel the complex molecular events in plant cell wall hydrolysis and apply this technology in mixed rumen cultures, artificial rumen systems and ultimately the animal itself.

Publications

  • No publications reported this period