Source: UNIV OF MINNESOTA submitted to NRP
APPLICATION OF NEW SEQUENCING TECHNOLOGIES TO INVESTIGATE LOCUS REMODELING AND CNV IN THE MHC OF THE TURKEY
Sponsoring Institution
National Institute of Food and Agriculture
Project Status
COMPLETE
Funding Source
Reporting Frequency
Annual
Accession No.
0229770
Grant No.
(N/A)
Cumulative Award Amt.
(N/A)
Proposal No.
(N/A)
Multistate No.
(N/A)
Project Start Date
Apr 1, 2012
Project End Date
Sep 30, 2013
Grant Year
(N/A)
Program Code
[(N/A)]- (N/A)
Recipient Organization
UNIV OF MINNESOTA
(N/A)
ST PAUL,MN 55108
Performing Department
Veterinary Biomedical Sciences
Non Technical Summary
This study is designed to further our investigations into the composition and function of the turkey MHC. Specifically we will sequence and assemble he genome of the closest sister species of the turkey. Second we will use a capture array method to selectively resequence the core MHC genes from a group of genetically diverse turkeys to explore the potential for copy number variation within the species.
Animal Health Component
(N/A)
Research Effort Categories
Basic
(N/A)
Applied
(N/A)
Developmental
(N/A)
Classification

Knowledge Area (KA)Subject of Investigation (SOI)Field of Science (FOS)Percent
3033230104050%
3043230104050%
Goals / Objectives
This study is designed to 1) sequence the genome of the closest sister species to M. gallopavo, the Ocellated turkey (M. ocellata) to gain better insight into the evolutionary history of the turkey MHC and 2) develop and implement an MHC capture array for comparative MHC sequencing of turkeys from diverse genetic backgrounds (16 haplotypes). Completion of this project will provide an ideal comparative sequence for the turkey MHC, a new platform for broader application of CNV, and a wealth of unique resources for future studies.
Project Methods
Aim 1. Whole genome sequencing of the Ocellated turkey. Working with the Minnesota Zoo we have obtained DNA (blood) samples from two Ocellated turkeys (male and female) housed at the Dallas Zoo. High molecular weight DNA has been extracted and prepared for sequencing. To assemble the genome we will obtain sequence reads from two platforms. Approximately 60Gb of sequence data (paired-end reads (2 x 100bp)) will be generated from a random sheared library (~600bp insert length) on the Illumina HighSeq 2000 at the BMGC. Sequence assembly will be conducted at the Virginia Bioinformatics Institute. Sequences of the M. gallopavo MHC will be used to identify within the new assembly homologous sequences in M. ocellata. Additional comparisons will be made to the newly released genome of the green anole lizard. This genome provides a resource for identifying MHC genes that are not included in the current turkey MHC assembly. Sequence assembly of the M. ocellata MHC will be verified by PCR and sequencing. The Agilent SureSelect platform is a target enrichment system that utilizes ultra-long (120mer) cRNA baits to capture sequence regions of interest from next-generation sequencing libraries by hybridization. The cRNA baits are designed from the target sequence which in our experiment will be the assembled sequence of the turkey MHC (Genbank DQ993255, FJ601914, HM851444). Briefly, gDNA is randomly sheared and used to prepare NGS libraries. Libraries are hybridized to the cRNA baits and targeted DNA fragments are recovered using magnetic streptavadin beads and then directed to automated sequencing (Illumina MiSeq). As an internal control for this system we will use the DNA from the same source used to create the BAC libraries ("Nici", Chaves et al., 2009) and from which our assembled sequences are derived. The remaining birds will be chosen from those previously determined to have diverse MHC haplotypes (Chaves et al., 2010). In addition, we will test both of our M. ocellata samples to determine the applicability of the custom platform in this species. Sequences obtained from the SureSelect enrichment will be aligned to the assembled turkey MHC sequence. For each individual variation in sequence depth across the locus (indicative of gene duplication), sequence polymorphisms (SNPs) and alignment inconsistencies (indicative of structural rearrangements) will be recorded. Differences indicative of CNV and rearrangements will be directly tested by PCR and sequencing.

Progress 04/01/12 to 09/30/13

Outputs
Target Audience: University faculty and students Fellow participants in Regional Project NC-1170 Genearl poultry scientific community Changes/Problems: Nothing Reported What opportunities for training and professional development has the project provided? This project involved particpation by a graduate student (Melissa Monson) for data analysis, and an two undergraduate students (Molly Greenshields and Adam Busaker) who performed lab experiments. How have the results been disseminated to communities of interest? Oral and poster presentations at regional and international conferences and through University seminar presentations. What do you plan to do during the next reporting period to accomplish the goals? Nothing Reported

Impacts
What was accomplished under these goals? Aim 1.Whole genome sequencing of the Ocellated turkey. To assemble the genome we have obtained sequence reads from two platforms. First, high molecular weight DNA was submitted to the BMGC at the Univeristy of Minnesota for Illumina sequencing. The sequencing library was prepped using the longer length protocol (~500bp). Approximately 550M paired-end reads (2 x 100bp) were generated from a random sheared library on the Illumina HighSeq 2000 (3 flow cell lanes). This represents almost 60x genome coverage. In addition, ~1.6M paired-end reads were generated from a large insert library (3-5 kb) on the Roche 454 platform at Virginia Tech University. A second large insert library (~8kb) was also sequenced. Sequence assembly is being conducted at the Virginia Bioinformatics Institute (with collaborator Dr. Robert Settlage) by alignment of reads with the latest build of the domestic turkey genome (UMD5.0). In addition, the domestic turkey MHC sequence was used to identify sequence reads within the Illumina data set to assemble the MHC of the ocellated turkey. The majority of this sequence have now been verified and oriented using PCR and direct sequencing. We are still completing data analysis and manuscript preparation. Aim 2. Development of an MHC sequence capture array and comparative MHC sequencing. Working with the Agilent SureSelect design team, a platform of over 12,000, 120mer cRNA baits was designed to capture MHC sequence for next-generation sequencing. Sheared genomic DNA libraries were created for 16 individuals including 14 turkeys (domestic, heritage and wild), an ocellted turkey (a second bird other than used above), and a chicken, and hybridized to the cRNA baits. Recovered fragments were subjected to automated sequencing (151bp paired-end run on the Illumina MiSeq platform). Included in our samples was the DNA the bird used to sequence the turkey genome ("Nici") and the chicken genome (RJF-256). The remaining birds were chosen from those previously determined to have diverse MHC haplotypes. M. ocellata and chicken (Gallus gallus) samples were included to determine the applicability of the custom platform to related species. Sequencing of the library pool has been completed (~6M reads representing 309K-458K reads/library). Initial analysis of this data indicates good sequence recovery with average match to the turkey genome exceeding 85% and match to MHC in excess of 50%. From this data we have a assembled a large array of SNPs for all individuals examined. The results of this aim are being summarized for publication.

Publications

  • Type: Conference Papers and Presentations Status: Other Year Published: 2013 Citation: Monson M, Mendoza KM, Settlage R, and Reed KM. Initial assembly and analysis of the MHC B-locus of the ocellated turkey. Plant, Animal and Microbial Genomes Conference XXI, San Diego, CA. Poster
  • Type: Conference Papers and Presentations Status: Other Year Published: 2013 Citation: Reed KM, Monson M, Settlage RE, Mendoza KM and RA Dalloul. MHC composition and variation in turkeys of the Americas. 100th annual meeting of the American Association of Immunologists, Honolulu, HI. Poster
  • Type: Conference Papers and Presentations Status: Other Year Published: 2013 Citation: Reed KM. Use of a capture platform for target enrichment and massively parallel sequencing to assess variation in the turkey MHC. Oral presentation NC1170 Poultry Workshop Plant & Animal Genome XXI


Progress 01/01/12 to 12/31/12

Outputs
OUTPUTS: Our plan is to create a high density whole genome sequence of the Ocellated turkey (Aim 1) in conjunction with MHC-specific sequencing (Aim 2) using the newly developed SureSelect target enrichment system (Agilent). Because of the constraints placed on release of the first year of funding, the time line for the projects was adjusted to focus collection of sequence data for both aims in year 1 leaving year 2 for data analysis. Aim 1. Whole genome sequencing of the Ocellated turkey. To assemble the genome we have obtained sequence reads from two platforms. First, high molecular weight DNA was submitted to the BMGC at the Univeristy of Minnesota for Illumina sequencing. The sequencing library was prepped using the longer length protocol (~500bp). Approximately 550M paired-end reads (2 x 100bp) were generated from a random sheared library on the Illumina HighSeq 2000 (3 flow cell lanes). This represents almost 60x genome coverage. In addition, ~1.6M paired-end reads were generated from a large insert library (3-5 kb) on the Roche 454 platform at Virginia Tech University. A second large insert library (~8kb) is currently being sequenced and should be completed in the next 2 weeks. Sequence assembly will be conducted independently at the Virginia Bioinformatics Institute (with collaborator Dr. Robert Settlage) and at the University of Maryland (collaborator Dr. Aleksy Zimin). We have already used the domestic turkey MHC sequence to identify sequence reads within the Illumina data set and are working to assemble the MHC of the ocellated turkey. This sequence will be verified and oriented using PCR and direct sequencing. Aim 2. Development of an MHC sequence capture array and comparative MHC sequencing Working with the Agilent SureSelect design team, a platform of over 12,000, 120mer cRNA baits was designed to capture MHC sequence for next-generation sequencing. Sheared genomic DNA libraries were created for 16 individuals including 14 turkeys (domestic, heritage and wild), an ocellted turkey, and a chicken, and hybridized to the cRNA baits. Recovered fragments were subjected to automated sequencing (151bp paired-end run on the Illumina MiSeq platform). Included in our samples was the DNA the bird used to sequence the turkey genome ("Nici") and the chicken genome (RJF-256). The remaining birds were chosen from those previously determined to have diverse MHC haplotypes. M. ocellata and chicken (Gallus gallus) samples were included to determine the applicability of the custom platform to related species. Sequencing of the library pool has been completed (~6M reads representing 309K-458K reads/library). Initial analysis of this data indicates good sequence recovery with average match to the turkey genome exceeding 85% and match to MHC in excess of 50%. PARTICIPANTS: Kent M. Reed, Department of Veterinary and Biomedical Sciences, University of Minnesota. Role: project PD. Kristelle M. Mendoza, Department of Veterinary and Biomedical Sciences, University of Minnesota. Role: laboratory scientist. Melissa Monson, Department of Veterinary and Biomedical Sciences, University of Minnesota. Role: graduate student, data analysis. Molly Greenshields, Department of Veterinary and Biomedical Sciences, University of Minnesota. Role: Undergraduate laboratory assistant. Robert Settlage, Virginia Bioinformatic Institute, Virginia Tech University Role: collaborator, data analysis. Aleksey Zimin, University of Maryland Role: collaborator, genome analysis. TARGET AUDIENCES: Nothing significant to report during this reporting period. PROJECT MODIFICATIONS: Because of the constraints placed on the first year of funding, the time line for the project was adjusted to focus collection of sequence data for both aims 1 and 2 in year 1 leaving year 2 for data analysis.

Impacts
The results of this project will generate the genome sequence of a second turkey species (Ocellated) and a detailed assembly of the Major Histocompatibility Complex (MHC) region (Aim 1). Re-sequencing of multiple individuals through focused sequence capture (Aim 2) will provide a means to test for copy number variation in the MHC for the turkey. Locus remodeling (gene rearrangement) and copy number variation (CNV) within the MHC are increasingly identified as having significant impact on disease susceptibility/resistance.

Publications

  • No publications reported this period