Source: UNIV OF MINNESOTA submitted to NRP
DETERMINING THE GENETIC BASIS FOR SHIVERS USING NEXT-GENERATION WHOLE GENOME SEQUENCING
Sponsoring Institution
State Agricultural Experiment Station
Project Status
COMPLETE
Funding Source
Reporting Frequency
Annual
Accession No.
1000529
Grant No.
(N/A)
Cumulative Award Amt.
(N/A)
Proposal No.
(N/A)
Multistate No.
(N/A)
Project Start Date
Aug 1, 2013
Project End Date
Mar 20, 2015
Grant Year
(N/A)
Program Code
[(N/A)]- (N/A)
Recipient Organization
UNIV OF MINNESOTA
(N/A)
ST PAUL,MN 55108
Performing Department
Veterinary Population Medicine
Non Technical Summary
Our hypothesis is that Shivers is a shared genetic condition in both Warmblood and Belgian Draft horses caused by a variation in DNA in a specific region on ECA8. To begin to uncover this variation in DNA sequence, we have performed whole genome sequencing in two Belgian Draft horses affected with Shivers and two controls. This information has helped us identify 1,956 DNA variants of which 345 are in a region of special interest called a noncoding RNA. Thirty variants in the non-coding RNA are only found in the two Shivers horses as compared to the two control horses. Variations in noncoding RNA are now known to be a cause of a number of neurologic and movement disorders in humans. In this grant, we propose to complete whole genome sequencing of three additional affected Belgian draft horses with Shivers and three additional Belgian control horses. We present preliminary data to show that we have successfully mastered the complex techniques to analyze this sequencing data. Results will narrow the number of potential variants associated with Shivers. Potential associated variants will be validated in our remaining 46 Belgian Shivers cases, 93 control Belgians as well as our bank of Warmblood and Thoroughbred horses with Shivers. We hypothesize that the genetic cause of Shivers in Belgian Draft and Warmblood horses will be identical. In support of this hypothesis, we previously found that the molecular basis for another disease, polysaccharide storage myopathy, was shared across Belgian Drafts, Warmbloods and many other breeds. These results will lead to a DNA test for Shivers susceptibility. Without studies such as this, Shivers will continue to be confounding.
Animal Health Component
(N/A)
Research Effort Categories
Basic
100%
Applied
(N/A)
Developmental
(N/A)
Classification

Knowledge Area (KA)Subject of Investigation (SOI)Field of Science (FOS)Percent
30338101080100%
Knowledge Area
303 - Genetic Improvement of Animals;

Subject Of Investigation
3810 - Horses, ponies, and mules;

Field Of Science
1080 - Genetics;
Goals / Objectives
Hypothesis: Our overall hypothesis is that Shivers in Draft and Warmblood breeds is a chronic progressive genetic disorder affecting neurons in the caudal spinal cord that regulate backwards locomotion. We believe that the stimulus of walking backwards triggers dysregulation of flexor and extensor muscles within and between the pelvic limbs because of degeneration of a precise group of neurons. Our specific hypothesis for this proposal is that Shivers is the result of a genetic mutation in a non-coding RNA that is located on ECA8 in the region of 59.34-59.72 Mb. Objective 1: To identify molecular variants in a specific region of equine chromosome 8 that are consistently present in Shivers, but not control, horses.
Project Methods
We have selected 3 Shivers-affected Belgian geldings and 3 age-matched control Belgian geldings based upon their genotypes from the SNP-chip (i.e. the 3 affected cases were chosen as having the "affected" haplotype, for the associated SNPs on ECA8 from 59.34-59.72 while the 3 unaffected horses have the "unaffected" haplotype). DNA from these 8 horses will be submitted to the Biomedical Genomic Center at the University of Minnesota where the DNA will be fragmented to create a DNA library for each horse. This library will then be used to perform whole-genome next generation sequencing on the Illumina HiSeq2000 sequencing machine. Each read (line of sequence) will be a paired (i.e. will have a "forward" and a "reverse") read and contain 100 base pairs. Each horse will be sequenced on 1 lane of the flow-cell, which will provide 168 million reads/horse. Based on the size of the equine genome (2.7 Gb), this will be equivalent to a targeted 12x coverage of the entire genome per horse. Once the reads appear to be of sufficient quality, they will be mapped to the reference sequence (http://www.ncbi.nlm.nih.gov/genome/145) using BWA for Illumina mapping program,10 also available through the Galaxy interface. Once all sequenced horses have had their genomes aligned to the reference sequence, they will be evaluated using the Genome Analysis Toolkit (GATK)11Finally, SNPeffect,12 will be used to characterize these variants by phenotype (i.e. a difference in Shivers-affected versus control cases), location within the genome.

Progress 08/01/13 to 03/20/15

Outputs
Target Audience: Academic veterinarians Practicing veterinarians Horse owners Veterinary students Changes/Problems: Nothing Reported What opportunities for training and professional development has the project provided? Nothing Reported How have the results been disseminated to communities of interest? Scientific meetings with abstracts 1. Finno CJ, Valberg SJ, Peterson J, Draper ACE, , Livesey L, Baird J, Firshman AM. Genome-wide association analysis of Shivers in Belgian horses. Plant and Animal Genetics Conference. San Diego CA 2013 2. Finno CJ, Valberg SJ, Mickelson JM. Whole genome next-generation sequencing of 4 Belgian horses. College of Veterinary Medicine Research Day October 2013 Veterinary meetings Valberg SJ Movement Disorders in Horses: Shivers, Stringhalt and other conundrums. California Bay Area Equine Practitioners. October 2012 Valberg SJ. Shivers in Horses. Purina Equine Conference. October 2013 Valberg SJ. Shivers, Stringhalt and other Movement Disorders SIVE, Mila Italy February 2014 Hores owner magazines Spring 2014. University of Minnesota Medical Bulletin. From humans to horses January 2015: Equus magazine. Feature article on our research on Shivers research What do you plan to do during the next reporting period to accomplish the goals? Nothing Reported

Impacts
What was accomplished under these goals? Next-generation sequencing identified 13.3 million variants (12 million SNPs and 1.3 million indels). Of these variants, 65% were intergenic, 30% were intronic, 2.5% were exonic and 2.5% were 500bp upstream or downstream of an annotated gene. The region on ECA8 that contained the associated SNPs from the GWAS (59.34-59.72 Mb; 382 kb in length) was evaluated first to determine if any of the variants segregate with the disease phenotype. Within this region, there were no variants that segregated with the disease phenotype, thereby excluding ECA8:59.34-59.72 Mb as a putative region for a genetic mutation for Shivers. To evaluate additional genomic regions for an association with the Shivers phenotype, an association analysis was performed across all 13.3 million detected variants. Using the program SNPSift Case control, each horse was labeled according to its phenotype and a Fisher Exact Test using four different models (dominant, recessive, allelic and genotypic) was performed for each of the 13.3 million variants. The allelic model was evaluated for significant associations of coding variants. Using a cut-off significance level of P=0.05 and selecting only non-synonymous variants, 88 variants were found to segregate with the phenotype whereby the affected horses possessed the non-reference allele. Haplotypes were identified within these 88 variants and 57 regions were prioritized for further assessment. Genotyping was performed on these variants using Sequenom genotyping technology on 43 affected Belgian horses and 66 controls. A non-synonymous coding mutation was identified in PMS1, a gene associated with DNA repair. All affected cases were homozygous for the missense mutation and 3/5 control horses were heterozygous for this mutation. The mutation resulted in a conservative valine to leucine amino acid change. Further investigation of this mutation is currently underway in additional Shivers-affected horses. Additionally, a custom pipeline has been designed, using FreeBayes as a variant-calling algorithm, to effectively identify larger (>10bp) indels in this next-generation sequence data and this algorithm will be run on this dataset.

Publications

  • Type: Journal Articles Status: Published Year Published: 2014 Citation: 1. Draper ACE, Bender JB, Firshman AM, Baird JD, Reed S, Mayhew IJ and Valberg SJ. Epidemiology of characterisation of Shivers in horses. Equine Vet J 2014 May 6. doi: 10.1111/evj.12296. [Epub ahead of print] 2.Draper ACE, Trumble, TN, Firshman AM, Baird JD, Reed S, Mayhew IJ and Valberg SJ Posture and movement characteristics of forward and backward walking in horses with Shivers and acquired bilateral Stringhalt. Equine Vet J. 2014 Mar 10. doi: 10.1111/evj.12259. [Epub ahead of print]


Progress 08/01/13 to 09/30/13

Outputs
Target Audience: Nothing Reported Changes/Problems: Nothing Reported What opportunities for training and professional development has the project provided? A DVM student worked on this project throughout the year as part of her postdoctoral research experience. How have the results been disseminated to communities of interest? Data was reported at the University of Minnesota Research Day and genome workshops such as Havemeyer 2013 and PAG 2013. What do you plan to do during the next reporting period to accomplish the goals? The next step is a second Sequenom analysis to examine the remaining 41 regions, including SNVs on ECAX and ECA18 in additional breeds. These regions may still be associated with Shivers but have not achieved significance due to a high allele frequency in the Belgian draft horse breed.

Impacts
What was accomplished under these goals? Our original plan was to select 3 Shivers-affected Belgian geldings and 3 age-matched control Belgian geldings; however, we combined funding from USEF and UMEC for Shivers to expand the number of horses to 5 Shivers-affected Belgian geldings and 5 age-matched control Belgian geldings and progress is reported for the combined analysis. DNA from ten horses was submitted to the Biomedical Genomic Center at the University of Minnesota, where the DNA was fragmented to create a DNA library for each horse and whole-genome next generation sequencing was performed on 100 bp paired-end reads using the Illumina HiSeq2000 sequencing machine. After performing quality control on the sequences obtained, we had, on average, 184 million reads/horse (13.6x coverage of genome). Of these reads, 98% of reads mapped successfully to the equine reference genome, with 96% properly paired. The genome analysis tool kit (GATK) software package was used to call variants (single nucleotide variants or SNVs and insertions/deletions or indels). 16 million variants were identified using a quality cut-off score of 4. Fifteen million SNVs and 1 million indels were identified. Of these, 65% were intergenic, 30% were intronic, 2.5% were exonic, 2.5% were 500 bp upstream ordownstream of an annotated gene. A case/control Fisher Exact Test was performed on variants; using an allelic model with a cut-off of <0.001, 133 variants were identified and using an allelic model with a cut-off of <0.01 20,179 variants were identified. Based on these results, we excluded the original region on ECA8 (59.3-57.72). We had an excellent average quality score (phred score) for variant calls of 815 (phred score of 50 has a call accuracy of 99.999%) and 57 regions were identified in which variants were associated with a haplotype block. A Sequenom assay (Illumina, San Diego, CA) using 43 Belgian draft horses with Shivers and 66 controls was performed for 63 of the variants identified by next generation whole genome sequencing. Four variants failed the multiplex design, 10 were not informative (not polymorphic) and 49 variants remained (13 non-synonymous, 13 intronic, 23 intergenic). An association analysis and logistic regression with sex as a covariate were performed. One SNV on the X chromosome (chrX_70406663; unadjusted p value of 0.02 for regression) was identified but no gene or transcript was found in this region. Two SNVs on ECA18 (66320637, 66321297; unadjusted p value of 0.06 for regression) were also identified. These SNVs are located within a gene that is involved in DNA repair. One of the SNVs causes a non-synonymous substitution within this gene. Although these values approached significance, based on the number of multiple test performed, an adjusted p value of 0.001 would be considered significant for this analysis. This first Sequenom analysis excluded 16 regions with variants.

Publications