Use of dihaploid populations to unravel the genetics of potato agronomic traits

USE OF DIHAPLOID POPULATIONS TO UNRAVEL THE GENETICS OF POTATO AGRONOMIC TRAITS

Sponsoring Institution

National Institute of Food and Agriculture

Project Status

COMPLETE

Funding Source

AFRI COMPETITIVE GRANT

Reporting Frequency

Annual

Accession No.

1010863

Grant No.

2017-67011-26038

Cumulative Award Amt.

$86,456.00

Proposal No.

2016-04723

Multistate No.

(N/A)

Project Start Date

Jan 1, 2017

Project End Date

Dec 31, 2019

Grant Year

2017

Program Code

[A7101]- AFRI Predoctoral Fellowships

Recipient Organization
MICHIGAN STATE UNIV
(N/A)
EAST LANSING,MI 48824

Performing Department
Plant Biology

Non Technical Summary
Potatofaces special challenges in terms of genetic improvement due to the nature of its genome. Potato possesses four copies of each chromosome, making it more difficult than usual to incorporate desirable agronomic traits, such as disease resistance and yield,during the breeding process. This trait of possessing more than two copies of each chromosome, called polyploidy, also makes it difficult to identify the genes and genetic variants underlying traits of interest. To simplify the system and make genetic studies more feasible, one strategy that can be utilized is the production of potatoes that possess only two copies of each chromosome, called dihaploids. Potato dihaploids are made by fertilizing the target plant with a special group of potatoes known as in vitro pollinators. Once the pollination is performed, the eggs from the target plant develop with only half the number of chromosomes as a usual potato plant. This makes them far easier to use in genetic studies, and they can also be used to incorporate genetic material from wild potato that only possess two copies of each chromosome and are otherwise incompatible with cultivated potato. During the formation of dihaploids, some genetic material is sometimes incorporated from the pollinator plant into the mother plant. This outcome is undesirable, and the proposed projects seek to identify the prevalence of this phenomenon in two dihaploid populations generated from the common potato cultivars Atlantic and Superior. The project will also utilize the dihaploid populations from these plants to study, on a genome-wide scale, how changes at the genetic and epigenetic level influence agronomic traits in the field and expression of genes at the molecular level. The studies, altogether, will contribute to the community's knowledge of how dihaploid production works andprovide the potato breeding community withmore fine-scaled knowledge on how variation in the potato genome contributes to observable trait differences between different plants.

Animal Health Component

Research Effort Categories

Basic

100%

Applied

Developmental

Classification

Knowledge Area (KA)	Subject of Investigation (SOI)	Field of Science (FOS)	Percent
201	1310	1080	100%

Knowledge Area
201 - Plant Genome, Genetics, and Genetic Mechanisms;

Subject Of Investigation
1310 - Potato;

Field Of Science
1080 - Genetics;

Keywords

Goals / Objectives
This project will investigate the molecular mechanism of dihaploid induction in potato, a common technique to reduce ploidy level for genetic studies and breeding. Previous work demonstrates that DNA from the dihaploid inducer is transferred to progeny, an undesirable result. The first goal of this project seeks to characterize the extend of this phenomenon. The objectives to meet his goal are as follows:Characterize single nucleotide polymorphism in potato dihaploid populations and the dihaploid inducer parent plant. This will be achieved through the analysis of next-generation sequencing data.Discover instances of incidental introgression from the dihaploid inducerDevelop bioinformatics approaches to explain cases of introgressionDihaploid populations will then be used to characterize genes and genomic variation influencing field-measured agronomic traits, including yield, vine vigor, and height. This will be accomplished with the following objectives:Identify and characterize genome-wide copy number variation in dihaploid populationsIn QTL regions associated with traits, determine differential expression regulated by copy number variation of genesDetermine differential gene regulation regulated by histone marks contributing to extreme phenotypes

Project Methods
Aim 1. Determine extent and mechanisms of incidental IVP introgressionHypothesis and experimental approaches: For the following studies, dihaploid (2x) populations were generated by the fertilization of two tetraploid (4x) varieties, Atlantic and Superior, using a haploid-inducing S. phureja in vitro pollinator, IVP101. IVP101 has been demonstrated to show no evidence of somatic translocation during haploid induction using AFLP markers (Straadt and Rasmussen, 2003). However, another in vitro pollinator, IVP48, has been demonstrated to cause introgressions in the maternal genome (Clulow and Rousselle-Bourgeois, 1997; Straadt and Rasmussen, 2003; Wilkinson et al., 1995). A whole-genome assessment has not yet been performed to search for incorporation chromosomal segments from IVP101 during dihaploid induction. To assess this possibility, the Atlantic and Superior dihaploid populations will be analyzed at the whole-genome level to determine if chromosomal translocation has occurred, which would result in the invasion of Atlantic and Superior haplotypes by IVP101 genetic material.Atlantic and Superior tetraploids have already been sequenced and genotyped in previous work focused on intragenome heterogeneity of six tetraploid cultivars (Pham et al. in preparation). Using this information, and newly generated sequence data from IVP101, IVP101-specific SNPs will be identified for use as markers for chromosomal translocation in the dihaploid populations (Figure 1). At this time, the Atlantic and Superior dihaploid populations consisting of 158 and 95 individuals, respectively, have been sequenced using a whole-shotgun approach at a depth of approximately 8X genome coverage per line (Figure 1). The Atlantic and Superior dihaploids, as well as S. phureja IVP101, will be genotyped using the Genome Analysis Toolkit workflow. Occurrences of chromosomal translocation will be inferred by locating IVP101-specific SNPs in the dihaploid individuals. Bioinformatics methods will be developed to determine the possible molecular mode of introgression, including non-homologous end joining, non-crossover double-stranded break repair, and crossover double-stranded break repair (Table 1).Potential pitfalls and alternative strategies: Low-coverage sequencing, depending on filtering stringency, may result in spurious genotype calls. To address this issue, genotypes called in the dihaploid populations will only be considered if they are monomorphic in the entire population, effectively using the combined read depth in multiple samples to ensure that homozygous sites are properly identified. These sites can then be used to locate IVP101-specific alleles. Additionally, it will be difficult to distinguish between non-homologous end-joining and aneuploidy because it is not possible to tell if chromosomal segments are fused or separate. To distinguish these two, it may be necessary to implement use of a breakpoint determining software.Aim 2. Study differential gene expression and epigenetic regulation of QTL regions underlying agronomic traitsHypothesis and experimental approaches: The Atlantic and Superior dihaploid populations have been grown at two field locations in Michigan and data for various agronomic traits, including emergence, plant height, total tuber yield, and other traits, were collected in 2014 and 2015. Using SNP genotyping data generated by whole-genome sequencing, a genetic map will be constructed and QTLs for the measured traits will be identified. A subset of three lines each has been selected to represent high, medium, and low vigor individuals. For these individuals, RNA sequencing will be performed on tubers and leaves, and ChIP sequencing will be performed surveying H3K9me3, a mark of repression, and H3K4me3, a mark of activation. The RNA-seq data will be analyzed to find differential expression of genes between high, medium, and low vigor individuals, and differential histone methylation will be examined to determine if gene expression differences are epigenetically regulated. Additionally, these data can be used on a genome-wide scale to find other genes that may be transcriptionally regulated by epigenetic changes that are responsible for phenotypic extremes in potato. Because DNA sequence data will also be available for these lines, intragenome variation can be characterized to determine the role of structural variation and SNPs in phenotypic variation. For example, the inheritance of different gene copy numbers within QTLs may be a key driver of plant vigor. The inheritance of recessive deleterious alleles in a homozygous state may also result in deleterious phenotypes.Potential pitfalls and alternative strategies: Focusing on QTL regions is a strategy to limit the focus of genomic analyses on pre-defined regions of the genome. This narrow focus may dilute the importance of genome-wide changes in epigenetic status, for example. To address this, global epigenetic changes will be compared between lines if comparisons between high and low vigor lines in QTL regions indicates that there may be larger genome-wide changes affecting plant vigor. Additionally, if there have been gross structural genomic aberrations during dihaploid induction, this may impact QTL analysis results. To address this, the results from Aim 1 will be considered during analysis of data in Aim 2 to account for the possibility of large-scale genomic alterations.

Progress 01/01/17 to 12/31/19

Outputs
Target Audience:The research findings provide important genomic resources for basic and applied biologists in the fields of plant biology and plant breeding. Changes/Problems: Nothing Reported What opportunities for training and professional development has the project provided? Nothing Reported How have the results been disseminated to communities of interest?Sequence data were released in the National Center for Biotechnology Information's Sequence Read Archive. What do you plan to do during the next reporting period to accomplish the goals? Nothing Reported

Impacts
What was accomplished under these goals? This project sought to use genomic methods to understand the potential genomic alterations that occur during dihaploid production, a method used in potato breeding where the number of chromosomes are reduced to half their normal count. This method is used to create individuals that can be used in different breeding schemes, including making crosses with wild potato relatives to introduce valuable traits into cultivated potato. In addition, the project aimed to provide genomic resources for the potato breeding and biology community. The results and data generated from this work has provided insights into the possible consequences of using haploid inducer lines to create dihaploid populations in potato (Pham et al., 2019). In addition, we have generated data which will be used to create a new potato genome sequence that will be a valuable resource for additional research. A dihaploid population derived from a cross between the tetraploid potato cultivar, Superior, and the haploid inducer line, IVP101, was genotyped using whole-genome shotgun sequencing. Single nucleotide polymorphisms were inferred using custom bioinformatic methods and the results were used to determine cases where potential DNA translocations occurred in the individuals. Lower levels of putative introgression were found relative to previous research published on dihaploid potato populations derived from crosses that used different haploid inducer lines. We found that the introgressions were most likely due to local DNA repair events, and that these events were correlated with DNA associated with histone marks that signify active chromatin. In addition, 8% of individuals showed primary trisomy events, highlighting the frequent occurrence of meiotic nondisjunction in Superior. As documented in the previous report, the Superior dihaploid population was used for a QTL mapping study that identified loci associated with important agronomic traits, including total tuber yield, tuber set, average tuber weight, vigor, and plant height (Manrique-Carpintero et al., 2018). The population was evaluated in Montcalm, MI, and East Lansing, MI.

Publications

Type: Journal Articles Status: Published Year Published: 2019 Citation: Pham GM, Braz GT, Conway M, Crisovan E, Laimbeer FPE, Manrique-Carpintero NC, Newton L, Douches DS, Jiang J, Veilleux RE, Buell CR (2019). Genome-wide inference of somatic translocation during potato dihaploid production. Plant Genome 12:180079.

Progress 01/01/18 to 12/31/18

Outputs
Target Audience:The research findings impact applied and basic researchers studying potato, including breeders and biologists. Additionally, other plant biologists may use the information for comparative purposes. Changes/Problems: Nothing Reported What opportunities for training and professional development has the project provided?This project provided the opportunity for travel to the USDA project directors' meeting in Washington D.C., which was an excellent networking and learning opportunity. Results from the project were presented to other project directors at this meeting, increasing exposure of other researchers to potato research. How have the results been disseminated to communities of interest?Sequence data, as described in the previous progress report, are publicly available through the National Center for Biotechnology Information's Sequence Read Archive. What do you plan to do during the next reporting period to accomplish the goals?The associated publication, which describes the results in detail, will be revised and published this year. The publication is currently under review.

Impacts
What was accomplished under these goals? This project examined the use of dihaploid production in potato and its molecular consequences on the potato breeding germplasm. The cultivar Superior, commonly used for producing potato chips, showed widespread evidence of DNA changes in its dihaploid progeny. This has implications for potato breeding programs, which use dihaploid production frequently and expect that the DNA in dihaploid progeny reflect the DNA content of their parents. Additionally, this research suggests that DNA changes are coupled to biological processes that influence the structure of DNA in the cell. This provides new information to the biological research community, who could potentially use this information to understand haploid production in other crop species. The results from this project demonstrate that in a population of 95 dihaploid potato individuals derived from a cross between the cultivar Superior and the dihaploid inducer IVP-101, IVP DNA translocation occurred at rates between 0.16 and 1.86%. The translocation sites showed allelic compositions suggesting that most events occurred in individual cell lineages. The translocations occurred in regions of the DNA suspected to be in open chromatin conformation. This conclusion is based on statistical analyses to examine the relationship between translocation sites and gene expression patterns using a potato gene expression atlas. Additionally, the translocations were associated with the histone marks H3K4 dimethylation and H4K5 acetylation, which are associated with open chromatin. The Superior dihaploid population was also used in a QTL mapping study (Manrique-Carpintero et al., 2018). The population showed quantitative variation in several agronomic traits at two different field sites in Montcalm, MI and East Lansing, MI. Using single nucleotide polymorphisms, QTLs were identified for total tuber yield, tuber set, average tuber weight, vigor, and height. Additionally, a case of double reduction resulting in the homozygous status of a mutation in the ga1 locus was found to cause a dwarf genotype in one dihaploid individual.

Publications

Type: Journal Articles Status: Under Review Year Published: 2019 Citation: Pham GM, Braz GT, Conway M, Crisovan E, Laimbeer FPE, Manrique-Carpintero NC, Newton L, Douches DS, Jiang J, Veilleux RE, Buell CR (2019). Genome-wide inference of somatic translocation during potato dihaploid production.

Progress 01/01/17 to 12/31/17

Outputs
Target Audience:The research-related work targets potato breeders whouse dihaploids in their breeding programs to introduce genetic material from wild potato.Regarding outreach and teaching efforts, the target audience is graduate, undergraduate, and elementary school students. Changes/Problems: Nothing Reported What opportunities for training and professional development has the project provided?Through laboratory instruction and development of course materials for Plant Genomics, a course offered at Michigan State University, graduate students and undergraduate students in plant science fields were targets for genomics analysis training. This afforded the opportunity to craft teaching materials and gain experience in teaching students how to get involved in computational biology.I have also participatedin an outreach event called Girls' STEMday, which involved preparation and execution of an activity for kindergarten through 4th grade girls called "Are Yeast Alive?" This gave me the opportunity to engage in community development. How have the results been disseminated to communities of interest?Sequencing data from the Superior dihaploid population, Superior, and IVP-101 have been uploaded to theNational Center for Biotechnology Information'sSequence Read Archiveand are publicly available at this time. What do you plan to do during the next reporting period to accomplish the goals?Dfferential allele expression analysis will be used to determine how histone marks may contribute to differences in cultivated potato phenotype and the alleles which are preferentially expressed. This will be accomplished using RNA-seq and ChIP-seq data from leaf tissue from one of the dihaploid individuals and its progenitor parent Superior. Repressive and activating histone marks will be examined for changes going from the tetraploid to diploid level and the influence of these marks on preferential allele expression. The role of heterozygosity in cultivated potato will also be explored by determining the extent of deleterious mutation in the genome.

Impacts
What was accomplished under these goals? Work on this project has resulted in the genetic characterization of 95 dihaploid potato individuals and their progenitor parents, Superior and the dihaploid inducer IVP-101. Conventional softwares for inference of single nucleotide polymorphisms (SNPs) was not sufficiently sensitive for detecting cases of somatic translocation from IVP-101 DNA to dihaploid progeny DNA. Thus, custom scripts have beendeveloped to detect low-frequency alleles in individuals. The approach to this method is based on the theory of mutation detection in human cancers, where mutations may be at low frequencies in a population of cells. Using this more sensitive approach to detecting IVP-101-specific alleles, most individuals show less than 1% translocation of IVP-101 alleles in their genomes, which may be due to errors. However, four individuals showed much higher percentages of translocation, with regions of the genome ranging from less than 1% to 15% IVP-101 translocation. Preliminary analyses suggestthat there may be "translocation hot spots" in the genome. Additionally, several individuals in the population showed primary trisomy in chromosomes 2, 3, 5, and 8 from sequence information. Eight individuals in total showed this trisomy and four of them were in chromosome 2, suggesting that trisomy is a relatively common phenomena during the formation of gametes in potato. We used fluorescence in-situ hybridization on two individuals to confirm our bioinformatic approaches for identifying trisomy and verified both trisomy events in these individuals.

Publications

Type: Journal Articles Status: Published Year Published: 2017 Citation: Extensive genome heterogeneity leads to preferential allele expression and copy number-dependent expression in cultivated potato Gina M. Pham, Linsey Newton, Krystle Wiegert-Rininger, Brieanne Vaillancourt, David S. Douches and C.Robin Buell