Source: AGRICULTURAL RESEARCH SERVICE submitted to NRP
GENOME-WIDE DISCOVERY OF GENES/GENE VARIANTS DETERMINING OIL TRAITS AND OIL FUNCTIONAL MARKERS BY EXPLORING SOYBEAN GENETIC DIVERSITY
Sponsoring Institution
Agricultural Research Service/USDA
Project Status
ACTIVE
Funding Source
Reporting Frequency
Annual
Accession No.
0426257
Grant No.
(N/A)
Cumulative Award Amt.
(N/A)
Proposal No.
(N/A)
Multistate No.
(N/A)
Project Start Date
Apr 1, 2014
Project End Date
Dec 31, 2016
Grant Year
(N/A)
Program Code
[(N/A)]- (N/A)
Recipient Organization
AGRICULTURAL RESEARCH SERVICE
(N/A)
COLUMBIA,MO 65211
Performing Department
(N/A)
Non Technical Summary
(N/A)
Animal Health Component
10%
Research Effort Categories
Basic
80%
Applied
10%
Developmental
10%
Classification

Knowledge Area (KA)Subject of Investigation (SOI)Field of Science (FOS)Percent
20418201040100%
Knowledge Area
204 - Plant Product Quality and Utility (Preharvest);

Subject Of Investigation
1820 - Soybean;

Field Of Science
1040 - Molecular biology;
Keywords
Goals / Objectives
The following specific objectives will be achieved in the project. 1. Sequence RNA populations (transcriptomes) in seeds for 100 germplasm lines important to US soybean breeding and agriculture that vary in seed oil content and composition. 2. Discover genes specifically regulating oil quality traits and their gene variations that lead to the difference in oil content and composition among those germplasm using a variety of data analysis and mining strategies, and further develop and validate a set of highly effective functional markers. 3. Develop a relational database and G-browser for soybean community to access and utilize the sequencing data, genes, functional markers and networks. 4. Develop transgenic plants with altered activity for two identified genes specifically controlling oil content.
Project Methods
The difference in seed oil composition and content among soybean germplasm is caused by variation in protein coding sequences and/or expression levels of the genes involved in oil synthesis and storage. To identify those oil genes and their variations, we had proposed that we would use next generation sequencing technology to determine transcript sequences and accumulation levels of all genes expressed in seeds, and develop a bioinformatic pipeline to identify transcript sequence variations that lead to seed quality variation among the germplasm. Although the soybean genome is large, transcribed sequences only account for less than 5% of its entire genome. In comparison with whole genome sequencing approach, the transcriptome sequencing approach does not only dramatically reduces the cost of determining both protein coding sequences and expression levels of seed genes in soybean germplasm, also significantly reduces the background noise from non-functional genome sequence to identify the genes and gene variants for oil quality traits. Low cost of sequencing for each germplasm makes it feasible to sequence large number of germplasm to identify the genes and gene variants using genome-wide association studies.