Ag Data Commons
Browse

Exome sequencing of diverse collection of wheat landraces, cultivars and breeding lines

dataset
posted on 2024-11-23, 21:57 authored by Kansas State University
The exome sequenced hexaploid wheat accessions were selected from a worldwide population of landraces and cultivars obtained from the USDA National Small Grains Collection and the Australian Grains Genebank (formally the Australian Winter Cereal Collection). The worldwide population comprised 3990 accessions from 106 countries that were genotyped using the Infinium iSelect 90K SNP wheat bead chip array. The MCG method was used to select accessions for exome sequencing. This method uses a genetic relationship matrix calculated from genotype data to iteratively chose and arbitrary number of accessions (in the case of this study 20% of the accessions) that collectively explain the largest proportion of variance in genetic relationships among the whole set. Genomic DNA was extracted from leaf tissue for each accession using the Agencourt DNAdvance Genomic DNA Isolation Kit (Beckman Coulter) and subjected to sequence capture using the NimbleGen SeqCap EZ wheat whole genome assay. In brief, 1 µg of DNA was fragmented with the Covaris S2 instrument to obtain an average fragment length of 300 bp. Shorter fragments were removed using Agencourt AMPure XP beads (Beckman Coulter). The KAPA Library Preparation Kit (Kapa Biosystems) and adapters supplied in the NimbleGen SeqCap EZ Reagent Kit Plus v2 (Roche) were used to prepare sample libraries for the capture assay, according to the KAPA protocol but excluding steps 8.1-8.22. The quality and yield of each sample library was assessed using an Agilent 2200 TapeStation (Agilent Technologies) before proceeding to the sequence capture assay following the Roche protocol. Each target capture sequence enriched sample library was sequenced on an Illumina HiSeq2000 instrument (Illumina) to generate about 30 million reads per accession. The project leaders are Matthew Hayden (Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, Victoria, Australia; La Trobe University, Bundoora, Victoria, Australia) and Eduard Akhunov (Kansas State University, USA). This project was supported by the National Research Initiative Competitive Grants 2017-67007-25939 (Wheat-CAP) and 2016-67013-24473 from the USDA National Institute of Food and Agriculture. Exome sequencing of Canadian wheat cultivars was supported through the Canadian Triticum Applied Genomics grant (CTAG) funded by Genome Canada, Genome Prairie, Saskatchewan Ministry of Agriculture, and the Western Grains Research Foundation. Corteva Agriscience, Agriculture Division of DowDuPont provided financial support through collaboration with Agriculture Victoria Services enabling the development of the SNP dataset and technologies used in this project. More information can be found on our project website at http://wheatgenomics.plantpath.ksu.edu/1000EC

Funding

USDA: 2017-67007-25939

History

Data contact name

BioProject Curation Staff

Publisher

National Center for Biotechnology Information

Temporal Extent Start Date

2019-01-29

Theme

  • Non-geospatial

ISO Topic Category

  • biota

National Agricultural Library Thesaurus terms

sequence analysis; genomics; genome

Pending citation

  • No

Public Access Level

  • Public

Accession Number

PRJNA517692

Preferred dataset citation

It is recommended to cite the accession numbers that are assigned to data submissions, e.g. the GenBank, WGS or SRA accession numbers. If individual BioProjects need to be referenced, state that "The data have been deposited with links to BioProject accession number PRJNA517692 in the NCBI BioProject database (https://www.ncbi.nlm.nih.gov/bioproject/)."

Usage metrics

    Categories

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC