Ag Data Commons

File(s) stored somewhere else

Please note: Linked content is NOT stored on Ag Data Commons and we can't guarantee its availability, quality, security or accept any liability.

Data from: Assessing metabolomic and chemical diversity of a soybean lineage representing 35 years of breeding

posted on 2024-02-13, 13:12 authored by Miyako Kusano, Ivan Baxter, Atsushi Fukushima, Akira Oikawa, Yozo Okazaki, Ryo Nakabayashi, Denise J. Bouvrette, Frederic Achard, Andrew R. Jakubowski, Joan M. Ballam, Jonathan R. Phillips, Angela H. Culler, Kazuki Saito, George G. Harrigan

Information on crop genotype- and phenotype-metabolite associations can be of value to trait development as well as to food security and safety. The unique study presented here assessed seed metabolomic and ionomic diversity in a soybean (Glycine max) lineage representing ~35 years of breeding (launch years 1972–2008) and increasing yield potential. Selected varieties included six conventional and three genetically modified (GM) glyphosate-tolerant lines. A metabolomics approach utilizing capillary electrophoresis (CE)-time-of-flight-mass spectrometry (TOF-MS), gas chromatography (GC)-TOF-MS and liquid chromatography (LC)-quadrupole (q)-TOFMS resulted in measurement of a total of 732 annotated peaks. Ionomics through inductively-coupled plasma (ICP)-MS profiled twenty mineral elements. Orthogonal partial least squares-discriminant analysis (OPLS-DA) of the seed data successfully differentiated newer higher-yielding soybean from earlier lower-yielding accessions at both field sites. This result reflected genetic fingerprinting data that demonstrated a similar distinction between the newer and older soybean. Correlation analysis also revealed associations between yield data and specific metabolites. There were no clear metabolic differences between the conventional and GM lines. Overall, observations of metabolic and genetic differences between older and newer soybean varieties provided novel and significant information on the impact of varietal development on biochemical variability. Proposed applications of omics in food and feed safety assessments will need to consider that GM is not a major source of metabolite variability and that trait development in crops will, of necessity, be associated with biochemical variation.

Resources in this dataset:

  • Resource Title: Pointer to Electronic Supplementary Material.

    File Name: Web Page, url:

    Link to Electronic Supplementary Material at Metabolomics. Files are:

    Supplementary material 1: Full metabolite profile and ionomic dataset - Download Excel

    Supplementary material 2: List of annotated metabolites - Download Excel

    • List of the 681 annotated metabolites obtained by using 4 platforms after data summarization and removal of 50% missing value in the metabolite profile data. Some metabolite peaks could not be summarized and thus kept in the list.

    Supplementary material 3: Monsanto_ionomics_Data_Baxterla - Download Excel

    Supplementary material 4: Metabolomics metadata - Download .docx

    • Plant context metadata; Chemical analysis metadata.

    Supplementary material 5: Spearman correlations between yield and metabolites/ions - Download Excel

    Supplementary material 6: Supporting Tables and Figures - Download .docx

    • Supporting Table 1.Similarity matrix: Genetic similarity of different soybean varieties based on genetic fingerprint data.
    • Supporting Table 2. Metabolite Coverage of Analytical Platforms.
    • Supporting Table 3. Summary of Statistically Significant Differences in Ionomic Profiles.
    • Supporting Figure A. PCA (principal components one and two) based on the genotypic data of 1,484 pre-commercial and commercial proprietary Monsanto lines.
    • Supporting Fig. 1. Evaluation of the achieved coverage of metabolite profile data.
    • Supporting Fig. 2. Principal component analysis of the identified or annotated metabolites/ peaks.
    • Supporting Fig. 3. Principal component analysis of the identified or annotated metabolites/peaks and including the ionomics data.
    • Supporting Fig. 4. The score scatter plot of OPLS-DA using the identified or annotated metabolites/ peaks and including the ionomics data.
    • Supporting Fig. 5. Graphic representation of nodes of the first neighbors in the yield-to-metabolite correlation networks of samples harvested at ILJA and ILJE.


Data contact name

Baxter, Ivan

Data contact email



Intended use

To assess soybean seed metabolomic and ionomic diversity in a lineage representing ~35 years of breeding (launch years 1972–2008) and increasing yield potential.

Temporal Extent Start Date


Temporal Extent End Date



  • Not specified

Geographic Coverage


Geographic location - description

Illinois - Jerseyville [ILJE] and Jacksonville [ILJA]

ISO Topic Category

  • biota

National Agricultural Library Thesaurus terms

metabolomics; soybeans; DNA fingerprinting; Glycine max; capillary electrophoresis; crops; gas chromatography; genetic engineering; glyphosate; herbicide resistance; ionomics; liquid chromatography; mass spectrometry; metabolites; plant breeding; seeds; transgenic plants; variance; United States; food security; minerals; correlation; genetic variation; safety assessment; biochemical polymorphism

OMB Bureau Code

  • 005:18 - Agricultural Research Service

OMB Program Code

  • 005:040 - National Research

Primary article PubAg Handle

Pending citation

  • No

Public Access Level

  • Public

Preferred dataset citation

Kusano, Miyako; Baxter, Ivan; Fukushima, Atsushi; Oikawa, Akira; Okazaki, Yozo; Nakabayashi, Ryo; Bouvrette, Denise J.; Achard, Frederic; Jakubowski, Andrew R.; Ballam, Joan M.; Phillips, Jonathan R.; Culler, Angela H.; Saito, Kazuki; Harrigan, George G. (2018). Data from: Assessing metabolomic and chemical diversity of a soybean lineage representing 35 years of breeding. Metabolomics.

Usage metrics



    Ref. manager