Ag Data Commons
Browse

Data and code from: Severity of charcoal rot disease in soybean genotypes inoculated with Macrophomina phaseolina isolates differs among growth environments

dataset
posted on 2025-04-28, 15:58 authored by Alemu Mengistu, Quentin ReadQuentin Read, Christopher R. Little, Heather M. Kelly, Peter M. Henry, Nacer Bellaloui

This dataset includes all the raw data and all the R statistical software code that we used to analyze the data and produce all the outputs that are in the figures, tables, and text of the associated manuscript:

Mengistu, A., Q. D. Read, C. R. Little, H. M. Kelly, P. M. Henry, and N. Bellaloui. 2025. Severity of charcoal rot disease in soybean genotypes inoculated with Macrophomina phaseolina isolates differs among growth environments. Plant Disease. DOI: 10.1094/PDIS-10-24-2230-RE.


The data included here come from a series of tests designed to evaluate methods for identifying soybean genotypes that are resistant or susceptible to charcoal rot, a widespread and economically significant disease. Four independent experiments were performed to determine the variability in disease severity by soybean genotype and by isolated variant of the charcoal rot fungus: two field tests, a greenhouse test, and a growth chamber test. The tests differed in the number of genotypes and isolates used, as well as the method of inoculation. The accuracy of identifying resistant and susceptible genotypes varied by study, and the same isolate tested across different studies often had highly variable disease severity. Our results indicate that the non-field methods are not reliable ways to identify sources of charcoal rot resistance in soybean.

The models fit in the R script archived here are Bayesian general linear mixed models with AUDPC (area under the disease progress curve) as the response variable. One-dimensional clustering is used to divide the genotypes into resistant and susceptible based on their model-predicted AUDPC values, and this result is compared with the preexisting resistance classification. Posterior distributions of the marginal means for different combinations of genotype, isolate, and other covariates are estimated and compared. Code to reproduce the tables and figures of the manuscript is also included.

The following files are included:

  • README.pdf: Full description, with column metadata for the data spreadsheets and text description of each R script
  • data2023-04-18.xlsx: Excel sheet with data from three of the four trials
  • cleaned_data.RData: all data in analysis-ready format; generates a set of data frames when imported into an R environment
  • Modified Cut-Tip Inoculation on DT974290 and LS980358 on first 32 isolates.xlsx: Excel spreadsheet with data from the fourth trial
  • data_cleaning.R: Script required to format data from .xlsx files into analysis-ready format (running this script is not necessary to reproduce the analysis; instead you may begin with the following script importing the cleaned .RData object)
  • AUDPC_fits.R: Script containing code for all model fitting, model predictions and comparisons, and figure and table generation


Funding

USDA-ARS: 6066-21220-015-000D

History

Data contact name

Read, Quentin D.

Data contact email

quentin.read@usda.gov

Publisher

Ag Data Commons

Intended use

The data and R code archived here are intended to allow anyone to reproduce the results presented in the associated manuscript.

Use limitations

The R code is single-purpose statistical software code intended only to analyze this specific dataset and should not be used to analyze other datasets without extensive modification.

Temporal Extent Start Date

2014-09-01

Temporal Extent End Date

2022-10-31

Theme

  • Non-geospatial

Geographic Coverage

{"type":"FeatureCollection","features":[{"geometry":{"type":"Point","coordinates":[-88.84650, 35.62288]},"type":"Feature","properties":{}}]}

Geographic location - description

USDA, Jackson, TN work site, West Tennessee Research and Education Center 35.62288 N, -88.84650 W

ISO Topic Category

  • farming
  • environment

National Agricultural Library Thesaurus terms

soybeans; genotype; disease susceptibility; charcoal rot; disease severity; Macrophomina phaseolina; greenhouses; growth chambers; models; Bayesian theory; disease course

OMB Bureau Code

  • 005:18 - Agricultural Research Service

OMB Program Code

  • 005:040 - National Research

ARS National Program Number

  • 303

ARIS Log Number

425663

Pending citation

  • No

Public Access Level

  • Public