Data and code from: Stem borer herbivory dependent on interactions of sugarcane variety, associated traits, and presence of prior borer damage
This dataset contains all the data and code needed to reproduce the analyses in the manuscript:
Penn, H. J., & Read, Q. D. (2023). Stem borer herbivory dependent on interactions of sugarcane variety, associated traits, and presence of prior borer damage. Pest Management Science. https://doi.org/10.1002/ps.7843
Included are two .Rmd notebooks containing all code required to reproduce the analyses in the manuscript, two .html file of rendered notebook output, three .csv data files that are loaded and analyzed, and a .zip file of intermediate R objects that are generated during the model fitting and variable selection process.
Notebook files
01_boring_analysis.Rmd
: This RMarkdown notebook contains R code to read and process the raw data, create exploratory data visualizations and tables, fit a Bayesian generalized linear mixed model, extract output from the statistical model, and create graphs and tables summarizing the model output including marginal means for different varieties and contrasts between crop years.02_trait_covariate_analysis.Rmd
: This RMarkdown notebook contains R code to read raw variety-level trait data, perform feature selection based on correlations between traits, fit another generalized linear mixed model using traits as predictors, and create graphs and tables from that model output including marginal means by categorical trait and marginal trends by continuous trait.
HTML files
These HTML files contain the rendered output of the two RMarkdown notebooks. They were generated by Quentin Read on 2023-08-30 and 2023-08-15.
01_boring_analysis.html
02_trait_covariate_analysis.html
CSV data files
These files contain the raw data. To recreate the notebook output the CSV files should be at the file path project/data/
relative to where the notebook is run. Columns are described below.
BoredInternodes_26April2022_no format.csv
: primary data file with sugarcane borer (SCB) damage- Columns A-C are the year, date, and location. All location values are the same.
- Column D identifies which experiment the data point was collected from.
- Column E,
Stubble
, indicates the crop year (plant cane or first stubble) - Column F indicates the variety
- Column G indicates the plot (integer ID)
- Column H indicates the stalk within each plot (integer ID)
- Column I,
# Internodes
, indicates how many internodes were on the stalk - Columns J-AM are numbered 1-30 and indicate whether SCB damage was observed on that internode (0 if no, 1 if yes, blank cell if that internode was not present on the stalk)
- Column AN indicates the experimental treatment for those rows that are part of a manipulative experiment
- Column AO contains notes
variety_lookup.csv
: summary information for the 16 varieties analyzed in this study- Column A is the variety name
- Column B is the total number of stalks assessed for SCB damage for that variety across all years
- Column C is the number of years that variety is present in the data
- Column D,
Stubble
, indicates which crop years were sampled for that variety ("PC" if only plant cane, "PC, 1S" if there are data for both plant cane and first stubble crop years) - Column E,
SCB resistance
, is a categorical designation with four values: susceptible, moderately susceptible, moderately resistant, resistant - Column F is the literature reference for the SCB resistance value
Select_variety_traits_12Dec2022.csv
: variety-level traits for the 16 varieties analyzed in this study- Column A is the variety name
- Column B is the SCB resistance designation as an integer
- Column C is the categorical SCB resistance designation (see above)
- Columns D-I are continuous traits from year 1 (plant cane), including sugar (Mg/ha), biomass or aboveground cane production (Mg/ha), TRS or theoretically recoverable sugar (g/kg), stalk weight of individual stalks (kg), stalk population density (stalks/ha), and fiber content of stalk (percent).
- Columns J-O are the same continuous traits from year 2 (first stubble)
- Columns P-V are categorical traits (in some cases continuous traits binned into categories): maturity timing, amount of stalk wax, amount of leaf sheath wax, amount of leaf sheath hair, tightness of leaf sheath, whether leaf sheath becomes necrotic with age, and amount of collar hair.
ZIP file of intermediate R objects
To recreate the notebook output without having to run computationally intensive steps, unzip the archive. The fitted model objects should be at the file path project/
relative to where the notebook is run.
intermediate_R_objects.zip
: This file contains intermediate R objects that are generated during the model fitting and variable selection process. You may use the R objects in the .zip file if you would like to reproduce final output including figures and tables without having to refit the computationally intensive statistical models.binom_fit_intxns_updated_only5yrs.rds
: fitted brms model object for the main statistical modelbinom_fit_reduced.rds
: fitted brms model object for the trait covariate analysismarginal_trends.RData
: calculated values of the estimated marginal trends with respect to year and previous damagemarginal_trend_trs.rds
: calculated values of the estimated marginal trend with respect to TRSmarginal_trend_fib.rds
: calculated values of the estimated marginal trend with respect to fiber content
Resources in this dataset:Resource Title: Sugarcane borer damage data by internode, 1993-2021.
File Name: BoredInternodes_26April2022_no format.csv
Resource Title: Summary information for the 16 sugarcane varieties analyzed.
File Name: variety_lookup.csv
Resource Title: Variety-level traits for the 16 sugarcane varieties analyzed.
File Name: Select_variety_traits_12Dec2022.csv
Resource Title: RMarkdown notebook 2: trait covariate analysis.
File Name: 02_trait_covariate_analysis.Rmd
Resource Title: Rendered HTML output of notebook 2.
File Name: 02_trait_covariate_analysis.html
Resource Title: RMarkdown notebook 1: main analysis.
File Name: 01_boring_analysis.Rmd
Resource Title: Rendered HTML output of notebook 1.
File Name: 01_boring_analysis.html
Resource Title: Intermediate R objects.
File Name: intermediate_R_objects.zip
Funding
Agricultural Research Service, 6052-21000-017-000-D
History
Data contact name
Penn, HannahData contact email
hannah.penn@usda.govPublisher
Ag Data CommonsIntended use
The data and code provided here will reproduce all analysis presented in the manuscript, including processing the raw data into analysis-ready format, fitting statistical models, doing variable selection, extracting output from the models, and creating graphs and tables.Use limitations
The R code is only intended to analyze the data provided and would need to be modified to work with other similar datasets.Temporal Extent Start Date
1993-01-01Temporal Extent End Date
2021-12-31Theme
- Not specified
Geographic Coverage
{"type":"FeatureCollection","features":[{"geometry":{"type":"Point","coordinates":[-90.84895,29.63199]},"type":"Feature","properties":{}}]}Geographic location - description
Ardoyne Research Farm, Houma, Louisiana, USAISO Topic Category
- biota
- environment
- farming
National Agricultural Library Thesaurus terms
boring insects; herbivores; sugarcane; data collection; computer software; data visualization; Bayesian theory; statistical models; graphs; Saccharum; Diatraea saccharalis; varietal resistance; integrated pest management; plant age; Papaipema nebris; stubble; crop year; internodes; sugars; biomass; population density; fiber content; leavesOMB Bureau Code
- 005:18 - Agricultural Research Service
OMB Program Code
- 005:040 - National Research
ARS National Program Number
- 305
Pending citation
- No
Public Access Level
- Public