File(s) not publicly available
Data and code from: Identification of a key target for elimination of nitrous oxide, a major greenhouse gas
Note: Data files will be made available upon manuscript publication
This dataset contains all code and data needed to reproduce the analyses in the manuscript:
IDENTIFICATION OF A KEY TARGET FOR ELIMINATION OF NITROUS OXIDE, A MAJOR GREENHOUSE GAS.
Blake A. Oakley (1), Trevor Mitchell (2), Quentin D. Read (3), Garrett Hibbs (1), Scott E. Gold (2), Anthony E. Glenn (2)
- Department of Plant Pathology, University of Georgia, Athens, GA, USA.
- Toxicology and Mycotoxin Research Unit, U.S. National Poultry Research Center, United States Department of Agriculture-Agricultural Research Service, Athens, GA, USA
- Southeast Area, United States Department of Agriculture-Agricultural Research Service, Raleigh, NC, USA
citation will be updated upon acceptance of manuscript
Brief description of study aims
Denitrification is a chemical process that releases nitrous oxide (N2O), a potent greenhouse gas. The NOR1 gene is part of the denitrification pathway in Fusarium. Three experiments were conducted for this study. (1) The N2O comparative experiment compares denitrification rates, as measured by N2O production, of a variety of Fusarium spp. strains with and without the NOR1 gene. (2) The N2O substrate experiment compares denitrification rates of selected strains on different growth media (substrates). For parts 1 and 2, linear models are fit comparing N2O production between strains and/or substrates. (3) The Bioscreen growth assay tests whether there is a pleiotropic effect of the NOR1 gene. In this portion of the analysis, growth curves are fit to assess differences in growth rate and carrying capacity between selected strains with and without the NOR1 gene.
Code
All code is included in a .zip archive generated from a private git repository on 2022-10-13 and archived as part of this dataset.
The code is contained in R scripts and RMarkdown notebooks. There are two components to the analysis: the denitrification analysis (comprising parts 1 and 2 described above) and the Bioscreen growth analysis (part 3). The scripts for each are listed and described below.
Analysis of results of denitrification experiments (parts 1 and 2)
NOR1_denitrification_analysis.Rmd
: The R code to analyze the experimental data comparing nitrous oxide emissions is all contained in a single RMarkdown notebook. This script analyzes the results from the comparative study and the substrate study.n2o_subgroup_figures.R
: R script to create additional figures using the output from the RMarkdown notebook
Analysis of results of Bioscreen growth assay (part 3)
bioscreen_analysis.Rmd
: This RMarkdown notebook contains all R code needed to analyze the results of the Bioscreen assay comparing growth of the different strains. It could be run as is. However, the model-fitting portion was run on a high-performance computing cluster with the following scripts:bioscreen_fit_simpler.R
: R script containing only the model-fitting portion of the Bioscreen analysis, fit using the Stan modeling language interfaced with R through the brms and cmdstanr packages.job_bssimple.sh
: Job submission shell script used to submit the model-fitting R job to be run on USDA SciNet high-performance computing cluster.
Additional scripts developed as part of the analysis but that are not required to reproduce the analyses in the manuscript are in the deprecated/
folder.
Also note the files nor1-denitrification.Rproj
(RStudio project file) and gtstyle.css
(stylesheet for formatting the tables in the notebooks) are included.
Data
Data required to run the analysis scripts are archived in this dataset, other than strain_lookup.csv
, a lookup table of strain abbreviations and full names included in the code repository for convenience. They should be placed in a folder or symbolic link called project
within the unzipped code repository directory.
N2O_data_2022-08-03/N2O_Comparative_Study_Trial_(n)_(date range).xlsx
: These are the data from the N2O comparative study, wheren
is the trial number from 1-3 anddate range
is the begin and end date of the trial.N2O_data_2022-08-03/Nitrogen_Substrate_Study_Trial_(n)_(date range).xlsx
: These are the data from the N2O substrate study, wheren
is the trial number from 1-3 anddate range
is the begin and end date of the trial.Outliers_NOR1_2022/Bioscreen_NOR1_Fungal_Growth_Assay_(substrate)_(oxygen level)_Outliers_BAO_(date).xlsx
: These are the raw Bioscreen data files in MS Excel format. The format of each file name includes the substrate (minimal medium with nitrite or nitrate and lysine), oxygen level (hypoxia or normoxia), and date of the run. This repository includes code to process these files, but the processed data are also included on Ag Data Commons, so it is not necessary to run the data processing portion of the code.clean_data/bioscreen_clean_data.csv
: This is an intermediate output file in CSV format generated bybioscreen_analysis.Rmd
. It includes all the data from the Bioscreen assays in a clean analysis-ready format.
Funding
Agricultural Research Service, 6040-42000-046-000D
History
Data contact name
Read, QuentinData contact email
quentin.read@usda.govPublisher
Ag Data CommonsIntended use
This dataset is intended to allow reproducing all analyses presented in the above-cited manuscript.Use limitations
The code included in this dataset is only designed to work with the input data provided and would need to be modified if running similar analyses on different input data.Temporal Extent Start Date
2021-12-08Temporal Extent End Date
2022-03-08Theme
- Not specified
Geographic Coverage
{"type":"FeatureCollection","features":[{"geometry":{"type":"Point","coordinates":[-83.3563255,33.928033]},"type":"Feature","properties":{}}]}Geographic location - description
Athens, Georgia, USAISO Topic Category
- environment
- farming
National Agricultural Library Thesaurus terms
nitrous oxide; greenhouse gases; data collection; denitrification; greenhouse gas emissions; comparative study; models; USDA; oxygen; lysine; hypoxia; normoxia; silver; information processing; nitrogen; sodium nitrite; sodium nitrate; air pollution control; bioassays; microbial growth; culture media; growth curves; computer software; Fusarium verticillioides; Fusarium oxysporum f. sp. vasinfectum; Fusarium graminearum; plant pathogenic fungi; mutants; strainsOMB Bureau Code
- 005:18 - Agricultural Research Service
OMB Program Code
- 005:040 - National Research
ARS National Program Number
- 108
Pending citation
- No
Public Access Level
- Public