Data from: Random forest regression to predict Farinograph traits from GlutoPeak output in wheat wild relative backcross lines
Flour quality is a key breeding target in hard winter wheat cultivar development. The Farinograph is perhaps the most important device for assessing quality prior to cultivar release in the United States, but large sample size requirements and long test times make in impracticable for early-stage selection. We used random forest regression to predict key Farinograph parameters from novel features we calculated from the raw data output of the GlutoPeak, which requires less time and less sample, in a winter wheat population containing wild relative introgressions. Here, we present the raw GlutoPeak data and Farinograph data used in model development.
GlutoPeak output for 68 wheat samples, contained in folder "GP_upload". Some lines including wild relative introgressions. Files with the same number prior to the underscore represent multiple replications of the same sample - one file was randomly selected for model construction.
FarinoGraph output for 68 wheat samples, some lines including wild relative introgressions.
Funding
National Science Foundation: IIP-1338897
USDA-ARS: 3020-21000-012-000D
History
Data contact name
Guttieri, Mary, J.Data contact email
mary.guttieri@usda.govPublisher
Ag Data CommonsTemporal Extent Start Date
2023-11-01Temporal Extent End Date
2024-02-29Theme
- Non-geospatial
ISO Topic Category
- biota
- farming
National Agricultural Library Thesaurus terms
plant breeding; winter wheat; Triticum; wild relatives; introgression; models; wheat flourOMB Bureau Code
- 005:18 - Agricultural Research Service
OMB Program Code
- 005:040 - National Research
ARS National Program Number
- 301
Pending citation
- No
Public Access Level
- Public