Ag Data Commons
Browse

Data from: High-throughput classification and quantification of skinning phenotype in sweet potatoes

dataset
posted on 2025-11-23, 02:59 authored by Zachary Bloom, Joshua Larsen, Enrique Pena Martinez, Cranos Williams, Daniela Jones, Michael Kudenov
<p>Sweetpotatoes (SPs) (<em>Ipomoea batatas</em>) are crops valued for their color, flavor, and nutrition.  Harvesting is labor-intensive, requiring hand-picking to maintain skin quality.  Mechanical harvesting often causes skin damage, known as “skinning,” where skin is cut, scraped, or torn, leading to lower quality during packing. To manage this, packers may conduct a costly “field-switch” to reduce skinning in the production line. Currently, skinning levels are visually assessed by the packers and stakeholders.  Field-switches involve transitioning between multiple fields during harvest to meet specific customer orders (e.g., supermarkets, processing plants, or end users) that require higher-quality SPs. This process aims to minimize skinning and ensure the SPs meet the desired quality standards for those orders.  This study introduces a computer vision (CV) pipeline to automate skinning assessment using a ResNet50-based DeepLabV3+ semantic segmentation model. The CV system was trained to identify three classes: skinning, intact skin\textbackslash SP, and background. A machine vision camera, mounted above a conveyor belt, captured images throughout several full production days. The pipeline calculated the percentage of skinning <em>PS</em> as the ratio between the predicted skinning area and the total SP surface area (<em>As</em> + <em>Ap</em>). Using this method, daily production trends and field-switch decisions were studied with image data from six production days, chosen by our grower collaborator. Percent skinning ratings calculated from random sub-samples of imaged SPs—where only a portion of the full image set was analyzed—showed no significant differences compared to those derived from complete image sets. This demonstrates that sub-sampling can reduce computational processing times by 90% while maintaining accuracy. When data was binned in 30-minute intervals, field-switches occurred when there was approximately 1.5 PS. Across 2,417,907 SP instances, the model achieved a root mean square error (RMSE) of 0.55%, R^2 of 0.84, 80.03% recall, 99.99% specificity, 77.44% F1 score, and 99.98% grading accuracy. This offers a promising improvement for automatic skinning detection on a commercial scale.</p>

Funding

North Carolina State University: 573000

USDA: 2023-34141-40975

USDA: 2021-51181-35865

USDA: 2021-2020-08183

History

Related Materials

  1. 1.
    DOI - Is supplement to https://doi.org/10.1002/ppj2.70023

Data contact name

Bloom, Zachary

Data contact email

zabloom@ncsu.edu

Publisher

Dryad

Theme

  • Not specified

ISO Topic Category

  • biota

National Agricultural Library Thesaurus terms

models; flavor; surface area; cameras; phenotype; color; computer vision; nutrition; stakeholders

Pending citation

  • No

Public Access Level

  • Public

Usage metrics

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC