Ag Data Commons
Browse
ARCHIVE
forest_soil_images_masks_for_testing_training.zip (42.89 MB)
ARCHIVE
6_leaf_training_testing_images_and_masks_for_paper.zip (12.92 MB)
ARCHIVE
Almond_bud_tube_D_P6_training_testing_images_and_masks.zip (11.65 MB)
1/0
3 files

X-ray CT data with semantic annotations for the paper "A workflow for segmenting soil and plant X-ray CT images with deep learning in Google’s Colaboratory"

dataset
posted on 2024-02-16, 22:25 authored by Devin RippnerDevin Rippner, Mina Momayyezi, Kenneth Shackel, Pranav Raja, Alexander Buchko, Fiona Duong, Dilworth Y. Parkinson, J. Mason Earles, Elisabeth J. Forrestel, Andrew J. McElrone

Leaves from genetically unique Juglans regia plants were scanned using X-ray micro-computed tomography (microCT) on the X-ray μCT beamline (8.3.2) at the Advanced Light Source (ALS) in Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA USA).

Soil samples were collected in Fall of 2017 from the riparian oak forest located at the Russell Ranch Sustainable Agricultural Institute at the University of California Davis. The soil was sieved through a 2 mm mesh and was air dried before imaging. A single soil aggregate was scanned at 23 keV using the 10x objective lens with a pixel resolution of 650 nanometers on beamline 8.3.2 at the ALS.

Additionally, a drought stressed almond flower bud (Prunus dulcis) from a plant housed at the University of California, Davis, was scanned using a 4x lens with a pixel resolution of 1.72 µm on beamline 8.3.2 at the ALS

Raw tomographic image data was reconstructed using TomoPy. Reconstructions were converted to 8-bit tif or png format using ImageJ or the PIL package in Python before further processing. Images were annotated using Intel’s Computer Vision Annotation Tool (CVAT) and ImageJ. Both CVAT and ImageJ are free to use and open source. Leaf images were annotated in following Théroux-Rancourt et al. (2020). Specifically, Hand labeling was done directly in ImageJ by drawing around each tissue; with 5 images annotated per leaf. Care was taken to cover a range of anatomical variation to help improve the generalizability of the models to other leaves. All slices were labeled by Dr. Mina Momayyezi and Fiona Duong.To annotate the flower bud and soil aggregate, images were imported into CVAT. The exterior border of the bud (i.e. bud scales) and flower were annotated in CVAT and exported as masks. Similarly, the exterior of the soil aggregate and particulate organic matter identified by eye were annotated in CVAT and exported as masks. To annotate air spaces in both the bud and soil aggregate, images were imported into ImageJ. A gaussian blur was applied to the image to decrease noise and then the air space was segmented using thresholding. After applying the threshold, the selected air space region was converted to a binary image with white representing the air space and black representing everything else. This binary image was overlaid upon the original image and the air space within the flower bud and aggregate was selected using the “free hand” tool. Air space outside of the region of interest for both image sets was eliminated. The quality of the air space annotation was then visually inspected for accuracy against the underlying original image; incomplete annotations were corrected using the brush or pencil tool to paint missing air space white and incorrectly identified air space black. Once the annotation was satisfactorily corrected, the binary image of the air space was saved. Finally, the annotations of the bud and flower or aggregate and organic matter were opened in ImageJ and the associated air space mask was overlaid on top of them forming a three-layer mask suitable for training the fully convolutional network. All labeling of the soil aggregate and soil aggregate images was done by Dr. Devin Rippner.

These images and annotations are for training deep learning models to identify different constituents in leaves, almond buds, and soil aggregates

Limitations: For the walnut leaves, some tissues (stomata, etc.) are not labeled and only represent a small portion of a full leaf. Similarly, both the almond bud and the aggregate represent just one single sample of each. The bud tissues are only divided up into buds scales, flower, and air space. Many other tissues remain unlabeled. For the soil aggregate annotated labels are done by eye with no actual chemical information. Therefore particulate organic matter identification may be incorrect.


Resources in this dataset:

  • Resource Title: Annotated X-ray CT images and masks of a Forest Soil Aggregate.

    File Name: forest_soil_images_masks_for_testing_training.zip

    Resource Description: This aggregate was collected from the riparian oak forest at the Russell Ranch Sustainable Agricultural Facility. The aggreagate was scanned using X-ray micro-computed tomography (microCT) on the X-ray μCT beamline (8.3.2) at the Advanced Light Source (ALS) in Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA USA) using the 10x objective lens with a pixel resolution of 650 nanometers. For masks, the background has a value of 0,0,0; pores spaces have a value of 250,250, 250; mineral solids have a value= 128,0,0; and particulate organic matter has a value of = 000,128,000. These files were used for training a model to segment the forest soil aggregate and for testing the accuracy, precision, recall, and f1 score of the model.


  • Resource Title: Annotated X-ray CT images and masks of an Almond bud (P. Dulcis).

    File Name: Almond_bud_tube_D_P6_training_testing_images_and_masks.zip

    Resource Description: Drought stressed almond flower bud (Prunis dulcis) from a plant housed at the University of California, Davis, was scanned by X-ray micro-computed tomography (microCT) on the X-ray μCT beamline (8.3.2) at the Advanced Light Source (ALS) in Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA USA) using the 4x lens with a pixel resolution of 1.72 µm using. For masks, the background has a value of 0,0,0; air spaces have a value of 255,255, 255; bud scales have a value= 128,0,0; and flower tissues have a value of = 000,128,000. These files were used for training a model to segment the almond bud and for testing the accuracy, precision, recall, and f1 score of the model.

    Resource Software Recommended: Fiji (ImageJ),url: https://imagej.net/software/fiji/downloads


  • Resource Title: Annotated X-ray CT images and masks of Walnut leaves (J. Regia) .

    File Name: 6_leaf_training_testing_images_and_masks_for_paper.zip

    Resource Description: Stems were collected from genetically unique J. regia accessions at the 117 USDA-ARS-NCGR in Wolfskill Experimental Orchard, Winters, California USA to use as scion, and were grafted by Sierra Gold Nursery onto a commonly used commercial rootstock, RX1 (J. microcarpa × J. regia). We used a common rootstock to eliminate any own-root effects and to simulate conditions for a commercial walnut orchard setting, where rootstocks are commonly used. The grafted saplings were repotted and transferred to the Armstrong lathe house facility at the University of California, Davis in June 2019, and kept under natural light and temperature. Leaves from each accession and treatment were scanned using X-ray micro-computed tomography (microCT) on the X-ray μCT beamline (8.3.2) at the Advanced Light Source (ALS) in Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA USA) using the 10x objective lens with a pixel resolution of 650 nanometers. For masks, the background has a value of 170,170,170; Epidermis value= 85,85,85; Mesophyll value= 0,0,0; Bundle Sheath Extension value= 152,152,152; Vein value= 220,220,220; Air value = 255,255,255.

    Resource Software Recommended: Fiji (ImageJ),url: https://imagej.net/software/fiji/downloads

Funding

U.S. Department of Energy: DE-AC02- 05CH11231

USDA-ARS: 2071-21000-057-00D

History

Data contact name

Rippner, Devin A.

Data contact email

devin.rippner@usda.gov

Publisher

Ag Data Commons

Intended use

For training deep learning models to identify different constituents in leaves, almond buds, and soil aggregates

Use limitations

For the leaves, this is just a small sample of 1 leaf and some tissues (stomata, etc.) are not labeled. Similarly, both the almond bud and the aggregate represent just one single sample of each. The bud tissues are only divided up into buds scales, flower, and air space. Many other tissues remain unlabeled. For the soil aggregate annotated labels are done by eye with no actual chemical information. Therefore particulate organic matter identification may be incorrect.

Temporal Extent Start Date

2017-10-22

Theme

  • Not specified

Geographic Coverage

{"type":"FeatureCollection","features":[{"geometry":{"type":"Point","coordinates":[-121.87222194698,38.538663827994]},"type":"Feature","properties":{}},{"geometry":{"type":"Point","coordinates":[-121.75111484554,38.532670036266]},"type":"Feature","properties":{}}]}

ISO Topic Category

  • environment

National Agricultural Library Thesaurus terms

air; flowers; particulate organic matter; soil sampling; autumn; forests; California; air drying; image analysis; soil aggregates; computer vision; micro-computed tomography

OMB Bureau Code

  • 005:18 - Agricultural Research Service

OMB Program Code

  • 005:040 - National Research

ARS National Program Number

  • 305

Pending citation

  • No

Public Access Level

  • Public

Preferred dataset citation

Rippner, Devin A.; Momayyezi, Mina; Shackel, Kenneth; Raja, Pranav; Buchko, Alexander; Duong, Fiona; Parkinson, Dilworth Y.; Earles, J. Mason; Forrestel, Elisabeth J.; McElrone, Andrew J. (2022). X-ray CT data with semantic annotations for the paper "A workflow for segmenting soil and plant X-ray CT images with deep learning in Google’s Colaboratory". Ag Data Commons. https://doi.org/10.15482/USDA.ADC/1524793

Usage metrics

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC