Ag Data Commons
Browse
ARCHIVE
CLEC.tar_0.gz (158.17 MB)
ARCHIVE
TCAS.tar.gz (233.09 MB)
ARCHIVE
DMEL.tar.gz (428.86 MB)
ARCHIVE
VDES.tar.gz (382.67 MB)
ARCHIVE
ONCFAS.tar.gz (164.94 MB)
ARCHIVE
AMEL.tar.gz (166.31 MB)
ARCHIVE
HVIT.tar.gz (242.76 MB)
ARCHIVE
LLUN.tar.gz (125.38 MB)
ARCHIVE
LHES.tar.gz (122.49 MB)
ARCHIVE
EAFF.tar.gz (196.33 MB)
ARCHIVE
CFLO.tar.gz (183 MB)
ARCHIVE
CCAP.tar.gz (179.1 MB)
ARCHIVE
AROS.tar.gz (356.3 MB)
ARCHIVE
APLA.tar.gz (164.36 MB)
1/0
14 files

Functional annotation for 15 diverse arthropod genomes

dataset
posted on 2023-11-30, 10:07 authored by Surya Saha, Amanda M. Cooksey, Anna K. Childers, Monica F. Poelchau, Fiona M. McCarthy

We present the annotation results of 15 arthropod proteomes using an open source, open access and containerized pipeline for genome-scale functional annotation of insect proteomes and apply it to a diverse range of arthropod species. You can find more information about the pipeline at our readthedocs site. The files for each genome include GOanna, InterproScan and KOBAS predictions.

Arthropod genomes selected for this study and their assembly and annotation statistics.

  1. Apis Mellifera (honey bee)
  2. Drosophila melanogaster (fruit fly)
  3. Tribolium castaneum (red flour beetle)
  4. Latrodectus hesperus (Western black widow spider)
  5. Limnephilus lunatus (caddisfly)
  6. Oncopeltus fasciatus (Large milkweed bug)
  7. Homalodisca vitripennis (Glassy-winged sharpshooter)
  8. Eurytemora affinis (calanoid copepod)
  9. Agrilus planipennis (emerald ash borer)
  10. Copidosoma floridanum (parasitoid wasp)
  11. Athalia rosae (turnip sawfly)
  12. Ceratitis capitata (Mediterranean fruit fly)
  13. Cimex lectularius (Cimicidae bed bug)
  14. Varroa destructor(parasitic mite)
  15. Diaphorina citri (Asian citrus psyllid)


    Resources in this dataset:

    • Resource Title: Cimex lectularius (Cimicidae bed bug) annotation.

      File Name: CLEC.tar.gz

      Resource Description: Functional annotation for Clec-OGSv1.2 protein set


    • Resource Title: Tribolium castaneum (red flour beetle) annotation.

      File Name: TCAS.tar.gz

      Resource Description: Functional annotation for TCAS_OGS_v3 protein set


    • Resource Title: Drosophila melanogaster (fruit fly) annotation.

      File Name: DMEL.tar.gz

      Resource Description: Functional annotation for DMEL_r6.38 protein set


  • Resource Title: Varroa destructor (parasitic mite) annotation.

    File Name: VDES.tar.gz

    Resource Description: Functional annotation for NCBI Varroa destructor Annotation Release 100 protein set based on Vdes_3.0 genome (GCA_002443255.1)


  • Resource Title: Oncopeltus fasciatus (Large milkweed bug) annotation.

    File Name: ONCFAS.tar.gz

    Resource Description: Functional annotation for oncfas_OGSv1.2 protein set


  • Resource Title: Apis Mellifera (honey bee) annotation.

    File Name: AMEL.tar.gz

    Resource Description: Functional annotation for OGSv3.3 protein set from Amel_4.5 genome (GCA_000002195.1)


  • Resource Title: Homalodisca vitripennis (Glassy-winged sharpshooter) annotation.

    File Name: HVIT.tar.gz

    Resource Description: Functional annotation for HVIT-BCM_version_0.5.3 protein set based on Hvit_1.0 genome (GCA_000696855.1)


  • Resource Title: Limnephilus lunatus (caddisfly) annotation.

    File Name: LLUN.tar.gz

    Resource Description: Functional annotation for LLUN-BCM_version_0.5.3 protein set from Llun_1.0 genome (GCA_000648945.1)


  • Resource Title: Latrodectus hesperus (Western black widow spider) annotation.

    File Name: LHES.tar.gz

    Resource Description: Functional annotation for LHES-BCM_version_0.5.3 protein set from Lhes_1.0 genome (GCA_000697925.1)


  • Resource Title: Eurytemora affinis (calanoid copepod) annotation.

    File Name: EAFF.tar.gz

    Resource Description: Functional annotation for EAFF-BCM_version_0.5.3 protein set from Eaff_1.0 genome (GCA_000591075.1)


  • Resource Title: Copidosoma floridanum (parasitoid wasp) annotation.

    File Name: CFLO.tar.gz

    Resource Description: Functional annotation for CFLO-BCM_version_0.5.3 protein set based on Cflo_1.0 genome (GCA_000648655.1)


  • Resource Title: Ceratitis capitata (Mediterranean fruit fly) annotation.

    File Name: CCAP.tar.gz

    Resource Description: Functional annotation for Ccap-OGSv1 protein set based on Ccap_1.1 assembly (GCA_000347755.2)


  • Resource Title: Athalia rosae (turnip sawfly) annotation.

    File Name: AROS.tar.gz

    Resource Description: Functional annotation for AROS-BCM_version_0.5.3 protein set based on Aros_1.0 genome (GCA_000344095.1)


  • Resource Title: Agrilus planipennis (emerald ash borer) annotation.

    File Name: APLA.tar.gz

    Resource Description: Functional annotation for APLA-BCM_version_0.5.3 protein set based on Apla_1.0 genome (GCA_000699045.1)

  • Funding

    Agricultural Research Service, 0500-00093-001-00-D

    History

    Data contact name

    Saha, Surya

    Data contact email

    ss2489@cornell.edu

    Publisher

    Ag Data Commons

    Intended use

    This functional annotation data provides a broad coverage of gene ontology (GO) terms and pathways for all proteins in a set of 15 arthropod genomes including 3 reference species. It can be used for comparative and deeper functional analysis of a wide range of gene functions and pathways.

    Temporal Extent Start Date

    2021-07-06

    Theme

    • Not specified

    Geographic Coverage

    {"type":"FeatureCollection","features":[{"geometry":{"type":"Polygon","coordinates":[[[-125.33203125,30.654452824401],[-125.33203125,48.848450835898],[-74.35546875,48.848450835898],[-74.35546875,30.654452824401],[-125.33203125,30.654452824401]]]},"type":"Feature","properties":{}}]}

    ISO Topic Category

    • biota

    National Agricultural Library Thesaurus terms

    proteome; prediction; statistics; Apis mellifera; honey bees; Drosophila melanogaster; fruit flies; Tribolium castaneum; Latrodectus hesperus; Limnephilus; Oncopeltus fasciatus; Homalodisca vitripennis; Eurytemora affinis; Agrilus planipennis; Copidosoma floridanum; parasitic wasps; Athalia rosae; Ceratitis capitata; Cimex lectularius; Varroa destructor; parasitic mites; Diaphorina citri; gene ontology; proteins; genes

    OMB Bureau Code

    • 005:18 - Agricultural Research Service

    OMB Program Code

    • 005:040 - National Research

    Primary article PubAg Handle

    Pending citation

    • No

    Public Access Level

    • Public

    Preferred dataset citation

    Saha, Surya; Cooksey, Amanda M.; Childers, Anna K.; Poelchau, Monica F.; McCarthy, Fiona M. (2021). Functional annotation for 15 diverse arthropod genomes. Ag Data Commons. https://doi.org/10.15482/USDA.ADC/1522860

    Usage metrics

      Licence

      Exports

      RefWorks
      BibTeX
      Ref. manager
      Endnote
      DataCite
      NLM
      DC