Functional annotation for 15 diverse arthropod genomes
We present the annotation results of 15 arthropod proteomes using an open source, open access and containerized pipeline for genome-scale functional annotation of insect proteomes and apply it to a diverse range of arthropod species. You can find more information about the pipeline at our readthedocs site. The files for each genome include GOanna, InterproScan and KOBAS predictions.
Arthropod genomes selected for this study and their assembly and annotation statistics.
- Apis Mellifera (honey bee)
- Drosophila melanogaster (fruit fly)
- Tribolium castaneum (red flour beetle)
- Latrodectus hesperus (Western black widow spider)
- Limnephilus lunatus (caddisfly)
- Oncopeltus fasciatus (Large milkweed bug)
- Homalodisca vitripennis (Glassy-winged sharpshooter)
- Eurytemora affinis (calanoid copepod)
- Agrilus planipennis (emerald ash borer)
- Copidosoma floridanum (parasitoid wasp)
- Athalia rosae (turnip sawfly)
- Ceratitis capitata (Mediterranean fruit fly)
- Cimex lectularius (Cimicidae bed bug)
- Varroa destructor(parasitic mite)
-
Diaphorina citri (Asian citrus psyllid)
Resources in this dataset:Resource Title: Cimex lectularius (Cimicidae bed bug) annotation.
File Name: CLEC.tar.gz
Resource Description: Functional annotation for Clec-OGSv1.2 protein set
Resource Title: Tribolium castaneum (red flour beetle) annotation.
File Name: TCAS.tar.gz
Resource Description: Functional annotation for TCAS_OGS_v3 protein set
Resource Title: Drosophila melanogaster (fruit fly) annotation.
File Name: DMEL.tar.gz
Resource Description: Functional annotation for DMEL_r6.38 protein set
Resource Title: Varroa destructor (parasitic mite) annotation.
File Name: VDES.tar.gz
Resource Description: Functional annotation for NCBI Varroa destructor Annotation Release 100 protein set based on Vdes_3.0 genome (GCA_002443255.1)
Resource Title: Oncopeltus fasciatus (Large milkweed bug) annotation.
File Name: ONCFAS.tar.gz
Resource Description: Functional annotation for oncfas_OGSv1.2 protein set
Resource Title: Apis Mellifera (honey bee) annotation.
File Name: AMEL.tar.gz
Resource Description: Functional annotation for OGSv3.3 protein set from Amel_4.5 genome (GCA_000002195.1)
Resource Title: Homalodisca vitripennis (Glassy-winged sharpshooter) annotation.
File Name: HVIT.tar.gz
Resource Description: Functional annotation for HVIT-BCM_version_0.5.3 protein set based on Hvit_1.0 genome (GCA_000696855.1)
Resource Title: Limnephilus lunatus (caddisfly) annotation.
File Name: LLUN.tar.gz
Resource Description: Functional annotation for LLUN-BCM_version_0.5.3 protein set from Llun_1.0 genome (GCA_000648945.1)
Resource Title: Latrodectus hesperus (Western black widow spider) annotation.
File Name: LHES.tar.gz
Resource Description: Functional annotation for LHES-BCM_version_0.5.3 protein set from Lhes_1.0 genome (GCA_000697925.1)
Resource Title: Eurytemora affinis (calanoid copepod) annotation.
File Name: EAFF.tar.gz
Resource Description: Functional annotation for EAFF-BCM_version_0.5.3 protein set from Eaff_1.0 genome (GCA_000591075.1)
Resource Title: Copidosoma floridanum (parasitoid wasp) annotation.
File Name: CFLO.tar.gz
Resource Description: Functional annotation for CFLO-BCM_version_0.5.3 protein set based on Cflo_1.0 genome (GCA_000648655.1)
Resource Title: Ceratitis capitata (Mediterranean fruit fly) annotation.
File Name: CCAP.tar.gz
Resource Description: Functional annotation for Ccap-OGSv1 protein set based on Ccap_1.1 assembly (GCA_000347755.2)
Resource Title: Athalia rosae (turnip sawfly) annotation.
File Name: AROS.tar.gz
Resource Description: Functional annotation for AROS-BCM_version_0.5.3 protein set based on Aros_1.0 genome (GCA_000344095.1)
Resource Title: Agrilus planipennis (emerald ash borer) annotation.
File Name: APLA.tar.gz
Resource Description: Functional annotation for APLA-BCM_version_0.5.3 protein set based on Apla_1.0 genome (GCA_000699045.1)
Funding
Agricultural Research Service, 0500-00093-001-00-D
History
Data contact name
Saha, SuryaData contact email
ss2489@cornell.eduPublisher
Ag Data CommonsIntended use
This functional annotation data provides a broad coverage of gene ontology (GO) terms and pathways for all proteins in a set of 15 arthropod genomes including 3 reference species. It can be used for comparative and deeper functional analysis of a wide range of gene functions and pathways.Temporal Extent Start Date
2021-07-06Theme
- Not specified
Geographic Coverage
{"type":"FeatureCollection","features":[{"geometry":{"type":"Polygon","coordinates":[[[-125.33203125,30.654452824401],[-125.33203125,48.848450835898],[-74.35546875,48.848450835898],[-74.35546875,30.654452824401],[-125.33203125,30.654452824401]]]},"type":"Feature","properties":{}}]}ISO Topic Category
- biota
National Agricultural Library Thesaurus terms
proteome; prediction; statistics; Apis mellifera; honey bees; Drosophila melanogaster; fruit flies; Tribolium castaneum; Latrodectus hesperus; Limnephilus; Oncopeltus fasciatus; Homalodisca vitripennis; Eurytemora affinis; Agrilus planipennis; Copidosoma floridanum; parasitic wasps; Athalia rosae; Ceratitis capitata; Cimex lectularius; Varroa destructor; parasitic mites; Diaphorina citri; gene ontology; proteins; genesOMB Bureau Code
- 005:18 - Agricultural Research Service
OMB Program Code
- 005:040 - National Research
Primary article PubAg Handle
Pending citation
- No
Public Access Level
- Public