Ag Data Commons
Browse

Cimex lectularius Genome Annotations v0.5.3

Download (211.82 MB)
dataset
posted on 2023-12-18, 17:11 authored by Daniel S.T. Hughes, Hsu Chao, Joshua B. Benoit, Jiaxin Qu, Kim C. Worley, Shwetha C. Murali, Stephen Richards

The Baylor College of Medicine recently sequenced and annotated the Cimex lectularius genome as part of the i5k pilot project. The C. lectularius research community has manually reviewed and curated the computational gene predictions and generated an official gene set, OGSv1.2.

The bed bug, Cimex lectularius, is a model organism for understanding dermonecrotic envenomation (sphingomylinase D). A key focus of research to understand neurotic fractions in venom. The bed bug also provides appropriate phylogenetic sampling across diversity of order Araneae.

This dataset presents the Cimex lectularius gene set BCM_v_0.5.3, which was generated computationally. RNA-Seq data was used with additional protein homology data for a MAKER automated annotation of the Cimex lectularius genome assembly 1.0. Further annotation method details will be available in a forthcoming publication.

NOTE: This gene set is an unstable pre-release (v0.5.3), and was provided to facilitate manual curation and analyses before the official gene set is released. Gene identifiers from this gene set will likely not be maintained.

If you wish to use this dataset, please follow the Baylor College of Medicine's conditions for data use: https://www.hgsc.bcm.edu/bcm-hgsc-conditions-use


Resources in this dataset:

  • Resource Title: Cimex lectularius genome annotations v0.5.3 for genome assembly Cimex lectularius v1.0.

    File Name: Cimex_lectularius_BCM_version_0.5.3_annotations.tar_.gz

    Resource Description:

    The attached tar.gz archive (Cimex_lectularius_BCM_version_0.5.3_annotations.tar.gz) contains the following folders:

    BCM_version_0.5.3-Primary_Gene_Set. This folder contains evidence files in gff3 format underlying the final gene predictions.

    BCM_version_0.5.3-Primary_Gene_Set/primary_gene_set. This folder contains the following files:

    CLEC.CDS.fna.gz. CDS sequences of Cimex lectularius genome annotations v0.5.3.

    CLEC.faa.gz. Amino acid sequences of Cimex lectularius genome annotations v0.5.3.

    CLEC.fna.gz. cDNA sequences of Cimex lectularius genome annotations v0.5.3.

    CLEC.Models.gff3.gz. Gff3 of all gene predictions of Cimex lectularius genome annotations v0.5.3.

    CLEC.Models-NALmod.gff3.gz. Gff3 of all gene predictions of Cimex lectularius genome annotations v0.5.3, modified by the National Agricultural Library to be compliant with gff3 specifications.

Funding

National Human Genome Research Institute

History

Data contact name

Benoit, Joshua B.

Data contact email

benoitja@UCMAIL.UC.EDU

Publisher

Ag Data Commons

Theme

  • Not specified

ISO Topic Category

  • health

Ag Data Commons Group

  • Insects - i5K

National Agricultural Library Thesaurus terms

genomics

Pending citation

  • No

Public Access Level

  • Public

Preferred dataset citation

Hughes, Daniel S.T.; Chao, Hsu ; Benoit, Joshua B.; Qu, Jiaxin; Worley, Kim C.; Murali, Shwetha C. ; Richards, Stephen (2015). Cimex lectularius Genome Annotations v0.5.3. . https://doi.org/10.15482/USDA.ADC/1196731

Usage metrics

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC