Ag Data Commons

sorry, we can't preview this file

Cimex_lectularius_BCM_version_0.5.3_annotations.tar_.gz (211.82 MB)

Cimex lectularius Genome Annotations v0.5.3

Download (211.82 MB)
posted on 2023-12-18, 17:11 authored by Daniel S.T. Hughes, Hsu Chao, Joshua B. Benoit, Jiaxin Qu, Kim C. Worley, Shwetha C. Murali, Stephen Richards

The Baylor College of Medicine recently sequenced and annotated the Cimex lectularius genome as part of the i5k pilot project. The C. lectularius research community has manually reviewed and curated the computational gene predictions and generated an official gene set, OGSv1.2.

The bed bug, Cimex lectularius, is a model organism for understanding dermonecrotic envenomation (sphingomylinase D). A key focus of research to understand neurotic fractions in venom. The bed bug also provides appropriate phylogenetic sampling across diversity of order Araneae.

This dataset presents the Cimex lectularius gene set BCM_v_0.5.3, which was generated computationally. RNA-Seq data was used with additional protein homology data for a MAKER automated annotation of the Cimex lectularius genome assembly 1.0. Further annotation method details will be available in a forthcoming publication.

NOTE: This gene set is an unstable pre-release (v0.5.3), and was provided to facilitate manual curation and analyses before the official gene set is released. Gene identifiers from this gene set will likely not be maintained.

If you wish to use this dataset, please follow the Baylor College of Medicine's conditions for data use:

Resources in this dataset:

  • Resource Title: Cimex lectularius genome annotations v0.5.3 for genome assembly Cimex lectularius v1.0.

    File Name: Cimex_lectularius_BCM_version_0.5.3_annotations.tar_.gz

    Resource Description:

    The attached tar.gz archive (Cimex_lectularius_BCM_version_0.5.3_annotations.tar.gz) contains the following folders:

    BCM_version_0.5.3-Primary_Gene_Set. This folder contains evidence files in gff3 format underlying the final gene predictions.

    BCM_version_0.5.3-Primary_Gene_Set/primary_gene_set. This folder contains the following files:

    CLEC.CDS.fna.gz. CDS sequences of Cimex lectularius genome annotations v0.5.3.

    CLEC.faa.gz. Amino acid sequences of Cimex lectularius genome annotations v0.5.3.

    CLEC.fna.gz. cDNA sequences of Cimex lectularius genome annotations v0.5.3.

    CLEC.Models.gff3.gz. Gff3 of all gene predictions of Cimex lectularius genome annotations v0.5.3.

    CLEC.Models-NALmod.gff3.gz. Gff3 of all gene predictions of Cimex lectularius genome annotations v0.5.3, modified by the National Agricultural Library to be compliant with gff3 specifications.


National Human Genome Research Institute


Data contact name

Benoit, Joshua B.

Data contact email



Ag Data Commons


  • Not specified

ISO Topic Category

  • health

Ag Data Commons Group

  • Insects - i5K

National Agricultural Library Thesaurus terms


Pending citation

  • No

Public Access Level

  • Public

Preferred dataset citation

Hughes, Daniel S.T.; Chao, Hsu ; Benoit, Joshua B.; Qu, Jiaxin; Worley, Kim C.; Murali, Shwetha C. ; Richards, Stephen (2015). Cimex lectularius Genome Annotations v0.5.3. .

Usage metrics


    Ref. manager