Ag Data Commons
Browse

Heliothis virescens Official Gene Set OGS 1.1

Download (20.99 MB)
dataset
posted on 2024-12-05, 22:14 authored by Megan Fritz, Alexie Papanicolaou, Amanda M. Cooksey, Alexandra DeYonke, Stephen Micinski, John Westbrook, Fred Gould

Genome annotation ascribes function to specific areas of the genome sequence. Often, computationally generated gene annotations serve as foundational datasets to facilitate research on a species, as complete gene and protein sequences are necessary for experiments on gene and gene family function and evolution.

Here, we provide genome annotations for Heliothis virescens genome assembly GCA_002382865.2. The i5k Workspace@NAL mapped previous genome annotations (see below) to an updated genome assembly GCA_002382865.2. We used the program LiftOff v1.6.3 with default parameters except the 'polish' option. 15,066 out of 15,084 gene models were preserved. The name of this new dataset is Heliothis virescens Official Gene Set OGS 1.1. This dataset, and tools to interact with it, are also available at the i5k Workspace@NAL. These updated annotations should facilitate continued research on Heliothis virescens in the context of the improved, newer genome assembly.

Originally, the authors generated annotations for the Heliothis virescens genome assembly GCA_002382865.1 using the Just Annotate My Genome pipeline to investigate this pest insect’s lack of physiological adaptations to Bt toxin in the field. De novo gene predictors GeneMark.HMM-ET and Augustus were chosen as part of the JAMg pipeline. For a complete description of the methods, see https://doi.org/10.1111/mec.14430. See https://www.ncbi.nlm.nih.gov/datasets/gene/GCA_002382865.1/ for feature-level retrieval of this original dataset, or https://i5k.nal.usda.gov/data/Arthropoda/helvir-(Heliothis_virescens)/GCA_002382865.1/2.Official or Primary Gene Set/H_virescens_OGS1/ for file-level retrieval.


Funding

USDA-NIFA: 2012-33522-19793

USDA-NIFA: 2016-33522-25640

History

Data contact name

Megan, Fritz

Data contact email

mfritz13@umd.edu

Publisher

Ag Data Commons

Intended use

Annotation coordinates in the provided GFF3 file are relative to genome assembly GCA_002382865.2 (https://www.ncbi.nlm.nih.gov/datasets/genome/GCA_002382865.2/).

Temporal Extent Start Date

2024-11-06

Theme

  • Non-geospatial

ISO Topic Category

  • biota

Ag Data Commons Group

  • Insects - i5K

National Agricultural Library Thesaurus terms

Heliothis virescens; genes; nucleotide sequences; data collection; amino acid sequences; evolution; genome assembly; insect pests; Bacillus thuringiensis; bioinformatics; models

Pending citation

  • No

Public Access Level

  • Public

Usage metrics

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC