Heliothis virescens Official Gene Set OGS 1.1
Genome annotation ascribes function to specific areas of the genome sequence. Often, computationally generated gene annotations serve as foundational datasets to facilitate research on a species, as complete gene and protein sequences are necessary for experiments on gene and gene family function and evolution.
Here, we provide genome annotations for Heliothis virescens genome assembly GCA_002382865.2. The i5k Workspace@NAL mapped previous genome annotations (see below) to an updated genome assembly GCA_002382865.2. We used the program LiftOff v1.6.3 with default parameters except the 'polish' option. 15,066 out of 15,084 gene models were preserved. The name of this new dataset is Heliothis virescens Official Gene Set OGS 1.1. This dataset, and tools to interact with it, are also available at the i5k Workspace@NAL. These updated annotations should facilitate continued research on Heliothis virescens in the context of the improved, newer genome assembly.
Originally, the authors generated annotations for the Heliothis virescens genome assembly GCA_002382865.1 using the Just Annotate My Genome pipeline to investigate this pest insect’s lack of physiological adaptations to Bt toxin in the field. De novo gene predictors GeneMark.HMM-ET and Augustus were chosen as part of the JAMg pipeline. For a complete description of the methods, see https://doi.org/10.1111/mec.14430. See https://www.ncbi.nlm.nih.gov/datasets/gene/GCA_002382865.1/ for feature-level retrieval of this original dataset, or https://i5k.nal.usda.gov/data/Arthropoda/helvir-(Heliothis_virescens)/GCA_002382865.1/2.Official or Primary Gene Set/H_virescens_OGS1/ for file-level retrieval.
Funding
USDA-NIFA: 2012-33522-19793
USDA-NIFA: 2016-33522-25640
History
Data contact name
Megan, FritzData contact email
mfritz13@umd.eduPublisher
Ag Data CommonsIntended use
Annotation coordinates in the provided GFF3 file are relative to genome assembly GCA_002382865.2 (https://www.ncbi.nlm.nih.gov/datasets/genome/GCA_002382865.2/).Temporal Extent Start Date
2024-11-06Theme
- Non-geospatial
ISO Topic Category
- biota
Ag Data Commons Group
- Insects - i5K
National Agricultural Library Thesaurus terms
Heliothis virescens; genes; nucleotide sequences; data collection; amino acid sequences; evolution; genome assembly; insect pests; Bacillus thuringiensis; bioinformatics; modelsPending citation
- No
Public Access Level
- Public