Halyomorpha halys Official Gene Set v1.2
This dataset presents the Halyomorpha halys Official Gene Set (OGS) v1.2. OGSv1.2 is an update of Halyomorpha halys OGSv1.1 (https://doi.org/10.15482/USDA.ADC/1504240) to the coordinates of genome assembly GCA_000696795.3 (https://www.ncbi.nlm.nih.gov/assembly/GCA_000696795.3) using https://github.com/NAL-i5K/coordinates_conversion/.
The original OGSv1.0 is an integration of automatic gene predictions from NCBI's eukaryotic annotation pipeline, NCBI Halyomorpha halys Annotation Release 100 (https://www.ncbi.nlm.nih.gov/genome/annotation_euk/Halyomorpha_halys/100/; ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/000/696/795/GCF_000696795.1_Hhal_1.0), with manual annotations by the research community (performed via the Apollo manual curation software, http://genomearchitect.org/). Manual annotations performed by the community were downloaded from Apollo, QC'd, and merged with NCBI Halyomorpha halys Annotation Release 100 using the GFF3toolkit software (https://github.com/NAL-i5K/GFF3toolkit/releases/tag/v1.4.4). The resulting merged dataset was formatted for ingest into the i5k Workspace and GenBank databases, resulting in Halyomorpha halys Official Gene Set (OGS) v1.0.
Halyomorpha Official Gene Set halhal_OGSv1.1 is a minor update of halhal_OGSv1.0: Alias attributes were added to all manually annotated cathepsin models; six models from contaminated scaffolds were removed; and notes were added to 3 models located on possibly contaminated scaffolds.
Resources in this dataset:
Resource Title: Halymorpha halys Official Gene Set OGSv1.2.
File Name: halhal_OGSv1.2.tar.gz
Resource Description: The attached tar.gz archive (halhal_OGSv1.2.tar.gz) contains the following files:
halhal_OGSv1.2.gff. Gff3 of all gene predictions of Halymorpha halys genome annotations OGSv1.2 halhal_OGSv1.2_CDS.fa. CDS sequences of Halymorpha halys genome annotations OGSv1.2 halhal_OGSv1.2_pep.fa. Amino acid sequences of Halymorpha halys genome annotations OGSv1.2 halhal_OGSv1.2_trans.fa. Transcript sequences of Halymorpha halys genome annotations OGSv1.2 readme. Readme file describing Halymorpha halys genome annotations OGSv1.2
Funding
National Human Genome Research Institute: U54 HG003273
History
Data contact name
Sparks, MichaelData contact email
Michael.Sparks2@USDA.GOVPublisher
Ag Data CommonsTemporal Extent Start Date
2019-01-01Theme
- Not specified
Geographic Coverage
{"type":"FeatureCollection","features":[{"geometry":{"type":"Polygon","coordinates":[[[-172.96875,-85.973919490277],[-172.96875,85.513398309887],[194.0625,85.513398309887],[194.0625,-85.973919490277],[-172.96875,-85.973919490277]]]},"type":"Feature","properties":{}}]}ISO Topic Category
- biota
Ag Data Commons Group
- Insects - i5K
National Agricultural Library Thesaurus terms
genomics; Halyomorpha halys; genes; prediction; genome assembly; sequence analysis; genomeOMB Bureau Code
- 005:18 - Agricultural Research Service
OMB Program Code
- 005:040 - National Research
Pending citation
- No
Public Access Level
- Public