Ag Data Commons

sorry, we can't preview this file

H-halys_CYPs_7-12-18.faa_.gz (31.16 kB)

Halyomorpha halys Cytochrome P450 protein sequences

Download (31.16 kB)
posted on 2024-02-13, 13:42 authored by David R. Nelson

These data correspond to “Manually annotated cytochrome P450 genes observed in the genome of Halyomorpha halys, the brown marmorated stink bug.”

Cytochrome P450s from Halyomorpha halys were mined by batch blast of NCBI’s nr section with 52 P450 sequences representative of insects. Results from each search were combined and filtered to remove duplicate hits. The results were 212 gene models predicted by Gnomon from the genome. Some of these were fusions of adjacent genes that had to be split. After further refinement to split fusions and remove variants of the same gene 142 P450s remained. To look for any additional P450s, 126 of the 141 sequences were used to blast search the WGS section for genomic contigs. 38,000 hits distilled to just 65 contigs, indicating P450 gene clustering. The 65 contigs were BLASTX searched against named a database of insect P450s to find all exons for P450s in the genome and determine the start and stop coordinates.

Resources in this dataset:

  • Resource Title: Halyomorpha halys Cytochrome P450 protein sequences in fasta format.

    File Name: H-halys_CYPs7-12-18.faa.gz

    Resource Description: Halyomorpha halys Cytochrome P450 protein sequences in fasta format.


Data contact name

Nelson, David

Data contact email


Ag Data Commons


  • Not specified

ISO Topic Category

  • biota

Ag Data Commons Group

  • Insects - i5K

National Agricultural Library Thesaurus terms

cytochrome P-450; genes; Halyomorpha halys; amino acid sequences; insects; genomics; exons

Pending citation

  • No

Public Access Level

  • Public

Preferred dataset citation

Nelson, David R. (2019). Halyomorpha halys Cytochrome P450 protein sequences. Ag Data Commons.

Usage metrics


    Ref. manager