Monarch geneset OGS2.0

DPOGS214903
TranscriptDPOGS214903-TA1074 bp
ProteinDPOGS214903-PA357 aa
Genomic positionDPSCF300135 - 120731-121888
RNAseq coverage34x (Rank: top 74%)
Annotation
HeliconiusHMEL0045304e-11352.84% 
BombyxBGIBMGA003293-TA4e-8842.49% 
DrosophilaCyp6g1-PA6e-4031.31% 
EBI UniRef50UniRef50_B2BNZ42e-8844.32%Cytochrome p450 CYP337B1 n=3 Tax=Obtectomera RepID=B2BNZ4_HELAM
NCBI RefSeqNP_001104827.13e-8544.66%cytochrome P450, family 337, subfamily a, polypeptide 1 [Bombyx mori]
NCBI nr blastpgi|1566195086e-8844.32%cytochrome p450 CYP337B1 [Helicoverpa armigera]
NCBI nr blastxgi|1566195081e-9044.32%cytochrome p450 CYP337B1 [Helicoverpa armigera]
Group
Gene OntologyGO:00090559.1e-78electron carrier activity
GO:00200379.1e-78heme binding
GO:00167059.1e-78oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:00055069.1e-78iron ion binding
GO:00551149.1e-78oxidation-reduction process
KEGG pathwaynvi:1001140232e-46 
 K07424 (CYP3A)maps-> Drug metabolism - cytochrome P450
    Drug metabolism - other enzymes
    Linoleic acid metabolism
    Steroid hormone biosynthesis
    Metabolism of xenobiotics by cytochrome P450
    gamma-Hexachlorocyclohexane degradation
    Retinol metabolism
InterPro domain[3-350] IPR0011289.1e-78Cytochrome P450
[149-166] IPR0024011.2e-20Cytochrome P450, E-class, group I
Orthology groupMCL18573 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214903-TA
ATGTACAATATCTTAGAACGATCTTCATCAGATTTCGTCAAATATATTGAAAAGAATCCTCACATGAAAGATAACCCTTATAAAGCATTACATAAATTCACGTCGGCCTCGATCAGTGCATCAGTGTTTGGAATCAATCCGAATACCAAAAACCTCATAGATTCGCCATTGGTAGATATTATTTGGAATGTTGCAGATTCGTTTGCGTCATTCAATTTTAAATTAGCACTTGCAAATATATTTCCTAAGCTACACAACTTTTTAAATCTTAAAGTCTTCGGTGCCCAAAAAGACGTTGTAGTTGATGCGATCAAAAATATTTTGAAGTATCGGAGAAATACGAAGGAAAGATGCCATGATTTTATCGATGCATGCATGGAGATGGAAAACGAAGGCGTTATAAAAGACAACGTTACCCAATACAAGTTAAAAGTTACTCCGGAATTTTTAGGAGCTCAAGCATACTCTCTCTTTTTTGCCGGAGTTGACACAGTTGCAAACTCAATGCACTTTACATTATTAGAGTTGTCAAATAACTCTGAAATATTAAAAAAGGTCCATGACGAAATTGACAATGTATTCGATAATTGTGAAGGAAGCATTTCATTGAAAGATATTATGAATCTGAAATATTTGGATATGGTTATTAGCGAATCTTTAAGAAAATATCCTCCAATTGGATTAATGCAACGAATATGTGCTAATGAAACTTTTTTATCCAGTAATGTTAAAGTAGATAAAGGTTGCGTAGTAATTGTTCCTATTTATGGAATACATAGAGATCCAAGACATTTCCCTAATCCAGACAAGTTCGATCCCGAAAGATTCTCACCCCAAAATCGTATGAATATCTCAAAATTTTGTTATATTCCATTTGGTGAAGGAAATCGGATGTGTCTAGGAGCAAGATTCGCAATGATTCAAATGAAAAGTGGACTTGCATGGCTTCTTAAACATTATACTTTAAGGGGATATAACTACATGCCAAATTGTTTCGAGCCAAGTCTCTTTGTTATTCGAGATCCAAAAGCACGATACGATTTAATTGTTAGAAACGAAACTGTAATTAGTTAA

Protein sequence:

>DPOGS214903-PA
MYNILERSSSDFVKYIEKNPHMKDNPYKALHKFTSASISASVFGINPNTKNLIDSPLVDIIWNVADSFASFNFKLALANIFPKLHNFLNLKVFGAQKDVVVDAIKNILKYRRNTKERCHDFIDACMEMENEGVIKDNVTQYKLKVTPEFLGAQAYSLFFAGVDTVANSMHFTLLELSNNSEILKKVHDEIDNVFDNCEGSISLKDIMNLKYLDMVISESLRKYPPIGLMQRICANETFLSSNVKVDKGCVVIVPIYGIHRDPRHFPNPDKFDPERFSPQNRMNISKFCYIPFGEGNRMCLGARFAMIQMKSGLAWLLKHYTLRGYNYMPNCFEPSLFVIRDPKARYDLIVRNETVIS-