Monarch geneset OGS2.0

DPOGS206198
TranscriptDPOGS206198-TA1212 bp
ProteinDPOGS206198-PA403 aa
Genomic positionDPSCF300345 + 241094-249280
RNAseq coverage1x (Rank: top 93%)
Annotation
HeliconiusHMEL0225861e-7054.82% 
BombyxBGIBMGA000056-TA3e-7841.94% 
DrosophilaCyp4d20-PA2e-3329.68% 
EBI UniRef50UniRef50_UPI000224687E4e-4330.75%UPI000224687E related cluster n=1 Tax=unknown RepID=UPI000224687E
NCBI RefSeqXP_972577.18e-4227.18%PREDICTED: similar to pheromone-degrading enzyme [Tribolium castaneum]
NCBI nr blastpgi|3454851101e-4230.75%PREDICTED: cytochrome P450 4C1 [Nasonia vitripennis]
NCBI nr blastxgi|3071907161e-4132.05%Cytochrome P450 4C1 [Camponotus floridanus]
Group
Gene OntologyGO:00090555.2e-71electron carrier activity
GO:00200375.2e-71heme binding
GO:00167055.2e-71oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:00055065.2e-71iron ion binding
GO:00551145.2e-71oxidation-reduction process
KEGG pathway 
InterPro domain[24-398] IPR0011285.2e-71Cytochrome P450
[59-78] IPR0024011.2e-23Cytochrome P450, E-class, group I
Orthology groupMCL21026 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206198-TA
ATGTTTCTGTTAATATTGCTGTGGACGTTGGTATTTCTATTATGGATATGGTCAAGAAGAGATAAAAATAAAGATGAGCCACCAACATTGCTTAATTTAGACTGGAGAAAACTATACCAATTTACAGAAGATACTGAAGGTTTCTGGAGTCTCATAAAAGAAATAAGCAGAGAGTGTCAAAAACAAGATGGAGTTATAAAAGTGACTATAGGTCCTAAAACCTTATATGTTCTAACAGACCCTGAAGATAGTTTGACTGTGTCAAATGCGTGCTTACAAAAGGATAGCGTTTATGACTTCGCAAAAAACTGGATTGGAAATGGCCTCATCACTGCGTCTTTACCTATTTGGAAAATACATCGAAAAGTATTAGATCCACTCTTCAGTGCTCGTCTATTGAATAACTTTATGGAGGTATTCAACAATCTTTCGCGTGTCCTCATCAAAAATCTAGAAGTAGAAGTTGAGAAAGGACCCTTTGATCCCTATGTTTATTCGAGACGGCACACTTTGGAAATAATATGTAGTTTCGAATTTGAACCATTTGCAGAAACTATTTTGAAAATTAGTGAGAATAATACGCAATTCACTGATAAGGATATAAGACAACATGTAGACACATTTATTGCAGCTGGTGAAGACACTTCCGCCGGGGTTATTATGTTATGTTTGATAACCGTGGGTTCTTATCCACAAGTGCAAGAAGAAATACACAAAGAGTTAAAACAGATTTTCGGTGATGAAGACAGAGATGTGACGAAAGAAGACCTTTCAAAACTAGTTTATTTAGAAGCAGTGATAAAAGAGATAATGCGATTATACCCAATAGTACCTATCGTAGCACGAGATCTAGATAAAGACGTCAAATTAAGTAACTGCACTTTATCAAAAGGTTGCACTGCAGTTCTATCGATCTATGGAATACATCGACATCCAATGTGGGGTCCAGACGTTGATGAGTTTAGACCGGAACGATGGCTTGACCTTCCATCAAATTATCAAAAGTATTTTGCTGCTTTTAGTTTGGGTCGCAGAATTTGTATAGGAAAAACCATGGCAATCGCATCGCTGAAAGTTACTATGGCTCACATATTTCGAAACTACATAATTCATGGCGAACACACAAATATGAAATTAAAGTTTGAACTTACTCTGAAAGCTGTTTCTGGACATCATATTTCCATTGGGAGAAGAATCAAAAATAAACCATAA

Protein sequence:

>DPOGS206198-PA
MFLLILLWTLVFLLWIWSRRDKNKDEPPTLLNLDWRKLYQFTEDTEGFWSLIKEISRECQKQDGVIKVTIGPKTLYVLTDPEDSLTVSNACLQKDSVYDFAKNWIGNGLITASLPIWKIHRKVLDPLFSARLLNNFMEVFNNLSRVLIKNLEVEVEKGPFDPYVYSRRHTLEIICSFEFEPFAETILKISENNTQFTDKDIRQHVDTFIAAGEDTSAGVIMLCLITVGSYPQVQEEIHKELKQIFGDEDRDVTKEDLSKLVYLEAVIKEIMRLYPIVPIVARDLDKDVKLSNCTLSKGCTAVLSIYGIHRHPMWGPDVDEFRPERWLDLPSNYQKYFAAFSLGRRICIGKTMAIASLKVTMAHIFRNYIIHGEHTNMKLKFELTLKAVSGHHISIGRRIKNKP-