Monarch geneset OGS2.0

DPOGS206747
TranscriptDPOGS206747-TA1116 bp
ProteinDPOGS206747-PA371 aa
Genomic positionDPSCF300316 - 40456-41901
RNAseq coverage153x (Rank: top 53%)
Annotation
HeliconiusHMEL0111771e-16571.99% 
BombyxBGIBMGA012386-TA2e-11353.44% 
DrosophilaCyp6a18-PA4e-6133.70% 
EBI UniRef50UniRef50_A5JTS82e-11051.76%Cytochrome P450 n=12 Tax=Ditrysia RepID=A5JTS8_BOMMO
NCBI RefSeqNP_001104007.14e-11151.76%cytochrome P450, family 6, subfamily ab, polypeptide 5 [Bombyx mori]
NCBI nr blastpgi|1190675967e-13061.14%CYP6AB7 [Depressaria pastinacella]
NCBI nr blastxgi|1190675963e-12861.14%CYP6AB7 [Depressaria pastinacella]
Group
Gene OntologyGO:00090558.3e-78electron carrier activity
GO:00200378.3e-78heme binding
GO:00167058.3e-78oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:00055068.3e-78iron ion binding
GO:00551148.3e-78oxidation-reduction process
KEGG pathwaydme:Dmel_CG94384e-59 
 K00517 (E1.14.-.-)maps-> Naphthalene and anthracene degradation
    Stilbenoid, diarylheptanoid and gingerol biosynthesis
    Limonene and pinene degradation
    gamma-Hexachlorocyclohexane degradation
InterPro domain[1-368] IPR0011288.3e-78Cytochrome P450
[160-177] IPR0024018e-22Cytochrome P450, E-class, group I
Orthology groupMCL10682 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206747-TA
ATGTTCCCTCTGATAGTGGAGAGAACGGAAAAATTAGCGAAGTTAGCTGAAGGTATGGCTGAGACATGTGATGAGCTTGATATCCGTGATCTGATGGCGAGGTACACCACAGATTTCATTGGGGCATGCGGTTTTGGTATAGACTCAGCCGCCCTTAACGACGAAAACTCAGATTTTCGAAAACTTGGCAAGCGTATTGTAACAGTAACCAAGAGAGACCACTTCGTGTCGCTTCTAAAGAGGATGATGCCGGTTTTATGCAAGGATTTACACTTTTTGGCACCAGAAATCGAAAATAACACCGTATCTATCGTCCAGCAGATTATGGCGCAAAGGAATTATAAGCCATCAGGCAGGAACGACTTCGTGGATATGATGTTGGAGCTTCGAGAGAAGGGGAATCTGGTCGGTGAATCGATCGAGCACAGGAATCCTGACGGTTCACCCAAGATCGTGGAAATAGAATTGGATGATCAGTTGATCGCTGCCCAGGTGTTCATTTTCTTTGTCGCCGGTTTCGAAACTTCTTCATCTGCCAGCAGCTTCTTGCTCCACATGCTGGCTTATCACCCGGAAGTACAGGAGAAATGCCGGAAGGAAGTCGACGAGGTGTTGAAGAATCACGAAGGGAAACTGTCCTTCGATGCTGTGAAGGACATGAAGTATCTGGAAATGTCCCTCAAGGAAAGCGTCAGGTTCTTGACTTCTCCTGGCTTTTTGATTCGTCGGACGGTCAATAAATGCACTCTACCGGGCACCAACTTTACGTTAGACGAAAACATGGTGATGATAGTGTCATCGCAGGCCATGAATATGGATGGAGAGCTTTTTGAGAACCCTGAGGAGTTCAGACCAGAGAGATTCAACCCAGACAACATTGGTGACATCAAAAAATGCACTTTCATGCCATTCGGCGATGGACCCAGGTCCTGCATTGGTGAGCGACTTGGAATCATGCAATCTTTGGCTGGAGTGGCCACAATTTTAAGTAAATTCACGTTGTCTCCATCACGTAGCTCGTTACGTAAGCCACGTATTGATCCGATGTCCACAATCGTTCAGACTGTTATCGGAGGACTGCCACTGTCACTTAAACGTAGAAAAGATATCTTATAA

Protein sequence:

>DPOGS206747-PA
MFPLIVERTEKLAKLAEGMAETCDELDIRDLMARYTTDFIGACGFGIDSAALNDENSDFRKLGKRIVTVTKRDHFVSLLKRMMPVLCKDLHFLAPEIENNTVSIVQQIMAQRNYKPSGRNDFVDMMLELREKGNLVGESIEHRNPDGSPKIVEIELDDQLIAAQVFIFFVAGFETSSSASSFLLHMLAYHPEVQEKCRKEVDEVLKNHEGKLSFDAVKDMKYLEMSLKESVRFLTSPGFLIRRTVNKCTLPGTNFTLDENMVMIVSSQAMNMDGELFENPEEFRPERFNPDNIGDIKKCTFMPFGDGPRSCIGERLGIMQSLAGVATILSKFTLSPSRSSLRKPRIDPMSTIVQTVIGGLPLSLKRRKDIL-