Monarch geneset OGS2.0

DPOGS205009
TranscriptDPOGS205009-TA1551 bp
ProteinDPOGS205009-PA516 aa
Genomic positionDPSCF300123 + 315912-337434
RNAseq coverage140x (Rank: top 55%)
Annotation
HeliconiusHMEL0094572e-16887.61% 
BombyxBGIBMGA010239-TA2e-7973.13% 
Drosophilashd-PB9e-12347.69% 
EBI UniRef50UniRef50_Q3LFR20.078.49%Ecdysone 20-hydroxylase n=2 Tax=Bombycoidea RepID=Q3LFR2_BOMMO
NCBI RefSeqNP_001106219.10.078.49%ecdysone 20-hydroxylase [Bombyx mori]
NCBI nr blastpgi|870824750.079.65%cytochrome P450 CYP314A1 [Manduca sexta]
NCBI nr blastxgi|870824750.079.65%cytochrome P450 CYP314A1 [Manduca sexta]
Group
Gene OntologyGO:00090553.8e-82electron carrier activity
GO:00200373.8e-82heme binding
GO:00167053.8e-82oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:00055063.8e-82iron ion binding
GO:00551143.8e-82oxidation-reduction process
KEGG pathwaydme:Dmel_CG134787e-121 
 K10723 (E1.14.99.22)maps-> Insect hormone biosynthesis
InterPro domain[59-515] IPR0011283.8e-82Cytochrome P450
[91-110] IPR0024011.1e-13Cytochrome P450, E-class, group I
Orthology groupMCL10619 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205009-TA
ATGTCTCTTCCTGGAGTTTTTCTATTTTCTCATTACGTGGAAAGCTTTTGGAGCGCACCGCCGCCTCTCGTGGACTGGTCAGGGGTGCCGACGTTGATTTTGGCATTGGTGGCGCTAGTTATGGCGGTGACAGCTCTACTTACGAAGACTGTAGACGGTAAAAGGCCCACTCGCTTACCGGGACCACCTGCTTTACCTTTCCTGGGATCGAGATGGTTATTCTGGAGCCGTTATAAGATGAACAAGCTGCATGAAGCGTACGAAGACATGTTCAGGCGATACGGGCCGGTGTTTGCTGAGACAACGCCGGGAGGCGCTCTACTAGTGTCCATCGCTGACAGGACAGCACTCGAAGCTGTGCTGAAGACACCAGCTCGAAGACCATATAGACCTCCAACAGAGATCGTTCAGGTGTACAGACGAAGCCGACCTGATAGATACGCGTCTACAGGGATAGTTAATGAGCAGGGTGAAAAATGGCACCACCTCCGTCGTCACCTAACGTCAGAGCTGACAAGCCCCCAAACTATCCACGGCTTCGTCCCTGAACTGAACAATATCTGCGATGACTTCATAAACCTGCTGAGGAACTCTCGCCGGCCGGATGGTTCCGTATTGGGATTCGACCAACTCACCAATCGAGTGGGTTTGGAATCTGTGTGTGGTCTAATGTTGGGGACGAGGTTGGGTTTCCTCGAGCGTTGGATGTCAGGGAGAGCTGCTGCGCTGGCTGCAGCGGTGAAGGCGCATTTCAGAGCTCAGAGGGATTCCTACTACGGAGCCCCCTTGTGGAAGTTTGCCCCGACGTCCGTGTATAAAACTTTTGTCAGGAGCGAAGAGACGATACATATGATAGTATCTGAACTGATGGAGGAGGCTCGCTGTCGGTCACGAGGCGCGGCTCAAGACGATCCGCTGCAGGAGATCTTCCTTAAGATCCTCGCCAACCCCGAGCTGGACATGAGGGACAAGAAAGCCGCTATCATCGACTTCATAACCGCTGGTATTGATACGCTCGCCAACAGCTTGGTGTTCCTTCTTTACTTGCTGAGCGGTCGCGTGGACTGGCAGCAGAGGATCCGGTCGGAGCTGCCGTCTTGTGGGGAGCTTCGTATTGAGGAGTTGTCGTCGGCGCCGTCGGTGAGGGCGGCGATCAACGAAGCCTTCAGACTGCTCCCCACGGCGCCCTTCCTGGCTAGACTTCTGGACGCGCCGATGACGATAGGAGAATATAGGCTCCCAGCCGGGACGTTCGTGTTAGCTCACACGGGCGCGGCTTGCAGGCGTGAGGAGAACTTCCACCGCGCGGGCGAGTACCTCCCGGAGCGTTGGGTGGAGCCCCGCGAACCCCACGCGCCCGGGCTACTGGCTCCGTTCGGACGGGGACGGAGAATGTGCCCCGGGAAGCGATTCGTTGAGATGCAGCTGCACTTGATACTGGCCAAGATCATCCAGCACTGGCGCGTGGAGTTCGACGGTGAGTTGGACATACAGTTTGATTTTCTTCTGTCACCAAAGTCCCCCGTGTCTCTGCGGCTAGTCGAGTGGTGA

Protein sequence:

>DPOGS205009-PA
MSLPGVFLFSHYVESFWSAPPPLVDWSGVPTLILALVALVMAVTALLTKTVDGKRPTRLPGPPALPFLGSRWLFWSRYKMNKLHEAYEDMFRRYGPVFAETTPGGALLVSIADRTALEAVLKTPARRPYRPPTEIVQVYRRSRPDRYASTGIVNEQGEKWHHLRRHLTSELTSPQTIHGFVPELNNICDDFINLLRNSRRPDGSVLGFDQLTNRVGLESVCGLMLGTRLGFLERWMSGRAAALAAAVKAHFRAQRDSYYGAPLWKFAPTSVYKTFVRSEETIHMIVSELMEEARCRSRGAAQDDPLQEIFLKILANPELDMRDKKAAIIDFITAGIDTLANSLVFLLYLLSGRVDWQQRIRSELPSCGELRIEELSSAPSVRAAINEAFRLLPTAPFLARLLDAPMTIGEYRLPAGTFVLAHTGAACRREENFHRAGEYLPERWVEPREPHAPGLLAPFGRGRRMCPGKRFVEMQLHLILAKIIQHWRVEFDGELDIQFDFLLSPKSPVSLRLVEW-