Monarch geneset OGS2.0

DPOGS206197
TranscriptDPOGS206197-TA1293 bp
ProteinDPOGS206197-PA430 aa
Genomic positionDPSCF300345 + 228796-235017
RNAseq coverage2x (Rank: top 91%)
Annotation
HeliconiusHMEL0109358e-10440.49% 
BombyxBGIBMGA002267-TA8e-7637.37% 
DrosophilaCyp4s3-PA5e-4630.02% 
EBI UniRef50UniRef50_B1AAB62e-5230.27%CYP366A1 n=2 Tax=Obtectomera RepID=B1AAB6_BOMMO
NCBI RefSeqXP_001945361.14e-5532.79%PREDICTED: similar to cytochrome P450 [Acyrthosiphon pisum]
NCBI nr blastpgi|3287033363e-5532.94%PREDICTED: cytochrome P450 4C1-like [Acyrthosiphon pisum]
NCBI nr blastxgi|3287033367e-5531.99%PREDICTED: cytochrome P450 4C1-like [Acyrthosiphon pisum]
Group
Gene OntologyGO:00090551.3e-79electron carrier activity
GO:00200371.3e-79heme binding
GO:00167051.3e-79oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:00055061.3e-79iron ion binding
GO:00551141.3e-79oxidation-reduction process
KEGG pathway 
InterPro domain[42-425] IPR0011281.3e-79Cytochrome P450
[139-157] IPR0024012.3e-14Cytochrome P450, E-class, group I
Orthology groupMCL21026 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206197-TA
ATGTTCCTGTTAATATTGCTGTGGACATTGGTATTTCTATTTTGGATGTGGTTAAAAAGAGATACAAATAAAGATGAACCACCAACATTGGTTAATTTAGACTGGAGAAAAATACATCGTATTACAGCCGATACTGAGGTTCTAACAGACCCTGAAGATTGTTTGACTGTGTCAAATGCGTGCTTACATAAGGATCGTGTTTATGACTTCGCAAAAAACTGGATTGGAAATGGCCTCATCACTGCGTCTTTACCTATTTGGAAAATACATCGAAAAGTATTAGATCCACTCTTCAGTGCTCGTCTATTGAATAACTTTATGGAGGTATTCAACAATCTTTCGCGTGTCCTCATCAAAAATCTAGAAGTAGAAGTTGAGAAAGGACCCTTTGATCCCTATGTTTATTCGAGACGGCACACTTTGGAAATAATATGTATGATTTCTTTTGGAACGAAATTAATGGACAACATTGAATGCAAAAATAAATACATGGAGAAGGTAGAAAAGATAACGAATGTTTATATATCTAGAGTACAGGTTCTTAATGAAATCAAAACAAGCTCTAAAGCAGAAAAATCTTTTAATACGGACGAAAGGAAAACAAATGGTTTCGCATTCGAACCATTGGCGGAAATTATTTTGAAAATTTGTGAGAATAGTAAACAGTTCACAGATCTAGATATAAGACAACATGTAGACACATTTATTGCAGCTGGTGAAGACAGTTCCGCCGGGGTTATTATGTTATGTTTGATAACCGTGGGTTCTTATCCACAAGTGCAAAAAGAGATACACAAAGAGTTAAAACAGATTTTCGGTGATGAAGACAGAGATGTGACGAGAGAAGACCTTTCAAAACTAGTTTACTTAGAGGCAGTAATAAAAGAGACAATGCGTTTTTACCCAATAGTACCCATCATAGCAAGAGATCTGGATAAAGACATCAAATTAAGTAACTGCACTTTATCAAAAGGTCGCTCTGTAGTTTTATCGATCTATGGAATACATCGACATCCAATGTGGGGTCCAGACGCTGATGAGTTTAGACCGGAACGATGGCTTGACCTTCCATCAAATTATCAAAAGTATTTTGCTGCTTTTAGTTTGGGTCGCAGAATTTGTATAGGAAAAACCATGGCAATCGCATCGCTGAAAGTTACTATGGCTCACATATTTCGAAACTACATAATTCATGGCGAACACACAAATATGAAATTAAAGTTTGAACTTACTCTGAAAGCTGTTTCTGGACATCATATTTCCATTGGGAGAAGAATCAAAAATAAACCATAA

Protein sequence:

>DPOGS206197-PA
MFLLILLWTLVFLFWMWLKRDTNKDEPPTLVNLDWRKIHRITADTEVLTDPEDCLTVSNACLHKDRVYDFAKNWIGNGLITASLPIWKIHRKVLDPLFSARLLNNFMEVFNNLSRVLIKNLEVEVEKGPFDPYVYSRRHTLEIICMISFGTKLMDNIECKNKYMEKVEKITNVYISRVQVLNEIKTSSKAEKSFNTDERKTNGFAFEPLAEIILKICENSKQFTDLDIRQHVDTFIAAGEDSSAGVIMLCLITVGSYPQVQKEIHKELKQIFGDEDRDVTREDLSKLVYLEAVIKETMRFYPIVPIIARDLDKDIKLSNCTLSKGRSVVLSIYGIHRHPMWGPDADEFRPERWLDLPSNYQKYFAAFSLGRRICIGKTMAIASLKVTMAHIFRNYIIHGEHTNMKLKFELTLKAVSGHHISIGRRIKNKP-