Monarch geneset OGS2.0

DPOGS207804
TranscriptDPOGS207804-TA1119 bp
ProteinDPOGS207804-PA372 aa
Genomic positionDPSCF300042 + 328440-332315
RNAseq coverage285x (Rank: top 39%)
Annotation
HeliconiusHMEL0175512e-4957.63% 
BombyxBGIBMGA005496-TA2e-6154.90% 
Drosophilasad-PA6e-5233.33% 
EBI UniRef50UniRef50_Q2HZZ46e-11059.43%Cytochrome P450 CYP315A1 n=4 Tax=Obtectomera RepID=Q2HZZ4_MANSE
NCBI RefSeqNP_001106224.15e-10659.02%cytochrome P450, family 315, subfamily a, polypeptide 1 [Bombyx mori]
NCBI nr blastpgi|864403152e-10959.43%cytochrome P450 CYP315A1 [Manduca sexta]
NCBI nr blastxgi|864403159e-10759.43%cytochrome P450 CYP315A1 [Manduca sexta]
Group
Gene OntologyGO:00090553.4e-53electron carrier activity
GO:00200373.4e-53heme binding
GO:00167053.4e-53oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:00055063.4e-53iron ion binding
GO:00551143.4e-53oxidation-reduction process
KEGG pathwaytca:6586658e-54 
 K10722 (SAD)maps-> Insect hormone biosynthesis
InterPro domain[31-372] IPR0011283.4e-53Cytochrome P450
[182-199] IPR0024014.9e-14Cytochrome P450, E-class, group I
Orthology groupMCL16945 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207804-TA
ATGTTCCGTCTGAAAAAAATCTCTCAAATGAAAATTGTGAAACAAAAAAGCTACATCTTGCCCGTTAAAAATTATTTAACTATTTCTGATATGCCCAAACCAAAATCTCTGCCAATTATAGGAACAAAATTGGAATTCTTGGCTGCAGGAGGCGCGAACAAAGATACTATGCAAACGTGGAAATTAAATTTGGAAAAAGGATGTCGATTTCCAAATATTGAATCTGAACTGTATAGGCTGTCGTCTAATGTTATTATTAATATTTTACTCGGTACCCACTCTTTAGAGCTAAGCGATCATTATAATGAAATGTTGTCATTGTTTTCGGATTCTATGAAGAAAATCTTCCAGACCACCACAAAATTGTATAGTATTCCTGTAAATTGGTGTCAAAAGCTAAATCTGAAAGTGTGGAGGGATTTCAAAGAATCCGTTGACCTTACACTGTTTTTAGGAAGAAAAATTACTCGTGAAATGATGTTTAATAAAAACAAAAGCGATGGTTTATTAAAAAGAATGACTGAAGAAAATATGTCACCTGAAATAATTACAAGAATTGTATCAGACTTGATCATTGCTGCGGGAGATACGACAACATATACCGCGTTATGGACTTTATTGTTACTGACAAGAAATGAAGATACATTGAAGGAAAGCAGGAAAGGAGATCAAAAATATATTAAATACATTGTCAAGGAATCAATGCGATTGTACCCTGTAGCTCCATTTCTGACGAGAATACTTCCACAAGAGACGATTTTAGGTGACTACAAACTGAGTAAAGGGACACCAATTATCGCTTCCATCTATACAACGGGAAGGGATAAGCAAAATTTTTCGGAACCAAACTCTTTTCTTCCTTACCGTTGGGACAAAACAGATCCACGCAAAAAAGATCTCATTAACCATGTTCCCCCAGCGACACTGCCTTTTGCATTGGGATCCCGTTCATGTATAGGCAAAAAAATTGCCATGAAACAATTATCGGAATTCATTAGTCAGATCACCTACAACTTTGACCTTAAATGTAATAACAATCAGCAAATAAAATCTGTGACATCTCAAATATTGATACCAGATCAGAACATAGATTTTTCATTATCTGTTAGGAAGCAATGA

Protein sequence:

>DPOGS207804-PA
MFRLKKISQMKIVKQKSYILPVKNYLTISDMPKPKSLPIIGTKLEFLAAGGANKDTMQTWKLNLEKGCRFPNIESELYRLSSNVIINILLGTHSLELSDHYNEMLSLFSDSMKKIFQTTTKLYSIPVNWCQKLNLKVWRDFKESVDLTLFLGRKITREMMFNKNKSDGLLKRMTEENMSPEIITRIVSDLIIAAGDTTTYTALWTLLLLTRNEDTLKESRKGDQKYIKYIVKESMRLYPVAPFLTRILPQETILGDYKLSKGTPIIASIYTTGRDKQNFSEPNSFLPYRWDKTDPRKKDLINHVPPATLPFALGSRSCIGKKIAMKQLSEFISQITYNFDLKCNNNQQIKSVTSQILIPDQNIDFSLSVRKQ-