Monarch geneset OGS2.0

DPOGS215325
TranscriptDPOGS215325-TA2550 bp
ProteinDPOGS215325-PA849 aa
Genomic positionDPSCF300120 + 204787-211915
RNAseq coverage158x (Rank: top 52%)
Annotation
HeliconiusHMEL0100241e-10390.77% 
BombyxBGIBMGA007963-TA0.077.82% 
Drosophilaphl-PB1e-17358.06% 
EBI UniRef50UniRef50_E9JEI80.077.82%Raf n=5 Tax=Endopterygota RepID=E9JEI8_BOMMO
NCBI RefSeqXP_396892.20.065.11%PREDICTED: similar to pole hole CG2845-PA [Apis mellifera]
NCBI nr blastpgi|3214000760.077.82%raf kinase, effector of Ras [Bombyx mori]
NCBI nr blastxgi|3214000760.077.82%raf kinase, effector of Ras [Bombyx mori]
Group
Gene OntologyGO:00167726.1e-63transferase activity, transferring phosphorus-containing groups
GO:00046724.8e-50protein kinase activity
GO:00064684.8e-50protein phosphorylation
GO:00055241.2e-41ATP binding
GO:00046741.2e-41protein serine/threonine kinase activity
GO:00047135.2e-24protein tyrosine kinase activity
GO:00071654.6e-21signal transduction
GO:00050574.6e-21receptor signaling protein activity
GO:00355561.7e-09intracellular signal transduction
KEGG pathwayame:4134480.0 
 K02644 (PHL)maps-> MAPK signaling pathway - fly
    Dorso-ventral axis formation
InterPro domain[517-793] IPR0110096.1e-63Protein kinase-like domain
[528-783] IPR0012454.8e-50Serine-threonine/tyrosine-protein kinase
[526-787] IPR0022901.2e-41Serine/threonine-protein kinase domain
[526-784] IPR0206355.2e-24Tyrosine-protein kinase, catalytic domain
[130-197] IPR0031164.6e-21Raf-like Ras-binding
[210-255] IPR0022191.7e-09Protein kinase C-like, phorbol ester/diacylglycerol binding
Orthology groupMCL11301 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215325-TA
ATGGCAGTAACTCGAGAGAAAGAAAATTGTACAAGAAAAATTAATGGAAATATACCTGCATTATCTCTTAACAGCGATAATATGTCGACATCGACCGACAGTGACGACGCAGCGGATCCAACATATGAAATTAGGAACATTCAGAGTATTATAAATGTGACCCGTGAAAACATTGACGCTTTAAATGCAAGGTTTGCTGGTTTGCAGCACCCACCTTCTCTGTATCTTTCGGAATATCAGGAGTTAACAGTAAAGTTGCATAATTTAGAACTTAGAGAACAGACACTTAGGGAATTGTTACAGTGCGATTCGCCGGATAGGAGCGAAGAATATTCATGGCAAGATGATTCTAAGAAGTATGATACTCTTACTCGGCAACCTAAAGTTTTCCTTAGAGCTTACTTGCCGAATCAACAGCGGACGAGTGTACAGGTGAAGGAGGGAGTTTCGCTTCGAGAAGCTCTCATGAAGGCTCTCAAGCTTCGTAACTTAACTTGTGAAATGTGCGAAGTGGTTCGGACAGGAAATAATGCTGTGATACCGTGGGATATCGATATTACTATGATTGATGCTGAGGAGGTGACAGTGCGAATTTTGGACAAATTGCCCATTATGTCTCACATGTCGCACCAGTTCACAAGAAAGACATTCTTCACACTCGCGTTCTGCGAGTGCTGTCGACGGCTGCTCTTCAACGGCTTCTACTGCTCGCAGTGCAACTTCAAGTTCCATCAGCGATGCGCTGATAAAGTTCCAAGTATTTGTCATCAGGTACGAATGACTGACACATACTACGCGGCTCTACTGGCCAAGAATCCGGAGACCCAGGCGGGGATCCTGCACGTGCCCGCCCACTACAGCTATCATCAGAACAAACATCAATATTTTTACTATTTGCTATCAGGCACATCACTAGCCTTGACCTACAAAAACATGTACAATGAGCCAATATTCCAGGTGACAGTGCGAATTTTGGACAAATTGCCCATTATGTCTCACATGTCGCACCAGTTCACAAGAAAGACATTCTTCACACTCGCATTCTGTGAGTGCTGTCGACGGCTGCTCTTCAACGGCTTCTACTGCTCGCAGTGCAACTTCAAGTTCCATCAGCGATGCGCTGATAAAGTTCCAAGTATTTGTCATCAGGTACGAATGACTGACACATACTACGCGGCTCTACTGGCCAAGAATCCGGAGACCCAGGCGGGGATCCTGCACGTGCCCGCTCACTACAGCTATCATCAGAACAAACAAGCCCACCCTCGCTCGCTGAACCAGCAAGACCGCTCCAATTCGGCGCCAAACGTGTGTGTCAACATGGTGAGACATTTGACCGATCACGGCCGGAAGAATAACCAGAGCAGCTCCCCGCAATCATCAAATTACCTTTCATCAGGGAGACACGGCAAAGACGACCGGGAGAAGGGCGAGCAGCAGAGTCAAAGCACTCAGGCCTCCCCCACGGGCACGCTTCGACCTCGAAGGGCGCGCGCTCGCTCCGCGGACGAGTCGTGGAAACATGCCCTGTCTCCTCGTGAAAGCTACGACGACTGGGTCATACACGCCGATGAAATACTCATAGGGGCCAGGATAGGCTCCGGTAGTTTTGGGACAGTGTATAAGGCGCATTGGCACGGACCGGTCGCTGTCAAGACGCTGAACGTGAAGACGCCCACCCCAGCTCAGCTACAGGCCTTCAAAAACGAGGTGGCTGTCCTCCGTAAGACGCGGCACTGCAACATCCTTCTGTTCATGGGCTGTGTGTCCAAGCCTTCCTTGGCGATAGTGACCCAGTGGTGTGAAGGCTCCTCGTTGTATCAACACCTGCACGTGCTGGAGACGCCGCTGCCGATGTTATACCTCATAGACGTGGCGCGCCAGACCGCCCAGGGCATGGACTACCTGCACGCCAAGTTGGATCTGAAATCTAACAACATATTCTTAAGAGACGACTGGTCGGTTAAGATTGGAGACTTCGGCCTGGCGACGGCCAAAGTGCGGTGGTCAGATTCGTCCACGCTGGGAGGCGTGCAGTGGCAGCAGCCGACCGGCTCGATACTGTGGATGGCGCCGGAAGTGATCCGTATGGAGGAACCGGCGCCGTACACCTTCCGGTCGGACGTGTACGCGTACGGGATCGTGCTGTTCGAGCTGACGGCCGGCGAGCTACCGTACGCGCACCTCAACAACAAAGACCAGATACTGTGGTCGGTGGGCCGCGGCCTGCTGCGACCGGACGTCCGTCGGTTGCGTCAGGACGCTCCGCAGGCCCTGCGCCGCCTGTTCGAGGACTGCATCCAGTTTGACCGGGAGCGCCGGCCGCTGTTCCGTCAGATCCTGGCGTCCCTGGAGGCCATGCTGCGAGCCATGCCCAAGATCACGCGCAGCGCGTCCGAGCCGTGCCTGCAACGCGCCCTCCACGCATCCGACGACTGTCTCGGGTACGCCTGTGCGGAACCAAAGACGCCCGTCAACTTCCAGTTCGACGCTCACACTAGCTTCCCGGCCTTCTACAGTATTCCAGTGCCTAGCCAGCGACACGCCTAG

Protein sequence:

>DPOGS215325-PA
MAVTREKENCTRKINGNIPALSLNSDNMSTSTDSDDAADPTYEIRNIQSIINVTRENIDALNARFAGLQHPPSLYLSEYQELTVKLHNLELREQTLRELLQCDSPDRSEEYSWQDDSKKYDTLTRQPKVFLRAYLPNQQRTSVQVKEGVSLREALMKALKLRNLTCEMCEVVRTGNNAVIPWDIDITMIDAEEVTVRILDKLPIMSHMSHQFTRKTFFTLAFCECCRRLLFNGFYCSQCNFKFHQRCADKVPSICHQVRMTDTYYAALLAKNPETQAGILHVPAHYSYHQNKHQYFYYLLSGTSLALTYKNMYNEPIFQVTVRILDKLPIMSHMSHQFTRKTFFTLAFCECCRRLLFNGFYCSQCNFKFHQRCADKVPSICHQVRMTDTYYAALLAKNPETQAGILHVPAHYSYHQNKQAHPRSLNQQDRSNSAPNVCVNMVRHLTDHGRKNNQSSSPQSSNYLSSGRHGKDDREKGEQQSQSTQASPTGTLRPRRARARSADESWKHALSPRESYDDWVIHADEILIGARIGSGSFGTVYKAHWHGPVAVKTLNVKTPTPAQLQAFKNEVAVLRKTRHCNILLFMGCVSKPSLAIVTQWCEGSSLYQHLHVLETPLPMLYLIDVARQTAQGMDYLHAKLDLKSNNIFLRDDWSVKIGDFGLATAKVRWSDSSTLGGVQWQQPTGSILWMAPEVIRMEEPAPYTFRSDVYAYGIVLFELTAGELPYAHLNNKDQILWSVGRGLLRPDVRRLRQDAPQALRRLFEDCIQFDRERRPLFRQILASLEAMLRAMPKITRSASEPCLQRALHASDDCLGYACAEPKTPVNFQFDAHTSFPAFYSIPVPSQRHA-