Monarch geneset OGS2.0

DPOGS200090
TranscriptDPOGS200090-TA1734 bp
ProteinDPOGS200090-PA577 aa
Genomic positionDPSCF300044 - 14966-33807
RNAseq coverage3050x (Rank: top 4%)
Annotation
HeliconiusHMEL0040640.074.49% 
BombyxBGIBMGA003607-TA0.075.15% 
Drosophilafray-PE0.071.48% 
EBI UniRef50UniRef50_B3LW890.072.19%GF16907 n=6 Tax=Coelomata RepID=B3LW89_DROAN
NCBI RefSeqXP_974782.10.073.31%PREDICTED: similar to serine/threonine protein kinase [Tribolium castaneum]
NCBI nr blastpgi|910830350.073.31%PREDICTED: similar to serine/threonine protein kinase [Tribolium castaneum]
NCBI nr blastxgi|910830350.073.31%PREDICTED: similar to serine/threonine protein kinase [Tribolium castaneum]
Group
Gene OntologyGO:00167721.8e-83transferase activity, transferring phosphorus-containing groups
GO:00055242.8e-81ATP binding
GO:00046742.8e-81protein serine/threonine kinase activity
GO:00064682.8e-81protein phosphorylation
GO:00046721.2e-63protein kinase activity
GO:00047131e-21protein tyrosine kinase activity
KEGG pathway 
InterPro domain[88-407] IPR0110091.8e-83Protein kinase-like domain
[102-376] IPR0022902.8e-81Serine/threonine-protein kinase domain
[103-376] IPR0174421.2e-63Serine/threonine-protein kinase-like domain
[102-376] IPR0206351e-21Tyrosine-protein kinase, catalytic domain
Orthology groupMCL10885 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200090-TA
ATGGAAGTGAGGCGTAAGAATAAAGGTTACTTCAGGATGGAGCCGGTGCCTGAGGTAGGCGGGCCGTTCAAGATAAAAACGGATAGAAACAGTAATATACAGGCAGGAATATTACCACCCGGTGTGAGTGGAGTTTTCCGCCAGAAATCTTTTGGAACATGCGACCGACAAATCTCAGCATTGTCGTCGTATGTGCGAGCTGCTTGCACCTTGAACACAGCCGGCAGCATTGGAAGTTTGTTCACTCGCAAAATGGCTGCCACTGCAAATACTTCATGTCCGTGGCCTAATAGTCAAGAAGATTACGATTTACGTGATGTTATTGGTGTGGGAGCAACTGCTGTTGTATATTCAGCATTTTGTAAACCAAGAAATGAAAAATGTGCTATAAAGAGAATAAATTTAGAGAAATGGAATACCTCAATGGACGAGCTACTAAAGGAAATTCAAGCAATGTCCAGTTGCAATCATCTCAATGTTGTCACATATTATACCAGTTTTGTGGTTAATGATGAATTGTGGCTTGTCCTTAAATTATTAGAAGGAGGGAGTCTCTTGGACGTAATCAAACATAAGATGCGGGTATCGAATTGTAAACACGGCGTATTTGATGAAGCAACGATCGCCACTGTCCTCAAAGAAGTGCTTAAAGGATTGGAGTACTTTCACAGCAACGGACAAATTCATAGGGATATAAAGGCGGGTAACATATTGCTGGGAGAGGATGGAACTGTTCAAATTGCTGACTTCGGGGTGTCAGCGTGGCTGGCGACCGGCAGGGATTTGTCGCGGCAGAAAGTCAGGCACACCTTCGTTGGTACTCCCTGCTGGATGGCACCCGAAGTCATGGAACAGGACCACGGTTACGACTTCAAAGCGGACATATGGTCCTTTGGTATAACAGCTATTGAGATGGCAACAGGCACCGCGCCTTATCATAAATACCCACCAATGAAGGTCCTGATGCTTACTCTGCAGAACGATCCACCAAACTTAGATACCGGTGCGGATGAAAAGGAACAATACAAAGTATACGGGAAAACCTTTAGGAAAATGATAATCGATTGTTTACAGAAAGATCCATCGAAGAGACCAACAGCGACAGAGTTACTTAAACATCAGTTTTTCAAAAAAGCTAAAGACAAAAAGTACCTAACACAGACTCTTGTCGCTATTGGACCAAGTATGGAGACGAGGGTTCATAAGGCAAGTAAACGGCAGCCGGGTGCTTCGGGTCGTCTTCATCGCACTGTAACTGGTGAATGGGTGTGGAGTGAGGAGGAAGACGAGGCTGGGAGAGAGTCGGATGAGGAGCCGGACAAACGACCCATGAACCAGTTACAGAGGGCAGACTCCAGTAGCGATAACGACGAGGAAGCAGGAGTGAGTAAACAAACAAATCAACTGGCTCGCGTAACATCTGAGTCAGTTGTTGTCAATCTAGTACTGAGGCTAAGGAATTCTCGTCGGGAATTGAACGATATACGCTTTGAATTCATCACGGGGAAGGATACTTCAGAAGGCATAGCTGGCGAGCTGGTCGGAGCCGGATTGGTGGATCCATTGGATTCTGTTCCGATATCCACTAACCTGGCCAAGCTGCTTGCACAGCGGAACACGCCGACGCCGGTCAACACTGTCACATTCCATCTTAATTCGACTCCTGCTAATGAACAGTTTGACGACAAAACTTTGATCGGTTTCGCGCAGATTTCCATAGTCGACATCGCTTAA

Protein sequence:

>DPOGS200090-PA
MEVRRKNKGYFRMEPVPEVGGPFKIKTDRNSNIQAGILPPGVSGVFRQKSFGTCDRQISALSSYVRAACTLNTAGSIGSLFTRKMAATANTSCPWPNSQEDYDLRDVIGVGATAVVYSAFCKPRNEKCAIKRINLEKWNTSMDELLKEIQAMSSCNHLNVVTYYTSFVVNDELWLVLKLLEGGSLLDVIKHKMRVSNCKHGVFDEATIATVLKEVLKGLEYFHSNGQIHRDIKAGNILLGEDGTVQIADFGVSAWLATGRDLSRQKVRHTFVGTPCWMAPEVMEQDHGYDFKADIWSFGITAIEMATGTAPYHKYPPMKVLMLTLQNDPPNLDTGADEKEQYKVYGKTFRKMIIDCLQKDPSKRPTATELLKHQFFKKAKDKKYLTQTLVAIGPSMETRVHKASKRQPGASGRLHRTVTGEWVWSEEEDEAGRESDEEPDKRPMNQLQRADSSSDNDEEAGVSKQTNQLARVTSESVVVNLVLRLRNSRRELNDIRFEFITGKDTSEGIAGELVGAGLVDPLDSVPISTNLAKLLAQRNTPTPVNTVTFHLNSTPANEQFDDKTLIGFAQISIVDIA-