Monarch geneset OGS2.0

DPOGS212581
TranscriptDPOGS212581-TA3372 bp
ProteinDPOGS212581-PA1123 aa
Genomic positionDPSCF300075 + 451851-467450
RNAseq coverage192x (Rank: top 48%)
Annotation
HeliconiusHMEL0120770.067.70% 
BombyxBGIBMGA012318-TA4e-16169.23% 
DrosophilaPR2-PA0.041.62% 
EBI UniRef50UniRef50_E0VHV90.048.18%Tyrosine-protein kinase pr2, putative n=1 Tax=Pediculus humanus corporis RepID=E0VHV9_PEDHC
NCBI RefSeqXP_002425703.10.048.18%tyrosine-protein kinase pr2, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420098660.048.18%tyrosine-protein kinase pr2, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420098660.046.96%tyrosine-protein kinase pr2, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00047137.2e-114protein tyrosine kinase activity
GO:00046721.5e-80protein kinase activity
GO:00064681.5e-80protein phosphorylation
GO:00167727.5e-69transferase activity, transferring phosphorus-containing groups
GO:00055241.4e-39ATP binding
GO:00046741.4e-39protein serine/threonine kinase activity
KEGG pathway 
InterPro domain[120-392] IPR0206357.2e-114Tyrosine-protein kinase, catalytic domain
[120-391] IPR0012451.5e-80Serine-threonine/tyrosine-protein kinase
[109-421] IPR0110097.5e-69Protein kinase-like domain
[120-390] IPR0022901.4e-39Serine/threonine-protein kinase domain
Orthology groupMCL16572 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212581-TA
ATGGATAACTCAGCTAATGATCAAGAGGATTTGATTGCATTTCTTGAAGAAGCTGATCTTGAGCAGTATTATGGGTGTTTTAGGGATATTTTGAAAGTCACAAAATTATCACAGCTGAAGTTTGTGGCTGTTGAAGATTTAATTCAAATTGGACTTTCAAAGCCAGAACAACGAAGATACAAGAAATTGTATTCAAGATACTTCCCGAACACATATATAACTAAACTGAAGAGACTTATAAGCGGCCACAGAAAGGCAGATTTAATGAAACAGGAGCAAATAAATGTAACACTTGATGTACAAACATCAGACATGAATGTAAAAGTGCCTACAAAACACATTATTTCAATTGAGGACATTACAATTAATAAGGAACTTGGTATGGGACAGTTTGGAGTTGTGCAGCAAGGCACATGGAATACTGGAAATCAAAGACTGCAAGTGGCAATCAAATGCCTTGGAAATGAAAAGATGACATCAAACTCAACAGAGTTCCTCAAGGAAGCAGCGGTGATGCATAGTATTGAACATCCCAATATTGTACGCTTGTATGGAGTTGTATTGCATTTAGATTCTTTGATGTTGGTGACTGAACTTGCACCATTGAGATCGCTTCTTGAGTGTTTGAGGGAGGTCACATTGCGTATTGACTTCTCTGTAGCTGTGCTTTGTGAGTTTGCGGAACAGATCTGTGATGGAATGACTTACTTAGAGAAGAAAAGACTGATACATAGGGATTTGGCAGCTCGGAACATCTTAGTATTCTGCAAGGATAGAGTGAAGATATCCGATTTTGGTCTGTCCCGAGCTCTGGGAGTTGGAAAAGACTACTACCAGACAAATTATAATGTGAATCTCAAATTGCCAGTAGCTTGGTGCGCCCCGGAATGCATCCTCTACCTTAGATTCACTTCTGCAAGCGATGTTTGGGCATTCGCTGTGTGTCTGTGGGAGATGTTCACGTATGGATTCCAGCCGTGGGCTGCTTTCTCAGGACAACAGATCCTTGAGGCCATTGACGCTCCCAATTTTCAGGCACTTTTTAACTCACTCTCAAACTTAGTACGTTTGGAGCGTCCCGATTGTTGCCCCGATTCTTATTATTCATTGATGTTGGAGTGCTGGTCGCATGATCCAAACGATCGACCAAAATTCAAAGACCTTGCCATCAAATTGCGTGGCATACGTCCTGAGGAGGTCCAGGCCATATCTACGTATAAGAAACATGAAGATGGCAGGCGAAGCAACTTTCTCGAATACAAGACGGGACAAAAGATCACCGTGATCGGGAAACAAAATTATACTAAATCTGCTATGTGGTATGGAGTGCTGCCAGGCGGTGCGTGCGGACTTTTCGATCCAAATCAAACCAAACCTTATGTAAAGCCCGAGCGAAGTTTGCCTGGCTCACTGTCTCATCCAACTCGCAATTCAACTCGTAACATTCGATCATCCCTATTGCGGTCGGATCTCAAAAGACACACATACTCCGGCAAACGTACCATACAGAGAAGCATGATATCAAGCCCACAGGGTGACGTTAAACACACCAGCCACTTTGGCCTTGACGGTGCATACTTTGGCGATATATCCTTCTTGGATTCCGCCGGCATTCCACCACGGCAGGTGGTGACTCCGTACAAACCATCGGAGGATCTGGAGCAGATGCCGCTCATAAATCCTCACTCGCCATCCCATTTCAGCGCCTTCCCGCTGCTTGATTCACCAAAACAGGATAATGATACAAACCCTGACGGCAAATTCAAAGGAATGATAAGACCTCTCAAAACCATCAATGAGCCGACTACAAGTAAAAGTTTCATCTACAAGGTGAAGAGTGCCACGTTGGGACGTTCGAACAAGAGCGATGAGGGTGATAACAACAACGAGGTCCACGAGTACCACGAGATATCTGACACCGATACGGAGAGTATTGCTGCTGAAAACAAAGGTGAAGGCACTCCAGAGAGTCCTGGGATTGTTGTAAAGAGCTATGCTGACTTCAGCAAGAGTCTTCTAGAAGAAATGGAGACTATCTTCCGCAGTCTAGAACACAAGCGGGGTGACGAAATCGTGAGAATCAAGGAAAGTGATGTTGAAGCAGCCTTGAGGCTCCATAATGTGAACCAGTTGGAGAAGAGGAAGGGTAACTCTGGTACAATGAAACCGATGTCGGCCCACGATGAAAAAACTCTCAATACGGCCGTGGCTATGGCCAATGAGATTACTACCAAGTCCATGAACGATCTCGGCGGTGTGACCAAGCCATTGAACTCACCGAGCTCCAAATTCCACTTCAGGTTCCCGTTAGTGACATCGCTTAACACCCACAGCATCCATCACGAATCTCCCAAGGAAAATGGAGTCCAACACCGCAACTTCACGGAGCAAGCGAGAAGCGTTCCTGATATACAGGCTTCCCTCACTGAGGAGAGTCGTCAGGCGTACGTGAGTTTGATCGAACAGCCTGTACCATCACGAGCTATGAGGGCCCAGTTGTCCCGCGAGATGCCAAGCACGTCCGGACTGCACTACACACCGCCGCCGCCGCCGCCTCCGCCGTCCATCACACCGCATGTCACGATGCCTAAAATTGAACATGAGCTTGGAAATTCTTCACAGGAGTCAGATTCAAGTGATGACGACGATGACAGCGAGAGCGGAAACGTCATTCCACTACCGCCTCGGGGCACCAAGCCGAAATTAGTTGAAAAACCGCGCCACGTTCGTAAATATCCTCTGCGTCTACCAAACGAGCCACCCGCGAGCGTAGTGCCAGAGCGTCCAGCGGCTGCTATATATCAGAACACGATCACTTGCGCCATCCACGGAGTGCAGATTGTAGACGGGCCAGCACCGGGGCCGAGTGGTGTCGACGTGTATGATGGCCCAGAACATAAAGACACCAATCCGTTTAATTCCATGCAGGACAGCGCCTCGCCTTCAGTGAACCCATTCCAATGTTACATATCAGATAATGTCGACGACAACTCCGATGACGAGTATGAAGGCCCCGTTCAAGTCGATGGACCTGGGGTTATAGAAGTCTCTCTATCCAAAGATCATGTATCAGTGGAGGATTTGCTAGAGTTTGCTGATCAAAAGCCGTCAGCTCAGCAGCGCGGCGTCGAGTCTGATGAAGTCAGGATTATGAACAAAGTCCTCAAACTTGAAGTTTCACCGGAAGAATGTCTTGAAGCTTTGGAGTTCAGCAACTGGAATGTCCACAACGCTATCAAAGTGTTGCGCGTCAAGATAGCATCCAAGAATGACAAAGTATCTCTGGAAGAATGCCAACGCGAACTGGATTCAAATGACGGAGATATACTTAAAGCAGCGAAACTCCTCCGCGATCACAAGATCAGCGAGTAA

Protein sequence:

>DPOGS212581-PA
MDNSANDQEDLIAFLEEADLEQYYGCFRDILKVTKLSQLKFVAVEDLIQIGLSKPEQRRYKKLYSRYFPNTYITKLKRLISGHRKADLMKQEQINVTLDVQTSDMNVKVPTKHIISIEDITINKELGMGQFGVVQQGTWNTGNQRLQVAIKCLGNEKMTSNSTEFLKEAAVMHSIEHPNIVRLYGVVLHLDSLMLVTELAPLRSLLECLREVTLRIDFSVAVLCEFAEQICDGMTYLEKKRLIHRDLAARNILVFCKDRVKISDFGLSRALGVGKDYYQTNYNVNLKLPVAWCAPECILYLRFTSASDVWAFAVCLWEMFTYGFQPWAAFSGQQILEAIDAPNFQALFNSLSNLVRLERPDCCPDSYYSLMLECWSHDPNDRPKFKDLAIKLRGIRPEEVQAISTYKKHEDGRRSNFLEYKTGQKITVIGKQNYTKSAMWYGVLPGGACGLFDPNQTKPYVKPERSLPGSLSHPTRNSTRNIRSSLLRSDLKRHTYSGKRTIQRSMISSPQGDVKHTSHFGLDGAYFGDISFLDSAGIPPRQVVTPYKPSEDLEQMPLINPHSPSHFSAFPLLDSPKQDNDTNPDGKFKGMIRPLKTINEPTTSKSFIYKVKSATLGRSNKSDEGDNNNEVHEYHEISDTDTESIAAENKGEGTPESPGIVVKSYADFSKSLLEEMETIFRSLEHKRGDEIVRIKESDVEAALRLHNVNQLEKRKGNSGTMKPMSAHDEKTLNTAVAMANEITTKSMNDLGGVTKPLNSPSSKFHFRFPLVTSLNTHSIHHESPKENGVQHRNFTEQARSVPDIQASLTEESRQAYVSLIEQPVPSRAMRAQLSREMPSTSGLHYTPPPPPPPPSITPHVTMPKIEHELGNSSQESDSSDDDDDSESGNVIPLPPRGTKPKLVEKPRHVRKYPLRLPNEPPASVVPERPAAAIYQNTITCAIHGVQIVDGPAPGPSGVDVYDGPEHKDTNPFNSMQDSASPSVNPFQCYISDNVDDNSDDEYEGPVQVDGPGVIEVSLSKDHVSVEDLLEFADQKPSAQQRGVESDEVRIMNKVLKLEVSPEECLEALEFSNWNVHNAIKVLRVKIASKNDKVSLEECQRELDSNDGDILKAAKLLRDHKISE-