Monarch geneset OGS2.0

DPOGS210157
TranscriptDPOGS210157-TA3318 bp
ProteinDPOGS210157-PA1105 aa
Genomic positionDPSCF300379 - 147039-155870
RNAseq coverage239x (Rank: top 43%)
Annotation
HeliconiusHMEL0119820.070.71% 
BombyxBGIBMGA004082-TA0.075.16% 
Drosophilahop-PA8e-10327.76% 
EBI UniRef50UniRef50_E0VH770.038.91%Tyrosine-protein kinase n=2 Tax=Neoptera RepID=E0VH77_PEDHC
NCBI RefSeqXP_002425471.10.038.91%tyrosine-protein kinase jak2, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420093910.038.91%tyrosine-protein kinase jak2, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420093910.039.18%tyrosine-protein kinase jak2, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00047132.2e-84protein tyrosine kinase activity
GO:00046723.4e-75protein kinase activity
GO:00064683.4e-75protein phosphorylation
GO:00167724.2e-64transferase activity, transferring phosphorus-containing groups
GO:00055241.6e-45ATP binding
GO:00046741.6e-45protein serine/threonine kinase activity
GO:00055153.5e-05protein binding
KEGG pathwaynvi:1001189960.0 
 K04447 (JAK2)maps-> Chemokine signaling pathway
    Leishmaniasis
    Adipocytokine signaling pathway
    Jak-STAT signaling pathway
InterPro domain[835-1100] IPR0206352.2e-84Tyrosine-protein kinase, catalytic domain
[836-1095] IPR0012453.4e-75Serine-threonine/tyrosine-protein kinase
[807-1097] IPR0110094.2e-64Protein kinase-like domain
[835-1101] IPR0022901.6e-45Serine/threonine-protein kinase domain
Orthology groupMCL10466 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210157-TA
ATGGCAGCGTCTGAGACAGTTAAGGTATCTGTCGTCATAGATCCGACGCCAAATATTATTCCCTGCTCAACAACTTTTACTGCTGAAGAATTGTGTATAGTTCTGTGTAAGAAATATAACATTCCGCCACTGACGCGTACCCTGTTTGCATTACGGATCAAAGGCTCAGACCTTTTCCTCAAAGATAACAGCAAAGTTCTTCTGAGTTCAAGAGATTATGAATTAAGGATACGGTTCAAGGTACCTCGACTGGGTCTGTTAATTACTTTAGATGAAACAACTTATGATTACTACTTTCAACAAGCAAGAAATGATGTTAATGAAAATAAAGTGCCTGAAATTAAGTACCCTGAAAATAAACAGGAACTTTTGGGTCTTGGCATAGCTGATATGACAAGGGCAATTATAGAAGAGAAACTACAATTAAATGATGTAGTTCGTAATCACAAGAAATATATTCCCAAAATTATAATAAGGAGACATGGTCCATATCCAAAAAAATATGCCATAGATTATCTACCGAAACTGTGTTCTGTGGGTCACAATGTTAACTATGTTAAAAATCTATATTTACAACAATTATATGGATTGGCACCAAACTATTTGGCTGAAGAGTATAACAATGTACTTTGGCAGAATGGTAGTGATGTATTGCCCGTTAAAGTTGTTGTTTCACCATTTCATCCCCACCAGCCAGGAATACGGCTTTTTAATATTACTAAAAGAGATTGGTTTCATGTTTGTACTATTGAGGAGTTGATATATCTGACTAGAAACGGAGACAACTGTTTGGAGATATCAAGACGTGGGACACCTTTGTTTCTTAAGTTCAAAAATGAGGAACAGTTATCATCGTTTATATCCCTATGTGATGGATACTATAGGTTGATGGTAAAGTGGACATTTAATTTGTCGAAGGATGATGAAACACCTTCGTTAAAGGAGCTCCAGAGAATAAAATGCCATGGGCCTGTGGGAGGTGCATTTTCATATCGTAAGCTTGAAGAGAAGCGATCTAAGAAGCATGGATGTTACATTCTTAGGCAATGTCAGGACGATTACAATGTATATTATTTGGATGTTTGCTCCAAAAATAGTACAACTGAAACATACAAAATAGAATTTAAAGGCCACTGCTATATATTTAATAAGGAAGAATATTTTAGCATCGAAAGATTGGTCAGTTGTCATCAAAATCCTGAAGGAAGGATATTTCTAAATGAATGCATACCACCTTCAGAGTATGATAAGTCTCAGTTACTGTTGTGTGGTGAACCATTAAAGAAAGGAGTTAGAATTGACCAAGCGGAGTTGCAAGAGATTTTAAAAGATAACAAAAGTCCAAGATGTCTGCCAAACAAAGATTTATTATTATACACTGGTTCTGAAAAAGTGGGTTCCGAAAATATAACAGCGACCTACAAGGCTCTTTGGCGTCTGGATGAAACTAAGAAATTGGTTGTGGCATTTAAGACATTACAAAGAGAAAAAGCTAATGATTATTTAAAGGATTTCATTGAGTTAGCAAGTAAATGGGCGTGCGTGCAATCTAGTTCGATAGTGAGGTTGTATGGAGTTACACTGAGTTCACCAACCGCTATGGTGTTGGAATACTTGCCCTATGGCCCCTTCGATGTATATCTTAGGGAGAATGAGGAGAATGTTAAGCCTATACACCTGAAGAAGGTTGCAGCGGGTCTGGCTCGTGCTTTATGGGATCTCTCAGAGGCTGGTATAGTACACGGAGCGATACGATGTCGCCGTTTACTACTGGCCTCTCACCACGATGACCGCATCATCGTCAAGCTCTCAGGACCGACACTCAGACAGTACTCGCCGCTCGATGTTCACTGGATGCCGGTAGAGTTCTTCGCTGATATGAACTTAGCGAAGAGATCTGTTTTAGGGGACATCTGGGCTTTCGCTACCACCCTGTGGCAGGTTTTCTCATACGGACACTCTCCGAATGATACCAATCCAGTTTTAACTGCAAGAAGTTACGAGATGGGTGACAGATTGCTTCGTCCGTCCCGGTGTCCTGGCGAGGTTTGGGCCTTGGTCAGATCCTGTTGGCAGAGCGACCCTCCAAGACCACAGGAGATTATGCGAGATATGAATCATATGCTGCATAGAGAATATGTTCCTTTACATGAGTACGAAGAACCAAAGATTTCATTGGATCACATGGAACATGCAGAAACGGTGTCGAGCGATCGTTTTATTCCCAGCGAGTTGAGTGACGCCGGCAGCAATAAGTCGTTAATATCAGTAGACAGCAGTGTACCGTCAACTAACGGGACAGTATCGGACTCGTATGATAATCCATTCGCTGACAACAAGAGTTCAAACTCGTTGGAATCAATGAATGCATTGACCTACGCACTGCGGTCGGAGGCTGTGATCAGTCGTAGCAACAGCGCGTGTGGACCCGACGAGCCGGATGATGGTGCGGGCGCCCCAAGACTCATGGAATCCATCGTGTCCCAGGGGAAGACATATCTCGTTACCATAACAAAGAAAATAGGAAGTGGCAATTACGGGCACGTCTTCAAGGGCTGGATGGAACGTGATAATCAGGAATCTCAAAGGAAAGAAGTAGCTGTTAAGAAATTAACTCGGCAAGCTTCAGAAAGAAATGGAAGCCTTTACGAGGACTTTAAAAATGAACTGGAAATTATGAAGTCCCTACAACACATCAACATAGTAGAGATCCTCGGTTATTCTTGGGATCACAGTTCGGATGTGCTGATAGTCATGGAATATTTAGAGGAGGGTTCCCTTAACTACTACCTCAAGTTCCAGGGAGATAAGCTAAGGATATCACATCTTTTGAAATACGCCAAAGATATTGCAACGGGCATGGATCACGTGTCAGCGAAGAACGTCGTGCATAGAGATCTAGCGACAAGGAACATTCTAGTTGTGAACAAATATCATGTGAAGATATCCGACTTTGGCTTAGCGAGGATCATACCTAAGGAGGAGAGTGCATACAGAATTAAGACAGAACGCCTTCTACCTATCAATTGGTACGCTCCGGAATCAGCAGTAGAGCCGTGGCATTTCTCCAGTAAGAGTGATGTGTGGTCGTACGGTGTTACAGCTTGGGAGATATTTACACGGGCTAGGACTGAAGTGCCCAAGTTTGATGTGGAAAGACCCAGGGAAAGGGCGTCGTGTTTTCAAATACCAGAAGGTTGTCCATCGGAGATATTCAGACATCTGATGAAAGAATGCTGGGCCCTAGACCCAAATTTACGTCCCAAGTTCATTGACCTCGTGCATATGTGCAAGCGATTTATGGATGAATACCAGTGA

Protein sequence:

>DPOGS210157-PA
MAASETVKVSVVIDPTPNIIPCSTTFTAEELCIVLCKKYNIPPLTRTLFALRIKGSDLFLKDNSKVLLSSRDYELRIRFKVPRLGLLITLDETTYDYYFQQARNDVNENKVPEIKYPENKQELLGLGIADMTRAIIEEKLQLNDVVRNHKKYIPKIIIRRHGPYPKKYAIDYLPKLCSVGHNVNYVKNLYLQQLYGLAPNYLAEEYNNVLWQNGSDVLPVKVVVSPFHPHQPGIRLFNITKRDWFHVCTIEELIYLTRNGDNCLEISRRGTPLFLKFKNEEQLSSFISLCDGYYRLMVKWTFNLSKDDETPSLKELQRIKCHGPVGGAFSYRKLEEKRSKKHGCYILRQCQDDYNVYYLDVCSKNSTTETYKIEFKGHCYIFNKEEYFSIERLVSCHQNPEGRIFLNECIPPSEYDKSQLLLCGEPLKKGVRIDQAELQEILKDNKSPRCLPNKDLLLYTGSEKVGSENITATYKALWRLDETKKLVVAFKTLQREKANDYLKDFIELASKWACVQSSSIVRLYGVTLSSPTAMVLEYLPYGPFDVYLRENEENVKPIHLKKVAAGLARALWDLSEAGIVHGAIRCRRLLLASHHDDRIIVKLSGPTLRQYSPLDVHWMPVEFFADMNLAKRSVLGDIWAFATTLWQVFSYGHSPNDTNPVLTARSYEMGDRLLRPSRCPGEVWALVRSCWQSDPPRPQEIMRDMNHMLHREYVPLHEYEEPKISLDHMEHAETVSSDRFIPSELSDAGSNKSLISVDSSVPSTNGTVSDSYDNPFADNKSSNSLESMNALTYALRSEAVISRSNSACGPDEPDDGAGAPRLMESIVSQGKTYLVTITKKIGSGNYGHVFKGWMERDNQESQRKEVAVKKLTRQASERNGSLYEDFKNELEIMKSLQHINIVEILGYSWDHSSDVLIVMEYLEEGSLNYYLKFQGDKLRISHLLKYAKDIATGMDHVSAKNVVHRDLATRNILVVNKYHVKISDFGLARIIPKEESAYRIKTERLLPINWYAPESAVEPWHFSSKSDVWSYGVTAWEIFTRARTEVPKFDVERPRERASCFQIPEGCPSEIFRHLMKECWALDPNLRPKFIDLVHMCKRFMDEYQ-