Monarch geneset OGS2.0

DPOGS206290
TranscriptDPOGS206290-TA2331 bp
ProteinDPOGS206290-PA776 aa
Genomic positionDPSCF300290 + 300851-312331
RNAseq coverage286x (Rank: top 38%)
Annotation
HeliconiusHMEL0131192e-7164.96% 
BombyxBGIBMGA010747-TA0.075.47% 
DrosophilaGprk2-PA5e-15458.38% 
EBI UniRef50UniRef50_Q7PQK81e-16456.73%AGAP004117-PA n=16 Tax=Bilateria RepID=Q7PQK8_ANOGA
NCBI RefSeqXP_002423550.14e-17261.44%cAMP-dependent protein kinase catalytic subunit, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3838518127e-17258.53%PREDICTED: G protein-coupled receptor kinase 2-like [Megachile rotundata]
NCBI nr blastxgi|1936693242e-17660.78%PREDICTED: G protein-coupled receptor kinase 2-like [Acyrthosiphon pisum]
Group
Gene OntologyGO:00167725.1e-61transferase activity, transferring phosphorus-containing groups
GO:00055242.1e-45ATP binding
GO:00064682.1e-45protein phosphorylation
GO:00046742.1e-45protein serine/threonine kinase activity
GO:00046721.3e-43protein kinase activity
GO:00048711.3e-11signal transducer activity
GO:00047132.4e-09protein tyrosine kinase activity
GO:00071654.3e-09signal transduction
GO:00047034.3e-09G-protein coupled receptor kinase activity
KEGG pathwayphu:Phum_PHUM0666501e-171 
 K08291 (E2.7.11.16)maps-> Chemokine signaling pathway
    Endocytosis
InterPro domain[336-656] IPR0110095.1e-61Protein kinase-like domain
[341-560] IPR0022902.1e-45Serine/threonine-protein kinase domain
[342-521] IPR0174421.3e-43Serine/threonine-protein kinase-like domain
[175-336] IPR0161378.3e-25Regulator of G protein signalling superfamily
[196-326] IPR0003421.3e-11Regulator of G protein signalling
[341-572] IPR0206352.4e-09Tyrosine-protein kinase, catalytic domain
[93-105] IPR0002394.3e-09GPCR kinase
[561-640] IPR0009613.5e-08AGC-kinase, C-terminal
Orthology groupMCL10689 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206290-TA
ATGGAGCTTTTGAGTTTCCAGGCGGAGAGATTCGAAGTTGAGGTGGACGAGACGAGGGTGTCGCTTGCAGCCGAAGTTTTCGGACGGTATCTGAAGACGGATGACGTGGATGCCGTCACTGACGTAGTACCATCAGACGTCATCAATGATGCCAGCAAACTGCTCGAAGATTTCACTTTCTCGGGCGGTTCCAAGGACATTTTTAGCGAATGCATCCGGTGTGTGAAGAGTCACTTGGCGGGTACTCCGTTCGCGGAGTTCGAGAGGTCGATGTACTTTCATAGATACTTACAGTGGAAATGGCTGGAAGCTCAGCCGGTCACCCAGAACACCTTTAGGATGTACAGAGTCTTGGGAAAGGGAGGCTTTGGAGAAGTCTGCGCGTGTCAGGTGAGAGCCACTGGCAAAATGTATGCATGCAAGAAATTGGAAAAGAAACGCATCAAGAAGAGGAAAGGCGAGGCCATGGTGCTCATCGAGAAGCAAATCCTGCAAAGAATAAACTCGAGATTCGTCGTTAATTTAGCGTACGCCTACGAAACTAAGGACGCTCTGTGTCTGGTTCTAACTATTATGAACGATGTAGAATACGACTACGTGGTAGACCAGCAGCCGATAGGGAAGCTGCTCTTCAGGCAGTTCTGCGAAAGGAATCGTCCTCACTACCACAAATACAACAGTTTCTTGGATGCGCTTTTGAGTTTCCAGGCGGAGAGATTCGAAGTTGAGGTGGACGAGACGAGGGTGTCGCTTGCAGCCGAAGTTTTCGGACGGTATCTGAAGACGGATGACGTAGATGCCGTCACTGACGTAGTACCATCAGACGTCATCAATGATGCCAGCAAACTGCTCGAAGATTTCACTTTCTCGGGCGGTTCCAAGGACATTTTTAGCGAATGCATCCGGTGTGTGAAGAGTCACTTGGCGGGTACTCCGTTCGCGGAGTTCGAGAGGTCGATGTACTTTCATAGATACTTACAGTGGAAATGGCTGGAAGCTCAGCCGGTCACCCAGAACACCTTCAGGATGTACAGAGTCTTGGGAAAGGGAGGCTTTGGAGAAGTCTGCGCGTGTCAGGTGAGAGCCACTGGCAAAATGTATGCATGCAAGAAATTGGAAAAGAAACGCATCAAGAAAAGGAAAGGCGAGGCCATGGTGCTCATCGAGAAGCAAATCCTGCAAAGAATAAATTCGAGATTCGTCGTTAATTTAGCGTACGCCTACGAAACTAAGGACGCTCTGTGTCTGGTTCTGACTATTATGAACGGTGGTGATCTGAAGTTCCACATCTACAACATGTGTGGTGCTGAGAGCGGTTTAGGGCTAGAAAGGGCAAGGTTCTACGCGGCGCAGGTCGCCTGCGGACTTGAACACCTCCACAGGATGGGCATCGTTTACAGGGATTGCAAGCCGGAGAACATTTTGTTGGATGACGTTGGCCACGTCCGGATCTCGGATCTGGGGCTGGCTGTGGACGTGCCGGAGAGCGGCGGGGTCAGGGGTCGGGTGGGCACAGTGGGGTACATGGCGCCCGAGCACGAGCAGGAGAAGTACAGCAGTAAGTTTAGTGACTGTGCGCGGGCGCTGTGCTCGTCTCTCCTGGTGAAGTCCGCGGCTGGGCGGCTCGGGGCGGGCGGAGGGCGGCGCGGGGCGAGGCAGGTCAAATCTCACAGGTTCTTCGCCAACATGAACTGGGCTAGACTCGAGGCCGGCATGGTAGAGGCGCCCTTCGTACCGGATCCTCACGCGGTTTACGCTAAGGACGTGTTGGATATCGAGCAGTTCTCCACAGTGAAAGGCGTCAACCTCGACGCTGGTGATGATTCATTCTACTGCAAGTTCAGCACCGGCTCCGTCAGCATACCCTGGCAGAGAGAGATGATAGAGACCGGGTGCTTCAATGAGCTGAACGTGTTCGCGGAGGATTCTGGTCGTGAGTGTCGCAGTCAGGACCTGTCTCTATCACCCACCCCGCCCCGGGAGACCACCTCCGGCTGCTGCGGACTCGCTCACAGGGTAACACACACATACACACACATAGAGGATTCTGGTCGTGAGTGTCGCAGTCAGGACCTGTCTCTGTCACCCACCCCGCCCCGGGAGACCACCTCCGGCTGCTGCGGACTCGCTCACAGGGTCCTTTGTCCCCAAAAGAAGATTCCAGCTCGTCTTCGTCCTATCCCGGTCCCAGAACATTTGTTGCAACCTTCATCGCCAGCGCCCAACTCCTCCAACGATACGCCCAACAAGACGCCGGACGTTCAAACCACACCGAACGATGTTCAAAACGCACCGAACGAGGTTCAGACGGGAACCGAACAGAACAGCTGA

Protein sequence:

>DPOGS206290-PA
MELLSFQAERFEVEVDETRVSLAAEVFGRYLKTDDVDAVTDVVPSDVINDASKLLEDFTFSGGSKDIFSECIRCVKSHLAGTPFAEFERSMYFHRYLQWKWLEAQPVTQNTFRMYRVLGKGGFGEVCACQVRATGKMYACKKLEKKRIKKRKGEAMVLIEKQILQRINSRFVVNLAYAYETKDALCLVLTIMNDVEYDYVVDQQPIGKLLFRQFCERNRPHYHKYNSFLDALLSFQAERFEVEVDETRVSLAAEVFGRYLKTDDVDAVTDVVPSDVINDASKLLEDFTFSGGSKDIFSECIRCVKSHLAGTPFAEFERSMYFHRYLQWKWLEAQPVTQNTFRMYRVLGKGGFGEVCACQVRATGKMYACKKLEKKRIKKRKGEAMVLIEKQILQRINSRFVVNLAYAYETKDALCLVLTIMNGGDLKFHIYNMCGAESGLGLERARFYAAQVACGLEHLHRMGIVYRDCKPENILLDDVGHVRISDLGLAVDVPESGGVRGRVGTVGYMAPEHEQEKYSSKFSDCARALCSSLLVKSAAGRLGAGGGRRGARQVKSHRFFANMNWARLEAGMVEAPFVPDPHAVYAKDVLDIEQFSTVKGVNLDAGDDSFYCKFSTGSVSIPWQREMIETGCFNELNVFAEDSGRECRSQDLSLSPTPPRETTSGCCGLAHRVTHTYTHIEDSGRECRSQDLSLSPTPPRETTSGCCGLAHRVLCPQKKIPARLRPIPVPEHLLQPSSPAPNSSNDTPNKTPDVQTTPNDVQNAPNEVQTGTEQNS-