Monarch geneset OGS2.0

DPOGS211050
TranscriptDPOGS211050-TA3717 bp
ProteinDPOGS211050-PA1238 aa
Genomic positionDPSCF300202 + 277556-286500
RNAseq coverage298x (Rank: top 37%)
Annotation
HeliconiusHMEL0043320.060.64% 
BombyxBGIBMGA003800-TA0.061.64% 
DrosophilaInR-PC9e-6242.71% 
EBI UniRef50UniRef50_D6WJG81e-15934.63%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WJG8_TRICA
NCBI RefSeqXP_397038.32e-16034.08%PREDICTED: similar to Insulin receptor precursor (IR) [Apis mellifera]
NCBI nr blastpgi|3287825241e-15933.60%PREDICTED: hypothetical protein LOC413596 [Apis mellifera]
NCBI nr blastxgi|910822031e-14934.39%PREDICTED: similar to melanoma receptor tyrosine-protein kinase [Tribolium castaneum]
Group
Gene OntologyGO:00047131.4e-112protein tyrosine kinase activity
GO:00046721.9e-81protein kinase activity
GO:00064681.9e-81protein phosphorylation
GO:00167727.5e-76transferase activity, transferring phosphorus-containing groups
GO:00055242e-38ATP binding
GO:00046742e-38protein serine/threonine kinase activity
KEGG pathwaytca:6615245e-69 
 K04527 (INSR)maps-> Aldosterone-regulated sodium reabsorption
    Insulin signaling pathway
    Adherens junction
    Type II diabetes mellitus
InterPro domain[904-1171] IPR0206351.4e-112Tyrosine-protein kinase, catalytic domain
[905-1170] IPR0012451.9e-81Serine-threonine/tyrosine-protein kinase
[876-1194] IPR0110097.5e-76Protein kinase-like domain
[904-1178] IPR0022902e-38Serine/threonine-protein kinase domain
[432-754] IPR0018282.1e-10Extracellular ligand-binding receptor
Orthology groupMCL15553 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211050-TA
ATGTGTCGAGTGTCGTTTGTAGTGCTCGTTGTTCGCAGTTCAGTGTGCAGTGTTTACGCAGTAGTGAATAGTGACTGGGGATGTTGTGTGTGGACGGAGACAAAAGGACAGACTTACGGCGGACAGAATACAGCGGTTGGGCCCACGAGACCGGACCTGGACTCGGTATTTTGGAGACACATCATCTGCATCGTATTCATCTTCCTCGTCTATAGTTGTTCAGGCCAGCACGAGGAATTCGTCGCTAGCGGAAATGAAGATGCCGGACCGGTGCTGTCGGTGACAGCGTGTGTTCCGGGAGGCGGTGCAGCAGCTCTGGTGCAGGTCGCTGTGTTGTTGCTGCAGGCAGCAGGGGTGCCGGCTCGAGGCCTGCCGCCGCCCGCCTCCTGCCTGCCTCCTCATGCCGCGGACGGAGCCTCCTGCCGCGACGCCGTCGTGTCCGCACTCACTCACGGCCGACTGCACGTGTCGCTGCTGAGCGTGGCCGCCGCCAGTGCCTCCCGCCTGCGCCCTCTCCGGGACGCTAACGTGGACGAGCTGGGTGACGTGACCCCACCGAGCCGGCGCTTGTGTCTAGCTCCGGCCCCCTTACACTCCATTCTGGACCTCGCAGACCCGTCTATAGCAGGTCACTACTACCTGCGACCGGACGAGGTCGGCTCCATTGAATACTCTTTCACAAATGACTCTTCTTCGAGATACGATGGATCCACAGTTCTAAGCCCCGAGTGGTGTAAACCTTCTATCAAATGTGCCGCTATTTTGACAAGTGATATGGGAGAAGCTCTTATACTTTATAATACTATACAAGAGCAAAATCTGTACGCTGTTGTTTATGTACTTGGAGATAACTTAGCAACAGTAGCTACTAAACTAAAAGGAAAATTTATAATTTGTGATTGGGCTCCCGACACCAAGGAGCTATTGGGAACCAGAAGTCTTGGCCCACCACCATGTACACAGGACGCCGACAGTTGCCCTTTCGAAAGCCGTCGACTAGTTAAACTTGTCAACACCAGAGCCTTGGCCTTGCTACCGGGTGCAGTACAGGCTTTATTTCGACTCAGTATAAGTACAAATGAAATGCTAGAATTAAGAAAATTAGCTCGTTTTTCGGAGCCCAAAACTGCAGCTCTCAGATTTCTCTTCGCTCACCCTACTAAAAGAGTGATTAGGGAAGTACGAGTCGTGGTCCTTATACCAAATCCGACACCACGGGAAGCTTACGATGCACCATCACTGGTTGCCGCGTCAGCTTTAGCGGAAGCCGACTTGGAGGCTCACTGGTCTAAGGCTTCCAGATTTAAGGTGATAACTCACGACGATCACTGTGATGCGACGCATTCCTTCCAATACATAAGTGACGCACGCGTTTCTTCCGATTACTTGGAGTTGTCTGCGGTGGCGGGACCCGCTTGTGGAGCAGCCTTCGCTGATGTCGCCCGCCAGTCCACGTCTCACGGCCTGCTCGCCATTTCGTACACACCACAGGCTCCTCCACCCTCACCTGCCGCTGCCCTTACTCTGCTGGCGGCTGGTGATTCACGCATGTATGCGTCCGCGCTTGGCTCGCTGTTTGCCGAGCTAGGGTGGAGGCGGCTGGCAGCCCTCAGCGAACCGGCCACACGAGCGTCTCTGGCTTCCGCTCGTCTTCAAGCAGACATCGTTGCCCACTTGGAGTTGCCGGATGACCGCGCTGATTACGAACCTGAAATCTTTATACAGTGGGCAGAGCGCGTGGTGAGAGCTGACGGCCGCGTCGTATGGTTGTGTGTCGAAGATGCACGGGCAATGCGAGCTGCGCTTTGTGCAGGGCTTGCAGCCGGTCTGGAGCCGAGATCTGGAATCGCCTGGTTGCTGCCAGCGGCGCTGCCACCCTCCGCGTATAGAGTTGGACCTCGTGATGGGTGCTCCCAAGACCAGCTCGACATGATGCTAGAAGGTCACATAAGTGTCGCTCCAAACTGGTTGTTGCCTTGGCTCCAGAATGCTCATGGACGGAATTCTTCTGTAACTACAGACTTGAAGGAATCGAGCGGAGACAGTGTGTCTAATTGGACGTCCCGATGGCGAACTCAGTGCGGGTTCATAGACGGTGGGTGCACAACTCCGGGACCACACGCGGCCCTCCTCTACGATGCCCTTACATTGTGGGCTAATACGCTCACCGATCTTTTCCAGACTAACTCCACCAAATTCTACGACCTGCATAATCGACAACTTCTCCGATCCCTCGTCAGGAAAGCTACTAAAACCAGTTTCGTGGGAGAGCTGACCGGACGGTTTGAATGGATGGCTGTGAATAATGAGGACGACAACGGTACGGCGTACGCTCGCTCTGCTCCGCTGGTCATACTGCAATGGAACGGAGGAGTTAGACGCGAGGTTGCACATTGGAACCGTGGTCGACTAGAACTAGTCTTGGGTGCTCTGCGGTGGAGTACGGCCGACGGACGCGCCCCGCGTGACAGTTCCGACCACTGTGCTCTTCAAACCATCGCAGACATCCTTGGAGGAGATTGCCGCACAGCTTTTATCGTTTTAGGAGTTCTCATGTTGTTATCGGTCACTGTAGCCCTGAGTACCGCCGCATTTTATTGCAAGAAACGAGCGGAAAGAGAATATAAATCTCGGCTAGAAGCATTAGGACTACATCCATTAGTTCCGAAGACTGTAGGACTTGATCGATGGGAAATATCAAGAGAACGGGTGGTTATAAATCGTAAATTAGGCATGGGTGCCTTTGGAACCGTTTACGGGGGTCATGCACTTCTTGCAGAGGATCGAGGATGGACTGCAGTGGCGGTCAAGACTCTGAAAGCGGGAGCAACGACCGAAGAGAAACTCGATTTCCTTTCGGAAGCCGAGGCAATGAAGCGTTTCGATCACAGAAATGTCATTCGACTTCTGGCCGTCATAACGAAGACGGAACCTGTGTGTACGGTCATGGAGTTCATGTTGTACGGGGATCTCAAAAATTACCTGCTGGCTCGACGGCATCTGGCGTGCGGTGGAGAGGACGCAGACGAGCAGGTCTCAGCGAGGCGCTTGACGGCAGCGGCACTGGACGTGGCGCGGGCTCTTGCTTATCTGGCACAGCTGCGGTATGTGCACCGTGACGTTGCCGCCCGAAACTGTCTCGTCAGTGCCCGCCGGGTCGTCAAGCTGGCTGACTTCGGGATGACCAGACTTGTATTCGAGAATGATTACTATCGATTTAGTAGAAAAGGAATGTTGCCAGTACGCTGGATGGCACCAGAGAGTCTAGCTCTCGGAGTATTCTCACCAGCGTCGGACATCTGGTCATTCGGCGTTCTTCTCTATGAGATCGTGACGTTCGGGTCTCTTCCCTTTCAGGGACTTAGCAATGCCGAGGTACTCACGAAAGTGAAGGCTGGACACACGCTCGATCTACCACCAGGACTGAAGCCTCAGTTGGAGGCGCTCATCAAGTCATGCTGGCAGCAGGACAGCAAGTCGCGACCGACGGCGGACGAAGTGGCAGCGACGCTGGAGGACGCGCCACGACTATTGGCGCCGTGTCTGGACGTACCTCTGGACGCCCTGCCTCTAGACGCTGAACCACCGTGGCGTCTCCCGCGCGACCGCGCTGAGGCTCGCTGGTTGTCTTGGGCCGCTCCTACCTCGGCCGCCACCGATACCACCTACCTCAGTGCTGAAACGCAGCCACGGGACACAGACGCCTTTCTACCCTGA

Protein sequence:

>DPOGS211050-PA
MCRVSFVVLVVRSSVCSVYAVVNSDWGCCVWTETKGQTYGGQNTAVGPTRPDLDSVFWRHIICIVFIFLVYSCSGQHEEFVASGNEDAGPVLSVTACVPGGGAAALVQVAVLLLQAAGVPARGLPPPASCLPPHAADGASCRDAVVSALTHGRLHVSLLSVAAASASRLRPLRDANVDELGDVTPPSRRLCLAPAPLHSILDLADPSIAGHYYLRPDEVGSIEYSFTNDSSSRYDGSTVLSPEWCKPSIKCAAILTSDMGEALILYNTIQEQNLYAVVYVLGDNLATVATKLKGKFIICDWAPDTKELLGTRSLGPPPCTQDADSCPFESRRLVKLVNTRALALLPGAVQALFRLSISTNEMLELRKLARFSEPKTAALRFLFAHPTKRVIREVRVVVLIPNPTPREAYDAPSLVAASALAEADLEAHWSKASRFKVITHDDHCDATHSFQYISDARVSSDYLELSAVAGPACGAAFADVARQSTSHGLLAISYTPQAPPPSPAAALTLLAAGDSRMYASALGSLFAELGWRRLAALSEPATRASLASARLQADIVAHLELPDDRADYEPEIFIQWAERVVRADGRVVWLCVEDARAMRAALCAGLAAGLEPRSGIAWLLPAALPPSAYRVGPRDGCSQDQLDMMLEGHISVAPNWLLPWLQNAHGRNSSVTTDLKESSGDSVSNWTSRWRTQCGFIDGGCTTPGPHAALLYDALTLWANTLTDLFQTNSTKFYDLHNRQLLRSLVRKATKTSFVGELTGRFEWMAVNNEDDNGTAYARSAPLVILQWNGGVRREVAHWNRGRLELVLGALRWSTADGRAPRDSSDHCALQTIADILGGDCRTAFIVLGVLMLLSVTVALSTAAFYCKKRAEREYKSRLEALGLHPLVPKTVGLDRWEISRERVVINRKLGMGAFGTVYGGHALLAEDRGWTAVAVKTLKAGATTEEKLDFLSEAEAMKRFDHRNVIRLLAVITKTEPVCTVMEFMLYGDLKNYLLARRHLACGGEDADEQVSARRLTAAALDVARALAYLAQLRYVHRDVAARNCLVSARRVVKLADFGMTRLVFENDYYRFSRKGMLPVRWMAPESLALGVFSPASDIWSFGVLLYEIVTFGSLPFQGLSNAEVLTKVKAGHTLDLPPGLKPQLEALIKSCWQQDSKSRPTADEVAATLEDAPRLLAPCLDVPLDALPLDAEPPWRLPRDRAEARWLSWAAPTSAATDTTYLSAETQPRDTDAFLP-