Monarch geneset OGS2.0

DPOGS200085
TranscriptDPOGS200085-TA3684 bp
ProteinDPOGS200085-PA1227 aa
Genomic positionDPSCF300044 - 175901-206082
RNAseq coverage292x (Rank: top 38%)
Annotation
HeliconiusHMEL0150660.096.96% 
BombyxBGIBMGA002069-TA0.092.79% 
DrosophilaAbl-PB0.076.06% 
EBI UniRef50UniRef50_F4WV600.063.73%Tyrosine-protein kinase Abl n=7 Tax=Acromyrmex echinatior RepID=F4WV60_ACREC
NCBI RefSeqXP_392652.20.064.42%PREDICTED: similar to Abl tyrosine kinase CG4032-PA [Apis mellifera]
NCBI nr blastpgi|3407094190.064.46%PREDICTED: tyrosine-protein kinase Abl-like isoform 1 [Bombus terrestris]
NCBI nr blastxgi|665011750.050.41%PREDICTED: tyrosine-protein kinase Abl-like [Apis mellifera]
Group
Gene OntologyGO:00047132.1e-141protein tyrosine kinase activity
GO:00046727.3e-97protein kinase activity
GO:00064687.3e-97protein phosphorylation
GO:00167722.9e-86transferase activity, transferring phosphorus-containing groups
GO:00055242.2e-56ATP binding
GO:00046742.2e-56protein serine/threonine kinase activity
GO:00055159.3e-34protein binding
KEGG pathwayame:4091270.0 
 K06619 (ABL1)maps-> Axon guidance
    Pathogenic Escherichia coli infection
    Viral myocarditis
    Shigellosis
    Pathways in cancer
    Neurotrophin signaling pathway
    ErbB signaling pathway
    Cell cycle
    Chronic myeloid leukemia
InterPro domain[272-523] IPR0206352.1e-141Tyrosine-protein kinase, catalytic domain
[273-523] IPR0012457.3e-97Serine-threonine/tyrosine-protein kinase
[261-551] IPR0110092.9e-86Protein kinase-like domain
[272-527] IPR0022902.2e-56Serine/threonine-protein kinase domain
[153-255] IPR0009809.3e-34SH2 motif
[94-151] IPR0014521.5e-16Src homology-3 domain
Orthology groupMCL11307 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200085-TA
ATGGGGGCTCAGCAGGCGAAAGAACGCGGTACGGCCAGCGGCGCGTCCATGCGTTCGACGAGAAACAAGCCGCGAGTACCCAAGGACCCCCGCATGCTCGGCTCCAACATTTTTACCGAACACAGCGAGGCCCTTCTGCAAAGTCGGCCATTGCCTCACATCCCCGATTTGCCGGACGGTGAAGGCACCTCCGCAGCTCCAGCTACGCCGCAACCCCTTGATGCGAACAGATGGACCTCTAAAGAAAACTTACTAGCCCACCATGAGGAAGACGATCCTCAACTATTTGTTGCACTGTACGATTTTCAAGCGGGCGGCGAAAATCAACTTAGTCTCAAGAAAGGAGAGCAAGTTCGTATAATGAGCTACAATAAAAGCGGCGAGTGGTGTGAAGCTCACACTTTATCCGGAGGTGTGGGGTGGGTTCCAAGTAACTATGTGACGCCGGTAAACTCATTAGAGAAGCACTCATGGTACCACGGGCCAATATCAAGGAATGCTGCTGAATATTTGCTCTCATCGGGCATCAATGGTAGCTTTCTCGTACGCGAGTCCGAGTCTAGTCCTGGTCAAAGAAGTATATCACTCAGATACGAAGGTCGCGTCTATCATTATCGTATAAACGAAGACTCGGACGGCAAGGTGTATGTGACGTCAGAATCCAAATTCGGCACCTTGGCCGAACTTGTTCATCACCACTCGGTGCTCGGTGACGGTCTTATAACTCAGCTGTTGTATCCAGCCCCCAAACGCAACAAGCCAACAGTGTTCCCATTGGCTCCTCCCGATGAATGGGAGATCGATCGTACTGATATAGTTATGAAACACAAGCTTGGCGGAGGACAATACGGGGATGTATACGAAGCCACTTGGAAGAGATGCAATATGACGGTGGCGGTGAAAACCTTAAAAGATGACACGATGGCCCTAAAAGACTTCCTTGAGGAGGCGGCCATAATGAAGGAGATGCGTCACCCGAATCTGGTTCAACTGCTGGGCGTTTGTACTAGAGAACCTCCTTTTTACATTATCACGGAGTTCATGAGCCGTGGCAACCTCCTGGACTATCTCCGTACTGGGAACCGTGAACACATCGACGCTGTGGTTCTCATGTACATGGCCACACAAATTGCATCCGGAATGAGCTATTTGGAAAGCCGCTCGTTTATACACAGGGATCTGGCAGCGAGGAACTGCCTCGTCGGCGAGAATCATCTCGTTAAAGTGGCTGACTTTGGTCTTGCTCGTCTCATGCGTGACGATACGTACACGGCCCACGCTGGTGCCAAGTTCCCGATCAAATGGACAGCCCCTGAGGGCCTGGCCTACAACACTTTCAGTACTAAGTCTGATGTATGGGCCTTCGGAATACTTCTATGGGAGATAGCCACATACGGCATGTCGCCGTACCCCGGCGTTGATCTGGCAGATGTGTATCACATGCTAGAGAAGGGCTATCGTATGGAGTGTCCGCCCGGTTGTCCCGCTGCCGTCTACGAGTTAATGAGAGGGTGCTGGCAGTGGAGCCCCTCAGACCGACCCTCCTTCCGCGAGATACATCACGCGCTCGAGCACATGTTCCAGGATAACTCCATCACCGAAGAAGTGGAGAAGCAGCTTCAAGAGGGTGTGTGTGCTACACCACAGATGTCTCTGAAGAAGGGAGGAACAGGCGGCGATAGGGTTCAAATGAGAAGGCCAACCAACCGCCGCGGGAAACAGGCGCCCACGCCGCCTAAGAGAACCAGTCTCTTGTCATCGTGCAGTTCGTTCCGCGAGTCTCAGTACGCTGCTGAGGATCAAGCTGGGCCTGACGACGGGCTAGCTTCCCTTAACGGCGGGGGTCGTGGTGGGGGTCGCGAGGCTTCATCCGAGGGTTCCCTTGCGGAAGCAACCCCGGATACAGATGAGTCGGGGGCCGGTATGGGGGTCGCCGAACACCGACACAGATCAAAAAGAAGACACGCACACGCTCCGATCCAACACCAACACCCGCCCGCACACGAACTGCAGCCCAAGCAAGGGGTGCAAGTAGCTGCACTTGAAGTGCAAAATGTAAAGCGCGCCATCAATAGATATGGAACACTGCCGAAAGGGGCACGGATAGGTGCTTACTTAGAGTCTCTGAGGCAGAGCGGTGGTACGCCATGTGTTATACGAGAACCTCAAGACGAAGCCCGATCTCTATCACCCCGTACCGCTCGTGCTCAGCCGCACATGATACGTTCCAATTCATCAGGCGGGGTGACTGCACCGGCGCCGGCCTCACCACGTGCAGCCCGCGCTCCTCCCCCCCTTCGCTCATTCAACAGTCCGGCGAAACCACGACCGAGGCTAGCTGAACTAGAGTTCCCACCGCCACCGCCAGACCTACCACCACCCCCAGAAGATACACAACCTCCGCCACCACCTCCGCCACCCGATTGTTGTACGGAAACAAATGATACTGATGATACTATTATACAGGAACGAGAGAGCGAAGTTAAACCCGCAAAGCAAATCATGAAGGAGATGCTCGAACTGAAATTAGTTGCAGAAATCAAAGAGAGGGCAGATAAAAAATTAAGCAAACTAAAGGAATCACCGCCGCCGAATGAAGTTATGGAAATGCATACATCTTTCGGAGACCCCGTGACTCGTTTAGTTTCTGAGCTCTCCGAAAGTTTAAATATGGAAGCTTTGCGTAAACAAGAGAAGAAGATTGAACAACCGAAACCAATTGATAATGGCAAGGACACCATATCTCCTATTGATCTCAAGGCTAGTTTACGAAAGACGTCCTTTGGAAACAACGTTGAAAAGAAAACCGAACCAGAGACAAAAACTGATTTCAAATCACAATTAAAAAAAGTTGACACAAATAGAATGTGTATTACTAGTAAGGACTCAGAGGAGCCCGGTCGCTCAATAATTGACTTTAAGTCCCGATTGCGGAAAGTTGAAGGCAACTGTCCTGCTACAAACGGCACATCCAAAAAGTTGGAACAATGTTCCCCGGAAGATTCAAGAAAAGATATATCTAGCAAGAGTGATGAATCAGACAAGAAACGTCCAGAAAGCGCATCATTAGACACCAGCGGCGGCGACGACGAAGACAAAAGACGAAGCACTGGAAGCATCAGTAGCTTAAAAAAATTATGGGAGAGTAAGGAGAGTGATGAAAGACTCAGTCCAAAAATGAGACCAAAAGGGGAAGAGAGTGAAGAAGGTTCCCCAGAAGAAAGGAGTGCGCTCGGCAGAAGCGGAAGCATCCTAGCTAGAAGAGATGACAAGCCAACTGTGGCTAATAAGCCTGCAGTACGCGCACGACCTTTAGGAAAGGGTGGCATCTATGCGACTCCTTTAGCACCGACGACATCCGACGATGACGGAGATGCACTTGCCGCTCTCCGGACATTACTAGAGTGCTGCACTAACGAGGTTCGTCGAGGCTGTAGCGGTACAGCCGGCGGCCGTGGGTCGTTATGGCGTCTCCAGACATCCGAGCGCCTCGCACGTCTGAGTGGTGCTTGTGCGACAGCAGCCGGTAGTTGTGCGCCGCAGTTACGGCTGCAGCTGCGGGCTGTCGCTGCCCGACTAGAGGCTGAGGCGCGCGCCCTTGCCTCGCCCGCCCCCCACCACCACCACCTCGCTGACGGCGTCGAGAAGGCACTCAAAGATCTCGCTACGATCGTACATAGATAA

Protein sequence:

>DPOGS200085-PA
MGAQQAKERGTASGASMRSTRNKPRVPKDPRMLGSNIFTEHSEALLQSRPLPHIPDLPDGEGTSAAPATPQPLDANRWTSKENLLAHHEEDDPQLFVALYDFQAGGENQLSLKKGEQVRIMSYNKSGEWCEAHTLSGGVGWVPSNYVTPVNSLEKHSWYHGPISRNAAEYLLSSGINGSFLVRESESSPGQRSISLRYEGRVYHYRINEDSDGKVYVTSESKFGTLAELVHHHSVLGDGLITQLLYPAPKRNKPTVFPLAPPDEWEIDRTDIVMKHKLGGGQYGDVYEATWKRCNMTVAVKTLKDDTMALKDFLEEAAIMKEMRHPNLVQLLGVCTREPPFYIITEFMSRGNLLDYLRTGNREHIDAVVLMYMATQIASGMSYLESRSFIHRDLAARNCLVGENHLVKVADFGLARLMRDDTYTAHAGAKFPIKWTAPEGLAYNTFSTKSDVWAFGILLWEIATYGMSPYPGVDLADVYHMLEKGYRMECPPGCPAAVYELMRGCWQWSPSDRPSFREIHHALEHMFQDNSITEEVEKQLQEGVCATPQMSLKKGGTGGDRVQMRRPTNRRGKQAPTPPKRTSLLSSCSSFRESQYAAEDQAGPDDGLASLNGGGRGGGREASSEGSLAEATPDTDESGAGMGVAEHRHRSKRRHAHAPIQHQHPPAHELQPKQGVQVAALEVQNVKRAINRYGTLPKGARIGAYLESLRQSGGTPCVIREPQDEARSLSPRTARAQPHMIRSNSSGGVTAPAPASPRAARAPPPLRSFNSPAKPRPRLAELEFPPPPPDLPPPPEDTQPPPPPPPPDCCTETNDTDDTIIQERESEVKPAKQIMKEMLELKLVAEIKERADKKLSKLKESPPPNEVMEMHTSFGDPVTRLVSELSESLNMEALRKQEKKIEQPKPIDNGKDTISPIDLKASLRKTSFGNNVEKKTEPETKTDFKSQLKKVDTNRMCITSKDSEEPGRSIIDFKSRLRKVEGNCPATNGTSKKLEQCSPEDSRKDISSKSDESDKKRPESASLDTSGGDDEDKRRSTGSISSLKKLWESKESDERLSPKMRPKGEESEEGSPEERSALGRSGSILARRDDKPTVANKPAVRARPLGKGGIYATPLAPTTSDDDGDALAALRTLLECCTNEVRRGCSGTAGGRGSLWRLQTSERLARLSGACATAAGSCAPQLRLQLRAVAARLEAEARALASPAPHHHHLADGVEKALKDLATIVHR-