New model in OGS2.0 | DPOGS205518  |
---|---|
Genomic Position | scaffold2091:- 10568-25715 |
See gene structure | |
CDS Length | 4404 |
Paired RNAseq reads   | 2625 |
Single RNAseq reads   | 6966 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA000152 (0.0) |
Best Drosophila hit   | Alk (0.0) |
Best Human hit | ALK tyrosine kinase receptor precursor (8e-100) |
Best NR hit (blastp)   | AGAP012070-PA [Anopheles gambiae str. PEST] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to leukocyte receptor tyrosine protein kinase [Tribolium castaneum] (0.0) |
GeneOntology terms    | GO:0005886 plasma membrane GO:0004714 transmembrane receptor protein tyrosine kinase activity GO:0004716 receptor signaling protein tyrosine kinase activity GO:0004713 protein tyrosine kinase activity GO:0006468 protein amino acid phosphorylation GO:0000186 activation of MAPKK activity GO:0048565 digestive tract development GO:0007522 visceral muscle development GO:0007169 transmembrane receptor protein tyrosine kinase signaling pathway GO:0005887 integral to plasma membrane GO:0005524 ATP binding GO:0005515 protein binding GO:0045610 regulation of hemocyte differentiation GO:0042051 compound eye photoreceptor development GO:0007411 axon guidance GO:0007498 mesoderm development |
InterPro families    | IPR001245 Serine-threonine/tyrosine-protein kinase IPR000998 MAM domain IPR002172 Low-density lipoprotein (LDL) receptor class A repeat IPR000719 Protein kinase, catalytic domain IPR017441 Protein kinase, ATP binding site IPR008266 Tyrosine-protein kinase, active site IPR023415 Low-density lipoprotein (LDL) receptor class A, conserved site IPR011009 Protein kinase-like domain IPR008985 Concanavalin A-like lectin/glucanase IPR002290 Serine/threonine-protein kinase domain IPR020635 Tyrosine-protein kinase, catalytic domain |
Orthology group | MCL12402 |
Nucleotide sequence:
ATGGAGAACATGGTGATTAAGTCACCACCATTCAGGGGATTGTGGGAGTTTTGCAAGTTC
GAGGCACGGTTACATCAGAGCAAAATGCAGAACGCCAGTATCCGTTTGGTCTCCGAGTCC
ACCATGTCGGAGGTGCAATGGATCCTGCAGGAAGTGGCCGGCAACAACTTCGAAGTATGG
AATCCAATGCGCTTTATAATAGGGAGGGTTCAGCAGGAAGTGCGTATCATAGTAGAAATA
ACCCGTCCAAGTTTTGTAAACCTGACGCAGACCTTGCCACACATAGCCCTGGACAACATC
CGAATGGTGGATTGCCTCCCTGAGCCTCCCGTCTTTAATGGGGAATGCCAGACTGGTCAG
CTAAAGTGCAAAATAATGAATAAGGAGTCGTGCATCAAGCTGCAGCGAGTCTGCGACCTG
GTGAGGGACTGTGATGACAACAGCGATGAAAATCAAAATTGTGATAAAATGCCATATGGA
TCGTATTGCAACTTCGAACAGGACAACTGTGGTTTTGAAAATGTGCCACAGCGCATCCTA
AAGTGGTCTCGGCATAGCGGGCCCACGCCCACGGATAAGACTGGCCCAAACTACGACCAT
ACCTGTGGTCCGCCTACCGCTTTCACCACCGCCAACCCCATGGTACCATTGTCACCAGCT
CACATGAACCACTCCCACGCCTGCGTGGGTTATTACTTCTTCGTGAACATGAATGTCACA
GGACCTAACAAAGAAAGAGCAGACTTCGCGTCGACGGCTGTCATGAAGACCGTCATATTC
AACCCGCCCCCCAAAGTCCACGGGGACATCAGCTCCAGATACTATAATTGTTGTATGATA
AGGTTCTACTATCAACAAAATGGGAGGAACTATGGTTCTTTGAGTGTGGACGTGGTGGAG
CTGACTCCGAGAGGTAACATTACCACGTCTCTCTGGTTCTCCACCAAGGACAAAGGCGAG
AACTGGTTCAGGGCGGGAATATTCCTGCCCAACATCACAAGAAGATACTATTTCCTTTTC
AAAACTCGTATGGGCATGCGGATTTACTCGGACTCCGCCATCGACGACTTCTCGATGGCT
CCAGAATGTTTCGGTTTTAACATAAACGCAACCGAACTCGGAGACTATAATTACTACGAT
CCCATATTCGACGAGAAAACGACGCCACATCCGGACTTCGTGGACGCCACTATGTATCGC
TTCTCGACTTGCAATGCGACAGGTCGGTTTGGTCCTCAACAGGACCAGTGTGACGCCTCC
TACACAGATATTGATGGAGGGGTCCTCGTCCTGAACACTCCGGGGATGAGGGGAGTTCAA
GTGTGGGAGGTTCCCAACGAGGGTCTCTACACTATCATAGCTACGGGAGCTTCGGGCGGG
TTAGGCTCGATGTGGGCGGGCGTGTCTCACGGCTCGTCTGCCAGGGGACTGTTCGAGCTG
CACCGGGGAGAGAAGGTGTACATGATGATCGGACAGCAAGGAATGAACGCCTGCAAGAAG
ACTCTATCAGCTCAAGAGAGCCAGGAATGCTCCAGAAGGGCGAACGAAACCACACAGGTG
TTCACCAGCAAGACCCACGAGATACGGAACACTCACGTCAACGACGGCGGCGGAGGGGGA
GGAGGGGCTACATACGTGTTCCTGATGGATAGAGAAGGCCAGCCGATCCCATTGTTGGTG
GGGGGAGGTGGTGGGGGATTGTCTGTGGGAGTGTTCAAGGACGACGGTTCACAACACGCC
CGGGGAAGAACAAACACGACCCCGGAATCTGGATATATGTATGGCCAACCGGGCCGCACT
TCCGGTGCTGGTGGTGGGTGGATCTCTCGTTCGGGGCCCATAACTCCTGGCTTTGAACTC
GTCAGAGGATCTGCTTTTAGAGAAGGTGGGGTTGGGGGCCTCGCTTGTACCGGCGGGGCC
CATGGCGGCTTCGGAGGGGGAGGGGGTGGCTGTCTTAGGGGGGGAGGTGGAGGGGGATGG
GTTGGTGGTAATACCAACGAGGGCGAGGGTCAGCACGGTGGTGGGGGGTGGTCGTTTGTT
GACGTCACTCGTGGGGTGCGAGAGTTCGGTGACGTCACCATGGCCCCTCGCACCGGCCCG
GGGGAGGTGGTCATCATACCAGCTATACATAAATGCGGCTGCAACTATCGGTGCGTGGCG
TTAGACGAACAGCGCGCTAATGTGCGCTGTTATTGTCCCGCGGCTTGGCGGACTGACCAC
GGAGATCCCACCAAGTGTGTCTACAACAAATACCAACGTAAGAAGGAAGCGGAACTTCGT
CAGAAACGCCTTCTGGAGCAGGAAGTACAGTTACACAGGTTACGAACAACACCGGGTGGT
GATAACGCTTTAAGTATGGCTTTCAACCCCCACTACGGCAGTGAAAGTTTTCTGCCACAA
GGGGTTGATGTCAGGGGTTTGCCGAAAGTAGCCAGAGAGAGTCTAACTTTGGTGAAAGCC
CTGGGCCAGGGTGCTTTCGGCGAGGTCTATCAAGGTCTCTACCGTCACAGAGGTGGTGAG
AGCGAAGAGATGCCAGTCGCGGTGAAAACTCTACCGGAGCTGTCGACAGGGCAGGCGGAG
TCAGATTTCCTTATGGAGGCTGCTATAATGGCTAAATTCAACCATCCGAATATCGTACAC
CTGATCGGCGTCTGCTTTGATAGACATCCGAGGTTCATAGTATTGGAACTGCTGGCTGGT
GGTGATTTGAAGAACTTCCTCAGGGAGAGTCGTCCAAGACCTGAGAGAGCCAGCGCCCTG
ACAATGAAAGACCTTCTTCTATGCGCACTCGATGTCTGTAAAGGATGCAAATATTTGGAG
GCCAAACGATTCGTGCACCGGGATATAGCCGCTCGTAATTGCCTCCTCACGTCTCGCGGT
CCGGGGAGGGTTGTCAAGATAGCTGATTTCGGTATGGCAAGAGACATATACCGCTCTGAT
TACTACAAGAAGGGAGGCAAAGCTATGCTGCCCATTAAATGGATGCCGCCCGAAGCGTAC
ATAGATGGAATATTCACCGTCAAGACTGATGTTTGGTCGTATGGTGTGTTGTTGTGGGAG
GTGTTTTCCCTCGGCATCATGCCGTACACCGGCTGCTCTAACAGGGAGGTCATGCAAATG
GTTTCTGGAGGAGGGAGATTAGAAAGACCCCACGGCTGTCCTCCTGAAATTTATCGTTTA
ATGTGTGAGTGTTGGAACCCCGCCCCCTCCGAGCGACCCTCTTTCGCACAGATGTTCGAC
AGGCTCCAGAGATATTTACAGGATCCGGAAATCACAAACGCGCCATTGCCGATGCTGCGT
TCGTTGTTGTCATTGTCTGAGGCACCGGAAGCGAACGGCCCCCAGCGCTCCGCGGGGGAC
TATCTCGTGCCGCTGTCGCCTCAGGACCAGCCGGCGAACGCTAGCTCAACGGATCACTTG
GTACCTCATAGCAAAACGCTAACGGCGGCGGAGAGGCCAGGAATGGAGGATTCACAATAT
CAACCTCTTCTGAAAGCATCACAAGAATATGGCAACATTCCAACGGTGTCGTTGAACGCG
AACGGAAAACGCGACGCCTCAAATGCTTCCGCCGCCGAGACAAAGAGCAATACAAAGTTT
CAGCAGTCTGATTCCACTGATAGGTTATTAAGTTCGTTAGAGGGTCCTATAGAGGTCGCA
GAGTCGCCGGAAGACAAACCCGCCGTGTGGGAGACCTCGTTCACGGAGACCAAAACGCCG
CCGAGCATCGAACCCGAACCCACGGTTGAGAAACTGATAAGTATAACACCAACTTCACCC
AAACCTCCAGAAAACGCTCTGTCGATAGCCATAGAGTCAACCAATGTCATAGACAAGACG
AACAAAACGCATTTGATACCAGAAAAGAAATCTGCACCTATAGCATCCCCCGACCCGCCG
CGCATAAACGCCTGGAATAGAGACGCACCCACCATCCCGAAACCTAGGCCGTTCTGTCTG
GACGCGGCCCAGTTAGAGCAGAACCTGAACGCCTTGAAGAAGAACAGCGGCTCAGTGAAC
CTCACCACGATGAACTCGATGAAGGTCCTCCCGCCGTATATAAACGTCGTCACCCCCAAT
AAGATATCAGAAGTGAAAACCATCCAGATAGACCCCAAGAAGGCCGTCATACACGATCTC
CAGTCTCCCAGCGAACCCGAGGACAAAGAGGAGAAGGGGAAGCTGAAACAGTCGAGTTCG
GCAGCCAGCTTACTGAACGGACCTCTCACGGAACTCCCTTACGCGGACAGCGACAACGCC
TCCTCCAGCTCCGGCAGCAACGCCCAGAACGTCATCAACAACCAGAAAAACAAAAACTAC
TCCGAAGTGGCCATCAACGGCAAAAACGTCAAAAGACAGGGTAGCAGCGGCGGGGAGTTC
GGTGAAGCTGAAATAAATTGTTAA
Protein sequence:
MENMVIKSPPFRGLWEFCKFEARLHQSKMQNASIRLVSESTMSEVQWILQEVAGNNFEVW
NPMRFIIGRVQQEVRIIVEITRPSFVNLTQTLPHIALDNIRMVDCLPEPPVFNGECQTGQ
LKCKIMNKESCIKLQRVCDLVRDCDDNSDENQNCDKMPYGSYCNFEQDNCGFENVPQRIL
KWSRHSGPTPTDKTGPNYDHTCGPPTAFTTANPMVPLSPAHMNHSHACVGYYFFVNMNVT
GPNKERADFASTAVMKTVIFNPPPKVHGDISSRYYNCCMIRFYYQQNGRNYGSLSVDVVE
LTPRGNITTSLWFSTKDKGENWFRAGIFLPNITRRYYFLFKTRMGMRIYSDSAIDDFSMA
PECFGFNINATELGDYNYYDPIFDEKTTPHPDFVDATMYRFSTCNATGRFGPQQDQCDAS
YTDIDGGVLVLNTPGMRGVQVWEVPNEGLYTIIATGASGGLGSMWAGVSHGSSARGLFEL
HRGEKVYMMIGQQGMNACKKTLSAQESQECSRRANETTQVFTSKTHEIRNTHVNDGGGGG
GGATYVFLMDREGQPIPLLVGGGGGGLSVGVFKDDGSQHARGRTNTTPESGYMYGQPGRT
SGAGGGWISRSGPITPGFELVRGSAFREGGVGGLACTGGAHGGFGGGGGGCLRGGGGGGW
VGGNTNEGEGQHGGGGWSFVDVTRGVREFGDVTMAPRTGPGEVVIIPAIHKCGCNYRCVA
LDEQRANVRCYCPAAWRTDHGDPTKCVYNKYQRKKEAELRQKRLLEQEVQLHRLRTTPGG
DNALSMAFNPHYGSESFLPQGVDVRGLPKVARESLTLVKALGQGAFGEVYQGLYRHRGGE
SEEMPVAVKTLPELSTGQAESDFLMEAAIMAKFNHPNIVHLIGVCFDRHPRFIVLELLAG
GDLKNFLRESRPRPERASALTMKDLLLCALDVCKGCKYLEAKRFVHRDIAARNCLLTSRG
PGRVVKIADFGMARDIYRSDYYKKGGKAMLPIKWMPPEAYIDGIFTVKTDVWSYGVLLWE
VFSLGIMPYTGCSNREVMQMVSGGGRLERPHGCPPEIYRLMCECWNPAPSERPSFAQMFD
RLQRYLQDPEITNAPLPMLRSLLSLSEAPEANGPQRSAGDYLVPLSPQDQPANASSTDHL
VPHSKTLTAAERPGMEDSQYQPLLKASQEYGNIPTVSLNANGKRDASNASAAETKSNTKF
QQSDSTDRLLSSLEGPIEVAESPEDKPAVWETSFTETKTPPSIEPEPTVEKLISITPTSP
KPPENALSIAIESTNVIDKTNKTHLIPEKKSAPIASPDPPRINAWNRDAPTIPKPRPFCL
DAAQLEQNLNALKKNSGSVNLTTMNSMKVLPPYINVVTPNKISEVKTIQIDPKKAVIHDL
QSPSEPEDKEEKGKLKQSSSAASLLNGPLTELPYADSDNASSSSGSNAQNVINNQKNKNY
SEVAINGKNVKRQGSSGGEFGEAEINC