DPGLEAN21367 in OGS1.0

New model in OGS2.0DPOGS208227 
Genomic Positionscaffold200:+ 52923-80405
See gene structure
CDS Length2577
Paired RNAseq reads  3964
Single RNAseq reads  10083
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006417 (0.0)
Best Drosophila hit  Fps oncogene analog, isoform A (6e-161)
Best Human hittyrosine-protein kinase Fes/Fps isoform 1 (2e-112)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC004367 [Tribolium castaneum] (0.0)
Best NR hit (blastx)  tyrosine-protein kinase transforming protein FPS, putative [Pediculus humanus corporis] (0.0)
GeneOntology terms













  
GO:0004713 protein tyrosine kinase activity
GO:0005886 plasma membrane
GO:0016020 membrane
GO:0006468 protein amino acid phosphorylation
GO:0008594 photoreceptor cell morphogenesis
GO:0005737 cytoplasm
GO:0004715 non-membrane spanning protein tyrosine kinase activity
GO:0005524 ATP binding
GO:0005515 protein binding
GO:0046664 dorsal closure, amnioserosa morphology change
GO:0051017 actin filament bundle assembly
GO:0007391 dorsal closure
GO:0005912 adherens junction
GO:0007394 dorsal closure, elongation of leading edge cells
GO:0007411 axon guidance
InterPro families







  
IPR011009 Protein kinase-like domain
IPR001245 Serine-threonine/tyrosine-protein kinase
IPR017441 Protein kinase, ATP binding site
IPR008266 Tyrosine-protein kinase, active site
IPR000980 SH2 motif
IPR001060 Fps/Fes/Fer/CIP4 homology
IPR002290 Serine/threonine-protein kinase domain
IPR020635 Tyrosine-protein kinase, catalytic domain
IPR000719 Protein kinase, catalytic domain
Orthology groupMCL11259

Nucleotide sequence:

ATGGGGTACGCGTCGAACGGTGCGGGGCGGGCGGCGCACGAGGCCCTGCTGGCGCGCCAG
GACGCTGAGCTGAGGCTCATGGAGACCATGAAGAGAAGCCTCCAGGCTAAGATGAAAAGC
GACAGAGAATATGCTCTGGCGCTATCCGCTGCGGCTGCGCAAGGACAAAAGATGGACAAA
TGCGAAGAACTGAACGGTTCAGTGATTGCTTCTGCCTGGCGCACCATGACCGAGGAATGG
GAGAACACTAGCCGTCTGATCAAGGCCAATGCTGAAGCTCTCGATTCCAGGGCGTTGGAC
CGTCTCAATTCATTGATGACGGAGAGACGGAAGGCCCGCAAGGTCTATCATGAGGATCAT
TCAAAAATATCATCTCAGTTTACACAACTCTCAGAGGAAGTACAAAGAAAACAAAGTGAA
TATCAAAAATGTTTAGAGTCATACAAACAAATGAGGGCTAAATTCGAAGAGAATTATCTC
AAGTGTCCGACGAGCCGAGGGGGTCGTAAATTGGACGAAACGCGTGACAAATACGGAAAG
GCGTGTCGCAGGTTGCATAGAGCACACAACGAGTATGTATTACTGTTGGCCGAGGCCACG
GAATGCGAGCGAGCTCTGAGAACGGCGTCGCTACCAGCTCTGTTGGACCAACAGAGACGA
CAGGGAGAGGCCACGGCGTCTTCATGGAAAGGGATTTTGCTGGAGGTAGTTCAAAGAGCT
GACTTCACTTCAGAAAAATTCAGAGAAATCCAGAACAAGGTGGAGTCAGTTGTCCAGAAT
CTGAGGCCACAGGAGGAATACAAGGAGTTCATAGAAAAGTACAAATCGCCACCACAGATG
CCGATAACTTTCAATTTCGATGAAAGTCTGGTCGAGGATACCAGCGGTAAGCTGCAACCC
AACCAGCTGACTGTCGACAACCTCACAGTGGAATGGCTTAAAGAAAAGCTCACAGAACTT
GAGAATATACTACAGGAGAACAGAGACAAGCAGACCACACTACAAGGACCTGAGATTATT
GGCAATGGAGTCTGTAAAAATAATAATACAGGGATCAACGGAGTCAATGGCGTGGATAGA
TATTCGCCTCCGCCGACGGAGCCTATGAAGCTTCTAGAACTGCGTGTTTCTGAACGGAAG
TGCGCGAAGCAGGCCGAGATGATACGAGCGGCGTTGGCCGAGCTCGGCTGCGAGGAGGCC
CCCAGCGGCTGTCCCGACCTGTCCGCTGATACAGTCGGCGTCGACTGTCTCGAGCCTGAT
GTACCGGATCGTTCGTCCCTCGGCTCGAAAGCGTCGATGGATACGTTAAAGTTGCATCTC
CACTCTATAGTACCACCAAAGCCATTCTTTCGTCTATTCACGCGTCCCTTCAGACGCAAG
TCCGCCCCCTCCAGCCCCGCGCTACCGCCAAAGACGTTCCGCAGTAGCAGCGAGGGCCCT
TCAGACTCGGTGAGTGTGAGCGAGACGGAACGATCCCTGGTGGACCAGGAGTGGTTCCAC
GGAGTACTGCCTAGGGAGGAAGTGGTCCGTCTGCTGAGAGCGGACGGAGATTACTTAGTA
CGAGAAACCACGAGGAACCACGCGCGGCAACTAGTGTTGTCCGTCTGCTGGGGACAACAC
AAACACTTCATAGTACAGACGACGCCAGAGGGTCACTATCGTTTCGAGGGTGCTTCGTTT
CCTTCAGTGGCTGAGCTGGTTGCCTGGCAGCGGGCGTCCGGTGTTCCCGTCACAGCTCGC
TCTGGAGCTCTACTAAGACGCGCCGTGCCTAGAGAGACCTGGGAGCTCAACAACGATCAC
GTGCAACTGCTAGATAAGATCGGACGGGGTAACTTTGGCGACGTGTACAAAGCTCGCCTC
AAGACGACGGGCCAGGAGGTCGCCGTGAAGACGTGTCGGGTAGCTCTGCCAGAAGAACAG
AAGCGGACCTTCCTACAGGAGGGTAGAATACTAAAACAATACCAACATCCCAATATAGTG
AGACTCATCGGTATAGCCGTCCAGAAACAACCCATCATGATTGTCATGGAACTCGTGTCA
GGCGGATCACTCTTGACGTTCCTTCGTACTCGGGCGTCCAACCTGAGCTTCAGGTCGCTC
CTGGCCATGTGTAGGGACGCGGCCGCTGGTATGAGATATCTGGAGTCCAAAAACTGCATT
CACAGAGACTTGGCTGCGAGGAACTGCTTGGTGGGAGACGATAATATAGTCAAGATATCA
GACTTCGGCATGTCCAGAGAGGAAGAAGAGTACATCGTGTCCGGAGGCATGAAACAAATA
CCTATCAAGTGGACTGCACCAGAGGCACTTAATTTTGGTAAATACACGTCTCTCTGTGAC
GTGTGGAGCTACGGCGTCCTCATGTGGGAGATCTTCTCGAAGGGCGACACGCCCTACGCT
GGTATGAGCAATTCACGGGCGAGAGAGAAAATTGATAATGGTTACCGCATGCCGGCACCA
GAGGGTTGTTCCGAGGATGTCTACGCACTGATGCTGCGCTGTTGGGAATACGAACCGGAG
AAAAGACCACATTTCCATCAGATATACACGCTCATAGACAACATTTACAACAGATGA

Protein sequence:

MGYASNGAGRAAHEALLARQDAELRLMETMKRSLQAKMKSDREYALALSAAAAQGQKMDK
CEELNGSVIASAWRTMTEEWENTSRLIKANAEALDSRALDRLNSLMTERRKARKVYHEDH
SKISSQFTQLSEEVQRKQSEYQKCLESYKQMRAKFEENYLKCPTSRGGRKLDETRDKYGK
ACRRLHRAHNEYVLLLAEATECERALRTASLPALLDQQRRQGEATASSWKGILLEVVQRA
DFTSEKFREIQNKVESVVQNLRPQEEYKEFIEKYKSPPQMPITFNFDESLVEDTSGKLQP
NQLTVDNLTVEWLKEKLTELENILQENRDKQTTLQGPEIIGNGVCKNNNTGINGVNGVDR
YSPPPTEPMKLLELRVSERKCAKQAEMIRAALAELGCEEAPSGCPDLSADTVGVDCLEPD
VPDRSSLGSKASMDTLKLHLHSIVPPKPFFRLFTRPFRRKSAPSSPALPPKTFRSSSEGP
SDSVSVSETERSLVDQEWFHGVLPREEVVRLLRADGDYLVRETTRNHARQLVLSVCWGQH
KHFIVQTTPEGHYRFEGASFPSVAELVAWQRASGVPVTARSGALLRRAVPRETWELNNDH
VQLLDKIGRGNFGDVYKARLKTTGQEVAVKTCRVALPEEQKRTFLQEGRILKQYQHPNIV
RLIGIAVQKQPIMIVMELVSGGSLLTFLRTRASNLSFRSLLAMCRDAAAGMRYLESKNCI
HRDLAARNCLVGDDNIVKISDFGMSREEEEYIVSGGMKQIPIKWTAPEALNFGKYTSLCD
VWSYGVLMWEIFSKGDTPYAGMSNSRAREKIDNGYRMPAPEGCSEDVYALMLRCWEYEPE
KRPHFHQIYTLIDNIYNR