DPGLEAN06647 in OGS1.0

New model in OGS2.0DPOGS200830 
Genomic Positionscaffold1567:+ 16858-24016
See gene structure
CDS Length3192
Paired RNAseq reads  4354
Single RNAseq reads  10776
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA009883 (1e-16)
Best Drosophila hit  CG7597, isoform A (2e-162)
Best Human hitcell division protein kinase 13 isoform 2 (8e-147)
Best NR hit (blastp)  PREDICTED: similar to CG7597-PA, isoform A [Apis mellifera] (0.0)
Best NR hit (blastx)  PREDICTED: similar to CG7597-PA, isoform A [Apis mellifera] (1e-179)
GeneOntology terms




  
GO:0008353 RNA polymerase II carboxy-terminal domain kinase activity
GO:0006468 protein amino acid phosphorylation
GO:0004693 cyclin-dependent protein kinase activity
GO:0005575 cellular_component
GO:0005524 ATP binding
GO:0006911 phagocytosis, engulfment
InterPro families





  
IPR017441 Protein kinase, ATP binding site
IPR008271 Serine/threonine-protein kinase, active site
IPR011009 Protein kinase-like domain
IPR017442 Serine/threonine-protein kinase-like domain
IPR000719 Protein kinase, catalytic domain
IPR002290 Serine/threonine-protein kinase domain
IPR020635 Tyrosine-protein kinase, catalytic domain
Orthology groupMCL11758

Nucleotide sequence:

ATGGAGCGCGGTTACGAGAAGAGACACAGTAAGCACCACAAGGAAAAGCATAAGAAGCGA
GCACACAAGAAGTATAGAAGTCATTCGGGCTCCCATGAACAGTCATATGCTACTAGTAAC
TCCATGAAACCCCTGGTGGAATATTCCGATGTCAGCTCCGAGGATCTGTCGGCCCCGGAG
GCCGGTGAGATAGAGAGTGAAGCCAGTTCCATCGGTCGGCACATTGCAGACGACTTGGAT
AGAACGAGAAAATCCCACCCTATACGTACTTTCCTTGATAACAAGATATCTGTGACGACT
AGCAGCCGACGTGTGGTCGAGGAATACAGCCTCACTCGTGATTCCTCTTCCACGGCGAGG
AAGAGACGAGGCGTTGAATACGAGGAGCCAGTCCCAGACTTCGATGATCATTCCCGATAT
AAGAAGAAGAAGGAGAAGCGAAAGAAAGACAAGAAGAAGAAAAAGAAGAAGTCCAAACAC
AGGTCACGCTCAGCCAGTCTAGAGAGCTTGTCTCCTGGTGATATTGTACCAACTTGTACA
CCACTTCGGCCCCTGACTCCACAACGGTACACCCAGGTGCCAGTCAGCGAATGGGAAAAG
GGATCATCTCCATTACGGAACGGTTCCTGTTCCCCAGTGTCGCCCTCAACACCTCCGCTG
AGGCATGAGGTGTCACCTCGACACCGACCCATTCACCGTGAGCCATCTATGCACAATGTT
ATCCCATTCTCGCCACATCCAGAGAGATCTCCTATTATTAGCAGGAGGAGGCAGAAGTCC
ACAACTCCTCACACACCCCTGGCGCCTCCCTACCATGAGACCGTTACTATTGATTCAGAT
ACAGAACAGGACTACGACAGGAGACGCGACTACTGGCACGAACAGCGCATCTCTAATGAA
ATCATGGTTATATCTGACTCCCCGGTCCACGAATCGCGAAGCCGCGACTACAGTCCGAGA
CGTCATCGCCGCCGCTCGCCTCGGAGACGACGCAGTAGAGACAGGGAGAGACACAGGGAG
AAGGTCCATCATCGATCTCACAGTAGGTCGTCATTGAAGCGTCGCCGTTCTGGGTCACGT
TCGAGACGCCGTTCATCATCACCTCCACGACATCGTCCGAGACATTCAGACATCCCCAGA
GACAGGCACAGGAGTAAACATGAATCCCCGAGTCCACCCACACTGTTACAGAGAAAAATT
GACTTCAAAGAAAAGATTAGCGATACCAGTTTGTTTGCGGAACTTGTTAAAGACAAACAC
AAGCGTGCCAAGAAACTACAAGAAATATTAGAACAGAAAGAAGAATCACAGGGAGCTGCC
TCAAGTAGTGCTTCAGTTACCAACACTGATGCGCCAACAGAAGACCTGGCCGATAATGCG
GATAGCTTGCAGAGTTCAAAGGAGAATGGTGAGAACTCTAAGTCGGATGTAGTTGACATT
CCAATGCCCGTGCAGACTTCAGAGGAATCCAATGCTCAACCAGCAGAGGCTGAGCCCCAA
GCTGCCCAGCAAACAAACGCAACCACCAATGGGGAAGCTAATGAGAATTCTAATAGCTCT
GAGACTCAAAAAGTTCCCGTCACACCGCCCGCCCCCCTACCAAGCGCCGCGTCCACTTTA
CCGCCCCTCCCAAAGGTGGGTGGGAGCGTGGTAGAGGGCGTCTTTATGAATAGCCAACAA
CCGCCTCCGCCGAAACCAAAAAGCCTTACGAAACTGCCCATGCCGCCCAATACTCAGGTC
GAAGATTTAAAGACCCTCGCCAACGACAGTCCATTGAGCACGCCGTCCCCAAGTCCCGTC
AAGAAACCGGAAAAACCTAAGAGAACCGGTATTATGAATTTACCTATGCCGCCAGTGATC
CCTGGGTCAGAGGAGCTGAGCGGGGACGAATTGGACGGGTCCACTCCGCCCGCGCGTCCT
CCCACACACAGGGATCAATACTCACACGTGTTCAGCGGCAGGAGGCAGGCCGATGCGGGA
TCTAAACTGAAGAGGCCGCGTATTCTGAAACGCCGAGGATCTAAGGTCGTCCCGGTAGCG
ACTCCGACACACCACGCCAAGGACTGGGGCGAGAAATGTGTGGACGGTTTCCAGGTAATA
ACTCAAATAGGCGAAGGCACGTACGGTCAAGTATATAAGGCGCGCGACAAAAACACGGCC
CAACTCGTAGCACTGAAGAAGGTGAGGTTGGAGAACGAAAAAGAAGGCTTTCCCATCACC
GCTGTCAGAGAAATTAAGATTCTGAGACAACTCAATCACAAGAACATTGTTAATCTGAGA
GAGATCGTTACCGATAAACAGGACGCCATGGACTTTAGAAAGGACAAGGGCTCGTTCTAT
CTGGTGTTTGAATACATGGACCACGACCTCATGGGTCTCTTGGAGTCGAAGATGGTAGAC
TTCACCGAGTCACATAACGCATCCATAATGCGTCAGCTGTTAGACGGACTGGCGTACTGT
CATCGCAAGAATTTTCTACATCGAGACATCAAATGCAGTAATATATTAATGAACAATAAA
GGGGAAGTAAAGCTAGGGGATTTCGGTCTGGCTCGACTGTGGTCAGCCGAGGACAGAGCT
CGGCCATACACTAACAAAGTTATCACACTGTGGTATAGACCACCAGAGCTATTGTTAGGA
GAGGAACGATATGGCCCCGCTGTGGATGTGTGGTCCATGGGATGTATACTGGGCGAGCTG
TTCCTGAAACACCCGCTGTTCCAGGCTAACACGGAGATGATGCAATTGGAGATGATATCT
CGCATCTGTGGCACTCCAGCGCCGGGAGTGTGGCCCAACGTGGTGAAGCTGCCCCTCTGG
CACACACTGCGGCCCAAACGTTTCCACAAGCGGTGTGTTCGGGAACAGTTTGCGTTTATG
CCACCAGCGGCTCTACAGCTGTTGGATCGTATGCTGGAGTTAGATCCAGACAAGCGAATC
ACGGCGGACGACGCGCTGAAGAGCGTGTGGCTGAAGAATGTTGTCCCGGATCAAATGCCG
GCCCCGGAGCTACCGACATGGCAAGACTGTCACGAGCTCTGGTCGAAGCAACGCCGGCGG
CAGCAGAGGGAACAGGAACAGTCGTCGAAACCTAAATCCTATACATTCACACAGGATGAG
AACTCTGAATCGAGTTTCAAAAGTGATATCAAGTCAGAGACTAACCCGGAACCTGTCGGA
CAAGTTAAATAG

Protein sequence:

MERGYEKRHSKHHKEKHKKRAHKKYRSHSGSHEQSYATSNSMKPLVEYSDVSSEDLSAPE
AGEIESEASSIGRHIADDLDRTRKSHPIRTFLDNKISVTTSSRRVVEEYSLTRDSSSTAR
KRRGVEYEEPVPDFDDHSRYKKKKEKRKKDKKKKKKKSKHRSRSASLESLSPGDIVPTCT
PLRPLTPQRYTQVPVSEWEKGSSPLRNGSCSPVSPSTPPLRHEVSPRHRPIHREPSMHNV
IPFSPHPERSPIISRRRQKSTTPHTPLAPPYHETVTIDSDTEQDYDRRRDYWHEQRISNE
IMVISDSPVHESRSRDYSPRRHRRRSPRRRRSRDRERHREKVHHRSHSRSSLKRRRSGSR
SRRRSSSPPRHRPRHSDIPRDRHRSKHESPSPPTLLQRKIDFKEKISDTSLFAELVKDKH
KRAKKLQEILEQKEESQGAASSSASVTNTDAPTEDLADNADSLQSSKENGENSKSDVVDI
PMPVQTSEESNAQPAEAEPQAAQQTNATTNGEANENSNSSETQKVPVTPPAPLPSAASTL
PPLPKVGGSVVEGVFMNSQQPPPPKPKSLTKLPMPPNTQVEDLKTLANDSPLSTPSPSPV
KKPEKPKRTGIMNLPMPPVIPGSEELSGDELDGSTPPARPPTHRDQYSHVFSGRRQADAG
SKLKRPRILKRRGSKVVPVATPTHHAKDWGEKCVDGFQVITQIGEGTYGQVYKARDKNTA
QLVALKKVRLENEKEGFPITAVREIKILRQLNHKNIVNLREIVTDKQDAMDFRKDKGSFY
LVFEYMDHDLMGLLESKMVDFTESHNASIMRQLLDGLAYCHRKNFLHRDIKCSNILMNNK
GEVKLGDFGLARLWSAEDRARPYTNKVITLWYRPPELLLGEERYGPAVDVWSMGCILGEL
FLKHPLFQANTEMMQLEMISRICGTPAPGVWPNVVKLPLWHTLRPKRFHKRCVREQFAFM
PPAALQLLDRMLELDPDKRITADDALKSVWLKNVVPDQMPAPELPTWQDCHELWSKQRRR
QQREQEQSSKPKSYTFTQDENSESSFKSDIKSETNPEPVGQVK