DPGLEAN15582 in OGS1.0

New model in OGS2.0DPOGS207034 
Genomic Positionscaffold1:+ 1270878-1283095
See gene structure
CDS Length3051
Paired RNAseq reads  2606
Single RNAseq reads  6315
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012976 (0.0)
Best Drosophila hit  Tao-1, isoform D (6e-171)
Best Human hitserine/threonine-protein kinase TAO1 isoform 1 (1e-142)
Best NR hit (blastp)  serine/threonine protein kinase [Aedes aegypti] (0.0)
Best NR hit (blastx)  PREDICTED: similar to serine/threonine protein kinase TAO1 [Apis mellifera] (7e-178)
GeneOntology terms



  
GO:0006468 protein amino acid phosphorylation
GO:0004702 receptor signaling protein serine/threonine kinase activity
GO:0005524 ATP binding
GO:0006915 apoptosis
GO:0004674 protein serine/threonine kinase activity
InterPro families




  
IPR002290 Serine/threonine-protein kinase domain
IPR020635 Tyrosine-protein kinase, catalytic domain
IPR000719 Protein kinase, catalytic domain
IPR017442 Serine/threonine-protein kinase-like domain
IPR017441 Protein kinase, ATP binding site
IPR011009 Protein kinase-like domain
Orthology groupMCL10634

Nucleotide sequence:

ATGTCAAATTTTCGTTATGCGATAAAAGAGGGCGCTTGTGGCTACAGTCACAGAGTGAGA
TTATTTTCCACTGGCTTGCAGCTGGCACAAGCAAGGAAGATGCCACGTCCGGGAAGTTTG
AAGGATCCGGACATTGCGGAGTTATTCAGCAAACATGATCCAGAGAAGATTTTTGAAGAT
CTCCGTGAGATTGGTCATGGCTCCTTCGGAGCCGTGTACTATGCTCGTTGCAATCTCACC
AAAGAGATAGTAGCTATCAAGAAGATGTCCTACCTCGGGAAGCAGAGTGAAGAGAAATGG
CAGGACATTTTGAAAGAGATAAAGTTTCTCAAGCAATTGGATCACCCGAACACTATAGAA
TACAAGGGTTGCTACAAACGGGAGCACACCGCCTGGCTGGTCATGGAGTACTGTGTCGGC
TCGGCTTCTGACATTATTGAGGTTCACAAGCGTCCTCTGCGCGAGGAGGAGATAGCTGCG
ATATGCGAGGGCGTCGTGTGCGGTCTGTCTTACTTGCATTCCTTGGGGCGGATCCATCGC
GACATCAAGGCTGGTAATATCCTACTGACGGAGAACGGGACGGTAAAGCTAGCTGACTTC
GGCAGCGCCAGCATCAAGTGCCCAGCCAACAGCTTTGTCGGCACTCCCTACTGGATGGCG
CCGGAGGTCATCCTCGCCATGGACGAGGGCCAGTACGATGGAAAGGTTGATGTGTGGTCG
CTCGGTATAACGTGCATTGAACTGGCTGAGAGGAAGCCTCCGTATTTCAACATGAACGCC
ATGTCCGCTTTGTACCATATCGCGCAGAACGATTCACCTCCATTGCAAGCACCTGAATGG
ACCGATACATTCCGTTACTTCGTGGAGGCGTGCCTACAAAAGAATCCACAGGATCGTCCG
TCGTCTACCAAACTTCTGTCCCACCCTTACATCACGCGGCCGAGATCACCGAACGTGCTG
GTCGACCTTATACAGAGAACAAAAGCGGCTGTGAGGGATCTAGACAATTTGAATTACAGG
AAGATGAAGAAAATACTCATGGTGGACGCCGATAATGAGAGCGCTATGGGTGATAATGAA
GAGACGGCTGAGGAGCGTTCGGGTGGAGATAGCTCCAAATCCAATTCCGCTACATCAGAG
CACAGTGCGGCCGGAGCATCCAGTCAGAGTTCCTCGTCAGGCAGTCTACGGCGCAGGCCT
ATCGCGCTGAACGCTAATCACAATCAGCAACAGCAGCAATACCAGCAATACAACCACCAA
CCAGCCCACCCCACCAGAGACGAGAGCCCTATACCGGGCTCCCACGACGACTACGTGAAC
AGGGAAGTTCTACGTGATTACGCGAACAGGGATTCCCTGTGTGACGATTACGGCGATGAT
TACGCGAATCGGGAGGCGATACGGGAGGCTCAGAGGGAAAGGGAGAGGGAACGCCAGGCT
CTAGAATACAGGGAGTACGTTAACGCGCCCTCCACGTGGCAACAGGACAGCGGCGACGAC
AATAGAAACACGCAGAGAAGACGTGTAAGCAACAACGTTTGTGCCGCTATATCTCTCGTG
TCCGAACACGGTGCGAACAACTTTGCTACTATACGGACTACTAGCATTGTCACAAAACAG
CAAAAGGAGCATAATCAGGAAATGCATGAACAGATGTCCGGTTATAAACGCATGCGTCGC
GAACATCAGGCTGCACTTCTCAAGTTGGAGGAGCGCTGTAAGGCGGACGTCGAGGCTCAT
AAGTCGCAGCTCGATCGTGAATATGACTCGCTGCTGCAACAGCTGTCTAGAGACCTGGAA
AGGCTGCAGACTAAACACGCTCAAGAGCTGGAACGGAAACAGAAACAGAATTCGACAGCC
GAGAAGAAGCTGATAAAGGATATAACATCACGCCAAGAACAAGAGAGGAAAGCCTTCGAG
ACACAGCGCAAGAGAGAGTACAAGGCGAATAAGGAGAGGTGGAAAAAGGAATTGAGTATG
GACGACGCCACGCCCAAACGGCAAAGGGATGCTACATTACAGTCCCATAAGGACAATCTG
AAGCAGGCGGAGGCGGCAGAGGAGGCGCGTTTGGTGCGCTCTCAGCGAGAATACCTCGAA
TTGGAACTGAGAAGATTCAGGAGGAGACGAATGCTGGCCCTGCATCATAAGGAACAGGAA
CTACTCAGAGAGGAGTTAAACAAGCGTCAGACTCAGTTGGAGCAGGCGCATGCGATGCTG
TTACGTCATCACGAGAAGACTCAGGAGTTGGAGTACAGACAACAGAAGGGCGTACATGCT
CTCAGGGAGGAACAGTTGTCCAACCAGCACGCGACCGAGTTGGCCAATCAGAGGGATTAT
ATGCAGAGAGCTGAACATGAGCTCAGGAAGAAACACGCGCTGCAACTCAAACAACAGCCC
AAGAGTCTTAAGCAAAAAGAAATGCTGATCCGCAAACAGTTCCGTGAGACTTGCAAGATA
CAAACGCGCCAGTACAAAGCACTCAAAGCACAAATCTTGCAAATGACACCTAAGGAGCAA
CAAAAGGAAGTCATCAAATCGCTGAAGGATGAGAAGCGTCGCAAGCTCGTGTTGTTAGGA
GAACAGTATGATCAGAGTATTAGCGAGATGTTGCAGAAACAGACAGTGCGCCTTGATGAG
AGCCAAATGATGGAGTGCCAGCAATTGAAGATGCAGTTGGAACACGAGCTGGATATGTTG
ACGGCCTACCAGAGCAAGAGTAAGATGCAGGCGGAGGCTCAGAGGAACAGAGAGCGACGG
GAGCTGGAGGAGAGAGTCGCTGTCAGGAGGGCGCTGTTAGAGCAAAAGATGGAGTGCGAG
TGTGGTCAGTTTGTCGCTGAACGAGCGGAGAGGACTCGCATGTTGCACGAGCGGCACGAG
AGAGACCTCGATCACTTCGACAACGAGAGCGCGAGACTTGGATTCAGTGCAATGGCGATC
GCCGAAGGTAGCAGGGAGGGTTACGGGGAGGAGGAGCAGTCACTCTCCGGGTCGATGCTG
TCCCTCGCCCACAGCAACTCGTCGGCCAGTTTCCCGGCCGGCTCCCTCTAA

Protein sequence:

MSNFRYAIKEGACGYSHRVRLFSTGLQLAQARKMPRPGSLKDPDIAELFSKHDPEKIFED
LREIGHGSFGAVYYARCNLTKEIVAIKKMSYLGKQSEEKWQDILKEIKFLKQLDHPNTIE
YKGCYKREHTAWLVMEYCVGSASDIIEVHKRPLREEEIAAICEGVVCGLSYLHSLGRIHR
DIKAGNILLTENGTVKLADFGSASIKCPANSFVGTPYWMAPEVILAMDEGQYDGKVDVWS
LGITCIELAERKPPYFNMNAMSALYHIAQNDSPPLQAPEWTDTFRYFVEACLQKNPQDRP
SSTKLLSHPYITRPRSPNVLVDLIQRTKAAVRDLDNLNYRKMKKILMVDADNESAMGDNE
ETAEERSGGDSSKSNSATSEHSAAGASSQSSSSGSLRRRPIALNANHNQQQQQYQQYNHQ
PAHPTRDESPIPGSHDDYVNREVLRDYANRDSLCDDYGDDYANREAIREAQRERERERQA
LEYREYVNAPSTWQQDSGDDNRNTQRRRVSNNVCAAISLVSEHGANNFATIRTTSIVTKQ
QKEHNQEMHEQMSGYKRMRREHQAALLKLEERCKADVEAHKSQLDREYDSLLQQLSRDLE
RLQTKHAQELERKQKQNSTAEKKLIKDITSRQEQERKAFETQRKREYKANKERWKKELSM
DDATPKRQRDATLQSHKDNLKQAEAAEEARLVRSQREYLELELRRFRRRRMLALHHKEQE
LLREELNKRQTQLEQAHAMLLRHHEKTQELEYRQQKGVHALREEQLSNQHATELANQRDY
MQRAEHELRKKHALQLKQQPKSLKQKEMLIRKQFRETCKIQTRQYKALKAQILQMTPKEQ
QKEVIKSLKDEKRRKLVLLGEQYDQSISEMLQKQTVRLDESQMMECQQLKMQLEHELDML
TAYQSKSKMQAEAQRNRERRELEERVAVRRALLEQKMECECGQFVAERAERTRMLHERHE
RDLDHFDNESARLGFSAMAIAEGSREGYGEEEQSLSGSMLSLAHSNSSASFPAGSL