DPGLEAN04849 in OGS1.0

New model in OGS2.0DPOGS214322 
Genomic Positionscaffold7106:+ 2861-10701
See gene structure
CDS Length3582
Paired RNAseq reads  1902
Single RNAseq reads  4573
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA004137 (0.0)
Best Drosophila hit  receptor mediated endocytosis 8 (1e-158)
Best Human hitdnaJ homolog subfamily C member 13 (0.0)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC008934 [Tribolium castaneum] (0.0)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC008934 [Tribolium castaneum] (0.0)
GeneOntology terms
  
GO:0005488 binding
GO:0031072 heat shock protein binding
InterPro families

  
IPR001623 Heat shock protein DnaJ, N-terminal
IPR011989 Armadillo-like helical
IPR016024 Armadillo-type fold
Orthology groupMCL11349

Nucleotide sequence:

ATGGACGAAGAGTTTTTCGACGGGGTGTTCGACAGTTACCCGAAAAATGGGAAAACTAGT
GACAAGCTACAAGTATTTTCAATAATGGACGAAGAAGACGCAATAATACGTCCGTTGCCG
AGAGTTAAGAGACTTCTCTCCGAACCCGCGTGCTTAGCTCATGTGGTGCAATTGTTACTG
ACGTTCGATCCTATACTGGTAGAGAAGGTGGCAACTTTGTTATATGAAATAATGCAAGAC
AACCCTGAGATCTCTAAGCTGTACTTGACCGGCGTGTTTTACTTCATGCTGCTGTACACG
GGCTCCAACCTGTTGCCCATAGCGAGGTTTCTGAGGTTGACGCACATGAAACAAGCGTTC
CGGGCCGACCAAACGAGCTCTGACATAATGCAGCGATCGATATTAGGACAACTGTTACCG
GAGGCGATGGTCTGTTATTTGGAAAACCATGGAGCTGAGAAGTTTGCTCAAATATTCCTC
GGCGAATGGGACACACCGGAGGCTATTTGGAATGCCGAAATGAGACGTATGCTGATCATG
AAGGTGTCTGCTCACATCGGGGAGTTCACGCCCCGCCTGCGCGCGCACGTGGCCGCCCGC
TACCCCTACCTCGCGATCCCCGCCGTCCGCTACCCGCAGCTGCAGAGAGAGCTGTTCTGC
AACATGTTCTACCTGAGACACCTGTGTGACACGCAGAGGTTCCCCGACTGGCCCATACCT
GACCCGGTGGGTCTTCTAAAGGACGTGCTCGAGGCGTGGAGGCGAGAAGTGGACAAGAAG
CCGTCGTCGATGACCGCGGAGCAGGCCTACACGGCGCTGGGACTCGAACCGACGACCCAC
GACGAGGCGGCCGTCAGGAAGGCCTACTACAGACTCGCACAGCAGTTCCACCCAGACAAG
AACCCAGAAGGACGGGACCGTTTCGAAGCAGTTAACCAAGCGTATGAGTTCCTGTGCAGT
CGCAACGTATGGACGGGCGATGGACCGAACACTAATAACATTCTACTCATTCTTCGCACA
CAGACTATACTATTTCAGAGATATTCTGAAGTGTTGTCTCCGTACAAGTACGCGGGATAC
TCAGCTTTGCTCCGCACTGCGCGGCTGGAGGCGGCCGCGGACACGTTGTTCTCGAGCGAG
GCCGCATTGTTACCGGCCGCCTGCGAACTGGCCCACGCTACACTCGCCTGCTCAGCGCTC
AACGCACAGGAACTGTGTCGAGAACGCGGCCTCGAGGTGCTCGAAGAGGCTCTTTCCCGC
TGCGTGTCGGTCCTAGGAGGCAGCGCGGCCGGCAGCACGGCGGCGGCGGTGTGCACACAC
TGCGCGAGGTGCTTTGCTGTTGCTGCACGCTTCCCCGTCTGTAGGGACGCTGCGGCGGAA
CTACCCACACTGTGTAGTGATATCGTACCACTACTGAGACGACCGGAGTTGGGTGAAACG
GCGTGTGCGGGAGCTGAGGCGGCAGCGGCGCTGGCAGCGGACCCGCGGTGCCGCGACCGT
CTCGCCCGCGCTCACGTCATCCACGCCCTCCTGCCGCCCGCCCTGCAGTACGACTACACG
CTCAAGGAGTCTGGCGTCTCCACCGAGGGAGATAACAAGCAGGAAGTGGCAAATCGTCTA
GCGGTTCAATGCGTTGCAGCGCTGTCAGCACTGTACGGACCGCACGAGGAGGCGGGCGAG
GAGGACGAGCGAGTCCAGGCGGCGCTCAGGATGCTGCTCACACCGTACATCTGTGATAAA
CTAGCGACTGCGGACCCGCACGAGCTACTTAAAACGCTGACGTCCAACTGGCGGACGCCG
TACCTTGTATGGGATAACAGCACTCGCGCGGAGCTTCGTGAAGCGCTCCGCTCGCGGCCT
CCTGAAGACACGCTGCTACAGGACGTGTATTACACCGCACACGAGGGCCTGCTCACCGTG
GGCGGAGTCTACCTCGACATATACAACGAGCAACCCGAGTTCCTTATAGAGAATCCCCAA
CAATTTGTTTTGGATCTGCTCCACTTCATCAAGGAACAAACGAAAGTCGCCAAATCAGAG
GAGACCGAGGAGCGCTTAACTTTAGCATTGAATGCTCTTGCCAATTGTATTATTAAGAAT
CCCGGCGTGGAGATCCAGTGCATCGGTCACTTCGCGGTGATATTCGGCCTGATGAGTGGC
GGCGTGTCGCGGGTGGTGGCGGGCGCGCTCCGCGTGTCGCTGGCGTGTTCCCGCAGCAGG
GCGTGTGTGGAGGAGGTGTCGGCGGGCGGCCTGCTGGGCCACGTGCTGCCTCTCCTCGCG
CCGCCCGCCCACAGAGAGGCGCTCGACACGCTCTCCGCCCTGCTCACCTGCACGCCGCTG
GTTAGAGAGGCGCTGGCTAAAGGAGCTGTGATATATCTGTTAGATCTCTTCTGTAACTGT
AAGACGCCGGAGATGCGAGAGATGGCGGTGGAACTACTCTCGCGGATGATGGCTGATAAA
TTACATGGACCCAAGGTGAGATTAACGATCTGCCGCTACGTGCCGGGCGTGTTCGCGGAC
GCAATGCGCGAGGCGGGCGGGGCGGGCGGCGCGGGGGGAGCGGGCGCGGCCGCCGCCATG
CACGCCTTCGACAACAACCACGAACATCCCGAGCTGGTGTGGGCGGACGAGCTGCGGCAG
AGGGTGAGGGCTGGCCTCGTGCAGCGGAGAGACAGGCTGTATACTAGTCAAATCCGTGAT
CCGACGATCCAATACGAGGAACGTGAAAAGGACACGGGCGAAGTGTCGTGGGCGCCGCCC
GGGGAGGTGGTGGTAGGAGGCGTGTACCTCAAGCTGTTCCTACAGAACCCCACCTGGAGT
CTGCGGAATCCCAAAAGCTTTCTGCAGGACCTCGTCTCCGAAACCCTGTCGGCACTCAAC
AAGGATTCATCCGAAGGTGGACGCGGGGACACGTGCGCGCGTGCTCTGACCGCCTTGCTC
CGTGCCCGCCCGGCGCTGTGCGAGGCGTGTGCCGCACTGGGCGAGCTGCCGCGCCTCGCC
CGCCTGCTGCCCGCCTGCCCCAGACACGCCGTGCCCGTGCTGGCGGCACTCGCCCACACG
CAGTCGTGTGTGGTGGCGCTGGTCCAGACGGACGCGATGGTGGGTCTCAAGACGGCGGTG
AAGACCTGTCGCGAGGTGGTGGGCCCCGCCTGCGACGCTCTCACCGCGATATTCAGCTCT
CCGGTCAACACGGACAAACTAGTGCTACAGGCTCTTGAATGCGACTTGATCACTGAGTTG
CTCTCGATGCTCGAGGGTCGCGGGTCGGGCGTCGGGCTAGAGGGCGGGAGCGTCGCCCGC
GTCGTCGGAGCCCTAAAAGCTATGAGTCGCGCCGGCTTACACGCGGAAAGAGTGAAAAAT
ATACTAGCAAGATCACCCGTCTGGGAACACTACGCCGCTCAGAGACACGACCTGTTTATA
TCGGCGCCGCAACATCACTCAATAACTGGTATACAATATATGAGTAGTTGCATCATTTTC
CACGCCACTAAAGCCGACCCAGTCCTTACTAGGTTAAACTGTGCTTCTGGTTTATCGACT
CAATCTCTTATAAAATTTACATACCTTGCGGATAGTCGATAA

Protein sequence:

MDEEFFDGVFDSYPKNGKTSDKLQVFSIMDEEDAIIRPLPRVKRLLSEPACLAHVVQLLL
TFDPILVEKVATLLYEIMQDNPEISKLYLTGVFYFMLLYTGSNLLPIARFLRLTHMKQAF
RADQTSSDIMQRSILGQLLPEAMVCYLENHGAEKFAQIFLGEWDTPEAIWNAEMRRMLIM
KVSAHIGEFTPRLRAHVAARYPYLAIPAVRYPQLQRELFCNMFYLRHLCDTQRFPDWPIP
DPVGLLKDVLEAWRREVDKKPSSMTAEQAYTALGLEPTTHDEAAVRKAYYRLAQQFHPDK
NPEGRDRFEAVNQAYEFLCSRNVWTGDGPNTNNILLILRTQTILFQRYSEVLSPYKYAGY
SALLRTARLEAAADTLFSSEAALLPAACELAHATLACSALNAQELCRERGLEVLEEALSR
CVSVLGGSAAGSTAAAVCTHCARCFAVAARFPVCRDAAAELPTLCSDIVPLLRRPELGET
ACAGAEAAAALAADPRCRDRLARAHVIHALLPPALQYDYTLKESGVSTEGDNKQEVANRL
AVQCVAALSALYGPHEEAGEEDERVQAALRMLLTPYICDKLATADPHELLKTLTSNWRTP
YLVWDNSTRAELREALRSRPPEDTLLQDVYYTAHEGLLTVGGVYLDIYNEQPEFLIENPQ
QFVLDLLHFIKEQTKVAKSEETEERLTLALNALANCIIKNPGVEIQCIGHFAVIFGLMSG
GVSRVVAGALRVSLACSRSRACVEEVSAGGLLGHVLPLLAPPAHREALDTLSALLTCTPL
VREALAKGAVIYLLDLFCNCKTPEMREMAVELLSRMMADKLHGPKVRLTICRYVPGVFAD
AMREAGGAGGAGGAGAAAAMHAFDNNHEHPELVWADELRQRVRAGLVQRRDRLYTSQIRD
PTIQYEEREKDTGEVSWAPPGEVVVGGVYLKLFLQNPTWSLRNPKSFLQDLVSETLSALN
KDSSEGGRGDTCARALTALLRARPALCEACAALGELPRLARLLPACPRHAVPVLAALAHT
QSCVVALVQTDAMVGLKTAVKTCREVVGPACDALTAIFSSPVNTDKLVLQALECDLITEL
LSMLEGRGSGVGLEGGSVARVVGALKAMSRAGLHAERVKNILARSPVWEHYAAQRHDLFI
SAPQHHSITGIQYMSSCIIFHATKADPVLTRLNCASGLSTQSLIKFTYLADSR