DPGLEAN02063 in OGS1.0

New model in OGS2.0DPOGS214147 
Genomic Positionscaffold323:+ 60586-63637
See gene structure
CDS Length1482
Paired RNAseq reads  2957
Single RNAseq reads  7735
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006193 (0.0)
Best Drosophila hit  tetratricopeptide repeat protein 2, isoform A (7e-111)
Best Human hitdnaJ homolog subfamily C member 7 isoform 1 (6e-107)
Best NR hit (blastp)  DnaJ (Hsp40) homolog 9 [Bombyx mori] (0.0)
Best NR hit (blastx)  DnaJ (Hsp40) homolog 9 [Bombyx mori] (0.0)
GeneOntology terms  GO:0031072 heat shock protein binding
InterPro families






  
IPR018253 Heat shock protein DnaJ, conserved site
IPR011990 Tetratricopeptide-like helical
IPR001623 Heat shock protein DnaJ, N-terminal
IPR003095 Heat shock protein DnaJ
IPR019734 Tetratricopeptide repeat
IPR013026 Tetratricopeptide repeat-containing
IPR013105 Tetratricopeptide TPR2
IPR001440 Tetratricopeptide TPR-1
Orthology groupMCL12371

Nucleotide sequence:

ATGGCTGAGCCAGAAGTAGTGGATTTGGATCTAACAATCGATGATTTAGTTCCCAAAAGT
CCAGAAAGACTGGCTGAGGAAAAAAAGGAGAGCGGAAACCATCTCTATAAATTCAAAAAT
TATAAGGGGGCATTGGCCATGTATGAAGATGCAATCAAACTCTGTCCTGAAAATGCAGCC
TATTATGGCAACAGATCTGCCTGCTACATGATGCTGGGGATGTATAAAAAAGCTTTAGAG
GATGCTCAAAAAGCTGTAGCTCTGGACCCAACATTCACTAAAGGATATATTCGTATGGCT
AAATGTCATATTGCTGTAGGTGATATATCTGGTGCAGAACAGGCGGTTCGTAGTGCAAGC
GAACTCGGTGGGCCAGATTGTGCATCGAACGAACGTCGTGCATTAGAATCACTGCGACGG
TTACATGAAGACGCACAGCGTGCCATGGAGGCAGGAGACTACCGTCGTGTGGTCTTCTGC
ATGGACCGCTGTTTAGAATACAGTCCTTCAAGTATAAAGGCAAAACTTATCAAAGCCGAG
TGCCTTGCAATGATTGGACGCTGTCAGGAAGCTCAGGAAATAGCAAATGATTCACTAAGA
TTTGATAGTTTAGACACAGAGGCAATATATGTACGTGGGTTGTGCCTTTATTTTGAGGAC
AAAGACGAGCAAGCCTTCAAACACTTCCAGCAGGTTTTGAGACTTGCACCAGATCACAAG
AAATCCCTTGAGACTTATAAAAAGGCCAAGCTACTAAAACAAAAGAAAGAGGAAGGCAAT
GAGGCGTTTAAAATGGGTAGATGGCAACAAGCTTTAAATCTGTATAACGAAGCACTGACT
ATTGATAAAAATAACAGAAAAGTCAACGCCAAACTATATTTTAATAAAGCCACTGTGTGC
TCAAAGTTGAATCAAATAGAAGAAGCAGCAGAGGCTTGCACAGCCGCATTGGAGTTAGAT
GAGAACTATGTTAAAGCTTTGTTGCGTCGTGCCAAATGTTACGCCGAACTGGGGAATCAC
GAAGACGCTGTCAAGGACTACGAGAAGCTTTATAAGATCGACAAAAATAAGGAACACAAA
CAGTTACTCCACGAGGCAAAATTGGCTTTAAAGAAATCCAAACGCAAAGACTACTATAAG
ATTTTGGGCATTGAAAAAACAGCATCAGAAGACGATATCAAGAAAGCTTATAGAAAGCGC
GCTCTAGTTCACCATCCGGACAGACACGCGGGGGCTCCGGACAACGAGCGCAGGGAACAG
GAGCGTCGCTTCAAGGAAGTGGGGGAGGCGTATGAAGTGCTCAGTGACCCCAAGAAACGA
GCCCGTTACGATCACGGACAGGACCTTGATGATGGTTCCGGTGGTATTAATATTGATCCA
AATATGATGTTCCAAACCTATTTTAACGGCGGTGGACAAGGTTTTGACTTTTCTTCAGGT
GGAGGCTTCCCGGGATCAGCTTTTAGCTTTCAATTTGGATAG

Protein sequence:

MAEPEVVDLDLTIDDLVPKSPERLAEEKKESGNHLYKFKNYKGALAMYEDAIKLCPENAA
YYGNRSACYMMLGMYKKALEDAQKAVALDPTFTKGYIRMAKCHIAVGDISGAEQAVRSAS
ELGGPDCASNERRALESLRRLHEDAQRAMEAGDYRRVVFCMDRCLEYSPSSIKAKLIKAE
CLAMIGRCQEAQEIANDSLRFDSLDTEAIYVRGLCLYFEDKDEQAFKHFQQVLRLAPDHK
KSLETYKKAKLLKQKKEEGNEAFKMGRWQQALNLYNEALTIDKNNRKVNAKLYFNKATVC
SKLNQIEEAAEACTAALELDENYVKALLRRAKCYAELGNHEDAVKDYEKLYKIDKNKEHK
QLLHEAKLALKKSKRKDYYKILGIEKTASEDDIKKAYRKRALVHHPDRHAGAPDNERREQ
ERRFKEVGEAYEVLSDPKKRARYDHGQDLDDGSGGINIDPNMMFQTYFNGGGQGFDFSSG
GGFPGSAFSFQFG