DPGLEAN14397 in OGS1.0

New model in OGS2.0DPOGS210525 
Genomic Positionscaffold296:+ 77601-79079
See gene structure
CDS Length1320
Paired RNAseq reads  2675
Single RNAseq reads  6214
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012627 (5e-132)
Best Drosophila hit  CG10565 (1e-86)
Best Human hitdnaJ homolog subfamily C member 2 isoform 1 (2e-66)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC013259 [Tribolium castaneum] (3e-147)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC013259 [Tribolium castaneum] (2e-116)
GeneOntology terms


  
GO:0006457 protein folding
GO:0051082 unfolded protein binding
GO:0031072 heat shock protein binding
GO:0003677 DNA binding
InterPro families




  
IPR014778 Myb, DNA-binding
IPR009057 Homeodomain-like
IPR012287 Homeodomain-related
IPR001005 SANT domain, DNA binding
IPR017877 MYB-like
IPR017884 SANT, eukarya
Orthology groupMCL15720

Nucleotide sequence:

ATGAATGCGCGGTGGTCAGAGAAAAAACATGTGCCTTTGTTGGGAGACGAGAACAGTTCT
CGGGAGTTTGTAGAAAAATTTTACTCTTTTTGGTATGAATTTGACTCGTGGCGTGAGTTT
TCTTATTTGGATGAGGAAGAGAAAGAGAAGGGGTCTGACAGAGAGGAGCGGAGGTGGATA
GAGAAACAAAACAAAGTGGCCAGAGCTAAACTGAAGAAGGAAGAAATGACGAGAATACGC
AGTCTTGTTGACTTAGCGTACGCTAACGACCCCAGGATACAGAGATTCAAACAGGAAGAC
AAAGATAAAAAGATAGCTGCGAAACGTGCCCGCCAGGATGCAGTCCAAGCTAAAAAAGCT
GAGGAAGAGAGATTAATTAAAGAGGCCCAAATTGCTAAACAAAAAGCTGAGGCGGCGGAG
AGGGCGAGGATGGAGGCGGCTCGAGCTGAGAGGGAACTGCAAAAGAAGAATCTGCGCAAG
GAACGAAAGTCACTTAGAGATTTATGTAAAAGTAAAAACTATTTTGCTAAAAACGAAGAC
GAAACCGTCAGTAATATGGCAGCCGTCGAAAAGATTTGTGAACTGTTGAAAGCCACAGAG
ATCCAAGCCCTGATCAAAGATATCGAGTCTAGTGGCCGGGATGCATTCATAAAGGCTATC
ACAGAGTCCGAAGAGAAGCTTGAAGCTGAACGCAGGGCTTTGTTTGAAAATAAAAGAGCT
GAGGAGCAAAAAGCAAAGAAAAATGCAGCCCTTAAGGTTCCCATAGAATGGTCTCCCGAA
ATGATGCAGTTGCTCATCAAAGCTGTCAACCTATTCCCTGCCGGTACAAATGCAAGATGG
GACGTCGTCGCTAACTTCCTGAATCAACACGGAACATTTACTGATGAAAGGCGTTTCAAT
GCTAAAGAAGTTTTAAATAAAGCTAAGGACTTGCAGAGTTCAGATTTCTCGAAGAGCATC
CTAAAGAAAGCTGCGAATGAAGAAGCTTTCGATCAGTTCGAAAAAGACAAAAAGAAGGTT
GTCAACTCGGTGGATGACAATAGTATATCCAAGAATGACACTCCCAAATTAGTGAATGGG
ATCTCCAAACCTAAAATGAATGGGGACGTCAAGGAATCCAAGGAAGAAAAGCCTTGGACC
AAGACCGAGCAGGAACTTCTGGAGCAGGCCATCAAAACATTCCCCGTCAGCACTTCGGAG
AGGTGGGACAAGATCGCCGAATGTATACCGAACCGCTCCAAGAAAGACTGCATGAAGAGG
TACAAAGAGCTGGTAGAATTAGTCAAAGCCAAGAAACAAGCGGCCAACATCTCGAAATAG

Protein sequence:

MNARWSEKKHVPLLGDENSSREFVEKFYSFWYEFDSWREFSYLDEEEKEKGSDREERRWI
EKQNKVARAKLKKEEMTRIRSLVDLAYANDPRIQRFKQEDKDKKIAAKRARQDAVQAKKA
EEERLIKEAQIAKQKAEAAERARMEAARAERELQKKNLRKERKSLRDLCKSKNYFAKNED
ETVSNMAAVEKICELLKATEIQALIKDIESSGRDAFIKAITESEEKLEAERRALFENKRA
EEQKAKKNAALKVPIEWSPEMMQLLIKAVNLFPAGTNARWDVVANFLNQHGTFTDERRFN
AKEVLNKAKDLQSSDFSKSILKKAANEEAFDQFEKDKKKVVNSVDDNSISKNDTPKLVNG
ISKPKMNGDVKESKEEKPWTKTEQELLEQAIKTFPVSTSERWDKIAECIPNRSKKDCMKR
YKELVELVKAKKQAANISK