DPGLEAN21476 in OGS1.0

New model in OGS2.0DPOGS215981 
Genomic Positionscaffold734:+ 136523-146183
See gene structure
CDS Length3630
Paired RNAseq reads  4243
Single RNAseq reads  12388
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001211 (3e-125)
Best Drosophila hit  helix loop helix protein 106, isoform C (3e-54)
Best Human hitsterol regulatory element-binding protein 1 isoform b (6e-58)
Best NR hit (blastp)  PREDICTED: similar to sterol regulatory element-binding protein 1 [Tribolium castaneum] (4e-103)
Best NR hit (blastx)  PREDICTED: similar to sterol regulatory element-binding protein 1 [Tribolium castaneum] (4e-87)
GeneOntology terms

  
GO:0030528 transcription regulator activity
GO:0005634 nucleus
GO:0045449 regulation of transcription
InterPro families
  
IPR011598 Helix-loop-helix DNA-binding
IPR001092 Helix-loop-helix DNA-binding domain
Orthology groupMCL11243

Nucleotide sequence:

ATGGATCCGATGGAGCCTTTAATAAACAATGATGTTTTTAATGTCAACGAAATAGCTGAA
ATAGAAGATTTCTTGAATGGCTGTGACGGAGATTTTATGAAAAAGTTAGAAGAAGAACTA
GTATTTGCAGATAATGACACTGGTCTGTTGAGCGTGGACACTAAATTTAGTACGAATGTT
TCATCACCCCAAGATTCTCCTTATTACACTGCTCCGGGGAACCCATTGGTTTCTCAACAT
CCGCAAAAGAGAAAACTGCCTCCATTTACTCAACAACCGAAAACAAGCCCAGTTTTAGGG
GTGTATGGCAGAGAAGAAAGTTATAATTTGGAAGTTAAAGCAGAAAGCCATCAGTTGCCT
GAGGTGCTACTTCAAAAAGCTCAAAACCAGACAGTTCAAACTCCGATGTTTGTGCAACAA
GTTATTCCAAAACCTCAATATGTTGCCTTAGGAAGCTTACAAAAATTACCCGATGGGGTG
GCATCTCTTGTTCAATTAGATTCTCCAATAAATCAAAATACTAAAGCCCAACCCGCAGCC
AAGCCATTGCTGCTCCCAAATAATGCAAAAGGGGTTACACCAGTTATTTTGAAGAGCAGC
GACTCAAATTTTTCACCTGTGATATTACAGTCAAATATTCTCAATCCTGAAACTCAGACT
TTGATGTACACCAGTGCTCCTGTACAAGGAACAACTCAAAGTATTATTACCAATTCTGGT
GCTGAAAGTCAGCCTGTACACACTTTCTTTGCTAGTAATAATGGCCCTACATTGGTTACT
GGCATACCATTGGTCCTGGATGGTGACAAAATTTCCTTCAACCAATCTTCGAATGGAAGT
CCTCCTAAAGTTAAAGAAGTAAAGAGAAGTGCCCACAATGCTATCGAAAGAAGATATAGA
ACTAGCATAAACGATAGAATAGTGGAGCTTAAGAATATGTTAGTAGGTGAAGAAGCAAAG
TTAAATAAGTCAGCAATATTAAGAAAAACAATAGATTACATAAAGTACCTGCAAAATCAG
AATACTAGACTTAAACAAGAGAATATAGCTCTGAAACTGGTGTGTCAGAAGTCTGGAGTG
AAGGATGTTGTGTTTGATGGAGCCTACACCCCACCACATAGTGACATATCATCCCCCTAC
CACTCTCCCCATGGTATGGATAGCACTCCTTCATCACCAGAGAGTAAGGTCGAAGAAAAA
TACTCGAAAATTGTTATTGGAATGGGAGATCATTCTCGTCTAGCATTATGTGCCTTTATG
ATAGGGCTGATTGCCTTCAATCCATTCAGTGCTTTCTTTGGCAGTTTTATGTCCGAATCC
TCTTATGATTACAACGCTCGACTTGATCAACGAAGAATACTTTCCGAAGATAGTTTTGGT
GCTGGAAATGTATCGTGGGGAGCCTGGCTATTCAATATGTTTTTGATATATTTGGTAAAT
ACAATAATTTTGGGAGGTTGTCTCATCAAACTTCTAGTGTATGGGGATTCTGTACCAAAA
TCACAATCTAAGGAAGCCGGCCTATTTTACAAACACAAGCAACAAGCTAATAATCATTTA
AAGAAGAATGATCTGGAGAATGCTCGGAGTGAGCTGAATCGTGCTCTGTCGGTATGTGGT
CGTAGTGTCCAGGCGGGTGGGTGGGGGCGTTACTCAGCGCTCACTGCTGCTGTAATGAGA
CAGATACTGCAACGACTTCCCTTGGGAGGCTTCCTGGCAAGACGAGCTGGAGATCTGTGG
GGTGACAGTCCAGCGAGACGCGCCACTCAGCACTGGGCCAAAGAAGTGTCTATGGTGTCT
CACAAATTGGCCCAATTGGAAATATTATCTAATCAGACGAGTGGCAGTAAATGTGTACTA
CTGGCGTTACAAGCTGTCAACCTTGCTGAAGTCACGAGCGATAAGCAGTTACTCGCCGAG
ACTTATGTTACCGCGGCGTTAGTCTTTAAGGACTATATGCCAAATTTTGGAAAATGGCTA
TGTGGATACTACCTGCGTCTATGTACATATTGGTGTTGGGAGACGATCCCTGAGGGTAAT
CCGCGTGTACGGTGGGCGACCAGCTCACGGGGACAGGACTTCCTCAGAACCCGTCGCTGG
GTGTATGAACAGAAACCTGCTTACCAACTGTTTTCCAGACTACCCACGCTCACCGATCCA
CTGGCTTATGCTATGAGGGCCTACCACCTGGAACTGCTGCAAACGAGCCTGCAAATGCTG
CTTTGTGCTGACGAACGCAGCAGCACACGAGATGTCCTCGACCTGGTGAAGCTGATTATT
GATGACGTGTCCACAGACGCGCCCCATCACTCAGGTTGCTGGGACCCGGTGTTAGAGTGG
TGGGCTAGCGTCGTTGGCGCTGCGGCAGCCTGCTTGCTGGCCGACGCGCCCGCCATCGCC
GACCTCGCCGACAAACTGGCCGTTCTGCCGGACGAACTCGCCACCAGTGAGGATCCGCTG
CCGGGATACTACCTGCGTCTATGTACATATTGGTGTTGGGAGACGATCCCTGAGGGTAAT
CCGCGTGTACGGTGGGCGACCAGCTCACGGGGACAGGACTTCCTCAGAACCCGTCGCTGG
GTGTATGAACAGAAACCTGCTTACCAACTGTTTTCCAGACTACCCACGCTCACCGATCCA
CTGGCTTATGCTATGAGGGCCTACCACCTGGAACTGCTGCAAACGAGCCTGCAAATGCTG
CTTTGTGCTGACGAACGCAGCAGCACACGAGATGTCCTCGACCTGGTGAAGCTGATTATT
GATGACGTGTCCACAGACGCGCCCCATCACTCAGGTTGCTGGGACCCGGTGTTAGAGTGG
TGGGCTAGCGTCGTTGGCGCTGCGGCAGCCTGCTTGCTGGCCGACGCGCCGGCCATCGCC
GACCTCGCCGACAAACTGGCCGTTCTGCCGGACGAACTCGCCACCAGTGAGGATCCGCTG
CCGGGTTCGCTGGACATGGCCTACAAAAGCCGGCGCGGGCTGCTCGCACTAGCTCACTGC
TCAGATGAAGACAAGCACTCGAAGACCACTCACACGCTGCTCAAGGTTTGTGATGTCGCC
GGAGCCCGGCTAGCGGATTCCTTGGCGTATTACTGCTGCCGGAAGCCGACACAGCTCATG
ATGCTGATGCAGGTCCTATGCTGTGATTGGGTGCTGGAGGTGAGAGCGGGGGTGTGGGAG
GCGCGCGGCGCGGGAGGGGGCGGGTCGCCCGTCCACAACCAGCTGGCTGGCTTCCAGAGG
GATTTACATTCTTTGAGGAGGCTGTCGCAGAACTTACCGTGGGTGACGTCAGCCCACAAG
GACGTGAGGCGGCACTGCCGCATGATGGCGGGCGCGGCGCCGCGGCGCACGCAACAACTG
CTGGACGGGAGCCTCAGACCCAGGTCTAACAGGACCTCGCTGATATGCGGCAAGGAGCGT
GCGTTAGAGGGCGGGGGTGGGGAGGGCGAACGTGCGGTAGCTTTATACATGGCGTGCAAG
CATCTCCCGGCGGCGGTGCTAGCGACCCCCGGCGAGAGGGCCGGCATGTTGGCGCAAGCT
GCAGCTACGCTACAGAAGATAGGCCATCGTTCAAGACTACCACACTGCTACCACCTCATG
AAGACCTTTGGCACTCTGCCCGCGCCTTGA

Protein sequence:

MDPMEPLINNDVFNVNEIAEIEDFLNGCDGDFMKKLEEELVFADNDTGLLSVDTKFSTNV
SSPQDSPYYTAPGNPLVSQHPQKRKLPPFTQQPKTSPVLGVYGREESYNLEVKAESHQLP
EVLLQKAQNQTVQTPMFVQQVIPKPQYVALGSLQKLPDGVASLVQLDSPINQNTKAQPAA
KPLLLPNNAKGVTPVILKSSDSNFSPVILQSNILNPETQTLMYTSAPVQGTTQSIITNSG
AESQPVHTFFASNNGPTLVTGIPLVLDGDKISFNQSSNGSPPKVKEVKRSAHNAIERRYR
TSINDRIVELKNMLVGEEAKLNKSAILRKTIDYIKYLQNQNTRLKQENIALKLVCQKSGV
KDVVFDGAYTPPHSDISSPYHSPHGMDSTPSSPESKVEEKYSKIVIGMGDHSRLALCAFM
IGLIAFNPFSAFFGSFMSESSYDYNARLDQRRILSEDSFGAGNVSWGAWLFNMFLIYLVN
TIILGGCLIKLLVYGDSVPKSQSKEAGLFYKHKQQANNHLKKNDLENARSELNRALSVCG
RSVQAGGWGRYSALTAAVMRQILQRLPLGGFLARRAGDLWGDSPARRATQHWAKEVSMVS
HKLAQLEILSNQTSGSKCVLLALQAVNLAEVTSDKQLLAETYVTAALVFKDYMPNFGKWL
CGYYLRLCTYWCWETIPEGNPRVRWATSSRGQDFLRTRRWVYEQKPAYQLFSRLPTLTDP
LAYAMRAYHLELLQTSLQMLLCADERSSTRDVLDLVKLIIDDVSTDAPHHSGCWDPVLEW
WASVVGAAAACLLADAPAIADLADKLAVLPDELATSEDPLPGYYLRLCTYWCWETIPEGN
PRVRWATSSRGQDFLRTRRWVYEQKPAYQLFSRLPTLTDPLAYAMRAYHLELLQTSLQML
LCADERSSTRDVLDLVKLIIDDVSTDAPHHSGCWDPVLEWWASVVGAAAACLLADAPAIA
DLADKLAVLPDELATSEDPLPGSLDMAYKSRRGLLALAHCSDEDKHSKTTHTLLKVCDVA
GARLADSLAYYCCRKPTQLMMLMQVLCCDWVLEVRAGVWEARGAGGGGSPVHNQLAGFQR
DLHSLRRLSQNLPWVTSAHKDVRRHCRMMAGAAPRRTQQLLDGSLRPRSNRTSLICGKER
ALEGGGGEGERAVALYMACKHLPAAVLATPGERAGMLAQAAATLQKIGHRSRLPHCYHLM
KTFGTLPAP