New model in OGS2.0 | DPOGS215981  |
---|---|
Genomic Position | scaffold734:+ 136523-146183 |
See gene structure | |
CDS Length | 3630 |
Paired RNAseq reads   | 4243 |
Single RNAseq reads   | 12388 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA001211 (3e-125) |
Best Drosophila hit   | helix loop helix protein 106, isoform C (3e-54) |
Best Human hit | sterol regulatory element-binding protein 1 isoform b (6e-58) |
Best NR hit (blastp)   | PREDICTED: similar to sterol regulatory element-binding protein 1 [Tribolium castaneum] (4e-103) |
Best NR hit (blastx)   | PREDICTED: similar to sterol regulatory element-binding protein 1 [Tribolium castaneum] (4e-87) |
GeneOntology terms    | GO:0030528 transcription regulator activity GO:0005634 nucleus GO:0045449 regulation of transcription |
InterPro families    | IPR011598 Helix-loop-helix DNA-binding IPR001092 Helix-loop-helix DNA-binding domain |
Orthology group | MCL11243 |
Nucleotide sequence:
ATGGATCCGATGGAGCCTTTAATAAACAATGATGTTTTTAATGTCAACGAAATAGCTGAA
ATAGAAGATTTCTTGAATGGCTGTGACGGAGATTTTATGAAAAAGTTAGAAGAAGAACTA
GTATTTGCAGATAATGACACTGGTCTGTTGAGCGTGGACACTAAATTTAGTACGAATGTT
TCATCACCCCAAGATTCTCCTTATTACACTGCTCCGGGGAACCCATTGGTTTCTCAACAT
CCGCAAAAGAGAAAACTGCCTCCATTTACTCAACAACCGAAAACAAGCCCAGTTTTAGGG
GTGTATGGCAGAGAAGAAAGTTATAATTTGGAAGTTAAAGCAGAAAGCCATCAGTTGCCT
GAGGTGCTACTTCAAAAAGCTCAAAACCAGACAGTTCAAACTCCGATGTTTGTGCAACAA
GTTATTCCAAAACCTCAATATGTTGCCTTAGGAAGCTTACAAAAATTACCCGATGGGGTG
GCATCTCTTGTTCAATTAGATTCTCCAATAAATCAAAATACTAAAGCCCAACCCGCAGCC
AAGCCATTGCTGCTCCCAAATAATGCAAAAGGGGTTACACCAGTTATTTTGAAGAGCAGC
GACTCAAATTTTTCACCTGTGATATTACAGTCAAATATTCTCAATCCTGAAACTCAGACT
TTGATGTACACCAGTGCTCCTGTACAAGGAACAACTCAAAGTATTATTACCAATTCTGGT
GCTGAAAGTCAGCCTGTACACACTTTCTTTGCTAGTAATAATGGCCCTACATTGGTTACT
GGCATACCATTGGTCCTGGATGGTGACAAAATTTCCTTCAACCAATCTTCGAATGGAAGT
CCTCCTAAAGTTAAAGAAGTAAAGAGAAGTGCCCACAATGCTATCGAAAGAAGATATAGA
ACTAGCATAAACGATAGAATAGTGGAGCTTAAGAATATGTTAGTAGGTGAAGAAGCAAAG
TTAAATAAGTCAGCAATATTAAGAAAAACAATAGATTACATAAAGTACCTGCAAAATCAG
AATACTAGACTTAAACAAGAGAATATAGCTCTGAAACTGGTGTGTCAGAAGTCTGGAGTG
AAGGATGTTGTGTTTGATGGAGCCTACACCCCACCACATAGTGACATATCATCCCCCTAC
CACTCTCCCCATGGTATGGATAGCACTCCTTCATCACCAGAGAGTAAGGTCGAAGAAAAA
TACTCGAAAATTGTTATTGGAATGGGAGATCATTCTCGTCTAGCATTATGTGCCTTTATG
ATAGGGCTGATTGCCTTCAATCCATTCAGTGCTTTCTTTGGCAGTTTTATGTCCGAATCC
TCTTATGATTACAACGCTCGACTTGATCAACGAAGAATACTTTCCGAAGATAGTTTTGGT
GCTGGAAATGTATCGTGGGGAGCCTGGCTATTCAATATGTTTTTGATATATTTGGTAAAT
ACAATAATTTTGGGAGGTTGTCTCATCAAACTTCTAGTGTATGGGGATTCTGTACCAAAA
TCACAATCTAAGGAAGCCGGCCTATTTTACAAACACAAGCAACAAGCTAATAATCATTTA
AAGAAGAATGATCTGGAGAATGCTCGGAGTGAGCTGAATCGTGCTCTGTCGGTATGTGGT
CGTAGTGTCCAGGCGGGTGGGTGGGGGCGTTACTCAGCGCTCACTGCTGCTGTAATGAGA
CAGATACTGCAACGACTTCCCTTGGGAGGCTTCCTGGCAAGACGAGCTGGAGATCTGTGG
GGTGACAGTCCAGCGAGACGCGCCACTCAGCACTGGGCCAAAGAAGTGTCTATGGTGTCT
CACAAATTGGCCCAATTGGAAATATTATCTAATCAGACGAGTGGCAGTAAATGTGTACTA
CTGGCGTTACAAGCTGTCAACCTTGCTGAAGTCACGAGCGATAAGCAGTTACTCGCCGAG
ACTTATGTTACCGCGGCGTTAGTCTTTAAGGACTATATGCCAAATTTTGGAAAATGGCTA
TGTGGATACTACCTGCGTCTATGTACATATTGGTGTTGGGAGACGATCCCTGAGGGTAAT
CCGCGTGTACGGTGGGCGACCAGCTCACGGGGACAGGACTTCCTCAGAACCCGTCGCTGG
GTGTATGAACAGAAACCTGCTTACCAACTGTTTTCCAGACTACCCACGCTCACCGATCCA
CTGGCTTATGCTATGAGGGCCTACCACCTGGAACTGCTGCAAACGAGCCTGCAAATGCTG
CTTTGTGCTGACGAACGCAGCAGCACACGAGATGTCCTCGACCTGGTGAAGCTGATTATT
GATGACGTGTCCACAGACGCGCCCCATCACTCAGGTTGCTGGGACCCGGTGTTAGAGTGG
TGGGCTAGCGTCGTTGGCGCTGCGGCAGCCTGCTTGCTGGCCGACGCGCCCGCCATCGCC
GACCTCGCCGACAAACTGGCCGTTCTGCCGGACGAACTCGCCACCAGTGAGGATCCGCTG
CCGGGATACTACCTGCGTCTATGTACATATTGGTGTTGGGAGACGATCCCTGAGGGTAAT
CCGCGTGTACGGTGGGCGACCAGCTCACGGGGACAGGACTTCCTCAGAACCCGTCGCTGG
GTGTATGAACAGAAACCTGCTTACCAACTGTTTTCCAGACTACCCACGCTCACCGATCCA
CTGGCTTATGCTATGAGGGCCTACCACCTGGAACTGCTGCAAACGAGCCTGCAAATGCTG
CTTTGTGCTGACGAACGCAGCAGCACACGAGATGTCCTCGACCTGGTGAAGCTGATTATT
GATGACGTGTCCACAGACGCGCCCCATCACTCAGGTTGCTGGGACCCGGTGTTAGAGTGG
TGGGCTAGCGTCGTTGGCGCTGCGGCAGCCTGCTTGCTGGCCGACGCGCCGGCCATCGCC
GACCTCGCCGACAAACTGGCCGTTCTGCCGGACGAACTCGCCACCAGTGAGGATCCGCTG
CCGGGTTCGCTGGACATGGCCTACAAAAGCCGGCGCGGGCTGCTCGCACTAGCTCACTGC
TCAGATGAAGACAAGCACTCGAAGACCACTCACACGCTGCTCAAGGTTTGTGATGTCGCC
GGAGCCCGGCTAGCGGATTCCTTGGCGTATTACTGCTGCCGGAAGCCGACACAGCTCATG
ATGCTGATGCAGGTCCTATGCTGTGATTGGGTGCTGGAGGTGAGAGCGGGGGTGTGGGAG
GCGCGCGGCGCGGGAGGGGGCGGGTCGCCCGTCCACAACCAGCTGGCTGGCTTCCAGAGG
GATTTACATTCTTTGAGGAGGCTGTCGCAGAACTTACCGTGGGTGACGTCAGCCCACAAG
GACGTGAGGCGGCACTGCCGCATGATGGCGGGCGCGGCGCCGCGGCGCACGCAACAACTG
CTGGACGGGAGCCTCAGACCCAGGTCTAACAGGACCTCGCTGATATGCGGCAAGGAGCGT
GCGTTAGAGGGCGGGGGTGGGGAGGGCGAACGTGCGGTAGCTTTATACATGGCGTGCAAG
CATCTCCCGGCGGCGGTGCTAGCGACCCCCGGCGAGAGGGCCGGCATGTTGGCGCAAGCT
GCAGCTACGCTACAGAAGATAGGCCATCGTTCAAGACTACCACACTGCTACCACCTCATG
AAGACCTTTGGCACTCTGCCCGCGCCTTGA
Protein sequence:
MDPMEPLINNDVFNVNEIAEIEDFLNGCDGDFMKKLEEELVFADNDTGLLSVDTKFSTNV
SSPQDSPYYTAPGNPLVSQHPQKRKLPPFTQQPKTSPVLGVYGREESYNLEVKAESHQLP
EVLLQKAQNQTVQTPMFVQQVIPKPQYVALGSLQKLPDGVASLVQLDSPINQNTKAQPAA
KPLLLPNNAKGVTPVILKSSDSNFSPVILQSNILNPETQTLMYTSAPVQGTTQSIITNSG
AESQPVHTFFASNNGPTLVTGIPLVLDGDKISFNQSSNGSPPKVKEVKRSAHNAIERRYR
TSINDRIVELKNMLVGEEAKLNKSAILRKTIDYIKYLQNQNTRLKQENIALKLVCQKSGV
KDVVFDGAYTPPHSDISSPYHSPHGMDSTPSSPESKVEEKYSKIVIGMGDHSRLALCAFM
IGLIAFNPFSAFFGSFMSESSYDYNARLDQRRILSEDSFGAGNVSWGAWLFNMFLIYLVN
TIILGGCLIKLLVYGDSVPKSQSKEAGLFYKHKQQANNHLKKNDLENARSELNRALSVCG
RSVQAGGWGRYSALTAAVMRQILQRLPLGGFLARRAGDLWGDSPARRATQHWAKEVSMVS
HKLAQLEILSNQTSGSKCVLLALQAVNLAEVTSDKQLLAETYVTAALVFKDYMPNFGKWL
CGYYLRLCTYWCWETIPEGNPRVRWATSSRGQDFLRTRRWVYEQKPAYQLFSRLPTLTDP
LAYAMRAYHLELLQTSLQMLLCADERSSTRDVLDLVKLIIDDVSTDAPHHSGCWDPVLEW
WASVVGAAAACLLADAPAIADLADKLAVLPDELATSEDPLPGYYLRLCTYWCWETIPEGN
PRVRWATSSRGQDFLRTRRWVYEQKPAYQLFSRLPTLTDPLAYAMRAYHLELLQTSLQML
LCADERSSTRDVLDLVKLIIDDVSTDAPHHSGCWDPVLEWWASVVGAAAACLLADAPAIA
DLADKLAVLPDELATSEDPLPGSLDMAYKSRRGLLALAHCSDEDKHSKTTHTLLKVCDVA
GARLADSLAYYCCRKPTQLMMLMQVLCCDWVLEVRAGVWEARGAGGGGSPVHNQLAGFQR
DLHSLRRLSQNLPWVTSAHKDVRRHCRMMAGAAPRRTQQLLDGSLRPRSNRTSLICGKER
ALEGGGGEGERAVALYMACKHLPAAVLATPGERAGMLAQAAATLQKIGHRSRLPHCYHLM
KTFGTLPAP