New model in OGS2.0 | DPOGS207522  |
---|---|
Genomic Position | scaffold134:+ 28172-34226 |
See gene structure | |
CDS Length | 2244 |
Paired RNAseq reads   | 654 |
Single RNAseq reads   | 1661 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA001884 (7e-69) |
Best Drosophila hit   | CG9305 (3e-22) |
Best Human hit | transcription factor TFIIIB component B'' homolog (7e-14) |
Best NR hit (blastp)   | PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum] (2e-39) |
Best NR hit (blastx)   | AGAP008674-PA [Anopheles gambiae str. PEST] (9e-26) |
GeneOntology terms    | GO:0003700 sequence-specific DNA binding transcription factor activity GO:0005634 nucleus |
InterPro families    | IPR009057 Homeodomain-like IPR001005 SANT domain, DNA binding |
Orthology group | MCL19445 |
Nucleotide sequence:
ATGTCTACACGGAGAGCAAGAATTAAAGCTGTTAATTTTTTGCCACTGCGACGAAAAAAT
ACTGAAACATCTGAGAATAAAAATAAAGTCTCTGATTCAAAAGACAATGCTGAAAAAAAT
TCCAAAGATTCTCAAACTCCTCTGCCATCAGCCTCCGCAAATCAAGAAGAAAATACGAGT
GTAATAAAAAATACACCGGAAAACCAAAACTCAAGTACGAACACGCCGTCAAAAGAACAG
AGCTTCAATAAATCGCCGGCCACCAACAGTACCACACCCGGCGAAACTCCTCATAACGCA
AACAATTCAACGTCATTCAAAGATAAAGCACAATCAATTAATTCAATAAAAGATACCAAC
ATTTTTGTATCCCCTTTGAGACGAAATAGTCCAAAAAAGACATTCGCATCTCCAATAGTC
CCTTCCCCTAAAGTCATAAGAAATGTGGAACAAAAATTAACCACAACACTCATTGGTAGG
AAGACTCCTGTAGCACAAAAAATCAATGAAAGTAATGAACTCCAAATATCAGGAAGTGTA
ATTAATATCACTCACAATATTACCAATGACAGCCGAAGGAATGAACAAGCCGGACCTGGC
ACACCGACCACAGCAAATGATGAAGCGATGGATGGCATAATTCCCTTGCAATCAGCATCA
ACTGCACCTAAGCCAATAGAGTTGTTGAAGAGTGAAATTATATCGGAAAATGCTGAGGTT
CTTTTTGATCCAATCGTCCCTCTCCCCTCACCGAGTAAGGTGAGACCTAAATTGCGTCCA
GCTCCAAGATTGGGGCCTCATCGACGGAACAGTGTACAGGGTAGTGCAAGTGAATCCGAG
GATGAAAGTCGAAGGAGTCTTCTGTCTGGAGGCAATACACCAGCTCATCAAAGACAAAGG
CATGACTCTCAAATGTCTCATAACACACTTGTGTCCTTACCCAACAGGAGATCCCGTAAT
GAACGTCGTTTGAACGCCATGAGACGTCGAGAAAACGTTAAACGTGACTCCCTGACGATG
TACGACCTCATCTTCTACAATCCAACCTCGAACCCTATTGTTCCGGACGACGACGAAATA
AATGCTAAGGAAGCGAACGAGAAGGAGATTCTGGAGAGTAACAAAAAGGAAGAGACCAAA
ACGGATAACCCGAAGGAGAACGCCGCGCCCGTGCCGCAGATAAAACTTGGACCTAACGGG
ACAATAATGATCGACGAGGAAAGCTTGGTCATAAAACAGACTGAGTCCGACCGTAAAGTG
TCTTCAGTGGTGCACGAGGGGTCCTGGTCAAAGAGCAGCGGGTATAAGGCGAAACATCTC
CGCTCTAGAGACTGGAGCTCCGCCGAGACTGTCAGGTTCTATAGAGCCTTGGCTGTTATT
GGAACAGATTTCTCGTTGATGGCACCTCTATTCCCTGATAGGACGAGACGAGAACTTAAA
TTTAAGTTCAAGAAAGAAGAAAGGATGAACGGCGCCCAAGTTGATAAGGCGTTGCGTTCG
ACCATCGAATGGGATGTGCTTAGACTCAAAGAAGAGTTCAAAGAAGAAAGAGCCCTGGCC
GCTAAACAAGCGGAGAGAGAAAGACAGTCACTGATAGAAGAGAAGAAGTTGGAGAGAGAA
AGACTGAAGGCTGCCAGAGAAACACGTGTTAGATCTAGCCGAGGTTCAAAAGCTCTGTCA
TCAAACATGTTGCCGGGGCTGAATAAAGTTCACAACGAAGTGTTCACAGCAGATGGTATT
ATAGAAAGAGCTAATCGGAGACCACATAAAAACAAACATGGAACAATCCAGTCATCGGAT
AAAAATAAAGATAATCAATCAAATGTAAGCCAGAACACACAAAATCAAGACGTAGCTGTA
CTAACAAAATTACCACCAATGCAAACTAAAACGCCCGAAGCCGTCAATACTTTGCCACCT
ATTCCACCGAATATTGAAACCGGCTCTTTGGTAGTTTTAACGGTCGATGACCCCTCTTCA
CCCGCCAAAAAAATGCTACAGACATACATCGCTCGTGGCCCGGGTCAGTTGACGCCAGTC
GCATTACCCACAACCTTCTTAAACTCCGTGGTAGGGTATATGAAGAAAAACAAAGGTCAA
GGGTCACCACAAATCATGTCGCCGGGCAGCGCTGCAAGCTACGACAGCAGATCGAGCGGA
ACTCCCGGAGTCCCAAACATTTCAGTGTTGCCAAGTCCAGCAAAAAGACAAAGACACAGC
TCATTCACCATAACGCAGCTTTGA
Protein sequence:
MSTRRARIKAVNFLPLRRKNTETSENKNKVSDSKDNAEKNSKDSQTPLPSASANQEENTS
VIKNTPENQNSSTNTPSKEQSFNKSPATNSTTPGETPHNANNSTSFKDKAQSINSIKDTN
IFVSPLRRNSPKKTFASPIVPSPKVIRNVEQKLTTTLIGRKTPVAQKINESNELQISGSV
INITHNITNDSRRNEQAGPGTPTTANDEAMDGIIPLQSASTAPKPIELLKSEIISENAEV
LFDPIVPLPSPSKVRPKLRPAPRLGPHRRNSVQGSASESEDESRRSLLSGGNTPAHQRQR
HDSQMSHNTLVSLPNRRSRNERRLNAMRRRENVKRDSLTMYDLIFYNPTSNPIVPDDDEI
NAKEANEKEILESNKKEETKTDNPKENAAPVPQIKLGPNGTIMIDEESLVIKQTESDRKV
SSVVHEGSWSKSSGYKAKHLRSRDWSSAETVRFYRALAVIGTDFSLMAPLFPDRTRRELK
FKFKKEERMNGAQVDKALRSTIEWDVLRLKEEFKEERALAAKQAERERQSLIEEKKLERE
RLKAARETRVRSSRGSKALSSNMLPGLNKVHNEVFTADGIIERANRRPHKNKHGTIQSSD
KNKDNQSNVSQNTQNQDVAVLTKLPPMQTKTPEAVNTLPPIPPNIETGSLVVLTVDDPSS
PAKKMLQTYIARGPGQLTPVALPTTFLNSVVGYMKKNKGQGSPQIMSPGSAASYDSRSSG
TPGVPNISVLPSPAKRQRHSSFTITQL