New model in OGS2.0 | DPOGS213641  |
---|---|
Genomic Position | scaffold543:- 67073-89796 |
See gene structure | |
CDS Length | 1992 |
Paired RNAseq reads   | 428 |
Single RNAseq reads   | 1309 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA004582 (8e-100) |
Best Drosophila hit   | CG16899 (3e-64) |
Best Human hit | forkhead box protein P1 isoform 1 (2e-64) |
Best NR hit (blastp)   | hypothetical protein TcasGA2_TC010836 [Tribolium castaneum] (3e-97) |
Best NR hit (blastx)   | PREDICTED: similar to FoxP protein [Tribolium castaneum] (1e-79) |
GeneOntology terms    | GO:0006355 regulation of transcription, DNA-dependent GO:0043565 sequence-specific DNA binding GO:0005634 nucleus GO:0003700 sequence-specific DNA binding transcription factor activity GO:0046872 metal ion binding GO:0005622 intracellular GO:0008270 zinc ion binding |
InterPro families    | IPR011991 Winged helix-turn-helix transcription repressor DNA-binding IPR001766 Transcription factor, fork head IPR018122 Transcription factor, fork head, conserved site |
Orthology group | MCL12830 |
Nucleotide sequence:
ATGCCCGTACGAGCAACAGGCGTGACGGAGGAAGGTTCCGCTAACGGTACACAGGTGGCG
CTGTACACGGAATACTCCAGGCACCCGCTCTGGATTGTATTCAAAACATTAGGAGGCATA
CCGCGACAGTCGCCCGACCGGCTCGGACCGCAGTGGCGGCGCTCCTCGCCCGACTCCCGG
AGCACCTGTGACTCCTGGATGTGTCTTCAGCGTGACGAAGATGACGTACACATAGATGAA
GCGAGACCAATGGTGCAGCTACCACTAACGGGTCCGGAGCCGGCACTGAACTCAAGTCCG
GGGGGCGCAGCCAAGCCGCCGCCTCTCAACTCATTCGAGCTGGACCGCTGTTTGGATAGG
GAGGAGTCCGGTCGTTGGGGCGAGGCGGCTCGTCGTCGCCGGCGCAGTTCCCCGCGCCGT
TCGCCCCCGCCCGGAAATGAACCCCTGTCCCTCGCCAGGGCGTCCGCCCCGGCCTCTCCC
CTGGTGACGTCATCTCCATCGTCCCCGTCTCCCGCTCGCTCGCCGCTGCTGGCGCCGGAC
CTGCTGGCGATGCAGCTCCTCGACCAGCACTCCCAGCTGCAGGCGCTCATGAAGCAGAGA
CTCTTCCACCAGCACCACCTGCAGAAACAGCACATGTCGTCGGAGGCAGCTAAGCGTCAG
TTGGAACAGTCCCGCCTCCAGGACCAGATCAACCTGAACCTTCTCTCTCAGTCTCACCTC
CAGCCGCCGGAGACTTCTCCCAGTCTCCAGCAGCAGCAGTTGGTCCAGCAGCTCCAGGCG
GTCCAGCGGCAGTATCTCATGCACGCTCCTATGTCCGTCCCACCAAACGCTCCTCCAGAC
TACGACACGGGCAGCGAGTTGGAGGAGCACCCTCTGTTCGGTCGCGGGGTGTGCAAGTGG
CCGGGCTGCGACGCGCTCGCTGAGGACTTCCAAGCCTTCCTCAAACACCTGGAGGCGGCC
CACACCCTGGACGACCGGTCGGCGGCGCAGGCGCGGGTCCAGATGCAGGTGGTCGCCCAG
CTGGAACTCCAGCTGAGGCGGGAGAGGGACCGCCTGGCCGCCATGATGAGACACCTGCAC
GCCGCCAGGGACAACCACAACAAGATGCACGGTCCGGCCAGCGAGGGCTCCTCCCCCGGG
CCCGTGCGCCGCCGCGTGTCGGACAAGTCCGGAGTCGCCATCGCTGGAGAGATACAGAGG
AACCGGGAGTTCTACAAGACGGCCGACGTGCGGCCGCCCTTCACCTACGCGAGTCTCATA
CGGCAGGCCATCATCGAGTCCCCGGACAAACAGCTGACTCTCAACGAGATCTACAACTGG
TTCCAGTCCACCTTCTGCTACTTCCGACGGAACGCCGCCACCTGGAAGAACGCCGTCCGC
CACAACCTGTCGCTCCACAAGTGCTTCATGAGGGTGGAGAACGTGAAGGGGGCCGTGTGG
ACCGTGGACGAGGTGGAGTTCTACAAGAGGAGGCCCCAGCGGGCCCACGCCGCCATCCAC
ACCGGATTCATGGGAGCTCAGAGTCCGCCGATGATGACCAGCCCTCACAGTTACAATGAA
GCTCTGAAGAGGAATCTACAGGGTATGATGGAGGATTGTAACCTGTCGTACATGTCAGGT
GACGACCACGTGATGCAGACTGAAGAGTATCCATCCTCACACGACGACTACAGTCAGAGT
CGTCCTCCGCTGCTGAACGGCTACGGCGGCGGCGGCTCCTCGCACGAGCCCAAGCCCGAG
GACCTGAGCGCGGGCGAGGCGCAGGACGTCAAGCCTAACCTGTACGCGCTCAAGCACGAG
CTGCACGCCTCCGACTACTCCAACAAAGGTGACTACCAGACCTCCGGCAAGGAGCCCGGC
AGCCGCTCCTGTGAGAGCGCTCCCCACGACGAATACTCCAGAGACGACCGATACCGACCC
GGCCACGGACAGCCCGCCCGCGACGACGAGGCCGAGAACCTGTCGCTCAACGAGCAGAGG
TCCGACAAGTAG
Protein sequence:
MPVRATGVTEEGSANGTQVALYTEYSRHPLWIVFKTLGGIPRQSPDRLGPQWRRSSPDSR
STCDSWMCLQRDEDDVHIDEARPMVQLPLTGPEPALNSSPGGAAKPPPLNSFELDRCLDR
EESGRWGEAARRRRRSSPRRSPPPGNEPLSLARASAPASPLVTSSPSSPSPARSPLLAPD
LLAMQLLDQHSQLQALMKQRLFHQHHLQKQHMSSEAAKRQLEQSRLQDQINLNLLSQSHL
QPPETSPSLQQQQLVQQLQAVQRQYLMHAPMSVPPNAPPDYDTGSELEEHPLFGRGVCKW
PGCDALAEDFQAFLKHLEAAHTLDDRSAAQARVQMQVVAQLELQLRRERDRLAAMMRHLH
AARDNHNKMHGPASEGSSPGPVRRRVSDKSGVAIAGEIQRNREFYKTADVRPPFTYASLI
RQAIIESPDKQLTLNEIYNWFQSTFCYFRRNAATWKNAVRHNLSLHKCFMRVENVKGAVW
TVDEVEFYKRRPQRAHAAIHTGFMGAQSPPMMTSPHSYNEALKRNLQGMMEDCNLSYMSG
DDHVMQTEEYPSSHDDYSQSRPPLLNGYGGGGSSHEPKPEDLSAGEAQDVKPNLYALKHE
LHASDYSNKGDYQTSGKEPGSRSCESAPHDEYSRDDRYRPGHGQPARDDEAENLSLNEQR
SDK