New model in OGS2.0 | DPOGS207171  |
---|---|
Genomic Position | scaffold7:- 780775-813376 |
See gene structure | |
CDS Length | 1314 |
Paired RNAseq reads   | 216 |
Single RNAseq reads   | 531 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA000635 (2e-52) |
Best Drosophila hit   | crocodile (3e-30) |
Best Human hit | forkhead box protein L2 (4e-36) |
Best NR hit (blastp)   | PREDICTED: similar to forkhead protein/ forkhead protein domain [Nasonia vitripennis] (2e-50) |
Best NR hit (blastx)   | forkhead protein/ forkhead protein domain [Culex quinquefasciatus] (6e-43) |
GeneOntology terms    | GO:0043565 sequence-specific DNA binding GO:0003700 sequence-specific DNA binding transcription factor activity GO:0006355 regulation of transcription, DNA-dependent GO:0005634 nucleus GO:0045449 regulation of transcription GO:0006350 transcription GO:0003677 DNA binding |
InterPro families    | IPR001766 Transcription factor, fork head IPR011991 Winged helix-turn-helix transcription repressor DNA-binding IPR018122 Transcription factor, fork head, conserved site |
Orthology group | MCL16668 |
Nucleotide sequence:
ATGAACGAATTGTCCGTCTCTTATACCGATCCGTCAAGAATGCAAATCTACTCGCAGCAC
CCGACGGAACTATCTATTCACGCTGTGTCCACTGCCGAATTGTTGAAACCAAAAGAGGAA
CCGGTATACAACCTGTCACCGTCTGCTTTATGCACCGTTACCCTACCGCATACTGGAGGT
CTGTCCTACGATAGTCGCATGTGTCTCCAGGAGTCAAACACTCCAGACTCCTTCAATCAG
CCAGAGATGAAGGAAGCTGAAGATTTCTCTCGGGTCTATCAGACCCTTACCCTCTCTGCG
CTTTCCAGAGACGATTCTGCTAATTCACCTACAAGCAGCGAGAATAAGCCCAAGCCAAAG
CCTACCCCGGCAGCTTGTCCTGGCAGTTCGCCTGAAATGAACCCACAGTCTACTACACCG
TCTTCACAGGCACTCACAAAACCGCCATACTCTTACGTGGCTCTGATTGCTATGGCTATC
ACCAATAGCCAGAATAAGCGCGCAACCCTAAGTGAAATATACGCTTACATTACCAAAAAA
TTTCCTTTCTTCGAGAAGGATAAAAAGGGCTGGCAAAATTCAATCAGACACAATCTGAGC
CTGAACGAATGCTTCATTAAAGTACGCAGAGAGGGCGGAAGTGAAAGCAAGGGAAATTAT
TGGACACTTGATCCGCAATGCGGAGACATGTTCGTGAATGGCAACTTCAGGCGGCGACGT
CGTATGAAGAGACCATTCAGGGCCGCTCCATATAAGACAATGTTCGACGGCTACGTCGCC
CACGGTGGTCAACATCCCCACATGCCCATCCAGCTCGGGCACAGGAACTACTTCGGTTCT
AGTACACCCTATCCTCCGTCTTACCCGAGATATGATGCATGGCTGAGTCAGCCGACAGGC
GGATTGGGTTACCCTGCTCCGATAGCTCGCAGTCCCCCTGGTTGCTCCCCCCAGGCGTCT
AACGTGAACCCCTTCTCCACCCACCAAAACCAAGGACAGTTACAGAGCCCGTTGCAATCC
ATGCAACCGATGACAATGAATTACAATACGCTCAATGTTGCCGCCATAGGTGAGTTTGAT
GGCTCTTCTAGTCCTGGATCTGGTTACGCCGCTGGTAGCTTCTCTCCAAATCGTCATCAT
GATATTGTCACTTTATCTGATGCTGTTTCTCGTTTTTCTTTTTGGCCCGAAGGTGGATCG
TCAAGTCCCAACTCTGGATATGTCCCAACTAACTTCTCCCCCCGCAGACATGAAGCTGTC
TCCTCCTCCGATGCTGCTGGTCGCTACTCTTTCTGGCCTGACGGTACTTTTTAA
Protein sequence:
MNELSVSYTDPSRMQIYSQHPTELSIHAVSTAELLKPKEEPVYNLSPSALCTVTLPHTGG
LSYDSRMCLQESNTPDSFNQPEMKEAEDFSRVYQTLTLSALSRDDSANSPTSSENKPKPK
PTPAACPGSSPEMNPQSTTPSSQALTKPPYSYVALIAMAITNSQNKRATLSEIYAYITKK
FPFFEKDKKGWQNSIRHNLSLNECFIKVRREGGSESKGNYWTLDPQCGDMFVNGNFRRRR
RMKRPFRAAPYKTMFDGYVAHGGQHPHMPIQLGHRNYFGSSTPYPPSYPRYDAWLSQPTG
GLGYPAPIARSPPGCSPQASNVNPFSTHQNQGQLQSPLQSMQPMTMNYNTLNVAAIGEFD
GSSSPGSGYAAGSFSPNRHHDIVTLSDAVSRFSFWPEGGSSSPNSGYVPTNFSPRRHEAV
SSSDAAGRYSFWPDGTF