DPGLEAN15160 in OGS1.0

New model in OGS2.0DPOGS207171 
Genomic Positionscaffold7:- 780775-813376
See gene structure
CDS Length1314
Paired RNAseq reads  216
Single RNAseq reads  531
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA000635 (2e-52)
Best Drosophila hit  crocodile (3e-30)
Best Human hitforkhead box protein L2 (4e-36)
Best NR hit (blastp)  PREDICTED: similar to forkhead protein/ forkhead protein domain [Nasonia vitripennis] (2e-50)
Best NR hit (blastx)  forkhead protein/ forkhead protein domain [Culex quinquefasciatus] (6e-43)
GeneOntology terms





  
GO:0043565 sequence-specific DNA binding
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0006355 regulation of transcription, DNA-dependent
GO:0005634 nucleus
GO:0045449 regulation of transcription
GO:0006350 transcription
GO:0003677 DNA binding
InterPro families

  
IPR001766 Transcription factor, fork head
IPR011991 Winged helix-turn-helix transcription repressor DNA-binding
IPR018122 Transcription factor, fork head, conserved site
Orthology groupMCL16668

Nucleotide sequence:

ATGAACGAATTGTCCGTCTCTTATACCGATCCGTCAAGAATGCAAATCTACTCGCAGCAC
CCGACGGAACTATCTATTCACGCTGTGTCCACTGCCGAATTGTTGAAACCAAAAGAGGAA
CCGGTATACAACCTGTCACCGTCTGCTTTATGCACCGTTACCCTACCGCATACTGGAGGT
CTGTCCTACGATAGTCGCATGTGTCTCCAGGAGTCAAACACTCCAGACTCCTTCAATCAG
CCAGAGATGAAGGAAGCTGAAGATTTCTCTCGGGTCTATCAGACCCTTACCCTCTCTGCG
CTTTCCAGAGACGATTCTGCTAATTCACCTACAAGCAGCGAGAATAAGCCCAAGCCAAAG
CCTACCCCGGCAGCTTGTCCTGGCAGTTCGCCTGAAATGAACCCACAGTCTACTACACCG
TCTTCACAGGCACTCACAAAACCGCCATACTCTTACGTGGCTCTGATTGCTATGGCTATC
ACCAATAGCCAGAATAAGCGCGCAACCCTAAGTGAAATATACGCTTACATTACCAAAAAA
TTTCCTTTCTTCGAGAAGGATAAAAAGGGCTGGCAAAATTCAATCAGACACAATCTGAGC
CTGAACGAATGCTTCATTAAAGTACGCAGAGAGGGCGGAAGTGAAAGCAAGGGAAATTAT
TGGACACTTGATCCGCAATGCGGAGACATGTTCGTGAATGGCAACTTCAGGCGGCGACGT
CGTATGAAGAGACCATTCAGGGCCGCTCCATATAAGACAATGTTCGACGGCTACGTCGCC
CACGGTGGTCAACATCCCCACATGCCCATCCAGCTCGGGCACAGGAACTACTTCGGTTCT
AGTACACCCTATCCTCCGTCTTACCCGAGATATGATGCATGGCTGAGTCAGCCGACAGGC
GGATTGGGTTACCCTGCTCCGATAGCTCGCAGTCCCCCTGGTTGCTCCCCCCAGGCGTCT
AACGTGAACCCCTTCTCCACCCACCAAAACCAAGGACAGTTACAGAGCCCGTTGCAATCC
ATGCAACCGATGACAATGAATTACAATACGCTCAATGTTGCCGCCATAGGTGAGTTTGAT
GGCTCTTCTAGTCCTGGATCTGGTTACGCCGCTGGTAGCTTCTCTCCAAATCGTCATCAT
GATATTGTCACTTTATCTGATGCTGTTTCTCGTTTTTCTTTTTGGCCCGAAGGTGGATCG
TCAAGTCCCAACTCTGGATATGTCCCAACTAACTTCTCCCCCCGCAGACATGAAGCTGTC
TCCTCCTCCGATGCTGCTGGTCGCTACTCTTTCTGGCCTGACGGTACTTTTTAA

Protein sequence:

MNELSVSYTDPSRMQIYSQHPTELSIHAVSTAELLKPKEEPVYNLSPSALCTVTLPHTGG
LSYDSRMCLQESNTPDSFNQPEMKEAEDFSRVYQTLTLSALSRDDSANSPTSSENKPKPK
PTPAACPGSSPEMNPQSTTPSSQALTKPPYSYVALIAMAITNSQNKRATLSEIYAYITKK
FPFFEKDKKGWQNSIRHNLSLNECFIKVRREGGSESKGNYWTLDPQCGDMFVNGNFRRRR
RMKRPFRAAPYKTMFDGYVAHGGQHPHMPIQLGHRNYFGSSTPYPPSYPRYDAWLSQPTG
GLGYPAPIARSPPGCSPQASNVNPFSTHQNQGQLQSPLQSMQPMTMNYNTLNVAAIGEFD
GSSSPGSGYAAGSFSPNRHHDIVTLSDAVSRFSFWPEGGSSSPNSGYVPTNFSPRRHEAV
SSSDAAGRYSFWPDGTF