DPGLEAN18551 in OGS1.0

New model in OGS2.0DPOGS209081 
Genomic Positionscaffold888:+ 33907-53363
See gene structure
CDS Length1827
Paired RNAseq reads  2114
Single RNAseq reads  5379
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA014423 (5e-49)
Best Drosophila hit  Mnf, isoform I (1e-44)
Best Human hitforkhead box protein K1 (4e-96)
Best NR hit (blastp)  PREDICTED: similar to forkhead box K1 [Apis mellifera] (3e-141)
Best NR hit (blastx)  PREDICTED: similar to forkhead box K1 [Apis mellifera] (5e-131)
GeneOntology terms



  
GO:0043565 sequence-specific DNA binding
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0006355 regulation of transcription, DNA-dependent
GO:0005634 nucleus
GO:0003677 DNA binding
InterPro families



  
IPR000253 Forkhead-associated (FHA) domain
IPR011991 Winged helix-turn-helix transcription repressor DNA-binding
IPR001766 Transcription factor, fork head
IPR018122 Transcription factor, fork head, conserved site
IPR008984 SMAD/FHA domain
Orthology groupMCL11878

Nucleotide sequence:

ATGGCTCAGACTCAGATATCGCGGAGCTCGGAGAGCGATGCCTGGACGCTCCTCTCCCTC
AAGTCGGCGCCTCCGTCGCCCTCGAAAGTGCAATGGGCACAGGAGCCGGCTCCGACTGCC
ATCGCGCGGCTCGATGGCCGCGATTTCGAATACATGATCCGCCAAAAGAAGGTGATAATC
GGACGGAATTCTAGCCGGGGTCAAGTAGACGTAAACATGGGACATTCTAGTTTTATATCG
CGGCGGCATCTTGAACTGTTCTACGACCACCCGGAGTTCTATTTGACTTGCAACAGTAAG
AACGGCGTGCTCGTCGATGGTGTGTTCCAACGGAAGGGAGCGGCCGCGATGCTACTTCCA
AAGAGATGTACTCTCCGTTTTCCGTCTACCAACATCCGCCTGGAGTTTCAGTCGTTGGTT
GAGGAGAGTGGCGTCGGGTCTGGGGGAGCAGGCCCACCCCTGCCCCCGCTACGTATTTCC
ATACCAGTGGACAACGATGGACGGAGTCCTGCGCCCTCGCCGACAGGTACGATAAGCGCC
ACCAACAGCTGTCCCACGTCGCCCCGGGGCGCCGGCTCCTCCGGCCGACGACACCCGGAC
CTCGGCCTGGTGGCGCAGTACGCCGCCCTGGCCGACCACCAGCGACCGAACTCGAACGGA
ACCGCGGCTTCGTCTACGTCGGACTCCGGCTACAGTTCCCGGGACGCCCGTGATGCCAGG
GAGCATCGCGAGGGTCGGGACGAGGCTAAACCACCGTACAGCTACGCCCAGCTGATAGTC
CAAGCTGTTGCTTCGGCGGCGGACAAGCAGCTCACGCTGAGCGGCATCTACAGCTACATC
ACCAAGCACTACCCCTACTACCGGACCGCCGACAAGGGCTGGCAGAACTCGATCCGACAC
AACCTGTCGCTCAACCGTTACTTCATCAAGGTTCCTCGTAGTCAGGAAGAGCCGGGCAAG
GGCAGCTTTTGGCGTATCGACCCACAGAGTGAAGGGAAACTCATCGAGCTGGCCTTCAGA
CCTCGCCGCCCGAGGGGAGTTCAGTTCAGGGCACCCTTCGGACTCTCCTCAAGGAGCGCT
CCTACTTCTCCGTCTCAAGTCGGCGTCTCCGGGCTGGTCACGCCTGAGGAGTTGTCGCGA
GAACCCACGCCTGACCTCTTCACCGCCGAGGAACATGAACAACAGCAGTCCGGCCAACAA
CGCTTGTCATCATCGTCACAATATCTGTTCCCGCAGAGAAGTGGGGTCAGTCAGAGCGCG
CCCGGATCACCTGGTCACGGCGTGTACGCGGGCGGCAGCGGCTTAGTGATGGCCGGACAT
CAGATAACGGTTGTCACCAACGGAGCTGGAGGGGAGAGAGAAGAGAAGTACGTGGTGGGC
ACGTCGGGGGGTGGGCTGGTGTCGATACCCGAGGAGGAGGTCCAGGCTGCCAACCTACTG
CTTCATCAGCACTCACCTTACTACGCCGGGTACAGCGGTGACGAGAACTGCGCGCTGGGT
GGAGAGTTGGTTATAGAGGAGGCGCCCGACGACCCGCCACACAAGAGACCCAAGCATCAT
AAGATCGACGCGCGTATTGACCGCGCGACGCACGGGCGCCTCGCGTGGGCCGACAGTCCT
CCCATTGGTGGAGACGTCTCAAACAGCAGCGTCGCGTCATCAACCGAGCCCGTGATTGGT
CAGCGAGGAGTCAGAACCGCAGCACAGTCACAACTTTGGCGGGAAAAAGACGCGCGTCAC
GACAACACCGCGTCATCGCATAGAAGCATATGGCCGACCGGCGTTACCGCGCGTTCTGAC
GAAGCAACCGATCTTAACGTACAATGA

Protein sequence:

MAQTQISRSSESDAWTLLSLKSAPPSPSKVQWAQEPAPTAIARLDGRDFEYMIRQKKVII
GRNSSRGQVDVNMGHSSFISRRHLELFYDHPEFYLTCNSKNGVLVDGVFQRKGAAAMLLP
KRCTLRFPSTNIRLEFQSLVEESGVGSGGAGPPLPPLRISIPVDNDGRSPAPSPTGTISA
TNSCPTSPRGAGSSGRRHPDLGLVAQYAALADHQRPNSNGTAASSTSDSGYSSRDARDAR
EHREGRDEAKPPYSYAQLIVQAVASAADKQLTLSGIYSYITKHYPYYRTADKGWQNSIRH
NLSLNRYFIKVPRSQEEPGKGSFWRIDPQSEGKLIELAFRPRRPRGVQFRAPFGLSSRSA
PTSPSQVGVSGLVTPEELSREPTPDLFTAEEHEQQQSGQQRLSSSSQYLFPQRSGVSQSA
PGSPGHGVYAGGSGLVMAGHQITVVTNGAGGEREEKYVVGTSGGGLVSIPEEEVQAANLL
LHQHSPYYAGYSGDENCALGGELVIEEAPDDPPHKRPKHHKIDARIDRATHGRLAWADSP
PIGGDVSNSSVASSTEPVIGQRGVRTAAQSQLWREKDARHDNTASSHRSIWPTGVTARSD
EATDLNVQ