New model in OGS2.0 | DPOGS209081  |
---|---|
Genomic Position | scaffold888:+ 33907-53363 |
See gene structure | |
CDS Length | 1827 |
Paired RNAseq reads   | 2114 |
Single RNAseq reads   | 5379 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA014423 (5e-49) |
Best Drosophila hit   | Mnf, isoform I (1e-44) |
Best Human hit | forkhead box protein K1 (4e-96) |
Best NR hit (blastp)   | PREDICTED: similar to forkhead box K1 [Apis mellifera] (3e-141) |
Best NR hit (blastx)   | PREDICTED: similar to forkhead box K1 [Apis mellifera] (5e-131) |
GeneOntology terms    | GO:0043565 sequence-specific DNA binding GO:0003700 sequence-specific DNA binding transcription factor activity GO:0006355 regulation of transcription, DNA-dependent GO:0005634 nucleus GO:0003677 DNA binding |
InterPro families    | IPR000253 Forkhead-associated (FHA) domain IPR011991 Winged helix-turn-helix transcription repressor DNA-binding IPR001766 Transcription factor, fork head IPR018122 Transcription factor, fork head, conserved site IPR008984 SMAD/FHA domain |
Orthology group | MCL11878 |
Nucleotide sequence:
ATGGCTCAGACTCAGATATCGCGGAGCTCGGAGAGCGATGCCTGGACGCTCCTCTCCCTC
AAGTCGGCGCCTCCGTCGCCCTCGAAAGTGCAATGGGCACAGGAGCCGGCTCCGACTGCC
ATCGCGCGGCTCGATGGCCGCGATTTCGAATACATGATCCGCCAAAAGAAGGTGATAATC
GGACGGAATTCTAGCCGGGGTCAAGTAGACGTAAACATGGGACATTCTAGTTTTATATCG
CGGCGGCATCTTGAACTGTTCTACGACCACCCGGAGTTCTATTTGACTTGCAACAGTAAG
AACGGCGTGCTCGTCGATGGTGTGTTCCAACGGAAGGGAGCGGCCGCGATGCTACTTCCA
AAGAGATGTACTCTCCGTTTTCCGTCTACCAACATCCGCCTGGAGTTTCAGTCGTTGGTT
GAGGAGAGTGGCGTCGGGTCTGGGGGAGCAGGCCCACCCCTGCCCCCGCTACGTATTTCC
ATACCAGTGGACAACGATGGACGGAGTCCTGCGCCCTCGCCGACAGGTACGATAAGCGCC
ACCAACAGCTGTCCCACGTCGCCCCGGGGCGCCGGCTCCTCCGGCCGACGACACCCGGAC
CTCGGCCTGGTGGCGCAGTACGCCGCCCTGGCCGACCACCAGCGACCGAACTCGAACGGA
ACCGCGGCTTCGTCTACGTCGGACTCCGGCTACAGTTCCCGGGACGCCCGTGATGCCAGG
GAGCATCGCGAGGGTCGGGACGAGGCTAAACCACCGTACAGCTACGCCCAGCTGATAGTC
CAAGCTGTTGCTTCGGCGGCGGACAAGCAGCTCACGCTGAGCGGCATCTACAGCTACATC
ACCAAGCACTACCCCTACTACCGGACCGCCGACAAGGGCTGGCAGAACTCGATCCGACAC
AACCTGTCGCTCAACCGTTACTTCATCAAGGTTCCTCGTAGTCAGGAAGAGCCGGGCAAG
GGCAGCTTTTGGCGTATCGACCCACAGAGTGAAGGGAAACTCATCGAGCTGGCCTTCAGA
CCTCGCCGCCCGAGGGGAGTTCAGTTCAGGGCACCCTTCGGACTCTCCTCAAGGAGCGCT
CCTACTTCTCCGTCTCAAGTCGGCGTCTCCGGGCTGGTCACGCCTGAGGAGTTGTCGCGA
GAACCCACGCCTGACCTCTTCACCGCCGAGGAACATGAACAACAGCAGTCCGGCCAACAA
CGCTTGTCATCATCGTCACAATATCTGTTCCCGCAGAGAAGTGGGGTCAGTCAGAGCGCG
CCCGGATCACCTGGTCACGGCGTGTACGCGGGCGGCAGCGGCTTAGTGATGGCCGGACAT
CAGATAACGGTTGTCACCAACGGAGCTGGAGGGGAGAGAGAAGAGAAGTACGTGGTGGGC
ACGTCGGGGGGTGGGCTGGTGTCGATACCCGAGGAGGAGGTCCAGGCTGCCAACCTACTG
CTTCATCAGCACTCACCTTACTACGCCGGGTACAGCGGTGACGAGAACTGCGCGCTGGGT
GGAGAGTTGGTTATAGAGGAGGCGCCCGACGACCCGCCACACAAGAGACCCAAGCATCAT
AAGATCGACGCGCGTATTGACCGCGCGACGCACGGGCGCCTCGCGTGGGCCGACAGTCCT
CCCATTGGTGGAGACGTCTCAAACAGCAGCGTCGCGTCATCAACCGAGCCCGTGATTGGT
CAGCGAGGAGTCAGAACCGCAGCACAGTCACAACTTTGGCGGGAAAAAGACGCGCGTCAC
GACAACACCGCGTCATCGCATAGAAGCATATGGCCGACCGGCGTTACCGCGCGTTCTGAC
GAAGCAACCGATCTTAACGTACAATGA
Protein sequence:
MAQTQISRSSESDAWTLLSLKSAPPSPSKVQWAQEPAPTAIARLDGRDFEYMIRQKKVII
GRNSSRGQVDVNMGHSSFISRRHLELFYDHPEFYLTCNSKNGVLVDGVFQRKGAAAMLLP
KRCTLRFPSTNIRLEFQSLVEESGVGSGGAGPPLPPLRISIPVDNDGRSPAPSPTGTISA
TNSCPTSPRGAGSSGRRHPDLGLVAQYAALADHQRPNSNGTAASSTSDSGYSSRDARDAR
EHREGRDEAKPPYSYAQLIVQAVASAADKQLTLSGIYSYITKHYPYYRTADKGWQNSIRH
NLSLNRYFIKVPRSQEEPGKGSFWRIDPQSEGKLIELAFRPRRPRGVQFRAPFGLSSRSA
PTSPSQVGVSGLVTPEELSREPTPDLFTAEEHEQQQSGQQRLSSSSQYLFPQRSGVSQSA
PGSPGHGVYAGGSGLVMAGHQITVVTNGAGGEREEKYVVGTSGGGLVSIPEEEVQAANLL
LHQHSPYYAGYSGDENCALGGELVIEEAPDDPPHKRPKHHKIDARIDRATHGRLAWADSP
PIGGDVSNSSVASSTEPVIGQRGVRTAAQSQLWREKDARHDNTASSHRSIWPTGVTARSD
EATDLNVQ