New model in OGS2.0 | DPOGS203949  |
---|---|
Genomic Position | scaffold65:+ 89247-90092 |
See gene structure | |
CDS Length | 846 |
Paired RNAseq reads   | 6 |
Single RNAseq reads   | 18 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA013330 (5e-100) |
Best Drosophila hit   | crocodile (4e-48) |
Best Human hit | forkhead box protein C2 (5e-47) |
Best NR hit (blastp)   | PREDICTED: similar to forkhead protein/ forkhead protein domain [Tribolium castaneum] (2e-65) |
Best NR hit (blastx)   | PREDICTED: similar to forkhead protein/ forkhead protein domain [Tribolium castaneum] (4e-58) |
GeneOntology terms    | GO:0005634 nucleus GO:0007380 specification of segmental identity, head GO:0003677 DNA binding GO:0003702 RNA polymerase II transcription factor activity GO:0006355 regulation of transcription, DNA-dependent GO:0003700 sequence-specific DNA binding transcription factor activity GO:0043565 sequence-specific DNA binding |
InterPro families    | IPR001766 Transcription factor, fork head IPR011991 Winged helix-turn-helix transcription repressor DNA-binding IPR018122 Transcription factor, fork head, conserved site |
Orthology group | MCL15024 |
Nucleotide sequence:
ATGGTCAAACCTCCCTATTCGTATATTGCACTGATAGCAATGGCGATTCAAAATGCACCC
GATAGGCGAATAACCCTCAATGGTATATACCAATTCATCATGGAACGTTTTCCTTATTAT
AGAGAAAATAAACAAGGGTGGCAAAACTCAATACGACACAACTTGAGTCTTAACGAATGT
TTTGTTAAGGTTGCAAGGGATGATAAAAAACCCGGCAAAGGAAGTTATTGGACGTTGGAT
CCAGATTCTTATAACATGTTTGATAACGGATCTTATTTGAGACGACGTCGCCGTTTCAAA
AAAAAGGATGCGCTTAAAGAAAAAGAAGAAGCTTTGAAACGCCAGCAACAATTACAACAA
GCGCAAGAACTGGCGGCACAGGAAGCTCTTAGCGCTGCTGATGCTTTAGGACAAGCGCGA
GACGTCAAGCCAGACGTAAAGCCTAGAATATTTGAATGTAGACCAAAACGAGAACCCGGT
GCGGATTGCACGCGGTATGACAAATTAAGTGAGCCTATAGACGAGTTTAGCGAGCCTCGA
TTACCGCCCTCCGCAGTTTACTGTTCGCCACAGCCATATTCTTTAGCAGCCGAGGAGTTT
CGAGCAGCTACAAGCGGCTGGTATTCTGCACCCGAACCATCCGCTGACCAGCTTCCGCCA
GCTTTTCGGGACCTCTTCGAACCGCCTAGTTGCCAGTTGGCGGGATACCGCGGCAGTTCA
CCGGCTCCCGACGCTTACCGCGCTTCACCTCCACCTCACCACCACTACCGTTCGCCAGCT
CCATCCTACTACCATCATCAAGCTTGCGTAGCCGCCGCACCCGCCTCTGCTCATAAATCC
TACTGA
Protein sequence:
MVKPPYSYIALIAMAIQNAPDRRITLNGIYQFIMERFPYYRENKQGWQNSIRHNLSLNEC
FVKVARDDKKPGKGSYWTLDPDSYNMFDNGSYLRRRRRFKKKDALKEKEEALKRQQQLQQ
AQELAAQEALSAADALGQARDVKPDVKPRIFECRPKREPGADCTRYDKLSEPIDEFSEPR
LPPSAVYCSPQPYSLAAEEFRAATSGWYSAPEPSADQLPPAFRDLFEPPSCQLAGYRGSS
PAPDAYRASPPPHHHYRSPAPSYYHHQACVAAAPASAHKSY