New model in OGS2.0 | DPOGS214467  |
---|---|
Genomic Position | scaffold690:+ 9375-17749 |
See gene structure | |
CDS Length | 1098 |
Paired RNAseq reads   | 80 |
Single RNAseq reads   | 216 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA013353 (1e-104) |
Best Drosophila hit   | onecut (9e-72) |
Best Human hit | hepatocyte nuclear factor 6 (3e-71) |
Best NR hit (blastp)   | PREDICTED: similar to One cut domain family member 2 (Transcription factor ONECUT-2) (OC-2) [Apis mellifera] (4e-95) |
Best NR hit (blastx)   | PREDICTED: similar to One cut domain family member 2 (Transcription factor ONECUT-2) (OC-2) [Apis mellifera] (4e-83) |
GeneOntology terms    | GO:0003702 RNA polymerase II transcription factor activity GO:0006350 transcription GO:0005634 nucleus GO:0030528 transcription regulator activity GO:0006355 regulation of transcription, DNA-dependent GO:0003677 DNA binding GO:0003700 sequence-specific DNA binding transcription factor activity GO:0043565 sequence-specific DNA binding |
InterPro families    | IPR003350 Homeodomain protein CUT IPR001356 Homeobox IPR010982 Lambda repressor-like, DNA-binding IPR009057 Homeodomain-like IPR012287 Homeodomain-related |
Orthology group | MCL11814 |
Nucleotide sequence:
ATGGACGATACACGGCTCAGGGAGCGAGCGCCTCTCACGGTCATAGTGGCGCCCAGTAAC
GTGTCTCCACCCCGGCTGTCCCCCGCGGACCTGCTGCCCGACGGAGACGCTGCCTTCCAC
CCGCTGTCCGCCGTCAACGGCCGCCTCACACCGCCCGGGCTCGAGCCCGCGTCCTACGCT
ACTCTGACGCCCCTGCTCCCTCTGCCCCCCATCAGCACCGTGTCCGACAAGTTCGCGTAC
CACGCGGGCGGCACTTTCACGGTCATCCAGCAGCAGCAGTCCTACGCGTCCTTGTCTCCG
ACCGCCTACAATGAGCCGCTGTCGCCGCAGTCCGCGTACAGTCGGCGGAGCGCGTCACCC
GGCTCGTACGAGCGCCGTTCCCCCTCGCCGCCGCTGCCCAGCCCGGGGCTGGACCTGAAC
GCGGCGCTTCTGGCCAGAGAGACGAGAGACGAGCAGGCGCAGCAGACGCAGCAGAACGAC
ACGGAGGAGATAAACACCAAGGAGCTCGCGCAGAGGATAAGCGGAGAACTGAAGAGGTAC
TCCATACCTCAGGCGATATTCGCTCAGAGGGTGCTGTGTCGGTCGCAGGGTACGCTCAGC
GACCTACTCAGGAACCCCAAGCCGTGGTCCAAGTTGAAGTCGGGCCGAGAAACCTTCAGG
CGGATGTGGAAATGGCTACAGGAACCCGAGTTTCAAAGGATGTCGGCCTTGAGACTTGCA
GCGGCCCAGATACCACAGAGAGGCAGCTGCAAGCGCAAGGAGGATATGGCCTCGGATACC
CTTCCTTCCCCGAAGAAGCCGCGTTTGGTTTTCACAGATCTGCAGCGGCGTACTCTGCAG
GCTATTTTTAAGGAAACAAAGAGACCATCAAAAGAGATGCAAGTAACGATAGCGCGTCAG
CTCGGTCTAGAGCCGACCACTGTTGGCAACTTCTTCATGAACGCTCGCAGACGATCCATG
GACAAATGGAAGGATGACGACGCACCATCCACAGACTTGGACCAGCAATGCGACGGGCAG
GATCTAGACCACGTCCCCAGTCTAGACTCGGCGGAGTCTGAGGAGGATCACGACGACCAA
GACGATCATCTATTGTGA
Protein sequence:
MDDTRLRERAPLTVIVAPSNVSPPRLSPADLLPDGDAAFHPLSAVNGRLTPPGLEPASYA
TLTPLLPLPPISTVSDKFAYHAGGTFTVIQQQQSYASLSPTAYNEPLSPQSAYSRRSASP
GSYERRSPSPPLPSPGLDLNAALLARETRDEQAQQTQQNDTEEINTKELAQRISGELKRY
SIPQAIFAQRVLCRSQGTLSDLLRNPKPWSKLKSGRETFRRMWKWLQEPEFQRMSALRLA
AAQIPQRGSCKRKEDMASDTLPSPKKPRLVFTDLQRRTLQAIFKETKRPSKEMQVTIARQ
LGLEPTTVGNFFMNARRRSMDKWKDDDAPSTDLDQQCDGQDLDHVPSLDSAESEEDHDDQ
DDHLL