New model in OGS2.0 | DPOGS205464  |
---|---|
Genomic Position | scaffold224:- 70220-72116 |
See gene structure | |
CDS Length | 1554 |
Paired RNAseq reads   | 1012 |
Single RNAseq reads   | 2658 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA008274 (4e-178) |
Best Drosophila hit   | bobby sox, isoform C (2e-42) |
Best Human hit | HMG box transcription factor BBX isoform 2 (8e-23) |
Best NR hit (blastp)   | PREDICTED: similar to bobby sox CG1414-PC [Tribolium castaneum] (2e-60) |
Best NR hit (blastx)   | hypothetical protein AaeL_AAEL011161 [Aedes aegypti] (2e-56) |
GeneOntology terms    | GO:0003700 sequence-specific DNA binding transcription factor activity GO:0006355 regulation of transcription, DNA-dependent GO:0005634 nucleus GO:0003677 DNA binding |
InterPro families    | IPR009071 High mobility group, superfamily IPR000910 High mobility group, HMG1/HMG2 |
Orthology group | MCL17874 |
Nucleotide sequence:
ATGAACGCCTTTCTCATATTCTGCAAACGTCACCGTAGCGTCGTCAGAGACAAGTATCCA
AATTTAGAAAACAGATCTATAACTAAAATTCTCGGAGAATGGTGGGCCAATTTGGACCAG
GAAGAGAAAGCTTGTTATACAGGTCTTGCTAAACAGTACAAAGACGCGTTCTTCAACGCC
AACCCTGACTTCAAATGGTACAAGTTGCCCGCACCACCGCTACGTACACTATCCTCCCGG
CCTCGGGAGTCCACTGAGAAAGCAGAATCCTCCAACGAATATAACGACCATGAACTGGAA
AAAACAAATAATAATTCCACAAAATTCGACTACAATAAGAAACCAGACAGTGATCCAGAA
ACTGAAGCTAAACCACTATCCATGTTCACGCCTGGAAAGTTAGCCGATGAAGCTCAAATG
GGAGGTCTGAGTAGCCTCCTAGCTACCAAGACTGAAGTCCAAACTCCTAATCCATATTAT
TCTCCTCCTTCCTGTAAATTTAACGCGATCTCAACTCCAATCGAATACAGATCTCATGAT
TCAATAGACAGACCCAAAACCAAACGAGAAGACAGCATACGTGAGCTTCAAAATGCACTT
ACAGAAACAACTCGAATGTTCGAAGAAGATTTCGATGAAAAGGAACAATTACGTTATTAC
GGTGCAGCTAACGACCAATTCACGAACCAAGATGTTATCGACCAAATAGTTGATAAACGC
TATTCTAAAGATGATGAAGGATACCAGAGAAACTGGTCGGATGATGAGAAAAATTCCAAG
TCTGGTAGAACTTGCAAAGGGAAAAGATATCAAGAGTTCATGGCTGTTGGAGGACTGATA
GTGAATAAGAGGCCGAGGAGAGATTATCCCGATAGATTGTCGGACGAAGGCTACAATGCA
TCTTGTAGCTGGGATCCTGGATCTTCGCTCGAGGAATCAACAATGACAATGGCAGACGAG
TCGACGCCCGACACTAACTATATACAGCACGACATAACAGTCGAAAGCGAACCAAATGTG
GAACCAAACGACACGCCCGAAATAGACAATAACTGTAACAAATCATTTAAGGCTGCCGAC
TTCGATTTAGAAGCTAAAATAAGAGCTCTACCCTCCCTTAGTTTAGAGAAGTTTCAACAG
AAAAAACGCGAGAATAAACGTAAGAAAAAAAACGTTAGCCTGAGAACTAAATCCGTCAAG
TCATCGCAAATAATAAACTCGGTTCCGCGTCCGGTTATGGACGAGCGTCATGAAATGGCG
GAGAATTGGCGCGAGACCGTCATAGGGAGCCAGAAACGGAAACCGAGGAAAATAAGCATC
ACACGACTCGAAATCAACAGCCTCGTCTCCAGTAACATGAACGGCGGCAATAAAATCAGT
CCAGAAATAAAAATTGCCACAGAAGCTCCTTGCACCATCCAAAGCATGGACATATGCAAT
CAGAGCCATGGTAATGTCGACCTGTTCGCGTTGGCCACGTTGGCCGAGGTCGCTGCCAAC
ACGTCCAAAATAGAGCAGACCAATGCGGCAAGCGAAGACGCTTCCAAGGTATGA
Protein sequence:
MNAFLIFCKRHRSVVRDKYPNLENRSITKILGEWWANLDQEEKACYTGLAKQYKDAFFNA
NPDFKWYKLPAPPLRTLSSRPRESTEKAESSNEYNDHELEKTNNNSTKFDYNKKPDSDPE
TEAKPLSMFTPGKLADEAQMGGLSSLLATKTEVQTPNPYYSPPSCKFNAISTPIEYRSHD
SIDRPKTKREDSIRELQNALTETTRMFEEDFDEKEQLRYYGAANDQFTNQDVIDQIVDKR
YSKDDEGYQRNWSDDEKNSKSGRTCKGKRYQEFMAVGGLIVNKRPRRDYPDRLSDEGYNA
SCSWDPGSSLEESTMTMADESTPDTNYIQHDITVESEPNVEPNDTPEIDNNCNKSFKAAD
FDLEAKIRALPSLSLEKFQQKKRENKRKKKNVSLRTKSVKSSQIINSVPRPVMDERHEMA
ENWRETVIGSQKRKPRKISITRLEINSLVSSNMNGGNKISPEIKIATEAPCTIQSMDICN
QSHGNVDLFALATLAEVAANTSKIEQTNAASEDASKV