New model in OGS2.0 | DPOGS214934  |
---|---|
Genomic Position | scaffold1723:- 19439-20947 |
See gene structure | |
CDS Length | 1509 |
Paired RNAseq reads   | 1400 |
Single RNAseq reads   | 3689 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA004818 (2e-161) |
Best Drosophila hit   | distal antenna (2e-35) |
Best Human hit | tigger transposable element-derived protein 5 (2e-09) |
Best NR hit (blastp)   | RecName: Full=Protein distal antenna (1e-53) |
Best NR hit (blastx)   | GG11370 [Drosophila erecta] (2e-45) |
GeneOntology terms    | GO:0003700 sequence-specific DNA binding transcription factor activity GO:0005634 nucleus GO:0007379 segment specification GO:0007469 antennal development GO:0045449 regulation of transcription GO:0048749 compound eye development |
InterPro families    | IPR011526 Helix-turn-helix, Psq-like IPR009057 Homeodomain-like IPR012287 Homeodomain-related IPR006695 Centromere protein Cenp-B, DNA-binding domain 1 |
Orthology group | MCL18887 |
Nucleotide sequence:
ATGACAACGAAGGGAAAGCGTCCTATGCGCGCCCTCACACCCGGAGATAAGATCGAGGCC
ATACAGAGGGTCAACGACGGCGAGTCCAAAGCCTCGGTCGCTCGTGACATAGGAGTGCCC
GAGTCCACGCTGCGGGGCTGGTGCAAGAATGAGGACAAGCTCCGCTACATGACCTCGAGG
TTGTCCTCCCCCGACACCGACAAGAGCAACGACGGGGAGCCGCCGGACAAGCGCGCGCGC
ACCGAGTCCCCGACAGCTCCTCAGTCACCCATCAACACCGGCCTGGATCTTTCTAGCGCG
GTCTCCGTTACTCACAGCACCACCCAGCCTCCGCCGCAGGCCGATGTCCCCGTCGAGCTC
ACTACCAAGCGCAGCGAGCCCTCGCCCCCACTCCATCCGCCACGAGAGCGCCGACCGGAC
CCCGGAGCCAGCGTCTCCATGAGCGCCATCAGCCCGCTATCGGGATTGGCCCATCTACCA
GGACTTACACATTCTCACCTCGGATTGAGCTTCAATGAAATCGCAAACAACCTAACACTC
CTCGCTCAACTGAACCCTGGACTGTCGACGCTGTCGGCGCAGCCGGCGAGCAGAGCGCTG
CGGTCTGTGCGCTCGCCGAAGCCAGCTCACAACGGAGTGCTTAACTTGAACGAAAACAAA
CATCGCAGCAAATCAAACCACTCATCGGACCCGTACAGACACAGCGGGTCCAAGTCGAGT
CATCACACTACTTCGCAATCAGCGTCTCAGCCCGTCGACGACACGCTGTGGTACTGGCTC
AAAACTCAACAGGCCATGCTGGATTTGACTTCCCAAACAACGGCTCATCCGTTGCAATTA
GGAAAAACTAGCGATCCCACTCTGCCGCCTAAGCCCGTGGCGCCCACGCCGCCCGTCAGT
TCGCACCTCGACTACAACAGAAACTCTTGGCTGTGGCAGTACTACAAACAGTTCGGTGGA
GCCATGCCGGTTCCGGAAGACAAGCACAAGCCGGCGTCACAGGTACCGAAAGACAAGTCC
GGAGACATCTTGTTCTCGCATTTAACTAAAGCGAAGCCGGAGGACGACCGGAGCATCATT
AGTCCAGACCAGAGCCAAACTCTGTCGGCTAAAGTCCGCGAGACAGTCCCGCCGCCGCTC
CCGGCCGCTCCCGCAGAACCTCGAGTGGCCGAGCCCGCCTCGAGCCCGGACGTCGGCACG
GAGAATAAGGAACCCTCAGTAGAGAAACCCATCGAGTCCGGCAGAAGCCAAACCAAGGCC
AGAAACGTGCTCGACAACTTACTGTTCAACAGCAGCCAAGCGGCCAACGAAGAGAATAAG
AGCAACGGCTCCACGAACGGCGAGTGGGAGGCGGGCACGGTGGAGGCGCTGGAACACGGA
GACAAGTTCCTCGCGTGGCTGGAGGCCAGCGGCGACCCGAGCGTCACCCGCATGCACGTG
CATCAGCTCCGAGCACTGCTCCACAACCTCCGCACGCGCCGCGCCGCGCCCGACGCACGC
CGCAAGTAA
Protein sequence:
MTTKGKRPMRALTPGDKIEAIQRVNDGESKASVARDIGVPESTLRGWCKNEDKLRYMTSR
LSSPDTDKSNDGEPPDKRARTESPTAPQSPINTGLDLSSAVSVTHSTTQPPPQADVPVEL
TTKRSEPSPPLHPPRERRPDPGASVSMSAISPLSGLAHLPGLTHSHLGLSFNEIANNLTL
LAQLNPGLSTLSAQPASRALRSVRSPKPAHNGVLNLNENKHRSKSNHSSDPYRHSGSKSS
HHTTSQSASQPVDDTLWYWLKTQQAMLDLTSQTTAHPLQLGKTSDPTLPPKPVAPTPPVS
SHLDYNRNSWLWQYYKQFGGAMPVPEDKHKPASQVPKDKSGDILFSHLTKAKPEDDRSII
SPDQSQTLSAKVRETVPPPLPAAPAEPRVAEPASSPDVGTENKEPSVEKPIESGRSQTKA
RNVLDNLLFNSSQAANEENKSNGSTNGEWEAGTVEALEHGDKFLAWLEASGDPSVTRMHV
HQLRALLHNLRTRRAAPDARRK