DPGLEAN00077 in OGS1.0

New model in OGS2.0DPOGS214934 
Genomic Positionscaffold1723:- 19439-20947
See gene structure
CDS Length1509
Paired RNAseq reads  1400
Single RNAseq reads  3689
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA004818 (2e-161)
Best Drosophila hit  distal antenna (2e-35)
Best Human hittigger transposable element-derived protein 5 (2e-09)
Best NR hit (blastp)  RecName: Full=Protein distal antenna (1e-53)
Best NR hit (blastx)  GG11370 [Drosophila erecta] (2e-45)
GeneOntology terms




  
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0005634 nucleus
GO:0007379 segment specification
GO:0007469 antennal development
GO:0045449 regulation of transcription
GO:0048749 compound eye development
InterPro families


  
IPR011526 Helix-turn-helix, Psq-like
IPR009057 Homeodomain-like
IPR012287 Homeodomain-related
IPR006695 Centromere protein Cenp-B, DNA-binding domain 1
Orthology groupMCL18887

Nucleotide sequence:

ATGACAACGAAGGGAAAGCGTCCTATGCGCGCCCTCACACCCGGAGATAAGATCGAGGCC
ATACAGAGGGTCAACGACGGCGAGTCCAAAGCCTCGGTCGCTCGTGACATAGGAGTGCCC
GAGTCCACGCTGCGGGGCTGGTGCAAGAATGAGGACAAGCTCCGCTACATGACCTCGAGG
TTGTCCTCCCCCGACACCGACAAGAGCAACGACGGGGAGCCGCCGGACAAGCGCGCGCGC
ACCGAGTCCCCGACAGCTCCTCAGTCACCCATCAACACCGGCCTGGATCTTTCTAGCGCG
GTCTCCGTTACTCACAGCACCACCCAGCCTCCGCCGCAGGCCGATGTCCCCGTCGAGCTC
ACTACCAAGCGCAGCGAGCCCTCGCCCCCACTCCATCCGCCACGAGAGCGCCGACCGGAC
CCCGGAGCCAGCGTCTCCATGAGCGCCATCAGCCCGCTATCGGGATTGGCCCATCTACCA
GGACTTACACATTCTCACCTCGGATTGAGCTTCAATGAAATCGCAAACAACCTAACACTC
CTCGCTCAACTGAACCCTGGACTGTCGACGCTGTCGGCGCAGCCGGCGAGCAGAGCGCTG
CGGTCTGTGCGCTCGCCGAAGCCAGCTCACAACGGAGTGCTTAACTTGAACGAAAACAAA
CATCGCAGCAAATCAAACCACTCATCGGACCCGTACAGACACAGCGGGTCCAAGTCGAGT
CATCACACTACTTCGCAATCAGCGTCTCAGCCCGTCGACGACACGCTGTGGTACTGGCTC
AAAACTCAACAGGCCATGCTGGATTTGACTTCCCAAACAACGGCTCATCCGTTGCAATTA
GGAAAAACTAGCGATCCCACTCTGCCGCCTAAGCCCGTGGCGCCCACGCCGCCCGTCAGT
TCGCACCTCGACTACAACAGAAACTCTTGGCTGTGGCAGTACTACAAACAGTTCGGTGGA
GCCATGCCGGTTCCGGAAGACAAGCACAAGCCGGCGTCACAGGTACCGAAAGACAAGTCC
GGAGACATCTTGTTCTCGCATTTAACTAAAGCGAAGCCGGAGGACGACCGGAGCATCATT
AGTCCAGACCAGAGCCAAACTCTGTCGGCTAAAGTCCGCGAGACAGTCCCGCCGCCGCTC
CCGGCCGCTCCCGCAGAACCTCGAGTGGCCGAGCCCGCCTCGAGCCCGGACGTCGGCACG
GAGAATAAGGAACCCTCAGTAGAGAAACCCATCGAGTCCGGCAGAAGCCAAACCAAGGCC
AGAAACGTGCTCGACAACTTACTGTTCAACAGCAGCCAAGCGGCCAACGAAGAGAATAAG
AGCAACGGCTCCACGAACGGCGAGTGGGAGGCGGGCACGGTGGAGGCGCTGGAACACGGA
GACAAGTTCCTCGCGTGGCTGGAGGCCAGCGGCGACCCGAGCGTCACCCGCATGCACGTG
CATCAGCTCCGAGCACTGCTCCACAACCTCCGCACGCGCCGCGCCGCGCCCGACGCACGC
CGCAAGTAA

Protein sequence:

MTTKGKRPMRALTPGDKIEAIQRVNDGESKASVARDIGVPESTLRGWCKNEDKLRYMTSR
LSSPDTDKSNDGEPPDKRARTESPTAPQSPINTGLDLSSAVSVTHSTTQPPPQADVPVEL
TTKRSEPSPPLHPPRERRPDPGASVSMSAISPLSGLAHLPGLTHSHLGLSFNEIANNLTL
LAQLNPGLSTLSAQPASRALRSVRSPKPAHNGVLNLNENKHRSKSNHSSDPYRHSGSKSS
HHTTSQSASQPVDDTLWYWLKTQQAMLDLTSQTTAHPLQLGKTSDPTLPPKPVAPTPPVS
SHLDYNRNSWLWQYYKQFGGAMPVPEDKHKPASQVPKDKSGDILFSHLTKAKPEDDRSII
SPDQSQTLSAKVRETVPPPLPAAPAEPRVAEPASSPDVGTENKEPSVEKPIESGRSQTKA
RNVLDNLLFNSSQAANEENKSNGSTNGEWEAGTVEALEHGDKFLAWLEASGDPSVTRMHV
HQLRALLHNLRTRRAAPDARRK