DPGLEAN21211 in OGS1.0

New model in OGS2.0DPOGS208434 
Genomic Positionscaffold359:- 14569-28576
See gene structure
CDS Length1122
Paired RNAseq reads  72
Single RNAseq reads  267
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008808 (5e-108)
Best Drosophila hit  brother of odd with entrails limited, isoform A (4e-28)
Best Human hitprotein odd-skipped-related 2 isoform b (8e-16)
Best NR hit (blastp)  conserved hypothetical protein [Culex quinquefasciatus] (3e-67)
Best NR hit (blastx)  conserved hypothetical protein [Culex quinquefasciatus] (1e-52)
GeneOntology terms






  
GO:0005634 nucleus
GO:0003702 RNA polymerase II transcription factor activity
GO:0007442 hindgut morphogenesis
GO:0007480 imaginal disc-derived leg morphogenesis
GO:0016348 imaginal disc-derived leg joint morphogenesis
GO:0008270 zinc ion binding
GO:0003676 nucleic acid binding
GO:0035220 wing disc development
InterPro families

  
IPR013087 Zinc finger, C2H2-type/integrase, DNA-binding
IPR015880 Zinc finger, C2H2-like
IPR007087 Zinc finger, C2H2-type
Orthology groupMCL40336

Nucleotide sequence:

ATGTATATGGAAGAGGACCCTGAATTTGTAAACTTTGTGGGCGACTACCTGACTACTGGA
GGGTTGGTCGCATCGTTGGATCGATCGGACGATTCGTCGCCTTATGAATCTACTAACGTA
ATGTCTGAGAACGGAACGGTAGTTTCAAGGGCTTCCACTCCTCGCCCAGCAACAGATAGT
CCAGTGACCGCTCCTTTGCGCTCGAGACATGACCAATTTCTTGCAAGTGCAGCTGAACGA
GCCCAGAGACCATCATCAAGTGGTCGAAATGCAAATGGAAACTTTGATTCCCCTCCTATT
CCTCCTTATCATCGCCAAATAACTACAGGCACCCTTCTTGCTCCTTCTCATCGCGAATCT
TCTGCATTTGTACCTGTTGTGCCAACAAGAGCCATGCACCCTGTTATGTATCCAAACGAA
ATGCATCCTTCGCTTCTACCATCTGAAATGATAGAAAGGGAAAGAATGTTAGAAAGAGAC
AGAAGTGAACCAGCCAAGGGGTCACCTGATAGAACTTCAGGCAATTTTGCTAAAAGAAAT
TCTTTTGACCTTATGGCCATGATGGTAGAAAAGCGCAAAGAAGTGGCCTTGCGTGAGGCT
GCAGCAGCTATGCTTATTCCACACCACAGAGCTTCGTCAATAGATGGTATGATATCGGAC
GGATCATCACAACCTCCAATTTATGGTCCTCCTGGAGCATTTCTCGGGGCTCCAGGACCT
TCTCCAACTGCTGCTAACAGTTTCACTTTTCCTGGTGCCGGATTATTCCCTTCCGGGGCT
GGTCCCCATCAAATGCACCCTCATCTTGATAGACGATTGCTTAGAGCTCCTGGGAGGGCT
TCAAGACCGAAAAAACAATTTATCTGTAAATTCTGCAATCGCCAATTTACAAAATCCTAT
AACCTTCTTATCCACGAAAGGACGCACACGGATGAACGACCATATTCATGTGATATCTGC
GGAAAAGCTTTCAGAAGACAAGATCACTTGAGGGACCACAGGCACCTGATGCCAGGGCAA
AGTAAGCACAATAAATGTTTGCGTCAGTCAGAGGTGAAATTGAGTGTGAAGCGGCAGTGG
CGAGCCGTCGCTCCCATTATGCCCGGCCGACAAACCGGCTGA

Protein sequence:

MYMEEDPEFVNFVGDYLTTGGLVASLDRSDDSSPYESTNVMSENGTVVSRASTPRPATDS
PVTAPLRSRHDQFLASAAERAQRPSSSGRNANGNFDSPPIPPYHRQITTGTLLAPSHRES
SAFVPVVPTRAMHPVMYPNEMHPSLLPSEMIERERMLERDRSEPAKGSPDRTSGNFAKRN
SFDLMAMMVEKRKEVALREAAAAMLIPHHRASSIDGMISDGSSQPPIYGPPGAFLGAPGP
SPTAANSFTFPGAGLFPSGAGPHQMHPHLDRRLLRAPGRASRPKKQFICKFCNRQFTKSY
NLLIHERTHTDERPYSCDICGKAFRRQDHLRDHRHLMPGQSKHNKCLRQSEVKLSVKRQW
RAVAPIMPGRQTG