DPGLEAN20422 in OGS1.0

Genomic Positionscaffold238:+ 31992-64202
See gene structure
CDS Length1026
Paired RNAseq reads  684
Single RNAseq reads  2253
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA004886 (3e-38)
Best Drosophila hit  homothorax, isoform F (4e-68)
Best Human hithomeobox protein Meis1 (5e-59)
Best NR hit (blastp)  homothorax, isoform F [Drosophila melanogaster] (3e-74)
Best NR hit (blastx)  homothorax homeobox protein [Aedes aegypti] (1e-72)
GeneOntology terms


















  
GO:0007422 peripheral nervous system development
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0005634 nucleus
GO:0006357 regulation of transcription from RNA polymerase II promoter
GO:0003677 DNA binding
GO:0007479 leg disc proximal/distal pattern formation
GO:0007420 brain development
GO:0048749 compound eye development
GO:0007432 salivary gland boundary specification
GO:0010092 specification of organ identity
GO:0009954 proximal/distal pattern formation
GO:0001752 compound eye photoreceptor fate commitment
GO:0043565 sequence-specific DNA binding
GO:0007380 specification of segmental identity, head
GO:0007480 imaginal disc-derived leg morphogenesis
GO:0035282 segmentation
GO:0048735 haltere morphogenesis
GO:0007476 imaginal disc-derived wing morphogenesis
GO:0007383 specification of segmental identity, antennal segment
GO:0010552 positive regulation of gene-specific transcription from RNA polymerase II promoter
InterPro families  ND
Orthology groupMCL10435

Nucleotide sequence:

ATGCCGTTTAATGTACATCAGCTGGCGGCGGCGACGCGTCGAGGAGTGGCGCGGTCGAGC
GGAGGGCGCCGGCGGGGTGGTGCGGCGGAGTGCGGCGAGCGGGGGGCGGGCACACGCCCC
GCGGTGGCCCGGCTCGCGTCAGCCGCGCCAGTGTCCGCTAGACTGCCGAGCGCGCCGGGC
CTCCCGCTCCCCGCGCCTCGCTCGGCCACGACGCGTCCCGTACGACACCTTCCAAACCTC
GACCGTCGCGAGTGCATTATGACACTAACAGCCCGGGACAACAAAACGTTCGTGGTGGCC
AGCGGAGCCTTGCACCCGGGCTCGGGCCGAGCTTTCATGGCTCAGCCTAGGTACGACGAG
AGCCTTCACGGCGGGGGCTACATGGAAGGCGGCGCCATGTACCACGAGCACCGTCTCTCC
CACCCTCACATCCCGCCGGTACACTACCCTCCGCCCGCTGCGCCGGCGCACGCGCTGCCA
GGCGAACCCCTCGTGCACAAACGAGACAAGGACGCCATTTATGGCCATCCCCTCTTCCCC
CTACTGGCGCTGATCTTTGAGAAGTGCGAGTTGGCGACTTGCACGCCTCGCGACCCTGGC
GTGGCGGGAGGAGATGTCTGCTCCTCAGAGTCCTTCAACGAAGATATCGCGGTCTTCAGT
AAACAGATACGCCAAGAAAAACCATATTACATTGCGGACCCAGAGGTTGACTCTTTAATG
GTGCAAGCGATACAAGTCTTACGGTTTCACTTGTTAGAATTAGAAAAAGTGCACGAGTTA
TGCGACAACTTCTGCCACCGCTACATCAGCTGTCTGAAGGGCAAGATGCCCATCGACCTG
GTGATCGACGAGCGAGAGACGCGGCCGCCGGACAACGGCGAGCGCTCCGCCCCTGACAGC
AGCCACGACGGCGCCTCCACTCCGGACGTCGTGAGTACCCCCTATAGCAAACACCTGATT
CAGAGGCGCGATCCCCAGCTAGCACCCAATTCGACCACGAGCTATCTCGTGCACCCCTGC
TCTTAG

Protein sequence:

MPFNVHQLAAATRRGVARSSGGRRRGGAAECGERGAGTRPAVARLASAAPVSARLPSAPG
LPLPAPRSATTRPVRHLPNLDRRECIMTLTARDNKTFVVASGALHPGSGRAFMAQPRYDE
SLHGGGYMEGGAMYHEHRLSHPHIPPVHYPPPAAPAHALPGEPLVHKRDKDAIYGHPLFP
LLALIFEKCELATCTPRDPGVAGGDVCSSESFNEDIAVFSKQIRQEKPYYIADPEVDSLM
VQAIQVLRFHLLELEKVHELCDNFCHRYISCLKGKMPIDLVIDERETRPPDNGERSAPDS
SHDGASTPDVVSTPYSKHLIQRRDPQLAPNSTTSYLVHPCS