DPGLEAN14276 in OGS1.0

New model in OGS2.0DPOGS206411 
Genomic Positionscaffold303:- 76755-85672
See gene structure
CDS Length822
Paired RNAseq reads  54
Single RNAseq reads  228
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA013872 (4e-65)
Best Drosophila hit  tailup, isoform B (7e-61)
Best Human hitinsulin gene enhancer protein ISL-2 (6e-48)
Best NR hit (blastp)  insulinprotein enhancer protein isl [Aedes aegypti] (1e-102)
Best NR hit (blastx)  insulinprotein enhancer protein isl [Aedes aegypti] (4e-86)
GeneOntology terms
















  
GO:0007362 terminal region determination
GO:0008293 torso signaling pathway
GO:0008258 head involution
GO:0046665 amnioserosa maintenance
GO:0007390 germ-band shortening
GO:0005634 nucleus
GO:0003704 specific RNA polymerase II transcription factor activity
GO:0006357 regulation of transcription from RNA polymerase II promoter
GO:0007391 dorsal closure
GO:0007399 nervous system development
GO:0008270 zinc ion binding
GO:0006355 regulation of transcription, DNA-dependent
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0043565 sequence-specific DNA binding
GO:0008045 motor axon guidance
GO:0008407 bristle morphogenesis
GO:0070983 dendrite guidance
GO:0035310 notum cell fate specification
InterPro families



  
IPR001781 Zinc finger, LIM-type
IPR012287 Homeodomain-related
IPR009057 Homeodomain-like
IPR001356 Homeobox
IPR017970 Homeobox, conserved site
Orthology groupMCL11887

Nucleotide sequence:

ATGAGAGCAAAGACGAAGATATATCATATAGACTGCTTCAGATGCTGCGCTTGCGCACGA
CAACTTATACCCGGTGACGAGTTCGCGTTGAGAGAAGGCGGAGCTTTATATTGTAGAGAA
GATCACGATGTATTAGAAAAGAGCGCTAACACAAGCGGCAGCAGCGCCGGCAACGCCGAG
AGCAACAACAACACAACACTCAGCAACAACAATTCGCATCACCCGCACGAGTTAGGATCT
ATGTCGGATTCAGGAAGTGAGTCTGGCTCGCATAAGAGTGGAAGAGCCAGGGCTGGCGCT
GCGGCTGATGGTAAACCCACCAGGGTGAGGACTGTCCTCAATGAGAAACAATTACACACA
CTAAGAACCTGTTATGCTGCGAATCCTAGACCTGACGCTCTCATGAAGGAACAGCTGGTT
GAAATGACAGGTCTTAGTCCTCGAGTGATAAGAGTGTGGTTCCAGAACAAGAGATGCAAA
GACAAGAAGAAGACTATACAGCTGAAGATGCAGATGCAGCAAGAGAAGGAAGGCCGCCGT
TTGGGCTATATGTCTATGGGAGTGCCGTTAGTGGCCGGTTCGCCTGTAAGACATGAGGCT
GGGTCTCTAGCTCTAGAGGTGACGGCGTATCAGCCGCCGTGGAAGGCCCTCAGCGACTTC
GCACTCCACGCGGACCTTGACAGGCCTCAACACAGCGCCGCCTTCCAACAGCTCGTGAAC
CAGATGCACGGTTACGACATCCCCTCTCTGCCCCCTCCACGTCACGAGGACAACTACGTC
ACCTATCTCGAGAGTGACGACAGTCTGCCGCCGTCACCCTAG

Protein sequence:

MRAKTKIYHIDCFRCCACARQLIPGDEFALREGGALYCREDHDVLEKSANTSGSSAGNAE
SNNNTTLSNNNSHHPHELGSMSDSGSESGSHKSGRARAGAAADGKPTRVRTVLNEKQLHT
LRTCYAANPRPDALMKEQLVEMTGLSPRVIRVWFQNKRCKDKKKTIQLKMQMQQEKEGRR
LGYMSMGVPLVAGSPVRHEAGSLALEVTAYQPPWKALSDFALHADLDRPQHSAAFQQLVN
QMHGYDIPSLPPPRHEDNYVTYLESDDSLPPSP