DPGLEAN08613 in OGS1.0

New model in OGS2.0DPOGS212692 
Genomic Positionscaffold3:- 595363-599236
See gene structure
CDS Length2931
Paired RNAseq reads  678
Single RNAseq reads  1657
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA013199 (0.0)
Best Drosophila hit  cut, isoform A (1e-89)
Best Human hithomeobox protein cut-like 2 (5e-34)
Best NR hit (blastp)  PREDICTED: similar to Homeobox protein cut [Tribolium castaneum] (0.0)
Best NR hit (blastx)  PREDICTED: similar to Homeobox protein cut [Tribolium castaneum] (0.0)
GeneOntology terms
























  
GO:0005634 nucleus
GO:0008587 imaginal disc-derived wing margin morphogenesis
GO:0003704 specific RNA polymerase II transcription factor activity
GO:0035277 spiracle morphogenesis, open tracheal system
GO:0048477 oogenesis
GO:0007443 Malpighian tubule morphogenesis
GO:0016360 sensory organ precursor cell fate determination
GO:0003677 DNA binding
GO:0008052 sensory organ boundary specification
GO:0045165 cell fate commitment
GO:0030713 ovarian follicle cell stalk formation
GO:0007424 open tracheal system development
GO:0007422 peripheral nervous system development
GO:0008585 female gonad development
GO:0007417 central nervous system development
GO:0007469 antennal development
GO:0048813 dendrite morphogenesis
GO:0043565 sequence-specific DNA binding
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0000278 mitotic cell cycle
GO:0045746 negative regulation of Notch signaling pathway
GO:0030707 ovarian follicle cell development
GO:0070983 dendrite guidance
GO:0007605 sensory perception of sound
GO:0048098 antennal joint development
GO:0032583 regulation of gene-specific transcription
InterPro families




  
IPR001356 Homeobox
IPR003350 Homeodomain protein CUT
IPR010982 Lambda repressor-like, DNA-binding
IPR009057 Homeodomain-like
IPR017970 Homeobox, conserved site
IPR012287 Homeodomain-related
Orthology groupMCL16599

Nucleotide sequence:

ATGTTTGAAAAGTGTGCCGGCAGATGTCGTACGTCAAACAATGTTTTAGGAAAGAGTGAA
AGCGAATTACGGCTCGTTTCCGTCACACGCGCGATATTCCTATACGCCGCCGGAACATAT
TGCTTCTATACAGACCTCTCGTTGGTCACAGTTCTGAATGTTGGAACAAAAGACGGTACA
ACCGGCCCCGGGTTCGGGAGGTCAGATGGTGACGGCGAGGAACGCCTGGCTCACATGCTC
AATGAAGCCTCACATATCATGAAGACACCGACGGGACAAGCCAACAACGATGACTCCAGG
AGCAACGAAGACTCCAGCTCACCGAGGACCCAGTGCCCGTCACCGTTTTCTAATAAGGAT
TCGAGTCAAAACAGACGGCTTAAGAAATACGAAAACGATGACATTCCTCAAGAAAAAGTA
GTGCGTATATACCAAGAAGAGCTGGCGAAGATAATGACGAGACGCGTGGAAGACATGCGC
CATAACAGAGACGGCTTCCCTGGCAGCGGCATGGCCCCGCACATGGAACGTCCTCCGGAA
GACATTAGGATGGCTCTGGAAGCGTATCACAGGGAACTAGCCAAAATACAACCGGGCGGA
AACATTCCGACCCTGCACAACTTGCCAGGGATGCCACCCTTCCCCAACCTGCTGGCCCTT
CAGCAGCAAGCCATGCAAGCACAAAGCCAGCACATCAACGGCTCCGGGGCAATCCAAGAT
CTCTCTCTGCCCAAAGAGAAAAATACCAAAATTAATGGAATGACTGATAGTGATAAGGAA
AGGTCTATGGACGCTGAAGAGGCCATCAGACACGCGGGAAGCGCTTTCTCGCTAGTTAGA
CCGAAATTAGAACCGGGACAGCAATCCACCGGCTCCTCGGCATCCAGCCCGCTGGGAAAT
GCTATTCTACCTCCCGCCATTACGCCGAATGAAGACTTCAGTAACTCGGCCGCAGCGAGT
CCATTACAAAGAATGGCTTCCATAACGAATAGTTTGATATCCCAGCCCCCGAATCCGCCA
CACCACGCGCCACCGCAGAGATCGATGAAGGCAGTCCTGCCACCGATAACTCAGCAACAG
TTCGATTTGTTCAACAATTTGAACACGGAGGAAATCGTGAAGAGAGTCAAAGAGGCTCTC
AGCCAGTATTCCATAAGCCAGAGATTGTTCGGCGAATCCGTGCTCGGCCTGTCTCAAGGA
TCCGTCAGCGATCTGCTAGCGAGACCGAAGCCATGGCACATGTTGACACAAAAGGGAAGA
GAGCCGTTCATTCGTATGAAAATGTTCTTGGAGGATGAAAACGCAGTGCACAAATTGGTT
GCGTCCCAATACAAAATCGCACCGGAGAAGCTGATGAGAACAGGAAACTATAGCGGAGCA
CCTTCATGTCCGCCAAATATGAACAAGCCGATGCCACCAACACAGAAGATGATCTCAGAT
GCCACGGTGCTCCTTAGCAAGATGCAACAGGAACAACTTCTAGGATCTGGACACTTAGGA
CATTTGGGACAACCGACCCCTCTCCTGTTGACTCCGCCTGGCTTCCCACCACATCACGCC
GTGACGCTGCCGCCTCAGCATCACGACAACAACAACAAGGAGAGGAAACCACCACCGCCT
CCACAACCCCATCACCAGCCGCCCGTGATGCGAGGCCTTCACCAGCACATGTCACCCAGC
GTCTACGAGATGGCAGCTCTGACGCAAGACCTCGACACTCAGACGATCACGACCAAAATA
AAGGAAGCGCTCCTCGCCAATAACATCGGACAGAAAATATTCGGCGAGGCCGTGTTGGGA
CTCTCCCAGGGATCGGTCAGTGAACTTCTATCGAAACCGAAACCCTGGCACATGTTGAGT
ATCAAAGGACGAGAGCCCTTCATCAGAATGCAGCTCTGGCTCAGCGATGCGCATAATATA
GATCGTCTCCAAGCGTTGAAGAATGAGAGACGCGAAGCTAACAAGAGACGGCGGTCGAGC
GGACCCGGTCAGGACAACTCCTCGGACACCTCATCGAATGATACGTCGGAGTTCTACCAC
TCCAGCTCGCCTGGACCGATACCCGGCGCGCCGTCCGCCAAGAAGCAGCGCGTGCTGTTC
TCGGAGGAACAGAAGGAAGCGCTGAGACTAGCCTTCGCTTTGGATCCCTACCCGAACATG
CCGACGATAGAATTCCTCGCTGCCGAGCTGGGCCTGTCCACCAGAACGATCACCAACTGG
TTCCACAACCATCGCATGCGGCTAAAGCAACAGGCGCCGCACGGCCTGCCCGCGGAACCT
CCAGCACGAGATCAGGCCTCCGCTCCCTTCGATCCCGTACAGTTCCGTCTCCTGCTCAAT
CAGAGGCTTCTGGAGCTGCAGAAGGAGAGGATGGGCCTGGCGGGGGTTCCTCTGCCGTAC
CCGCCCTACTTCGCCGCCAACTCCAACTTCGCCGCCCTCATCGGTCGCGGCCTGCTGCCC
ACCGACGAGCGCGTCAAGGACCCTGCCGCCGGACTCGACCTCTCGATGCCGCTGAAGCGT
GACCCTGACGGAGACGACTTCGAGGAGGACGACGTCGAGAGCAACCTCGGCTCCGAGGAC
TCCCTCGACGATGACTCCAAGACTGAGCCCAAGGCGGCCTCCACCCCCGCTGGTCGGTCC
AGCCGCCGCAAGCCCGCGGCGCCGCAGTGGGTCAACCCCGACTGGCAGGACGAGAAGCCG
CGCAACCCCGACGAGGTCATCATCAACGGCGTCTGCGTGATGCGCGCCGACGACTACCGT
CGCGAGGCCACGGAGACCGTGAGGGTGGAGCCATCCCCCGCCCCCCGCGAGAGCTCCCCC
GCCCCCCAGGACACGCCGCGCGCGCCTCGCACCCCCCGCACGCCGTCCCCGGACGTCCTG
CCCGAGGACAAGATCAAGACGGAGGCGGAAGACGACCGGTGGGAGTATTAA

Protein sequence:

MFEKCAGRCRTSNNVLGKSESELRLVSVTRAIFLYAAGTYCFYTDLSLVTVLNVGTKDGT
TGPGFGRSDGDGEERLAHMLNEASHIMKTPTGQANNDDSRSNEDSSSPRTQCPSPFSNKD
SSQNRRLKKYENDDIPQEKVVRIYQEELAKIMTRRVEDMRHNRDGFPGSGMAPHMERPPE
DIRMALEAYHRELAKIQPGGNIPTLHNLPGMPPFPNLLALQQQAMQAQSQHINGSGAIQD
LSLPKEKNTKINGMTDSDKERSMDAEEAIRHAGSAFSLVRPKLEPGQQSTGSSASSPLGN
AILPPAITPNEDFSNSAAASPLQRMASITNSLISQPPNPPHHAPPQRSMKAVLPPITQQQ
FDLFNNLNTEEIVKRVKEALSQYSISQRLFGESVLGLSQGSVSDLLARPKPWHMLTQKGR
EPFIRMKMFLEDENAVHKLVASQYKIAPEKLMRTGNYSGAPSCPPNMNKPMPPTQKMISD
ATVLLSKMQQEQLLGSGHLGHLGQPTPLLLTPPGFPPHHAVTLPPQHHDNNNKERKPPPP
PQPHHQPPVMRGLHQHMSPSVYEMAALTQDLDTQTITTKIKEALLANNIGQKIFGEAVLG
LSQGSVSELLSKPKPWHMLSIKGREPFIRMQLWLSDAHNIDRLQALKNERREANKRRRSS
GPGQDNSSDTSSNDTSEFYHSSSPGPIPGAPSAKKQRVLFSEEQKEALRLAFALDPYPNM
PTIEFLAAELGLSTRTITNWFHNHRMRLKQQAPHGLPAEPPARDQASAPFDPVQFRLLLN
QRLLELQKERMGLAGVPLPYPPYFAANSNFAALIGRGLLPTDERVKDPAAGLDLSMPLKR
DPDGDDFEEDDVESNLGSEDSLDDDSKTEPKAASTPAGRSSRRKPAAPQWVNPDWQDEKP
RNPDEVIINGVCVMRADDYRREATETVRVEPSPAPRESSPAPQDTPRAPRTPRTPSPDVL
PEDKIKTEAEDDRWEY