DPGLEAN09934 in OGS1.0

New model in OGS2.0DPOGS212254 
Genomic Positionscaffold97:- 310951-344209
See gene structure
CDS Length2082
Paired RNAseq reads  44
Single RNAseq reads  128
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA011565 (2e-138)
Best Drosophila hit  knot, isoform C (0.0)
Best Human hittranscription factor COE1 (0.0)
Best NR hit (blastp)  PREDICTED: similar to knot CG10197-PB [Tribolium castaneum] (0.0)
Best NR hit (blastx)  PREDICTED: similar to knot CG10197-PB [Tribolium castaneum] (0.0)
GeneOntology terms
















  
GO:0006355 regulation of transcription, DNA-dependent
GO:0035291 specification of segmental identity, intercalary segment
GO:0035287 head segmentation
GO:0001700 embryonic development via the syncytial blastoderm
GO:0007389 pattern specification process
GO:0005634 nucleus
GO:0003704 specific RNA polymerase II transcription factor activity
GO:0007474 imaginal disc-derived wing vein specification
GO:0030528 transcription regulator activity
GO:0007476 imaginal disc-derived wing morphogenesis
GO:0045449 regulation of transcription
GO:0003677 DNA binding
GO:0007350 blastoderm segmentation
GO:0035288 anterior head segmentation
GO:0035289 posterior head segmentation
GO:0045087 innate immune response
GO:0035203 regulation of lamellocyte differentiation
GO:0009608 response to symbiont
InterPro families



  
IPR014756 Immunoglobulin E-set
IPR002909 Cell surface receptor IPT/TIG
IPR018350 Transcription factor COE, conserved site
IPR003523 Transcription factor COE
IPR013783 Immunoglobulin-like fold
Orthology groupMCL10609

Nucleotide sequence:

ATGACGTTGTCTCGCTTTAGTGGCGAGGGCGTGTTCCCAGGGCGCCTCCCCCAGTCCGCG
GGGACGGGGGCACGCGAAGCTCGAGCACTCCGCGGCCTGTTCCCGCGACGCTTTGATTGT
CGCCGAGCCTGTGGGGCTCGCGCGTGTTCGGTCGCGATTCGGTCTGACGAGTTGAACGCG
GCGGCGGGCGGGTACACCGGCTCGCCGTGGCCGCCGCTGGAGCTCGACCCGGTTGGATGG
GGAAGGAAACTCTACCCCACTGGAGCCCCCAGGTCATCAGGGGGGCTTATGTTCGGGCTC
CATCAGGAGGGGGTCCACGCCCAGCCTCGAGGGCCTGTCACCTCGCTGAAGGAGGAACCT
CTCACCAGGGCCTGGATGACACCAACCTCACTAGTCGACAATACAAATACGGTGGGTGTT
GGCCGCGCTCACTTCGAGAAACAACCGCCAAGCAACCTGCGCAAGAGCAACTTCTTCCAC
TTCGTAGTTGCTTTATACGACCGAGCCGGACAGCCCGTAGAGATAGAACGAACAGCTTTC
ATAGGATTCATCGAAAAGGATCAGGAAGCGGAAGGCCAAAAGACAAACAACGGTATCCAG
TACAGATTACAGTTACTTTACGCAAATGGTATTCGACAGGAGCAAGACATATTCGTTCGA
TTGATAGATTCCGTCACGAAACAGCCCATCGTATATGAAGGTCAAGATAAGAATCCTGAA
ATGTGCCGAGTCCTTCTCACGCATGAAGTTATGTGCAGTCGGTGCTGTGATAAGAAAAGT
TGCGGCAACAGAAACGAAACGCCATCAGATCCTGTTATCATAGATCGATTTTTCTTAAAG
TTCTTCTTAAAATGCAACCAGAATTGTTTGAAAAATGCCGGCAACCCGCGAGATATGAGA
CGATTTCAGGTGGTAATCTCGACGCAAGTGATGGTGGATGGTCCTCTGCTAGCGATATCG
GACAACATGTTTGTCCACAACAACAGCAAGCACGGTAGACGTGCCAAGAGATTAGACCCA
TCTGAAGGTATTGACGCCGCGCCCGACTCTAATTCGGGGCTATACCCACCGTTGCCCGTA
GCAACGCCATGCATCAAAGCAATATCTCCCAGCGAAGGCTGGACGTCAGGGGGCTCTACC
GTAATAATAGTGGGGGACAACTTTTTCGATGGACTTCAAGTTGTATTTGGAACTATGTTG
GTGTGGAGCGAGCTGATAACATCACATGCGATAAGAGTTCAAACTCCACCGCGGCACATA
CCTGGTGTAGTTGAAGTCACACTTTCATACAAGAGCAAGCAGTTCTGCAAAGGAGCGCCT
GGAAGATTCGTATATGTTTCAGCTCTCAACGAGCCTACAATAGATTACGGCTTTCAGAGA
CTACAGAAACTTATACCGCGGCATCCTGGTGATCCAGAGAAACTACCAAAGGAGATAATT
CTAAAGCGAGCAGCAGACCTCGCCGAGGCCTTGTATTCAATGCCTCGTAATAACCAACTG
GGTCTATCCGCTCCTCGCTCGCCCTCCAGTATGCCCTTCAACTCATACACCGGACAGTTG
GCGGTCAGCGTCCAAGATACTGCCGCCTCACAGTGGACTGAAGAAGAGTACGCACGCAGC
GGCGGTTCGGTATCGCCGCGGTATTGTTCTGCCGCGTCTACGCCGCACGCGCCCGCCGCC
TACCCACCGCAACACTACCCTGCACCACCTACCTCACTCTTCAATACCTCCTCGCTGTCT
CTAGGTCCCTACCACCCGGCCAACGTAAACGGACATACAGAATACAATGCTTACAAAGAA
ACCGAGCATTATACGGAAAGAAATGATGATAATAAAACCATCTATCAAAACACTCATACG
AAATGTATCGACACGAAAACGCATAAAGACAAGTCCCGAAGTGCGTTCGCAGTTGTCAGA
CAAAGTCCACCGCGTAATTTCCAACAGCAAAATTGGCAACATCTCGCTGTACAGTCAGGA
ATGGGCGGTCTGGTGTCATCGCCTTTTAGTGTGAATCCGTTCTCGCTGCCCACTTGCAGC
GCACAGCAATACGCGCAGACAGCGCCGCTTGCCTCCAAGTAA

Protein sequence:

MTLSRFSGEGVFPGRLPQSAGTGAREARALRGLFPRRFDCRRACGARACSVAIRSDELNA
AAGGYTGSPWPPLELDPVGWGRKLYPTGAPRSSGGLMFGLHQEGVHAQPRGPVTSLKEEP
LTRAWMTPTSLVDNTNTVGVGRAHFEKQPPSNLRKSNFFHFVVALYDRAGQPVEIERTAF
IGFIEKDQEAEGQKTNNGIQYRLQLLYANGIRQEQDIFVRLIDSVTKQPIVYEGQDKNPE
MCRVLLTHEVMCSRCCDKKSCGNRNETPSDPVIIDRFFLKFFLKCNQNCLKNAGNPRDMR
RFQVVISTQVMVDGPLLAISDNMFVHNNSKHGRRAKRLDPSEGIDAAPDSNSGLYPPLPV
ATPCIKAISPSEGWTSGGSTVIIVGDNFFDGLQVVFGTMLVWSELITSHAIRVQTPPRHI
PGVVEVTLSYKSKQFCKGAPGRFVYVSALNEPTIDYGFQRLQKLIPRHPGDPEKLPKEII
LKRAADLAEALYSMPRNNQLGLSAPRSPSSMPFNSYTGQLAVSVQDTAASQWTEEEYARS
GGSVSPRYCSAASTPHAPAAYPPQHYPAPPTSLFNTSSLSLGPYHPANVNGHTEYNAYKE
TEHYTERNDDNKTIYQNTHTKCIDTKTHKDKSRSAFAVVRQSPPRNFQQQNWQHLAVQSG
MGGLVSSPFSVNPFSLPTCSAQQYAQTAPLASK