New model in OGS2.0 | DPOGS212254  |
---|---|
Genomic Position | scaffold97:- 310951-344209 |
See gene structure | |
CDS Length | 2082 |
Paired RNAseq reads   | 44 |
Single RNAseq reads   | 128 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA011565 (2e-138) |
Best Drosophila hit   | knot, isoform C (0.0) |
Best Human hit | transcription factor COE1 (0.0) |
Best NR hit (blastp)   | PREDICTED: similar to knot CG10197-PB [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to knot CG10197-PB [Tribolium castaneum] (0.0) |
GeneOntology terms    | GO:0006355 regulation of transcription, DNA-dependent GO:0035291 specification of segmental identity, intercalary segment GO:0035287 head segmentation GO:0001700 embryonic development via the syncytial blastoderm GO:0007389 pattern specification process GO:0005634 nucleus GO:0003704 specific RNA polymerase II transcription factor activity GO:0007474 imaginal disc-derived wing vein specification GO:0030528 transcription regulator activity GO:0007476 imaginal disc-derived wing morphogenesis GO:0045449 regulation of transcription GO:0003677 DNA binding GO:0007350 blastoderm segmentation GO:0035288 anterior head segmentation GO:0035289 posterior head segmentation GO:0045087 innate immune response GO:0035203 regulation of lamellocyte differentiation GO:0009608 response to symbiont |
InterPro families    | IPR014756 Immunoglobulin E-set IPR002909 Cell surface receptor IPT/TIG IPR018350 Transcription factor COE, conserved site IPR003523 Transcription factor COE IPR013783 Immunoglobulin-like fold |
Orthology group | MCL10609 |
Nucleotide sequence:
ATGACGTTGTCTCGCTTTAGTGGCGAGGGCGTGTTCCCAGGGCGCCTCCCCCAGTCCGCG
GGGACGGGGGCACGCGAAGCTCGAGCACTCCGCGGCCTGTTCCCGCGACGCTTTGATTGT
CGCCGAGCCTGTGGGGCTCGCGCGTGTTCGGTCGCGATTCGGTCTGACGAGTTGAACGCG
GCGGCGGGCGGGTACACCGGCTCGCCGTGGCCGCCGCTGGAGCTCGACCCGGTTGGATGG
GGAAGGAAACTCTACCCCACTGGAGCCCCCAGGTCATCAGGGGGGCTTATGTTCGGGCTC
CATCAGGAGGGGGTCCACGCCCAGCCTCGAGGGCCTGTCACCTCGCTGAAGGAGGAACCT
CTCACCAGGGCCTGGATGACACCAACCTCACTAGTCGACAATACAAATACGGTGGGTGTT
GGCCGCGCTCACTTCGAGAAACAACCGCCAAGCAACCTGCGCAAGAGCAACTTCTTCCAC
TTCGTAGTTGCTTTATACGACCGAGCCGGACAGCCCGTAGAGATAGAACGAACAGCTTTC
ATAGGATTCATCGAAAAGGATCAGGAAGCGGAAGGCCAAAAGACAAACAACGGTATCCAG
TACAGATTACAGTTACTTTACGCAAATGGTATTCGACAGGAGCAAGACATATTCGTTCGA
TTGATAGATTCCGTCACGAAACAGCCCATCGTATATGAAGGTCAAGATAAGAATCCTGAA
ATGTGCCGAGTCCTTCTCACGCATGAAGTTATGTGCAGTCGGTGCTGTGATAAGAAAAGT
TGCGGCAACAGAAACGAAACGCCATCAGATCCTGTTATCATAGATCGATTTTTCTTAAAG
TTCTTCTTAAAATGCAACCAGAATTGTTTGAAAAATGCCGGCAACCCGCGAGATATGAGA
CGATTTCAGGTGGTAATCTCGACGCAAGTGATGGTGGATGGTCCTCTGCTAGCGATATCG
GACAACATGTTTGTCCACAACAACAGCAAGCACGGTAGACGTGCCAAGAGATTAGACCCA
TCTGAAGGTATTGACGCCGCGCCCGACTCTAATTCGGGGCTATACCCACCGTTGCCCGTA
GCAACGCCATGCATCAAAGCAATATCTCCCAGCGAAGGCTGGACGTCAGGGGGCTCTACC
GTAATAATAGTGGGGGACAACTTTTTCGATGGACTTCAAGTTGTATTTGGAACTATGTTG
GTGTGGAGCGAGCTGATAACATCACATGCGATAAGAGTTCAAACTCCACCGCGGCACATA
CCTGGTGTAGTTGAAGTCACACTTTCATACAAGAGCAAGCAGTTCTGCAAAGGAGCGCCT
GGAAGATTCGTATATGTTTCAGCTCTCAACGAGCCTACAATAGATTACGGCTTTCAGAGA
CTACAGAAACTTATACCGCGGCATCCTGGTGATCCAGAGAAACTACCAAAGGAGATAATT
CTAAAGCGAGCAGCAGACCTCGCCGAGGCCTTGTATTCAATGCCTCGTAATAACCAACTG
GGTCTATCCGCTCCTCGCTCGCCCTCCAGTATGCCCTTCAACTCATACACCGGACAGTTG
GCGGTCAGCGTCCAAGATACTGCCGCCTCACAGTGGACTGAAGAAGAGTACGCACGCAGC
GGCGGTTCGGTATCGCCGCGGTATTGTTCTGCCGCGTCTACGCCGCACGCGCCCGCCGCC
TACCCACCGCAACACTACCCTGCACCACCTACCTCACTCTTCAATACCTCCTCGCTGTCT
CTAGGTCCCTACCACCCGGCCAACGTAAACGGACATACAGAATACAATGCTTACAAAGAA
ACCGAGCATTATACGGAAAGAAATGATGATAATAAAACCATCTATCAAAACACTCATACG
AAATGTATCGACACGAAAACGCATAAAGACAAGTCCCGAAGTGCGTTCGCAGTTGTCAGA
CAAAGTCCACCGCGTAATTTCCAACAGCAAAATTGGCAACATCTCGCTGTACAGTCAGGA
ATGGGCGGTCTGGTGTCATCGCCTTTTAGTGTGAATCCGTTCTCGCTGCCCACTTGCAGC
GCACAGCAATACGCGCAGACAGCGCCGCTTGCCTCCAAGTAA
Protein sequence:
MTLSRFSGEGVFPGRLPQSAGTGAREARALRGLFPRRFDCRRACGARACSVAIRSDELNA
AAGGYTGSPWPPLELDPVGWGRKLYPTGAPRSSGGLMFGLHQEGVHAQPRGPVTSLKEEP
LTRAWMTPTSLVDNTNTVGVGRAHFEKQPPSNLRKSNFFHFVVALYDRAGQPVEIERTAF
IGFIEKDQEAEGQKTNNGIQYRLQLLYANGIRQEQDIFVRLIDSVTKQPIVYEGQDKNPE
MCRVLLTHEVMCSRCCDKKSCGNRNETPSDPVIIDRFFLKFFLKCNQNCLKNAGNPRDMR
RFQVVISTQVMVDGPLLAISDNMFVHNNSKHGRRAKRLDPSEGIDAAPDSNSGLYPPLPV
ATPCIKAISPSEGWTSGGSTVIIVGDNFFDGLQVVFGTMLVWSELITSHAIRVQTPPRHI
PGVVEVTLSYKSKQFCKGAPGRFVYVSALNEPTIDYGFQRLQKLIPRHPGDPEKLPKEII
LKRAADLAEALYSMPRNNQLGLSAPRSPSSMPFNSYTGQLAVSVQDTAASQWTEEEYARS
GGSVSPRYCSAASTPHAPAAYPPQHYPAPPTSLFNTSSLSLGPYHPANVNGHTEYNAYKE
TEHYTERNDDNKTIYQNTHTKCIDTKTHKDKSRSAFAVVRQSPPRNFQQQNWQHLAVQSG
MGGLVSSPFSVNPFSLPTCSAQQYAQTAPLASK