New model in OGS2.0 | DPOGS212692  |
---|---|
Genomic Position | scaffold3:- 595363-599236 |
See gene structure | |
CDS Length | 2931 |
Paired RNAseq reads   | 678 |
Single RNAseq reads   | 1657 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA013199 (0.0) |
Best Drosophila hit   | cut, isoform A (1e-89) |
Best Human hit | homeobox protein cut-like 2 (5e-34) |
Best NR hit (blastp)   | PREDICTED: similar to Homeobox protein cut [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to Homeobox protein cut [Tribolium castaneum] (0.0) |
GeneOntology terms    | GO:0005634 nucleus GO:0008587 imaginal disc-derived wing margin morphogenesis GO:0003704 specific RNA polymerase II transcription factor activity GO:0035277 spiracle morphogenesis, open tracheal system GO:0048477 oogenesis GO:0007443 Malpighian tubule morphogenesis GO:0016360 sensory organ precursor cell fate determination GO:0003677 DNA binding GO:0008052 sensory organ boundary specification GO:0045165 cell fate commitment GO:0030713 ovarian follicle cell stalk formation GO:0007424 open tracheal system development GO:0007422 peripheral nervous system development GO:0008585 female gonad development GO:0007417 central nervous system development GO:0007469 antennal development GO:0048813 dendrite morphogenesis GO:0043565 sequence-specific DNA binding GO:0003700 sequence-specific DNA binding transcription factor activity GO:0000278 mitotic cell cycle GO:0045746 negative regulation of Notch signaling pathway GO:0030707 ovarian follicle cell development GO:0070983 dendrite guidance GO:0007605 sensory perception of sound GO:0048098 antennal joint development GO:0032583 regulation of gene-specific transcription |
InterPro families    | IPR001356 Homeobox IPR003350 Homeodomain protein CUT IPR010982 Lambda repressor-like, DNA-binding IPR009057 Homeodomain-like IPR017970 Homeobox, conserved site IPR012287 Homeodomain-related |
Orthology group | MCL16599 |
Nucleotide sequence:
ATGTTTGAAAAGTGTGCCGGCAGATGTCGTACGTCAAACAATGTTTTAGGAAAGAGTGAA
AGCGAATTACGGCTCGTTTCCGTCACACGCGCGATATTCCTATACGCCGCCGGAACATAT
TGCTTCTATACAGACCTCTCGTTGGTCACAGTTCTGAATGTTGGAACAAAAGACGGTACA
ACCGGCCCCGGGTTCGGGAGGTCAGATGGTGACGGCGAGGAACGCCTGGCTCACATGCTC
AATGAAGCCTCACATATCATGAAGACACCGACGGGACAAGCCAACAACGATGACTCCAGG
AGCAACGAAGACTCCAGCTCACCGAGGACCCAGTGCCCGTCACCGTTTTCTAATAAGGAT
TCGAGTCAAAACAGACGGCTTAAGAAATACGAAAACGATGACATTCCTCAAGAAAAAGTA
GTGCGTATATACCAAGAAGAGCTGGCGAAGATAATGACGAGACGCGTGGAAGACATGCGC
CATAACAGAGACGGCTTCCCTGGCAGCGGCATGGCCCCGCACATGGAACGTCCTCCGGAA
GACATTAGGATGGCTCTGGAAGCGTATCACAGGGAACTAGCCAAAATACAACCGGGCGGA
AACATTCCGACCCTGCACAACTTGCCAGGGATGCCACCCTTCCCCAACCTGCTGGCCCTT
CAGCAGCAAGCCATGCAAGCACAAAGCCAGCACATCAACGGCTCCGGGGCAATCCAAGAT
CTCTCTCTGCCCAAAGAGAAAAATACCAAAATTAATGGAATGACTGATAGTGATAAGGAA
AGGTCTATGGACGCTGAAGAGGCCATCAGACACGCGGGAAGCGCTTTCTCGCTAGTTAGA
CCGAAATTAGAACCGGGACAGCAATCCACCGGCTCCTCGGCATCCAGCCCGCTGGGAAAT
GCTATTCTACCTCCCGCCATTACGCCGAATGAAGACTTCAGTAACTCGGCCGCAGCGAGT
CCATTACAAAGAATGGCTTCCATAACGAATAGTTTGATATCCCAGCCCCCGAATCCGCCA
CACCACGCGCCACCGCAGAGATCGATGAAGGCAGTCCTGCCACCGATAACTCAGCAACAG
TTCGATTTGTTCAACAATTTGAACACGGAGGAAATCGTGAAGAGAGTCAAAGAGGCTCTC
AGCCAGTATTCCATAAGCCAGAGATTGTTCGGCGAATCCGTGCTCGGCCTGTCTCAAGGA
TCCGTCAGCGATCTGCTAGCGAGACCGAAGCCATGGCACATGTTGACACAAAAGGGAAGA
GAGCCGTTCATTCGTATGAAAATGTTCTTGGAGGATGAAAACGCAGTGCACAAATTGGTT
GCGTCCCAATACAAAATCGCACCGGAGAAGCTGATGAGAACAGGAAACTATAGCGGAGCA
CCTTCATGTCCGCCAAATATGAACAAGCCGATGCCACCAACACAGAAGATGATCTCAGAT
GCCACGGTGCTCCTTAGCAAGATGCAACAGGAACAACTTCTAGGATCTGGACACTTAGGA
CATTTGGGACAACCGACCCCTCTCCTGTTGACTCCGCCTGGCTTCCCACCACATCACGCC
GTGACGCTGCCGCCTCAGCATCACGACAACAACAACAAGGAGAGGAAACCACCACCGCCT
CCACAACCCCATCACCAGCCGCCCGTGATGCGAGGCCTTCACCAGCACATGTCACCCAGC
GTCTACGAGATGGCAGCTCTGACGCAAGACCTCGACACTCAGACGATCACGACCAAAATA
AAGGAAGCGCTCCTCGCCAATAACATCGGACAGAAAATATTCGGCGAGGCCGTGTTGGGA
CTCTCCCAGGGATCGGTCAGTGAACTTCTATCGAAACCGAAACCCTGGCACATGTTGAGT
ATCAAAGGACGAGAGCCCTTCATCAGAATGCAGCTCTGGCTCAGCGATGCGCATAATATA
GATCGTCTCCAAGCGTTGAAGAATGAGAGACGCGAAGCTAACAAGAGACGGCGGTCGAGC
GGACCCGGTCAGGACAACTCCTCGGACACCTCATCGAATGATACGTCGGAGTTCTACCAC
TCCAGCTCGCCTGGACCGATACCCGGCGCGCCGTCCGCCAAGAAGCAGCGCGTGCTGTTC
TCGGAGGAACAGAAGGAAGCGCTGAGACTAGCCTTCGCTTTGGATCCCTACCCGAACATG
CCGACGATAGAATTCCTCGCTGCCGAGCTGGGCCTGTCCACCAGAACGATCACCAACTGG
TTCCACAACCATCGCATGCGGCTAAAGCAACAGGCGCCGCACGGCCTGCCCGCGGAACCT
CCAGCACGAGATCAGGCCTCCGCTCCCTTCGATCCCGTACAGTTCCGTCTCCTGCTCAAT
CAGAGGCTTCTGGAGCTGCAGAAGGAGAGGATGGGCCTGGCGGGGGTTCCTCTGCCGTAC
CCGCCCTACTTCGCCGCCAACTCCAACTTCGCCGCCCTCATCGGTCGCGGCCTGCTGCCC
ACCGACGAGCGCGTCAAGGACCCTGCCGCCGGACTCGACCTCTCGATGCCGCTGAAGCGT
GACCCTGACGGAGACGACTTCGAGGAGGACGACGTCGAGAGCAACCTCGGCTCCGAGGAC
TCCCTCGACGATGACTCCAAGACTGAGCCCAAGGCGGCCTCCACCCCCGCTGGTCGGTCC
AGCCGCCGCAAGCCCGCGGCGCCGCAGTGGGTCAACCCCGACTGGCAGGACGAGAAGCCG
CGCAACCCCGACGAGGTCATCATCAACGGCGTCTGCGTGATGCGCGCCGACGACTACCGT
CGCGAGGCCACGGAGACCGTGAGGGTGGAGCCATCCCCCGCCCCCCGCGAGAGCTCCCCC
GCCCCCCAGGACACGCCGCGCGCGCCTCGCACCCCCCGCACGCCGTCCCCGGACGTCCTG
CCCGAGGACAAGATCAAGACGGAGGCGGAAGACGACCGGTGGGAGTATTAA
Protein sequence:
MFEKCAGRCRTSNNVLGKSESELRLVSVTRAIFLYAAGTYCFYTDLSLVTVLNVGTKDGT
TGPGFGRSDGDGEERLAHMLNEASHIMKTPTGQANNDDSRSNEDSSSPRTQCPSPFSNKD
SSQNRRLKKYENDDIPQEKVVRIYQEELAKIMTRRVEDMRHNRDGFPGSGMAPHMERPPE
DIRMALEAYHRELAKIQPGGNIPTLHNLPGMPPFPNLLALQQQAMQAQSQHINGSGAIQD
LSLPKEKNTKINGMTDSDKERSMDAEEAIRHAGSAFSLVRPKLEPGQQSTGSSASSPLGN
AILPPAITPNEDFSNSAAASPLQRMASITNSLISQPPNPPHHAPPQRSMKAVLPPITQQQ
FDLFNNLNTEEIVKRVKEALSQYSISQRLFGESVLGLSQGSVSDLLARPKPWHMLTQKGR
EPFIRMKMFLEDENAVHKLVASQYKIAPEKLMRTGNYSGAPSCPPNMNKPMPPTQKMISD
ATVLLSKMQQEQLLGSGHLGHLGQPTPLLLTPPGFPPHHAVTLPPQHHDNNNKERKPPPP
PQPHHQPPVMRGLHQHMSPSVYEMAALTQDLDTQTITTKIKEALLANNIGQKIFGEAVLG
LSQGSVSELLSKPKPWHMLSIKGREPFIRMQLWLSDAHNIDRLQALKNERREANKRRRSS
GPGQDNSSDTSSNDTSEFYHSSSPGPIPGAPSAKKQRVLFSEEQKEALRLAFALDPYPNM
PTIEFLAAELGLSTRTITNWFHNHRMRLKQQAPHGLPAEPPARDQASAPFDPVQFRLLLN
QRLLELQKERMGLAGVPLPYPPYFAANSNFAALIGRGLLPTDERVKDPAAGLDLSMPLKR
DPDGDDFEEDDVESNLGSEDSLDDDSKTEPKAASTPAGRSSRRKPAAPQWVNPDWQDEKP
RNPDEVIINGVCVMRADDYRREATETVRVEPSPAPRESSPAPQDTPRAPRTPRTPSPDVL
PEDKIKTEAEDDRWEY