New model in OGS2.0 | DPOGS201526  |
---|---|
Genomic Position | scaffold64:+ 9323-12070 |
See gene structure | |
CDS Length | 2748 |
Paired RNAseq reads   | 4175 |
Single RNAseq reads   | 11500 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA002705 (1e-86) |
Best Drosophila hit   | bunched, isoform A (1e-44) |
Best Human hit | ND |
Best NR hit (blastp)   | PREDICTED: similar to bunched CG5461-PA, isoform A [Apis mellifera] (4e-68) |
Best NR hit (blastx)   | PREDICTED: similar to bunched CG5461-PA, isoform A [Apis mellifera] (1e-64) |
GeneOntology terms    | GO:0005737 cytoplasm GO:0001751 compound eye photoreceptor cell differentiation GO:0008101 decapentaplegic receptor signaling pathway GO:0035282 segmentation GO:0042803 protein homodimerization activity GO:0046843 dorsal appendage formation GO:0007422 peripheral nervous system development GO:0005634 nucleus GO:0048749 compound eye development GO:0001709 cell fate determination GO:0003702 RNA polymerase II transcription factor activity GO:0048477 oogenesis GO:0007304 chorion-containing eggshell formation GO:0009996 negative regulation of cell fate specification GO:0030707 ovarian follicle cell development GO:0048102 autophagic cell death GO:0035071 salivary gland cell autophagic cell death GO:0003700 sequence-specific DNA binding transcription factor activity GO:0006355 regulation of transcription, DNA-dependent GO:0045746 negative regulation of Notch signaling pathway GO:0007297 ovarian follicle cell migration GO:0043066 negative regulation of apoptosis GO:0030307 positive regulation of cell growth GO:0008284 positive regulation of cell proliferation |
InterPro families   | IPR000580 TSC-22 / Dip / Bun |
Orthology group | MCL39686 |
Nucleotide sequence:
ATGGCTGACAATCTGATTCAAAAGTCACATAAAACTAGCGAAAAAAACAAGTATAACAAC
GTTGTTCATCGTACTACGAGTGAATCTCTTCGACTAAATGAGTCTGAGAAGGGAGTGACT
CACCCGACGAGTCTTCAATCCGCCCATAACCCAAGGAAAATATCTTCCTTTCAGATTACG
AGTGTGACTGTTGGGTCTCGGGTGAGCACTGATGCAGGGGAGGACTCCGCGGACGATCTA
GACGAATCTCACACCGATGACATCTCAAGGGTGACCGACATAGAGAATGAAACACCGAGC
TACTCTGAAGACACCTTTTCGAAAGACGACGTTTTCTACAACGCGTCAAGTGCATCGCTA
GGTTGTGCGCCCGTCATTCCGACCAGTTCGCAGTACGGACTCGCGATCGTCGGTCAGGAC
GCTAATACCAATCAAGTAGGAGGAGCTGTGCCAAATAGTAATAACACGGAAGTGAATGAC
ATGCACGTCAGTGTCACTAACGCCGGAACAGGCAGCATCATCAATCTTATAGGTAATTCT
AAGCCTCAAGAAGGCATGAAGGAGATCCAAGAACATGTCAGAAATGAGAGGTTTAAAGTT
GTTAAAATTGAAAGTACTGAACCTTTCCGCCGTGGCAGATGGATGTGCATGGACTATTTA
GATCACACTACTACACAACAAAATGCCCCAATAACTTTGAACAATAATTTGGATGTCACA
GAAACCAATGCTTTGCAAGCACCTGATAGTGGAGTTGTTATTAATGATAGTCAACATGAT
GATATGTGTAATGATTTGGCAAATAAAGTGCCTAATGATCAAGTGAGTGCTCCTATTCAA
CAAATGGATCAATGTGTGCAGAAACAATTTCCGATGGCATCTCCGGGGCAATCACTTACT
CAACCCATTAATATGGCCCAGCAACCTATGCCTGTCACTCAGTCAGTATCTGTACAATCT
CCACCATTAGAGATGCCACAGCAAATGATTAATCAGAATATGCAATCAAACCAACAAGTG
GCTCAACAACATCCTCAAAGTATGACACATATCACTATGCAGAATGCACCGCAGAGCCAT
CAACAGCCTACCCAACAGCAGCAGGTTCAACAGATTCCACAAAGTTTTCCACAGCATCAA
CTTCAACAAGTTATTGCACAGTCTCAAAGTATGGCAATGCAACAAATACCTATGCATCAA
CAAATGCCGCAGCAAATGCAGCAAATTCCCAATCAACAGATGCAACAAATGGGTCAACAT
TTGCCTCAAGCTCAGATAACTAATATGCAATTGCAGCAGCAGATTCCACAAATGCAAGGG
CTGTCTAATCAGGGGCAACTCTCTCAAATTCCGGGCCAGCAGCCACAAATACAACAAATG
CAGATGCAACCTATGCAGGGACAGCCAAATCCACAACAAATTCATCAGATGCAACAAGCC
CAGATGCCTAATATGGGTCAAGGACACCATCTTCAAGGTCAACAAGCTATGGGACCTCAG
CAAGCCCAACTGCAGCACTTGTCTAGTCAGGCACAAATTCAAGCACTGCAGAATCAACAG
ATGCCAAACCATATTCAGCAAATGCAATTACCAACTCAACTCGCTGCGACTAATCAACAA
ATATCACAAATGCAAACCCAAATGCATCAACTTCCATCACAGCCCCAACAATCAGTGGTA
CCAGGAATGCATCCACAAATGCAGCAGCAGGGAATGGCTGTGCCGCAGCCACAATACAAT
CAAGCGGTGAATCAGCAAAGTAACGCTCAGTCGATGATTAGTGGCAGTCTTCCATCATCA
CAGCAACCTATGGTTCAAACACAACACAATGTTCCTCAATACCATACCCAACAATCTCAG
ATGACACAGCAGGGCCAGACATTGCCACAAGAGGTGCTGACTAGTATTGTAACATCACAG
CAAGGTGCAACTCTTCCTACTAATTTGCAACCAATGGCCTCGCAGCCACAAGGATCAACT
TTACCAGCTAATCTCCAAAGTCTAACTGGGCAACAGCAGGCTCCCGTTCACACTATGGCC
CCACAACAGACTAATGTACCACAAGGCAATATACAGATTATGAGTGCTGCACCACAGAAT
ATGCAGATGACTTTAGATGGCAATCAGGCAGGAATGCAAATGCCAGCCTCTGATCCCATT
TACATGCAACCACCCAATGTTGGGCAACAGATGCCAGCACATATGACCCAACCGCAAACC
ATGTTGCAGCAACAGGTGAGTGGCCAACATACAACACAGATGGGTGGTGTGCAATATGTT
CCTAATCAGACAGTGCCTCAAGTGTCTATGGCACATCAGAACATTCCGATGTCTATGCAG
CAAAGCATGCCGATGGGAATGGGCGGAGGAGTTCATGCTAGTGTAGTGCAATCTCAGACG
AGCATGGGTCTGGGTTATGGGGGTGTGGTACCCGTGATGTCACAGACTAGCGTGGCGCCA
GTCGAGGCGACCGTGTCGGGGACTAACTCGCCAATAGTGTCAATGCCAGTAAATTCCACC
GCGTACGTGTCTAATGCTCCACAGCCGGGACATGACAGTCAGGGTTTCGGCAGCCCAGTG
AGTGCGGTGGTGTCTCACGCCATTAGTGGCTCGGTGGTGAGCAATGTGAATGTGAATGCG
TGTGATAGCAGTGCACCAGAATCGGTACCAGACGGAATGCAGGCTGGCGACACTGGCGAT
GGTAAAGAGGAGCCGCAACCAGCTGTGCAACCAGACGATGAAAGGTAA
Protein sequence:
MADNLIQKSHKTSEKNKYNNVVHRTTSESLRLNESEKGVTHPTSLQSAHNPRKISSFQIT
SVTVGSRVSTDAGEDSADDLDESHTDDISRVTDIENETPSYSEDTFSKDDVFYNASSASL
GCAPVIPTSSQYGLAIVGQDANTNQVGGAVPNSNNTEVNDMHVSVTNAGTGSIINLIGNS
KPQEGMKEIQEHVRNERFKVVKIESTEPFRRGRWMCMDYLDHTTTQQNAPITLNNNLDVT
ETNALQAPDSGVVINDSQHDDMCNDLANKVPNDQVSAPIQQMDQCVQKQFPMASPGQSLT
QPINMAQQPMPVTQSVSVQSPPLEMPQQMINQNMQSNQQVAQQHPQSMTHITMQNAPQSH
QQPTQQQQVQQIPQSFPQHQLQQVIAQSQSMAMQQIPMHQQMPQQMQQIPNQQMQQMGQH
LPQAQITNMQLQQQIPQMQGLSNQGQLSQIPGQQPQIQQMQMQPMQGQPNPQQIHQMQQA
QMPNMGQGHHLQGQQAMGPQQAQLQHLSSQAQIQALQNQQMPNHIQQMQLPTQLAATNQQ
ISQMQTQMHQLPSQPQQSVVPGMHPQMQQQGMAVPQPQYNQAVNQQSNAQSMISGSLPSS
QQPMVQTQHNVPQYHTQQSQMTQQGQTLPQEVLTSIVTSQQGATLPTNLQPMASQPQGST
LPANLQSLTGQQQAPVHTMAPQQTNVPQGNIQIMSAAPQNMQMTLDGNQAGMQMPASDPI
YMQPPNVGQQMPAHMTQPQTMLQQQVSGQHTTQMGGVQYVPNQTVPQVSMAHQNIPMSMQ
QSMPMGMGGGVHASVVQSQTSMGLGYGGVVPVMSQTSVAPVEATVSGTNSPIVSMPVNST
AYVSNAPQPGHDSQGFGSPVSAVVSHAISGSVVSNVNVNACDSSAPESVPDGMQAGDTGD
GKEEPQPAVQPDDER