DPGLEAN13699 in OGS1.0

New model in OGS2.0DPOGS201526 
Genomic Positionscaffold64:+ 9323-12070
See gene structure
CDS Length2748
Paired RNAseq reads  4175
Single RNAseq reads  11500
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA002705 (1e-86)
Best Drosophila hit  bunched, isoform A (1e-44)
Best Human hitND
Best NR hit (blastp)  PREDICTED: similar to bunched CG5461-PA, isoform A [Apis mellifera] (4e-68)
Best NR hit (blastx)  PREDICTED: similar to bunched CG5461-PA, isoform A [Apis mellifera] (1e-64)
GeneOntology terms






















  
GO:0005737 cytoplasm
GO:0001751 compound eye photoreceptor cell differentiation
GO:0008101 decapentaplegic receptor signaling pathway
GO:0035282 segmentation
GO:0042803 protein homodimerization activity
GO:0046843 dorsal appendage formation
GO:0007422 peripheral nervous system development
GO:0005634 nucleus
GO:0048749 compound eye development
GO:0001709 cell fate determination
GO:0003702 RNA polymerase II transcription factor activity
GO:0048477 oogenesis
GO:0007304 chorion-containing eggshell formation
GO:0009996 negative regulation of cell fate specification
GO:0030707 ovarian follicle cell development
GO:0048102 autophagic cell death
GO:0035071 salivary gland cell autophagic cell death
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0006355 regulation of transcription, DNA-dependent
GO:0045746 negative regulation of Notch signaling pathway
GO:0007297 ovarian follicle cell migration
GO:0043066 negative regulation of apoptosis
GO:0030307 positive regulation of cell growth
GO:0008284 positive regulation of cell proliferation
InterPro families  IPR000580 TSC-22 / Dip / Bun
Orthology groupMCL39686

Nucleotide sequence:

ATGGCTGACAATCTGATTCAAAAGTCACATAAAACTAGCGAAAAAAACAAGTATAACAAC
GTTGTTCATCGTACTACGAGTGAATCTCTTCGACTAAATGAGTCTGAGAAGGGAGTGACT
CACCCGACGAGTCTTCAATCCGCCCATAACCCAAGGAAAATATCTTCCTTTCAGATTACG
AGTGTGACTGTTGGGTCTCGGGTGAGCACTGATGCAGGGGAGGACTCCGCGGACGATCTA
GACGAATCTCACACCGATGACATCTCAAGGGTGACCGACATAGAGAATGAAACACCGAGC
TACTCTGAAGACACCTTTTCGAAAGACGACGTTTTCTACAACGCGTCAAGTGCATCGCTA
GGTTGTGCGCCCGTCATTCCGACCAGTTCGCAGTACGGACTCGCGATCGTCGGTCAGGAC
GCTAATACCAATCAAGTAGGAGGAGCTGTGCCAAATAGTAATAACACGGAAGTGAATGAC
ATGCACGTCAGTGTCACTAACGCCGGAACAGGCAGCATCATCAATCTTATAGGTAATTCT
AAGCCTCAAGAAGGCATGAAGGAGATCCAAGAACATGTCAGAAATGAGAGGTTTAAAGTT
GTTAAAATTGAAAGTACTGAACCTTTCCGCCGTGGCAGATGGATGTGCATGGACTATTTA
GATCACACTACTACACAACAAAATGCCCCAATAACTTTGAACAATAATTTGGATGTCACA
GAAACCAATGCTTTGCAAGCACCTGATAGTGGAGTTGTTATTAATGATAGTCAACATGAT
GATATGTGTAATGATTTGGCAAATAAAGTGCCTAATGATCAAGTGAGTGCTCCTATTCAA
CAAATGGATCAATGTGTGCAGAAACAATTTCCGATGGCATCTCCGGGGCAATCACTTACT
CAACCCATTAATATGGCCCAGCAACCTATGCCTGTCACTCAGTCAGTATCTGTACAATCT
CCACCATTAGAGATGCCACAGCAAATGATTAATCAGAATATGCAATCAAACCAACAAGTG
GCTCAACAACATCCTCAAAGTATGACACATATCACTATGCAGAATGCACCGCAGAGCCAT
CAACAGCCTACCCAACAGCAGCAGGTTCAACAGATTCCACAAAGTTTTCCACAGCATCAA
CTTCAACAAGTTATTGCACAGTCTCAAAGTATGGCAATGCAACAAATACCTATGCATCAA
CAAATGCCGCAGCAAATGCAGCAAATTCCCAATCAACAGATGCAACAAATGGGTCAACAT
TTGCCTCAAGCTCAGATAACTAATATGCAATTGCAGCAGCAGATTCCACAAATGCAAGGG
CTGTCTAATCAGGGGCAACTCTCTCAAATTCCGGGCCAGCAGCCACAAATACAACAAATG
CAGATGCAACCTATGCAGGGACAGCCAAATCCACAACAAATTCATCAGATGCAACAAGCC
CAGATGCCTAATATGGGTCAAGGACACCATCTTCAAGGTCAACAAGCTATGGGACCTCAG
CAAGCCCAACTGCAGCACTTGTCTAGTCAGGCACAAATTCAAGCACTGCAGAATCAACAG
ATGCCAAACCATATTCAGCAAATGCAATTACCAACTCAACTCGCTGCGACTAATCAACAA
ATATCACAAATGCAAACCCAAATGCATCAACTTCCATCACAGCCCCAACAATCAGTGGTA
CCAGGAATGCATCCACAAATGCAGCAGCAGGGAATGGCTGTGCCGCAGCCACAATACAAT
CAAGCGGTGAATCAGCAAAGTAACGCTCAGTCGATGATTAGTGGCAGTCTTCCATCATCA
CAGCAACCTATGGTTCAAACACAACACAATGTTCCTCAATACCATACCCAACAATCTCAG
ATGACACAGCAGGGCCAGACATTGCCACAAGAGGTGCTGACTAGTATTGTAACATCACAG
CAAGGTGCAACTCTTCCTACTAATTTGCAACCAATGGCCTCGCAGCCACAAGGATCAACT
TTACCAGCTAATCTCCAAAGTCTAACTGGGCAACAGCAGGCTCCCGTTCACACTATGGCC
CCACAACAGACTAATGTACCACAAGGCAATATACAGATTATGAGTGCTGCACCACAGAAT
ATGCAGATGACTTTAGATGGCAATCAGGCAGGAATGCAAATGCCAGCCTCTGATCCCATT
TACATGCAACCACCCAATGTTGGGCAACAGATGCCAGCACATATGACCCAACCGCAAACC
ATGTTGCAGCAACAGGTGAGTGGCCAACATACAACACAGATGGGTGGTGTGCAATATGTT
CCTAATCAGACAGTGCCTCAAGTGTCTATGGCACATCAGAACATTCCGATGTCTATGCAG
CAAAGCATGCCGATGGGAATGGGCGGAGGAGTTCATGCTAGTGTAGTGCAATCTCAGACG
AGCATGGGTCTGGGTTATGGGGGTGTGGTACCCGTGATGTCACAGACTAGCGTGGCGCCA
GTCGAGGCGACCGTGTCGGGGACTAACTCGCCAATAGTGTCAATGCCAGTAAATTCCACC
GCGTACGTGTCTAATGCTCCACAGCCGGGACATGACAGTCAGGGTTTCGGCAGCCCAGTG
AGTGCGGTGGTGTCTCACGCCATTAGTGGCTCGGTGGTGAGCAATGTGAATGTGAATGCG
TGTGATAGCAGTGCACCAGAATCGGTACCAGACGGAATGCAGGCTGGCGACACTGGCGAT
GGTAAAGAGGAGCCGCAACCAGCTGTGCAACCAGACGATGAAAGGTAA

Protein sequence:

MADNLIQKSHKTSEKNKYNNVVHRTTSESLRLNESEKGVTHPTSLQSAHNPRKISSFQIT
SVTVGSRVSTDAGEDSADDLDESHTDDISRVTDIENETPSYSEDTFSKDDVFYNASSASL
GCAPVIPTSSQYGLAIVGQDANTNQVGGAVPNSNNTEVNDMHVSVTNAGTGSIINLIGNS
KPQEGMKEIQEHVRNERFKVVKIESTEPFRRGRWMCMDYLDHTTTQQNAPITLNNNLDVT
ETNALQAPDSGVVINDSQHDDMCNDLANKVPNDQVSAPIQQMDQCVQKQFPMASPGQSLT
QPINMAQQPMPVTQSVSVQSPPLEMPQQMINQNMQSNQQVAQQHPQSMTHITMQNAPQSH
QQPTQQQQVQQIPQSFPQHQLQQVIAQSQSMAMQQIPMHQQMPQQMQQIPNQQMQQMGQH
LPQAQITNMQLQQQIPQMQGLSNQGQLSQIPGQQPQIQQMQMQPMQGQPNPQQIHQMQQA
QMPNMGQGHHLQGQQAMGPQQAQLQHLSSQAQIQALQNQQMPNHIQQMQLPTQLAATNQQ
ISQMQTQMHQLPSQPQQSVVPGMHPQMQQQGMAVPQPQYNQAVNQQSNAQSMISGSLPSS
QQPMVQTQHNVPQYHTQQSQMTQQGQTLPQEVLTSIVTSQQGATLPTNLQPMASQPQGST
LPANLQSLTGQQQAPVHTMAPQQTNVPQGNIQIMSAAPQNMQMTLDGNQAGMQMPASDPI
YMQPPNVGQQMPAHMTQPQTMLQQQVSGQHTTQMGGVQYVPNQTVPQVSMAHQNIPMSMQ
QSMPMGMGGGVHASVVQSQTSMGLGYGGVVPVMSQTSVAPVEATVSGTNSPIVSMPVNST
AYVSNAPQPGHDSQGFGSPVSAVVSHAISGSVVSNVNVNACDSSAPESVPDGMQAGDTGD
GKEEPQPAVQPDDER