DPGLEAN19653 in OGS1.0

New model in OGS2.0DPOGS207262 
Genomic Positionscaffold76:+ 78859-81836
See gene structure
CDS Length1275
Paired RNAseq reads  822
Single RNAseq reads  1994
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012127 (0.0)
Best Drosophila hit  lethal (2) 09851 (4e-151)
Best Human hitglutamate-rich WD repeat-containing protein 1 (2e-121)
Best NR hit (blastp)  PREDICTED: similar to GA11814-PA [Tribolium castaneum] (3e-169)
Best NR hit (blastx)  PREDICTED: similar to GA11814-PA [Tribolium castaneum] (1e-162)
GeneOntology terms
  
GO:0005634 nucleus
GO:0005730 nucleolus
InterPro families







  
IPR020472 G-protein beta WD-40 repeat
IPR019782 WD40 repeat 2
IPR017986 WD40-repeat-containing domain
IPR019781 WD40 repeat, subgroup
IPR022052 Histone-binding protein RBBP4
IPR001680 WD40 repeat
IPR015943 WD40/YVTN repeat-like-containing domain
IPR019775 WD40 repeat, conserved site
IPR011046 WD40 repeat-like-containing domain
Orthology groupMCL12066

Nucleotide sequence:

ATGGAAGAAGGCGACCAAACCGAAGAAGATTCACGGCCTAAGACCTATCTTCCCGACCAG
CCTTTACAAGAAGATGAGCATTTAATTTGTGATCAATCGGCTTATGTTATGTTGCACCAA
GCACAAACTGGTGCTCCATGCCTTAGTTTTGATATAATCACAGACAACCTAGGCAATGAT
CGTCAACAATTTCCCATGACAGCTTACTTGGTTGCCGGTACACAAGCCTCGAGCGCTCAC
TTAAATAATGTTTTAGTTGTAAAAATGTCAAATTTACACACCACCGCAAAACCAGAGGAT
GAAGAAGAATCCGATGAGGACGATGATGACGAAGAGGAAGATGAAGAAAAGAAACCACAA
ATGACTTTTGCATTTATTAAACACCAGGGCTGTGTGAATAGAATAAGAACCACAAACTAT
AAAAACTCAGTTTTGGCAGCAACATGGTCTGAATTAGGAAGGGTGGATGTGTGGAATATT
ACTCAGCAATTACAAGCAGTTGATGAACCAGCATTACTTGAGAGATACAATCTTGACACC
GTGTCTAATCCAGTGAAACCATTATATTCATTCAATGGACACCAACAAGAAGGATTTGGC
ATGGACTGGTGTCCAACTGAGCCAGGAGTATTAGCAACAGGTGATTGCAGAAGAGACATT
CATATATGGAAGCCGAATGAGGCTGGTACTTGGACAGTGGACCAAAGACCCTTAGTTGGA
CACACAAGTTCAGTGGAAGATATCCAATGGTCACCTAATGAAAAAAATGTCCTGGCTACC
TGCTCAGTTGATAGAACTATCAGAATATGGGACACAAGAGCACCACCACACAAAGCGTGT
ATGTTGACAGCTGAAAATGCTCACGAGAGAGATATTAATGTTATATCTTGGAATAGAAAA
GAACCATTTATAGCTAGCGGTGGCGATGATGGTTTTCTCCACATATGGGATCTCCGACAA
TTCACTCGCAGTACGCCTGTTGGTACTTTCAAACATCATACTGCGCCGATCACGTCAGTT
GAGTGGCACTGGACAGAGCCCAGTGTGCTTGCTTCAGCAGGAGAGGATAACCAAGTCGCT
CTGTGGGACCTTGCTGTTGAAAGAGATGATGAAGAAGTAGTGGAAGAAGAGTTAAAGAAT
TTACCACCACAATTGCTTTTTATTCATCAAGGACAAACAGATATTAAGGAACTTCATTGG
CACAAGCAAATTCCTGGCGTCATAGTGACAACCGCACATACAGGATTCAATATATTTAAA
ACTATAAGTGTATAA

Protein sequence:

MEEGDQTEEDSRPKTYLPDQPLQEDEHLICDQSAYVMLHQAQTGAPCLSFDIITDNLGND
RQQFPMTAYLVAGTQASSAHLNNVLVVKMSNLHTTAKPEDEEESDEDDDDEEEDEEKKPQ
MTFAFIKHQGCVNRIRTTNYKNSVLAATWSELGRVDVWNITQQLQAVDEPALLERYNLDT
VSNPVKPLYSFNGHQQEGFGMDWCPTEPGVLATGDCRRDIHIWKPNEAGTWTVDQRPLVG
HTSSVEDIQWSPNEKNVLATCSVDRTIRIWDTRAPPHKACMLTAENAHERDINVISWNRK
EPFIASGGDDGFLHIWDLRQFTRSTPVGTFKHHTAPITSVEWHWTEPSVLASAGEDNQVA
LWDLAVERDDEEVVEEELKNLPPQLLFIHQGQTDIKELHWHKQIPGVIVTTAHTGFNIFK
TISV