DPGLEAN09669 in OGS1.0

New model in OGS2.0DPOGS213986 
Genomic Positionscaffold295:+ 28395-32319
See gene structure
CDS Length2475
Paired RNAseq reads  4088
Single RNAseq reads  9644
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008256 (0.0)
Best Drosophila hit  CG5033, isoform A (0.0)
Best Human hitribosome biogenesis protein BOP1 (6e-166)
Best NR hit (blastp)  PREDICTED: similar to ribosome biogenesis protein bop1 (block of proliferation 1 protein) [Tribolium castaneum] (0.0)
Best NR hit (blastx)  PREDICTED: similar to ribosome biogenesis protein bop1 (block of proliferation 1 protein) [Tribolium castaneum] (0.0)
GeneOntology terms


  
GO:0043021 ribonucleoprotein binding
GO:0042254 ribosome biogenesis
GO:0005730 nucleolus
GO:0006364 rRNA processing
InterPro families






  
IPR012953 BOP1, N-terminal
IPR019781 WD40 repeat, subgroup
IPR019782 WD40 repeat 2
IPR017986 WD40-repeat-containing domain
IPR015943 WD40/YVTN repeat-like-containing domain
IPR019775 WD40 repeat, conserved site
IPR011046 WD40 repeat-like-containing domain
IPR001680 WD40 repeat
Orthology groupMCL11658

Nucleotide sequence:

ATGCCTCCTTACAAAGCAGGGTTGTTGAAAAGAAAAAATGAGTCCCAAGAGTCAAAAAAC
AGTAATTCAGAAGACAAGAACACCACCAGTGCTGATGAGGGAGAGGAGTTAGTGAAAGGC
TTCCTGGATGATGAGGACGATGCTGCAGACTCTGAGCCTGAGGAGGACGAACACTCAGAC
GGAGAAAGAGATGAAGTTTTGTTTGAAGAACATGATTCAACATTAGGTGATGATGAGCAG
GAAAATGAAACATCAGAAGCTGAATCTTCTGACCTGACCGATTCAGAGGAAGTTGAAGAA
ACTGATGAAGATGAAGGTGACAGTAGTGAGAATGAGGAACAGGTGAGCAGTGACAGTGGA
GCAGACACTGGCATGGCGCCGATAACAACAGGCAGCAGTGATGAGGCAGTTAGGAAGATA
GGAGGCAAGAAGAAAGTAAAACAAGTGACCAAGAAGACACAGGAAAAGAAGAAAAACAAT
AAAACAAGTATTGAAATAATGTCTGCTAAAATACAAGAAGACAAAGTGACAGCACCCGTA
CAACAAAAAGGTGATGAGTACGAGTCTGGAGACACATCGGACGAGGAGGACAGAACTAAC
ACCGTGGGTGACATTCCCATGTGGTGGTACAATGAGTACCCGCATATCGGATACACCTTG
GACGGCGAGAGGATCATCAAGCCACCACAGAGGGACCAGATTGATGAATTCCTTAAGAAG
TGTTCGGATCCTGATTTCTGGCGCACCGTAAAGGACCCACAGACGGGTCAGGACGTTGTG
CTGGCTCCCAGCGACTTGAGATTGCTGGAGAGACTCAGAGCCTCACGCCTGCCTTCCGAC
ACTCACGACGACTATGAGCCGTGGGTGGAGTGGTTCAGTCGCGAGGTGTTGGCGACGCCT
CTGCGCGCCTTCCCAGAACACAAGAGATCGTTCTTGCCGTCTCGCTCGGAGCAGCTCGCC
GTCTCCAAGCTTGTGCACGCCCTTAAGATGGGCTGGACGAAGACCAGAAAGGAGATGGCC
GCAGAGAGGAGGAAGAAGAAGGAGCGTGCCTTCTACGACTTATGGTCGTCGTCGGCCACA
TCTGCGACGGGCGGAGCGCGCGTGTTGCCCGCTCCGAAACGAGCGCTGCCGGGACACGCC
GAGAGTTACAACCCGCCGGCGGAGTATCTGCTGGATAACAAGGAGATGAAGGAGTGGGAC
TCGCTGGCGGAGACCCCTTGGAAACGAAAGTACACCTTCCTACCGCAGAAACACAGCTGC
TTGAGAGAGGTCGAGGCGTTCCCGCGGTTCATTCGGGAGAGATTCCTGCGTTGCTTAGAC
CTGTATCTGGCACCCAGAGCGATCCGTATGAGGCTGACCATTAACGCGGAGGACCTGGTG
CCCAAGCTGCCGTCCCCGCGAGACCTACAGCCCTTCCCTACGGCCGAGGTGCTCCAGTTC
CGAGGACATACGGACGTCGTCCGCAGCTGCGACTTCGACCCCTCGGGACAGTACGTCGTG
TCGGGAAGCGAGGACGGCACTCTCAAAGTGTGGGAGAGCAGTACTGGTCGCTGTGTGCGC
ACGGTGTCCCTGGGAGCGGCCGTGACGCGCGTGTCATGGAGTCCTGCAGCCGCGCTGTGC
CTGGTGGCGGCGGCGGCCGGACCTCGCGCGCTGCTACTAAACGTGCAGGCGGGCGCCGGC
GCGCACCGCGCCGCGCGTGCCACCGATCGCCTGCTGGCCGAGGCGCCTCCGCAGCACGAT
ACTCGCATGGACGAGCGCACGTCGTCGTGTGTGGAGTGGGAGGAGGTGGACGCGACGCAG
TGGGCGCACGGCATCAGGATCTCCATCAAACACTTCAAGCAGCTGGCGCACCTGTCGTGG
CACGCCCGAGGCGACTACCTGGCGGCCACGGTGACGGAGGGCGCATCTCGCGCGGTGGTG
GTGCACCAGCTGTCCCGACGCCGCTCGCAGCTCCCATTCCGACGAGCGTGCGGGCTGGTG
CAGGCGGCCGTGTTCCACCCACGCCGGCCGCTGCTGCTCGTGGCCACGCAGCGCGCCGTC
CGCATCTACGACCTCGTGAAGCAGGAACTGTCCCGCAAGCTGCGACCGGGCGCCCAGTGG
CTCAGCTCGCTGGCGGTGCATCCGGGCGGAGACCACCTGCTGCTCGGCTCCTACGACCGC
AAGCTGGTGTGGTTCGACCTGGAGCTGTCGGCTCGCCCCTACCGCACGCTGCGGGTGCAC
GGGGCGGCGGTACGCGCGGTGGCCTTCCACCCGCGGTACCCGCTGTTCGCGTCGGGTGGC
GACGACGCCTACCTGGTGGTGTGTCACGGCACGGTGTACAACGACCTGCTCACGAACCCG
CTGCTGGTGCCGCTGAAGCAGTTGGCGGCTCCCGCGGCGGCCGGCGCGCTCCGAGTGCTC
GACCTGCGCTGGCATCCTCACCAACCCTGGCTGCTGGCCGCGGGCGCCGACGGCACGCTG
CGCCTGTACTCGTGA

Protein sequence:

MPPYKAGLLKRKNESQESKNSNSEDKNTTSADEGEELVKGFLDDEDDAADSEPEEDEHSD
GERDEVLFEEHDSTLGDDEQENETSEAESSDLTDSEEVEETDEDEGDSSENEEQVSSDSG
ADTGMAPITTGSSDEAVRKIGGKKKVKQVTKKTQEKKKNNKTSIEIMSAKIQEDKVTAPV
QQKGDEYESGDTSDEEDRTNTVGDIPMWWYNEYPHIGYTLDGERIIKPPQRDQIDEFLKK
CSDPDFWRTVKDPQTGQDVVLAPSDLRLLERLRASRLPSDTHDDYEPWVEWFSREVLATP
LRAFPEHKRSFLPSRSEQLAVSKLVHALKMGWTKTRKEMAAERRKKKERAFYDLWSSSAT
SATGGARVLPAPKRALPGHAESYNPPAEYLLDNKEMKEWDSLAETPWKRKYTFLPQKHSC
LREVEAFPRFIRERFLRCLDLYLAPRAIRMRLTINAEDLVPKLPSPRDLQPFPTAEVLQF
RGHTDVVRSCDFDPSGQYVVSGSEDGTLKVWESSTGRCVRTVSLGAAVTRVSWSPAAALC
LVAAAAGPRALLLNVQAGAGAHRAARATDRLLAEAPPQHDTRMDERTSSCVEWEEVDATQ
WAHGIRISIKHFKQLAHLSWHARGDYLAATVTEGASRAVVVHQLSRRRSQLPFRRACGLV
QAAVFHPRRPLLLVATQRAVRIYDLVKQELSRKLRPGAQWLSSLAVHPGGDHLLLGSYDR
KLVWFDLELSARPYRTLRVHGAAVRAVAFHPRYPLFASGGDDAYLVVCHGTVYNDLLTNP
LLVPLKQLAAPAAAGALRVLDLRWHPHQPWLLAAGADGTLRLYS