DPGLEAN05848 in OGS1.0

New model in OGS2.0DPOGS209957 
Genomic Positionscaffold128:- 98769-102404
See gene structure
CDS Length3459
Paired RNAseq reads  1273
Single RNAseq reads  3129
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA011274 (0.0)
Best Drosophila hit  CG7728 (0.0)
Best Human hitribosome biogenesis protein BMS1 homolog (9e-127)
Best NR hit (blastp)  GJ12043 [Drosophila virilis] (0.0)
Best NR hit (blastx)  GJ12043 [Drosophila virilis] (0.0)
GeneOntology terms


  
GO:0005634 nucleus
GO:0042254 ribosome biogenesis
GO:0003924 GTPase activity
GO:0005525 GTP binding
InterPro families
  
IPR007034 Ribosome biogenesis protein BMS1/TSR1, C-terminal
IPR012948 AARP2CN
Orthology groupMCL14006

Nucleotide sequence:

ATGGCCGATGAGGCTATTTTCGATAAAAAGAAGTCACACCGTGCAAAGCATGCCGGCAGA
AAAGCAGAGAAGAAAAAAAAGAAAAATCAAGTTGATCAGTCTAACCTAAGTGCGCGGCAA
AGAAATCCAAAAGCCTTCGCAATTAATTCAGCGGTACGAGCTGAGAGACAGTTTAGGCGT
CGCGAAGATGTTATATCTAAAAAGCAGCACATACCACTAGTGGATAAAACACCTTTGGAG
CCTCCACCTATCGTCGTCGCTGTAGTCGGGCCGCCGCGAGTCGGAAAAACTACAGTCATC
AATAACCTTATTAAAAGTTTTGTCAAAACAAATGTCACCAGTACCAATGGCCCCATAACG
ATAGTCACGTCTAAGAAAAGGCGTCTCACGTTAATAGAGTGTAACAACGATGTCAATTCT
ATGATAGATATCGCGAAGTGTGCGGATCTCGTCCTCTTGCTGTGTGATGCGAGCTTCGGG
TTTGAAATGGAAATATTCGAATTCCTGAATATATGCCAGGTACATGGGATGCCAAAGATT
ATGGGTGTGTTAACACATCTGGACATGATAAAGAATGCCAAGAAGTTAAAGATGACCAAG
AAAACATTGAAGCATCGTTTCTGGACCGAAGTGTATCCTGGCGCCAAATTATTTTATTTA
TCTGGTATCATTCATGGAGAGTACTTGAGAAATGAAATAAAAAATTTGTCCAGATTCATA
TCTGTTATGAAATTTAGGCCTCTGAGCTGGCGAATGACTCATGCTTATATTTTGGCTGAT
AGATTAGAAGATATCACTAGTCAGGATAGCATCAGGAAGGATCCAAAAATTAATAGAGAT
GTGGTCTTATATGGTTATGTGAGGGGTGTTCCTCTCATGAAGGATTCAATGGTGCATTTA
GCAGGTGTTGGTGATATGAAGATCAGTGAACTCTCATACCTGCCAGACCCATGTCCTCTA
CCCAGCAGTGAGAAAAAAAGACATTTGATGGAGAGGGAAAGACAAATCTATGCTCCCTTC
TCTGGTGTGGGAGGAATTGTCTATGACAAGGACGCAGTTTATATTGAACTCAAAGGATCT
CACTCACATAAACAAGAAGATGAAGAAACAAATGAGAAGCAAGCCTTACTTAAAAGTGTT
GTGGAGACAACAGAGACTGTTGATGAGCAAATGCAAGAATCTGGCTTCAGACTGTTCAGT
GGAGGAACTGTCATTTATCCAGATATGGTAAAAGATGATAAAGACTTGCAGAACAGAGAA
TCTCAGAATAATGAGTCCAGCTCTGATGAATCTGATTCTGGAGATGAAAATGATTCTGAC
AATGACAGCGGCATAGATCAAAGTGAGAAAGACACCACAGGAAAGAGTTCTAAACTGCCT
TGGGACAATAACTCAGAATCTGACGAAGATGATGAAGACGATGAAGATGACGGTAACGAT
GAAGCAAAAAACCTTCAAGTGGACAAACTGTACAGTCATGAAAATGATTCTGATGAAATT
GATGAAGAACATAAAAGTTCTGACGCAGAAGACTTTAGCATTAAGTGGAAGGAAAAATTA
GGTGAAAAAGCATCATTGGCCTATTTTGAAAGACAAAAGACTTCAAAAAATATCATGAAA
CTTGTTTATGGAGAATTTGAAATAGGAAATAAGAGAAAAGAAGATGAAGATGGGAATCGT
GATGAGGAGAGTGACGAAGAAGAAATAGGAGGTCTCTTCAAGAAAGTTACAAGTTCACAG
AAAAGAAAGCAGGCGGACAAGGAACAGTTGGATCTCGAGGAATGTTCGTATTTCTATAAC
AGAAATAAACCACATCTAAGAGATTGGACCACGGAAGAAAATAAAAAACTGATTATTAAT
TGTTTTGTGACTGGAAAGAGGGGCGCCGACGAGGACGCCGAGGAACTCTTGAAATTAGAC
GACGCCAGTGACGGGGAGGATGAAGTCTATGGAGACTTCGAAGATCTAGAGACGGGTGAA
AAGCATATTAGTAAACAGCAACCGGAAGATAAGACAGAAGAAGTAGGTTCAAAGCGCAAA
GCGGAGCCGACCAAATCGGATATTTTAGACAAAAAATTAAAATTAAAAGCTAAGTTTGAT
GCTGAGTATGACAACCCAGACGACCACAGAATCAAGGGGGATCATTCTTATTATGAAAGT
TTAAAAGCTGAAGCGCTCAAGCAGTCTCAACTAAACAAGTCAGTTTTCGAAAATTTAGAC
AACGGTTTGCGAGTCGAAGTCGAAGGCTTTCGACCCGGTCTGTACGTTCGTATGTTATTC
AAGAATATGCCCTCGGAGTTCGTCACAAACTTCGATTCAAGCTATCCTCTCCTCATAGGC
GCTTTAAATATGGCTGAGCAAAACATCGGCTTCGTCTCGTGTAAGGTTAAAAAACATAGA
TGGTACAAAAAGATATTGAAAACCAACGATCCTTTGATTATTTCCTTAGGGTGGCGGCGG
TTCCAGACGTTGCCGATCTATTCCAAAATAGAGGATGATTTAAAATGTAGGTATTTGAAA
TATACCCCGGAACACGTCACTTGTAACATGCACTTCTATGGTCCGATAACGCCACAGAAC
ACAGGGTTTCTGGCGTTACAGACGGTCAATAACAATTCAAACGAAATTAAACAGCTAGGG
TTCAGAATCGCCGCCACCGGCAGCGTCAATGAGATTAACAAGTCGACTCAAATCATGAAG
AAACTGAAATTGGTCGGGACTCCGTTAAAAATTTACAAGAAGACTGCCTTCATCAAGGAC
ATGTTTACGAGCACTCTCGAAGTTGCTAAGTTCGAAGGAGCGAAAGTTAAAACGGTGTCC
GGAATCAGAGGTCAAATCAAAAAAGCACTGAACAAACCCGAAGGTGCGTTCCGAGCGACG
TTCGAGGACAAGATCCTCATGAGCGATGTGATCTTCTGCAGGACATGGTTCAAAGTGGAC
GTATCAAAATTCTACGCACCCGTCGTGAATCTTCTTCTTCCTATCGGAGCGAAGAACGCC
TGGCAAGGCATGAAGACGAAAGGCCAATTGAAGAGAGAACGGAACATAAAGGTCGAAGCG
AACACGGATTCCATGTACACCGATATTGTGAGAGAACCGAAAGTGTTCAAGCCACTAGTT
ATACCCAAGGAACTACAAAAAGGTTTGCCTTACAAGTTAAAACCCAAAGAGAAGACGAGC
ACGCTGACCAAGAAGAACATTAAAGAAAAATTGTCCAATAGAGTCGCCGTCATAAAGAGT
CCGCACGAACAGAAAGTAGCGAATGTCATGAAGATGTTGAAGACCAACTTTGAAAAGAAG
AGGGAAGTGCAGAAGGCCTCGACCGCCGAGAGGCTGAAGAAGTTCAAGAAGCAGCAAGAG
GAAGAGGAGTGGAGGAAGATCAAGAGGCAGAAGGAACTCAAGAAGAAAATCTGCAGGCAT
CTGAGCAAAATATCCAACAAGAAACAGACACAGATGTGA

Protein sequence:

MADEAIFDKKKSHRAKHAGRKAEKKKKKNQVDQSNLSARQRNPKAFAINSAVRAERQFRR
REDVISKKQHIPLVDKTPLEPPPIVVAVVGPPRVGKTTVINNLIKSFVKTNVTSTNGPIT
IVTSKKRRLTLIECNNDVNSMIDIAKCADLVLLLCDASFGFEMEIFEFLNICQVHGMPKI
MGVLTHLDMIKNAKKLKMTKKTLKHRFWTEVYPGAKLFYLSGIIHGEYLRNEIKNLSRFI
SVMKFRPLSWRMTHAYILADRLEDITSQDSIRKDPKINRDVVLYGYVRGVPLMKDSMVHL
AGVGDMKISELSYLPDPCPLPSSEKKRHLMERERQIYAPFSGVGGIVYDKDAVYIELKGS
HSHKQEDEETNEKQALLKSVVETTETVDEQMQESGFRLFSGGTVIYPDMVKDDKDLQNRE
SQNNESSSDESDSGDENDSDNDSGIDQSEKDTTGKSSKLPWDNNSESDEDDEDDEDDGND
EAKNLQVDKLYSHENDSDEIDEEHKSSDAEDFSIKWKEKLGEKASLAYFERQKTSKNIMK
LVYGEFEIGNKRKEDEDGNRDEESDEEEIGGLFKKVTSSQKRKQADKEQLDLEECSYFYN
RNKPHLRDWTTEENKKLIINCFVTGKRGADEDAEELLKLDDASDGEDEVYGDFEDLETGE
KHISKQQPEDKTEEVGSKRKAEPTKSDILDKKLKLKAKFDAEYDNPDDHRIKGDHSYYES
LKAEALKQSQLNKSVFENLDNGLRVEVEGFRPGLYVRMLFKNMPSEFVTNFDSSYPLLIG
ALNMAEQNIGFVSCKVKKHRWYKKILKTNDPLIISLGWRRFQTLPIYSKIEDDLKCRYLK
YTPEHVTCNMHFYGPITPQNTGFLALQTVNNNSNEIKQLGFRIAATGSVNEINKSTQIMK
KLKLVGTPLKIYKKTAFIKDMFTSTLEVAKFEGAKVKTVSGIRGQIKKALNKPEGAFRAT
FEDKILMSDVIFCRTWFKVDVSKFYAPVVNLLLPIGAKNAWQGMKTKGQLKRERNIKVEA
NTDSMYTDIVREPKVFKPLVIPKELQKGLPYKLKPKEKTSTLTKKNIKEKLSNRVAVIKS
PHEQKVANVMKMLKTNFEKKREVQKASTAERLKKFKKQQEEEEWRKIKRQKELKKKICRH
LSKISNKKQTQM