New model in OGS2.0 | DPOGS209957  |
---|---|
Genomic Position | scaffold128:- 98769-102404 |
See gene structure | |
CDS Length | 3459 |
Paired RNAseq reads   | 1273 |
Single RNAseq reads   | 3129 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA011274 (0.0) |
Best Drosophila hit   | CG7728 (0.0) |
Best Human hit | ribosome biogenesis protein BMS1 homolog (9e-127) |
Best NR hit (blastp)   | GJ12043 [Drosophila virilis] (0.0) |
Best NR hit (blastx)   | GJ12043 [Drosophila virilis] (0.0) |
GeneOntology terms    | GO:0005634 nucleus GO:0042254 ribosome biogenesis GO:0003924 GTPase activity GO:0005525 GTP binding |
InterPro families    | IPR007034 Ribosome biogenesis protein BMS1/TSR1, C-terminal IPR012948 AARP2CN |
Orthology group | MCL14006 |
Nucleotide sequence:
ATGGCCGATGAGGCTATTTTCGATAAAAAGAAGTCACACCGTGCAAAGCATGCCGGCAGA
AAAGCAGAGAAGAAAAAAAAGAAAAATCAAGTTGATCAGTCTAACCTAAGTGCGCGGCAA
AGAAATCCAAAAGCCTTCGCAATTAATTCAGCGGTACGAGCTGAGAGACAGTTTAGGCGT
CGCGAAGATGTTATATCTAAAAAGCAGCACATACCACTAGTGGATAAAACACCTTTGGAG
CCTCCACCTATCGTCGTCGCTGTAGTCGGGCCGCCGCGAGTCGGAAAAACTACAGTCATC
AATAACCTTATTAAAAGTTTTGTCAAAACAAATGTCACCAGTACCAATGGCCCCATAACG
ATAGTCACGTCTAAGAAAAGGCGTCTCACGTTAATAGAGTGTAACAACGATGTCAATTCT
ATGATAGATATCGCGAAGTGTGCGGATCTCGTCCTCTTGCTGTGTGATGCGAGCTTCGGG
TTTGAAATGGAAATATTCGAATTCCTGAATATATGCCAGGTACATGGGATGCCAAAGATT
ATGGGTGTGTTAACACATCTGGACATGATAAAGAATGCCAAGAAGTTAAAGATGACCAAG
AAAACATTGAAGCATCGTTTCTGGACCGAAGTGTATCCTGGCGCCAAATTATTTTATTTA
TCTGGTATCATTCATGGAGAGTACTTGAGAAATGAAATAAAAAATTTGTCCAGATTCATA
TCTGTTATGAAATTTAGGCCTCTGAGCTGGCGAATGACTCATGCTTATATTTTGGCTGAT
AGATTAGAAGATATCACTAGTCAGGATAGCATCAGGAAGGATCCAAAAATTAATAGAGAT
GTGGTCTTATATGGTTATGTGAGGGGTGTTCCTCTCATGAAGGATTCAATGGTGCATTTA
GCAGGTGTTGGTGATATGAAGATCAGTGAACTCTCATACCTGCCAGACCCATGTCCTCTA
CCCAGCAGTGAGAAAAAAAGACATTTGATGGAGAGGGAAAGACAAATCTATGCTCCCTTC
TCTGGTGTGGGAGGAATTGTCTATGACAAGGACGCAGTTTATATTGAACTCAAAGGATCT
CACTCACATAAACAAGAAGATGAAGAAACAAATGAGAAGCAAGCCTTACTTAAAAGTGTT
GTGGAGACAACAGAGACTGTTGATGAGCAAATGCAAGAATCTGGCTTCAGACTGTTCAGT
GGAGGAACTGTCATTTATCCAGATATGGTAAAAGATGATAAAGACTTGCAGAACAGAGAA
TCTCAGAATAATGAGTCCAGCTCTGATGAATCTGATTCTGGAGATGAAAATGATTCTGAC
AATGACAGCGGCATAGATCAAAGTGAGAAAGACACCACAGGAAAGAGTTCTAAACTGCCT
TGGGACAATAACTCAGAATCTGACGAAGATGATGAAGACGATGAAGATGACGGTAACGAT
GAAGCAAAAAACCTTCAAGTGGACAAACTGTACAGTCATGAAAATGATTCTGATGAAATT
GATGAAGAACATAAAAGTTCTGACGCAGAAGACTTTAGCATTAAGTGGAAGGAAAAATTA
GGTGAAAAAGCATCATTGGCCTATTTTGAAAGACAAAAGACTTCAAAAAATATCATGAAA
CTTGTTTATGGAGAATTTGAAATAGGAAATAAGAGAAAAGAAGATGAAGATGGGAATCGT
GATGAGGAGAGTGACGAAGAAGAAATAGGAGGTCTCTTCAAGAAAGTTACAAGTTCACAG
AAAAGAAAGCAGGCGGACAAGGAACAGTTGGATCTCGAGGAATGTTCGTATTTCTATAAC
AGAAATAAACCACATCTAAGAGATTGGACCACGGAAGAAAATAAAAAACTGATTATTAAT
TGTTTTGTGACTGGAAAGAGGGGCGCCGACGAGGACGCCGAGGAACTCTTGAAATTAGAC
GACGCCAGTGACGGGGAGGATGAAGTCTATGGAGACTTCGAAGATCTAGAGACGGGTGAA
AAGCATATTAGTAAACAGCAACCGGAAGATAAGACAGAAGAAGTAGGTTCAAAGCGCAAA
GCGGAGCCGACCAAATCGGATATTTTAGACAAAAAATTAAAATTAAAAGCTAAGTTTGAT
GCTGAGTATGACAACCCAGACGACCACAGAATCAAGGGGGATCATTCTTATTATGAAAGT
TTAAAAGCTGAAGCGCTCAAGCAGTCTCAACTAAACAAGTCAGTTTTCGAAAATTTAGAC
AACGGTTTGCGAGTCGAAGTCGAAGGCTTTCGACCCGGTCTGTACGTTCGTATGTTATTC
AAGAATATGCCCTCGGAGTTCGTCACAAACTTCGATTCAAGCTATCCTCTCCTCATAGGC
GCTTTAAATATGGCTGAGCAAAACATCGGCTTCGTCTCGTGTAAGGTTAAAAAACATAGA
TGGTACAAAAAGATATTGAAAACCAACGATCCTTTGATTATTTCCTTAGGGTGGCGGCGG
TTCCAGACGTTGCCGATCTATTCCAAAATAGAGGATGATTTAAAATGTAGGTATTTGAAA
TATACCCCGGAACACGTCACTTGTAACATGCACTTCTATGGTCCGATAACGCCACAGAAC
ACAGGGTTTCTGGCGTTACAGACGGTCAATAACAATTCAAACGAAATTAAACAGCTAGGG
TTCAGAATCGCCGCCACCGGCAGCGTCAATGAGATTAACAAGTCGACTCAAATCATGAAG
AAACTGAAATTGGTCGGGACTCCGTTAAAAATTTACAAGAAGACTGCCTTCATCAAGGAC
ATGTTTACGAGCACTCTCGAAGTTGCTAAGTTCGAAGGAGCGAAAGTTAAAACGGTGTCC
GGAATCAGAGGTCAAATCAAAAAAGCACTGAACAAACCCGAAGGTGCGTTCCGAGCGACG
TTCGAGGACAAGATCCTCATGAGCGATGTGATCTTCTGCAGGACATGGTTCAAAGTGGAC
GTATCAAAATTCTACGCACCCGTCGTGAATCTTCTTCTTCCTATCGGAGCGAAGAACGCC
TGGCAAGGCATGAAGACGAAAGGCCAATTGAAGAGAGAACGGAACATAAAGGTCGAAGCG
AACACGGATTCCATGTACACCGATATTGTGAGAGAACCGAAAGTGTTCAAGCCACTAGTT
ATACCCAAGGAACTACAAAAAGGTTTGCCTTACAAGTTAAAACCCAAAGAGAAGACGAGC
ACGCTGACCAAGAAGAACATTAAAGAAAAATTGTCCAATAGAGTCGCCGTCATAAAGAGT
CCGCACGAACAGAAAGTAGCGAATGTCATGAAGATGTTGAAGACCAACTTTGAAAAGAAG
AGGGAAGTGCAGAAGGCCTCGACCGCCGAGAGGCTGAAGAAGTTCAAGAAGCAGCAAGAG
GAAGAGGAGTGGAGGAAGATCAAGAGGCAGAAGGAACTCAAGAAGAAAATCTGCAGGCAT
CTGAGCAAAATATCCAACAAGAAACAGACACAGATGTGA
Protein sequence:
MADEAIFDKKKSHRAKHAGRKAEKKKKKNQVDQSNLSARQRNPKAFAINSAVRAERQFRR
REDVISKKQHIPLVDKTPLEPPPIVVAVVGPPRVGKTTVINNLIKSFVKTNVTSTNGPIT
IVTSKKRRLTLIECNNDVNSMIDIAKCADLVLLLCDASFGFEMEIFEFLNICQVHGMPKI
MGVLTHLDMIKNAKKLKMTKKTLKHRFWTEVYPGAKLFYLSGIIHGEYLRNEIKNLSRFI
SVMKFRPLSWRMTHAYILADRLEDITSQDSIRKDPKINRDVVLYGYVRGVPLMKDSMVHL
AGVGDMKISELSYLPDPCPLPSSEKKRHLMERERQIYAPFSGVGGIVYDKDAVYIELKGS
HSHKQEDEETNEKQALLKSVVETTETVDEQMQESGFRLFSGGTVIYPDMVKDDKDLQNRE
SQNNESSSDESDSGDENDSDNDSGIDQSEKDTTGKSSKLPWDNNSESDEDDEDDEDDGND
EAKNLQVDKLYSHENDSDEIDEEHKSSDAEDFSIKWKEKLGEKASLAYFERQKTSKNIMK
LVYGEFEIGNKRKEDEDGNRDEESDEEEIGGLFKKVTSSQKRKQADKEQLDLEECSYFYN
RNKPHLRDWTTEENKKLIINCFVTGKRGADEDAEELLKLDDASDGEDEVYGDFEDLETGE
KHISKQQPEDKTEEVGSKRKAEPTKSDILDKKLKLKAKFDAEYDNPDDHRIKGDHSYYES
LKAEALKQSQLNKSVFENLDNGLRVEVEGFRPGLYVRMLFKNMPSEFVTNFDSSYPLLIG
ALNMAEQNIGFVSCKVKKHRWYKKILKTNDPLIISLGWRRFQTLPIYSKIEDDLKCRYLK
YTPEHVTCNMHFYGPITPQNTGFLALQTVNNNSNEIKQLGFRIAATGSVNEINKSTQIMK
KLKLVGTPLKIYKKTAFIKDMFTSTLEVAKFEGAKVKTVSGIRGQIKKALNKPEGAFRAT
FEDKILMSDVIFCRTWFKVDVSKFYAPVVNLLLPIGAKNAWQGMKTKGQLKRERNIKVEA
NTDSMYTDIVREPKVFKPLVIPKELQKGLPYKLKPKEKTSTLTKKNIKEKLSNRVAVIKS
PHEQKVANVMKMLKTNFEKKREVQKASTAERLKKFKKQQEEEEWRKIKRQKELKKKICRH
LSKISNKKQTQM