New model in OGS2.0 | DPOGS208846  |
---|---|
Genomic Position | scaffold2102:- 11276-18833 |
See gene structure | |
CDS Length | 2925 |
Paired RNAseq reads   | 2757 |
Single RNAseq reads   | 6048 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA007954 (0.0) |
Best Drosophila hit   | CG4849 (0.0) |
Best Human hit | 116 kDa U5 small nuclear ribonucleoprotein component isoform a (0.0) |
Best NR hit (blastp)   | CG4849 [Drosophila melanogaster] (0.0) |
Best NR hit (blastx)   | 116 kDa U5 small nuclear ribonucleoprotein component, putative [Pediculus humanus corporis] (0.0) |
GeneOntology terms    | GO:0003746 translation elongation factor activity GO:0006414 translational elongation GO:0005829 cytosol GO:0030532 small nuclear ribonucleoprotein complex GO:0000398 nuclear mRNA splicing, via spliceosome GO:0005682 U5 snRNP GO:0003924 GTPase activity GO:0005525 GTP binding GO:0071013 catalytic step 2 spliceosome GO:0071011 precatalytic spliceosome |
InterPro families    | IPR000795 Protein synthesis factor, GTP-binding IPR020568 Ribosomal protein S5 domain 2-type fold IPR009000 Translation elongation/initiation factor/Ribosomal, beta-barrel IPR009022 Elongation factor G/III/V IPR005517 Translation elongation factor EFG/EF2, domain IV IPR000640 Translation elongation factor EFG/EF2, C-terminal IPR004161 Translation elongation factor EFTu/EF1A, domain 2 IPR014721 Ribosomal protein S5 domain 2-type fold, subgroup |
Orthology group | MCL10253 |
Nucleotide sequence:
ATGGACGGGGATCTTTATGACGAATTCGGTAACTACATCGGCCCCGATTTAGAATCCGAC
TCTGATGATGAACAGAGTGTGTATGGCCAGGATAACCGCGATGGTGATGAAGATGCTATG
GAGGAAGACGAGGACGCTGACGCTGAGCCCGAAGTAGCTCCGATGTCCGTTGTATTGCAT
GAGGACAAGAGATATTATCCACAAGCCGTTGAAGTTTATGGTCCAGATGTGGAGACGGTG
GTACAAGAAGAGGATACACAGGCCCTGGACAAACCACTTGTAGAGCCGGTTAAACACAAG
AAGTTTCAAGTTCAAGAGCAACATTTGCCGGAGACCACTTATGACATGGAGTATTTAGCG
GACATGCTAGACAACACCAACCTCATGAGAAATGTTACTTTGATGGGACATTTACACAAT
GGTAAAACATCTTTTGTGGATTGCCTTATCCGTCAAACACATCCCGGTACCATCAACAAT
GAGACAACAATCCCCATGAGGTATACGGACACGCTGTTTGTTGAACAGGAGAGGGGAGTT
TCCATCAAATCAATGCCTGTCACACTACTCCTCAAGGATATCAAGGGAAAATCACATTTA
CTTAACATTATGGATACTCCGGGACATGTGAACTTCAGCGATGAAGTCACCGCTGCATTG
AGGATATCTGATGGTGCGGTGTTATTTGTGGACGCGGCGGAGGGTATAATGTTGAATACC
GAGCGGTTGTTACGGCACGCCGTCCAAGAACGAGTGCCGCTGACGCTGTGCATTAACAAG
ATAGACCGCCTGATACTGGAGTTGAAGCTGCCTCCCGCGGACGCGTACTACAAGCTGAGG
CACATCATAGACGAACTGAACACGATGCTTGAGACCAACCAGCCTCAGGACAACGCCGAT
GAGCCGCCCATTGTGTTTTCACCCTTACTAGGCAACGTTTGCTTCGCGTCGTCGCTGTAC
GACGTGTGTTTCACGTTGGAGTCGTTCGCTGCTATGTACGCGCGGTCTCACGACGGCTTT
CGGGCCGGCGACATGTCTCGCTGGCTGTGGGGCGACATGTACTTCAATAATAAGACGCGG
CGCTTCACTAAGAAACAGCCGCACGCCTCAGCTCAGAGGAGCTTCGTGGAGTTCATACTG
GAACCGCTGTATAAGATATTCGCTCAGGTGGTGGGTGATGTAGACGACACCCTGCTCACA
GTACTGGCCGAGCTGGGCATCAAGCTCACCAAGCAGGAAGCTAAACTCAACGTGAGACCG
CTCTTGAGACTGGTCTGCAGCCGATTCTTTGGTGATTTTTGCGGTTTCGTGGACATGTTG
GTCCGGCACGTGCCCTCCCCGCTGGACGCCGCTCCCCGCAAGGTGCAGCACTGCTACCGC
GGCGCCAGCGGCCCGCTCTACGACGACATGATGACCTGCGACCAGTCCGGCAGACTCGTA
GCTCACACCACCAAGATGTACCCCACAGACGACTGCACTTTCTTCCTTGTGCTGGCTCGC
ATAATGTCCGGCACACTGTATGCGGGTCAGACTGTGCGTGTGCTGGGGGAGAACTACTCC
TCGCAGGATGAAGAGGACTCGAGGATTATGAATGTTGGTCGTCTATGGATCTATGAAGCC
AGATACAAGGTGGAACTAAACAGAGTTCCGGCTGGTTGCTGGGCTCTCATAGAAGGTATA
GACCAGCCCATAGTGAAGACTTGCACTGTGGTCTCCGCTGATGAGGAAGAGGAGCTTCAC
ACCTTCAAGCCGTTGAGATTCAACACACAAGCCGTTGTCAAGATAGCTGTTGAGCCAGTC
AACCCATCCGAGCTGCCTAAAATGTTGGACGGGCTTAGAAAGGTGAACAAGTCTTATCCG
GTGTTGTCAACTCGTGTGGAGGAAAGCGGAGAACACGTTGTGCTGGGGACAGGGGAGTTG
TATCTAGACTGTGTGATGCATGACCTTAGAGATATGTACTCGGAGATCGATATTAAAGTT
GCAGATCCAGTAGTTTCCTTTTGTGAGACGGTGGTAGAGACTTCATCTCTCAAGTGTTTT
GCGGAGACTCCCAACAAGAGGAACAAGCTGACCATGATCGCTGAACCTCTAGAGAGAGGA
CTCGCTGAAGATATTGAGGCGGGAGCTGTTTGTGTTACCTGGGACAGAAGGAGACTGGGA
GAGTTCTTCCAGACCAAATACGACTGGGATCTGTTGGCGGCTCGGTCCATCTGGGCGTTC
GGTCCAGACGCCGCGGGTCCTAATATACTGGTCGACGACACGCTGCCCTCTGAGGTCGAC
AAACATCTGCTGGCCTCTGTCAAGGACAGTATTGTGCAAGGCTTCCAGTGGGGCACTCGC
GAGGGTCCTCTGTGCGAGGAACCCATCAGGAACGTGAAGTTTAAGATCCTGGACGCCGTG
ATCGCCAACGAGCCGCTCCATCGCGGCGGCGGGCAGATCATCCCCACCGCTAGACGGGTG
GCGTATTCTGCGTTCCTCATGGCGACTCCCCGCCTGATGGAGCCCTACCTGTTCGTGGAG
GTACAGGCGCCCGCCGACTGCGTGTCCGCCGTCTACACCGTGCTCGCTAAGAGGAGAGGT
CACGTGACCCAGGACGCCCCCGTCCCAGGATCGCCTCTGTACACGATCAAGGCCTTCGTC
CCCGCGATCGACTCGTTCGGCTTCGAGACGGACCTGAGGACGCACACGCAGGGCCAGGCC
TTCTGCCTGCAGGTGTTCCATCACTGGCAGATCGTGCCCGGCGATCCTCTGGATAAGAGC
ATCGTTATCAGACCTCTAGAACCTCAGCCGGCGACGCACCTCGCCCGTGAGTTCATGATA
AAGACGAGGAGACGGAAGGGTCTCAGCGAGGACGTGTCCATCAATAAATTCTTCGACGAC
CCCATGTTGTTGGAGCTAGCGAGACAAGATGTACAATTCAATTAA
Protein sequence:
MDGDLYDEFGNYIGPDLESDSDDEQSVYGQDNRDGDEDAMEEDEDADAEPEVAPMSVVLH
EDKRYYPQAVEVYGPDVETVVQEEDTQALDKPLVEPVKHKKFQVQEQHLPETTYDMEYLA
DMLDNTNLMRNVTLMGHLHNGKTSFVDCLIRQTHPGTINNETTIPMRYTDTLFVEQERGV
SIKSMPVTLLLKDIKGKSHLLNIMDTPGHVNFSDEVTAALRISDGAVLFVDAAEGIMLNT
ERLLRHAVQERVPLTLCINKIDRLILELKLPPADAYYKLRHIIDELNTMLETNQPQDNAD
EPPIVFSPLLGNVCFASSLYDVCFTLESFAAMYARSHDGFRAGDMSRWLWGDMYFNNKTR
RFTKKQPHASAQRSFVEFILEPLYKIFAQVVGDVDDTLLTVLAELGIKLTKQEAKLNVRP
LLRLVCSRFFGDFCGFVDMLVRHVPSPLDAAPRKVQHCYRGASGPLYDDMMTCDQSGRLV
AHTTKMYPTDDCTFFLVLARIMSGTLYAGQTVRVLGENYSSQDEEDSRIMNVGRLWIYEA
RYKVELNRVPAGCWALIEGIDQPIVKTCTVVSADEEEELHTFKPLRFNTQAVVKIAVEPV
NPSELPKMLDGLRKVNKSYPVLSTRVEESGEHVVLGTGELYLDCVMHDLRDMYSEIDIKV
ADPVVSFCETVVETSSLKCFAETPNKRNKLTMIAEPLERGLAEDIEAGAVCVTWDRRRLG
EFFQTKYDWDLLAARSIWAFGPDAAGPNILVDDTLPSEVDKHLLASVKDSIVQGFQWGTR
EGPLCEEPIRNVKFKILDAVIANEPLHRGGGQIIPTARRVAYSAFLMATPRLMEPYLFVE
VQAPADCVSAVYTVLAKRRGHVTQDAPVPGSPLYTIKAFVPAIDSFGFETDLRTHTQGQA
FCLQVFHHWQIVPGDPLDKSIVIRPLEPQPATHLAREFMIKTRRRKGLSEDVSINKFFDD
PMLLELARQDVQFN