DPGLEAN22170 in OGS1.0

New model in OGS2.0DPOGS208846 
Genomic Positionscaffold2102:- 11276-18833
See gene structure
CDS Length2925
Paired RNAseq reads  2757
Single RNAseq reads  6048
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA007954 (0.0)
Best Drosophila hit  CG4849 (0.0)
Best Human hit116 kDa U5 small nuclear ribonucleoprotein component isoform a (0.0)
Best NR hit (blastp)  CG4849 [Drosophila melanogaster] (0.0)
Best NR hit (blastx)  116 kDa U5 small nuclear ribonucleoprotein component, putative [Pediculus humanus corporis] (0.0)
GeneOntology terms








  
GO:0003746 translation elongation factor activity
GO:0006414 translational elongation
GO:0005829 cytosol
GO:0030532 small nuclear ribonucleoprotein complex
GO:0000398 nuclear mRNA splicing, via spliceosome
GO:0005682 U5 snRNP
GO:0003924 GTPase activity
GO:0005525 GTP binding
GO:0071013 catalytic step 2 spliceosome
GO:0071011 precatalytic spliceosome
InterPro families






  
IPR000795 Protein synthesis factor, GTP-binding
IPR020568 Ribosomal protein S5 domain 2-type fold
IPR009000 Translation elongation/initiation factor/Ribosomal, beta-barrel
IPR009022 Elongation factor G/III/V
IPR005517 Translation elongation factor EFG/EF2, domain IV
IPR000640 Translation elongation factor EFG/EF2, C-terminal
IPR004161 Translation elongation factor EFTu/EF1A, domain 2
IPR014721 Ribosomal protein S5 domain 2-type fold, subgroup
Orthology groupMCL10253

Nucleotide sequence:

ATGGACGGGGATCTTTATGACGAATTCGGTAACTACATCGGCCCCGATTTAGAATCCGAC
TCTGATGATGAACAGAGTGTGTATGGCCAGGATAACCGCGATGGTGATGAAGATGCTATG
GAGGAAGACGAGGACGCTGACGCTGAGCCCGAAGTAGCTCCGATGTCCGTTGTATTGCAT
GAGGACAAGAGATATTATCCACAAGCCGTTGAAGTTTATGGTCCAGATGTGGAGACGGTG
GTACAAGAAGAGGATACACAGGCCCTGGACAAACCACTTGTAGAGCCGGTTAAACACAAG
AAGTTTCAAGTTCAAGAGCAACATTTGCCGGAGACCACTTATGACATGGAGTATTTAGCG
GACATGCTAGACAACACCAACCTCATGAGAAATGTTACTTTGATGGGACATTTACACAAT
GGTAAAACATCTTTTGTGGATTGCCTTATCCGTCAAACACATCCCGGTACCATCAACAAT
GAGACAACAATCCCCATGAGGTATACGGACACGCTGTTTGTTGAACAGGAGAGGGGAGTT
TCCATCAAATCAATGCCTGTCACACTACTCCTCAAGGATATCAAGGGAAAATCACATTTA
CTTAACATTATGGATACTCCGGGACATGTGAACTTCAGCGATGAAGTCACCGCTGCATTG
AGGATATCTGATGGTGCGGTGTTATTTGTGGACGCGGCGGAGGGTATAATGTTGAATACC
GAGCGGTTGTTACGGCACGCCGTCCAAGAACGAGTGCCGCTGACGCTGTGCATTAACAAG
ATAGACCGCCTGATACTGGAGTTGAAGCTGCCTCCCGCGGACGCGTACTACAAGCTGAGG
CACATCATAGACGAACTGAACACGATGCTTGAGACCAACCAGCCTCAGGACAACGCCGAT
GAGCCGCCCATTGTGTTTTCACCCTTACTAGGCAACGTTTGCTTCGCGTCGTCGCTGTAC
GACGTGTGTTTCACGTTGGAGTCGTTCGCTGCTATGTACGCGCGGTCTCACGACGGCTTT
CGGGCCGGCGACATGTCTCGCTGGCTGTGGGGCGACATGTACTTCAATAATAAGACGCGG
CGCTTCACTAAGAAACAGCCGCACGCCTCAGCTCAGAGGAGCTTCGTGGAGTTCATACTG
GAACCGCTGTATAAGATATTCGCTCAGGTGGTGGGTGATGTAGACGACACCCTGCTCACA
GTACTGGCCGAGCTGGGCATCAAGCTCACCAAGCAGGAAGCTAAACTCAACGTGAGACCG
CTCTTGAGACTGGTCTGCAGCCGATTCTTTGGTGATTTTTGCGGTTTCGTGGACATGTTG
GTCCGGCACGTGCCCTCCCCGCTGGACGCCGCTCCCCGCAAGGTGCAGCACTGCTACCGC
GGCGCCAGCGGCCCGCTCTACGACGACATGATGACCTGCGACCAGTCCGGCAGACTCGTA
GCTCACACCACCAAGATGTACCCCACAGACGACTGCACTTTCTTCCTTGTGCTGGCTCGC
ATAATGTCCGGCACACTGTATGCGGGTCAGACTGTGCGTGTGCTGGGGGAGAACTACTCC
TCGCAGGATGAAGAGGACTCGAGGATTATGAATGTTGGTCGTCTATGGATCTATGAAGCC
AGATACAAGGTGGAACTAAACAGAGTTCCGGCTGGTTGCTGGGCTCTCATAGAAGGTATA
GACCAGCCCATAGTGAAGACTTGCACTGTGGTCTCCGCTGATGAGGAAGAGGAGCTTCAC
ACCTTCAAGCCGTTGAGATTCAACACACAAGCCGTTGTCAAGATAGCTGTTGAGCCAGTC
AACCCATCCGAGCTGCCTAAAATGTTGGACGGGCTTAGAAAGGTGAACAAGTCTTATCCG
GTGTTGTCAACTCGTGTGGAGGAAAGCGGAGAACACGTTGTGCTGGGGACAGGGGAGTTG
TATCTAGACTGTGTGATGCATGACCTTAGAGATATGTACTCGGAGATCGATATTAAAGTT
GCAGATCCAGTAGTTTCCTTTTGTGAGACGGTGGTAGAGACTTCATCTCTCAAGTGTTTT
GCGGAGACTCCCAACAAGAGGAACAAGCTGACCATGATCGCTGAACCTCTAGAGAGAGGA
CTCGCTGAAGATATTGAGGCGGGAGCTGTTTGTGTTACCTGGGACAGAAGGAGACTGGGA
GAGTTCTTCCAGACCAAATACGACTGGGATCTGTTGGCGGCTCGGTCCATCTGGGCGTTC
GGTCCAGACGCCGCGGGTCCTAATATACTGGTCGACGACACGCTGCCCTCTGAGGTCGAC
AAACATCTGCTGGCCTCTGTCAAGGACAGTATTGTGCAAGGCTTCCAGTGGGGCACTCGC
GAGGGTCCTCTGTGCGAGGAACCCATCAGGAACGTGAAGTTTAAGATCCTGGACGCCGTG
ATCGCCAACGAGCCGCTCCATCGCGGCGGCGGGCAGATCATCCCCACCGCTAGACGGGTG
GCGTATTCTGCGTTCCTCATGGCGACTCCCCGCCTGATGGAGCCCTACCTGTTCGTGGAG
GTACAGGCGCCCGCCGACTGCGTGTCCGCCGTCTACACCGTGCTCGCTAAGAGGAGAGGT
CACGTGACCCAGGACGCCCCCGTCCCAGGATCGCCTCTGTACACGATCAAGGCCTTCGTC
CCCGCGATCGACTCGTTCGGCTTCGAGACGGACCTGAGGACGCACACGCAGGGCCAGGCC
TTCTGCCTGCAGGTGTTCCATCACTGGCAGATCGTGCCCGGCGATCCTCTGGATAAGAGC
ATCGTTATCAGACCTCTAGAACCTCAGCCGGCGACGCACCTCGCCCGTGAGTTCATGATA
AAGACGAGGAGACGGAAGGGTCTCAGCGAGGACGTGTCCATCAATAAATTCTTCGACGAC
CCCATGTTGTTGGAGCTAGCGAGACAAGATGTACAATTCAATTAA

Protein sequence:

MDGDLYDEFGNYIGPDLESDSDDEQSVYGQDNRDGDEDAMEEDEDADAEPEVAPMSVVLH
EDKRYYPQAVEVYGPDVETVVQEEDTQALDKPLVEPVKHKKFQVQEQHLPETTYDMEYLA
DMLDNTNLMRNVTLMGHLHNGKTSFVDCLIRQTHPGTINNETTIPMRYTDTLFVEQERGV
SIKSMPVTLLLKDIKGKSHLLNIMDTPGHVNFSDEVTAALRISDGAVLFVDAAEGIMLNT
ERLLRHAVQERVPLTLCINKIDRLILELKLPPADAYYKLRHIIDELNTMLETNQPQDNAD
EPPIVFSPLLGNVCFASSLYDVCFTLESFAAMYARSHDGFRAGDMSRWLWGDMYFNNKTR
RFTKKQPHASAQRSFVEFILEPLYKIFAQVVGDVDDTLLTVLAELGIKLTKQEAKLNVRP
LLRLVCSRFFGDFCGFVDMLVRHVPSPLDAAPRKVQHCYRGASGPLYDDMMTCDQSGRLV
AHTTKMYPTDDCTFFLVLARIMSGTLYAGQTVRVLGENYSSQDEEDSRIMNVGRLWIYEA
RYKVELNRVPAGCWALIEGIDQPIVKTCTVVSADEEEELHTFKPLRFNTQAVVKIAVEPV
NPSELPKMLDGLRKVNKSYPVLSTRVEESGEHVVLGTGELYLDCVMHDLRDMYSEIDIKV
ADPVVSFCETVVETSSLKCFAETPNKRNKLTMIAEPLERGLAEDIEAGAVCVTWDRRRLG
EFFQTKYDWDLLAARSIWAFGPDAAGPNILVDDTLPSEVDKHLLASVKDSIVQGFQWGTR
EGPLCEEPIRNVKFKILDAVIANEPLHRGGGQIIPTARRVAYSAFLMATPRLMEPYLFVE
VQAPADCVSAVYTVLAKRRGHVTQDAPVPGSPLYTIKAFVPAIDSFGFETDLRTHTQGQA
FCLQVFHHWQIVPGDPLDKSIVIRPLEPQPATHLAREFMIKTRRRKGLSEDVSINKFFDD
PMLLELARQDVQFN