Monarch geneset OGS2.0

DPOGS208846
TranscriptDPOGS208846-TA2925 bp
ProteinDPOGS208846-PA974 aa
Genomic positionDPSCF300036 + 910339-917896
RNAseq coverage551x (Rank: top 23%)
Annotation
HeliconiusHMEL0154130.095.49% 
BombyxBGIBMGA007954-TA0.092.42% 
DrosophilaCG4849-PA0.076.43% 
EBI UniRef50UniRef50_Q9VAX80.076.43%CG4849 n=18 Tax=Eukaryota RepID=Q9VAX8_DROME
NCBI RefSeqNP_651605.10.076.43%CG4849 [Drosophila melanogaster]
NCBI nr blastpgi|213577430.076.43%CG4849 [Drosophila melanogaster]
NCBI nr blastxgi|1953917280.076.22%GJ22766 [Drosophila virilis]
Group
Gene OntologyGO:00055251.3e-40GTP binding
GO:00039241.3e-40GTPase activity
KEGG pathwaydme:Dmel_CG48490.0 
 K12852 (EFTUD2)maps-> Spliceosome
InterPro domain[675-833] IPR0147211.3e-70Ribosomal protein S5 domain 2-type fold, subgroup
[665-833] IPR0205684.9e-66Ribosomal protein S5 domain 2-type fold
[129-306] IPR0007951.3e-40Protein synthesis factor, GTP-binding
[834-948] IPR0006408.2e-36Translation elongation factor EFG/EF2, C-terminal
[446-582] IPR0090005.1e-30Translation elongation/initiation factor/Ribosomal, beta-barrel
[834-921] IPR0090222.3e-28Elongation factor G/III/V
[713-828] IPR0055172.6e-26Translation elongation factor EFG/EF2, domain IV
[130-287] IPR0052252.6e-12Small GTP-binding protein domain
[496-570] IPR0041617.8e-09Translation elongation factor EFTu/EF1A, domain 2
Orthology groupMCL10187 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208846-TA
ATGGACGGGGATCTTTATGACGAATTCGGTAACTACATCGGCCCCGATTTAGAATCCGACTCTGATGATGAACAGAGTGTGTATGGCCAGGATAACCGCGATGGTGATGAAGATGCTATGGAGGAAGACGAGGACGCTGACGCTGAGCCCGAAGTAGCTCCGATGTCCGTTGTATTGCATGAGGACAAGAGATATTATCCACAAGCCGTTGAAGTTTATGGTCCAGATGTGGAGACGGTGGTACAAGAAGAGGATACACAGGCCCTGGACAAACCACTTGTAGAGCCGGTTAAACACAAGAAGTTTCAAGTTCAAGAGCAACATTTGCCGGAGACCACTTATGACATGGAGTATTTAGCGGACATGCTAGACAACACCAACCTCATGAGAAATGTTACTTTGATGGGACATTTACACAATGGTAAAACATCTTTTGTGGATTGCCTTATCCGTCAAACACATCCCGGTACCATCAACAATGAGACAACAATCCCCATGAGGTATACGGACACGCTGTTTGTTGAACAGGAGAGGGGAGTTTCCATCAAATCAATGCCTGTCACACTACTCCTCAAGGATATCAAGGGAAAATCACATTTACTTAACATTATGGATACTCCGGGACATGTGAACTTCAGCGATGAAGTCACCGCTGCATTGAGGATATCTGATGGTGCGGTGTTATTTGTGGACGCGGCGGAGGGTATAATGTTGAATACCGAGCGGTTGTTACGGCACGCCGTCCAAGAACGAGTGCCGCTGACGCTGTGCATTAACAAGATAGACCGCCTGATACTGGAGTTGAAGCTGCCTCCCGCGGACGCGTACTACAAGCTGAGGCACATCATAGACGAACTGAACACGATGCTTGAGACCAACCAGCCTCAGGACAACGCCGATGAGCCGCCCATTGTGTTTTCACCCTTACTAGGCAACGTTTGCTTCGCGTCGTCGCTGTACGACGTGTGTTTCACGTTGGAGTCGTTCGCTGCTATGTACGCGCGGTCTCACGACGGCTTTCGGGCCGGCGACATGTCTCGCTGGCTGTGGGGCGACATGTACTTCAATAATAAGACGCGGCGCTTCACTAAGAAACAGCCGCACGCCTCAGCTCAGAGGAGCTTCGTGGAGTTCATACTGGAACCGCTGTATAAGATATTCGCTCAGGTGGTGGGTGATGTAGACGACACCCTGCTCACAGTACTGGCCGAGCTGGGCATCAAGCTCACCAAGCAGGAAGCTAAACTCAACGTGAGACCGCTCTTGAGACTGGTCTGCAGCCGATTCTTTGGTGATTTTTGCGGTTTCGTGGACATGTTGGTCCGGCACGTGCCCTCCCCGCTGGACGCCGCTCCCCGCAAGGTGCAGCACTGCTACCGCGGCGCCAGCGGCCCGCTCTACGACGACATGATGACCTGCGACCAGTCCGGCAGACTCGTAGCTCACACCACCAAGATGTACCCCACAGACGACTGCACTTTCTTCCTTGTGCTGGCTCGCATAATGTCCGGCACACTGTATGCGGGTCAGACTGTGCGTGTGCTGGGGGAGAACTACTCCTCGCAGGATGAAGAGGACTCGAGGATTATGAATGTTGGTCGTCTATGGATCTATGAAGCCAGATACAAGGTGGAACTAAACAGAGTTCCGGCTGGTTGCTGGGCTCTCATAGAAGGTATAGACCAGCCCATAGTGAAGACTTGCACTGTGGTCTCCGCTGATGAGGAAGAGGAGCTTCACACCTTCAAGCCGTTGAGATTCAACACACAAGCCGTTGTCAAGATAGCTGTTGAGCCAGTCAACCCATCCGAGCTGCCTAAAATGTTGGACGGGCTTAGAAAGGTGAACAAGTCTTATCCGGTGTTGTCAACTCGTGTGGAGGAAAGCGGAGAACACGTTGTGCTGGGGACAGGGGAGTTGTATCTAGACTGTGTGATGCATGACCTTAGAGATATGTACTCGGAGATCGATATTAAAGTTGCAGATCCAGTAGTTTCCTTTTGTGAGACGGTGGTAGAGACTTCATCTCTCAAGTGTTTTGCGGAGACTCCCAACAAGAGGAACAAGCTGACCATGATCGCTGAACCTCTAGAGAGAGGACTCGCTGAAGATATTGAGGCGGGAGCTGTTTGTGTTACCTGGGACAGAAGGAGACTGGGAGAGTTCTTCCAGACCAAATACGACTGGGATCTGTTGGCGGCTCGGTCCATCTGGGCGTTCGGTCCAGACGCCGCGGGTCCTAATATACTGGTCGACGACACGCTGCCCTCTGAGGTCGACAAACATCTGCTGGCCTCTGTCAAGGACAGTATTGTGCAAGGCTTCCAGTGGGGCACTCGCGAGGGTCCTCTGTGCGAGGAACCCATCAGGAACGTGAAGTTTAAGATCCTGGACGCCGTGATCGCCAACGAGCCGCTCCATCGCGGCGGCGGGCAGATCATCCCCACCGCTAGACGGGTGGCGTATTCTGCGTTCCTCATGGCGACTCCCCGCCTGATGGAGCCCTACCTGTTCGTGGAGGTACAGGCGCCCGCCGACTGCGTGTCCGCCGTCTACACCGTGCTCGCTAAGAGGAGAGGTCACGTGACCCAGGACGCCCCCGTCCCAGGATCGCCTCTGTACACGATCAAGGCCTTCGTCCCCGCGATCGACTCGTTCGGCTTCGAGACGGACCTGAGGACGCACACGCAGGGCCAGGCCTTCTGCCTGCAGGTGTTCCATCACTGGCAGATCGTGCCCGGCGATCCTCTGGATAAGAGCATCGTTATCAGACCTCTAGAACCTCAGCCGGCGACGCACCTCGCCCGTGAGTTCATGATAAAGACGAGGAGACGGAAGGGTCTCAGCGAGGACGTGTCCATCAATAAATTCTTCGACGACCCCATGTTGTTGGAGCTAGCGAGACAAGATGTACAATTCAATTAA

Protein sequence:

>DPOGS208846-PA
MDGDLYDEFGNYIGPDLESDSDDEQSVYGQDNRDGDEDAMEEDEDADAEPEVAPMSVVLHEDKRYYPQAVEVYGPDVETVVQEEDTQALDKPLVEPVKHKKFQVQEQHLPETTYDMEYLADMLDNTNLMRNVTLMGHLHNGKTSFVDCLIRQTHPGTINNETTIPMRYTDTLFVEQERGVSIKSMPVTLLLKDIKGKSHLLNIMDTPGHVNFSDEVTAALRISDGAVLFVDAAEGIMLNTERLLRHAVQERVPLTLCINKIDRLILELKLPPADAYYKLRHIIDELNTMLETNQPQDNADEPPIVFSPLLGNVCFASSLYDVCFTLESFAAMYARSHDGFRAGDMSRWLWGDMYFNNKTRRFTKKQPHASAQRSFVEFILEPLYKIFAQVVGDVDDTLLTVLAELGIKLTKQEAKLNVRPLLRLVCSRFFGDFCGFVDMLVRHVPSPLDAAPRKVQHCYRGASGPLYDDMMTCDQSGRLVAHTTKMYPTDDCTFFLVLARIMSGTLYAGQTVRVLGENYSSQDEEDSRIMNVGRLWIYEARYKVELNRVPAGCWALIEGIDQPIVKTCTVVSADEEEELHTFKPLRFNTQAVVKIAVEPVNPSELPKMLDGLRKVNKSYPVLSTRVEESGEHVVLGTGELYLDCVMHDLRDMYSEIDIKVADPVVSFCETVVETSSLKCFAETPNKRNKLTMIAEPLERGLAEDIEAGAVCVTWDRRRLGEFFQTKYDWDLLAARSIWAFGPDAAGPNILVDDTLPSEVDKHLLASVKDSIVQGFQWGTREGPLCEEPIRNVKFKILDAVIANEPLHRGGGQIIPTARRVAYSAFLMATPRLMEPYLFVEVQAPADCVSAVYTVLAKRRGHVTQDAPVPGSPLYTIKAFVPAIDSFGFETDLRTHTQGQAFCLQVFHHWQIVPGDPLDKSIVIRPLEPQPATHLAREFMIKTRRRKGLSEDVSINKFFDDPMLLELARQDVQFN-