Monarch geneset OGS2.0

DPOGS203345
TranscriptDPOGS203345-TA2001 bp
ProteinDPOGS203345-PA666 aa
Genomic positionDPSCF300003 - 106985-113275
RNAseq coverage885x (Rank: top 14%)
Annotation
HeliconiusHMEL0226570.069.99% 
BombyxBGIBMGA011993-TA0.053.64% 
DrosophilaCG2017-PA2e-12849.49% 
EBI UniRef50UniRef50_Q960F73e-12649.49%CG2017, isoform A n=25 Tax=Coelomata RepID=Q960F7_DROME
NCBI RefSeqXP_002429073.12e-12752.22%GTP-binding protein, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420171874e-12652.22%GTP-binding protein, putative [Pediculus humanus corporis]
NCBI nr blastxgi|246444624e-12249.49%CG2017, isoform B [Drosophila melanogaster]
Group
Gene OntologyGO:00055258.1e-20GTP binding
GO:00039248.1e-20GTPase activity
KEGG pathway 
InterPro domain[156-381] IPR0007958.1e-20Protein synthesis factor, GTP-binding
[564-657] IPR0090011.2e-12Translation elongation factor EF1A/initiation factor IF2gamma, C-terminal
[392-485] IPR0090003.2e-10Translation elongation/initiation factor/Ribosomal, beta-barrel
Orthology groupMCL14086 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203345-TA
ATGTACCTGGCCGGCGACCTGAGCGCGGAGGACCTTGAGAGCGACTTCTTTTACAGTTCAGACGACATGGACGACTTCTGTGACGATACAACCTCGGAGGCGTCTGACTCTGGCGAGGAATGCTACGGCAGTCTTCCCCCAGAACCTCGTTTCGGTAACGTTGAGTACAAACTGCAGCTGGTGTCGCCTTGCGAGAGAAGATTCCAGCATTTGGTTACACAGCTGAAATGGCGTCTCCGCTCTGGCGGTGGGAGCGCGGTGTACGTGGTAGGGGTCCGGGATAATGGAGCGCTAGTAGGGCTCCATGCGGGTGCGCTCCGAGCCTCGCTGTGCTCGTTGAGGGACATGGCGAGAGCTCTGGGAGCTGTGATCGTTAGCGCGCGCGCGAGACGAGTGACCAGTAGCCGGGCTGTCGCTGAGGTTTATATACGGAAGTTAGCGGACACTCAACAAAGTGTGGAACTGCGTGTGGCTGTGATGGGGGCTATAGAAGCTGGGAAATCTACCCTCATTGGGGTCCTAACACAAGGTGAATTAGATAATGGCAGAGGCAGCGCTCGTCTGAATATGTTCAGACATCTCCATGAAGTCAGAAGCGGAAGGACGTCTTCGCTCAGCCACGAGATACTCGGGTTCGACTCTCAGGGTAACGTGGTGAATTATGGCTGTTCTGAGCTGATGACGGCGGAGCGTATCGGAGAGAGGAGTTCCAAGCTGGTGTCTTTCTTAGACCTCGCGGGACACAGCAAGTATCAGCGGACCACGGTGTACGGTCTCACGGGATACTCGCCGCATTACGCCATGATAGTGATATCAGCAACGGCTGGGATAACACCGATAACAGAAGAACACATAGGTCTACTTCTTGCCCTGGAACTGCCTTTTTTCGCTGTTATTAATAAGACGGAGCTAGCTTCCAGCACTAAGGAGCTGGTGGATAGGCTCGGAGAAATACTTTCGACGGCGAACAAGAAACCTCTTCTCATAACGGACGAGAACCTCGCGAGGAATTGTATAGCGCCGTCCATATTGGACTCCATTGATAATGAGGATAAGGAAAATGAAGGATCCTTCATACCTGTGTTCCCTGTTAGCTGTGTTCGTGGAGTTGGTCTCAACTCATTGCACGCGTACCTCCTCGCTCTCAGACCACCCGCTGGCGGCGTAGAGACTACAAGGGAAGATGAGACCTGCGAGTTCCAAATAGACGAGATCTTCCACGTGGCGTCCGGGGCTCCGGTCGTTGGAGGTCTCCTGGCTCGGGGGGCGCTCAACGAGGGCGACACGCTGTTAGTGGGTCCATTAGACAGCGGTCAATTCGTCAAAACGACTGTGTTATCCATATATCGTAATCGGGTTCCTTGCGCGTCCGTCCGCGCCGGACAGTCTGCCTCGCTGGGGCTCCGCCCCGGGCCAGTTCTGAGGCCCGGTATGGTGCTCCTCGCTATACCAGAAGACTATGGCACGGGGGCCCGGCCCGCTTTCGGCGGCTGTGGGGGACTACGGTGCGGGGGGAGGGAGATATCGGAACTTGTGAAATCGTCGCAGGAGAAGAACCGAAAGAACGCGAGGCGCAACAAGAACATTAAGGAGATCAACATAACGGACAAACATACAGACAAACTCACAGACGCTTTGGGGGACGGAGACTGCGTGTGCAGCGACGTGGTACCACTAGAAGACCCGAACGACCCAAGAGGTTGTATTTACTTCCAGGCTAGCGTCCACCTCCTCCGACACTCCACCTCCATATCTCCAGGGTTCCAGTGTTCCGTACACGTGGGGAACGTGAGGCAGACAGCCATCATAGAGGGTATACTGTCAGCGATGTCTTCGCTCCGGCCCGGTCAGAGCGCGTGCGTGTTGTTCAGGTTTGCGCGCTGTCCGGAGTATTTGAGGAAGGGCAGGAGGCTGCTGTTCACCGCCGGACTTGGGACCAGAGCCATCGGAGTCGTGACGCAGACGTTCCCGTACATACCGCAGCCGAAAGATAATTTATAA

Protein sequence:

>DPOGS203345-PA
MYLAGDLSAEDLESDFFYSSDDMDDFCDDTTSEASDSGEECYGSLPPEPRFGNVEYKLQLVSPCERRFQHLVTQLKWRLRSGGGSAVYVVGVRDNGALVGLHAGALRASLCSLRDMARALGAVIVSARARRVTSSRAVAEVYIRKLADTQQSVELRVAVMGAIEAGKSTLIGVLTQGELDNGRGSARLNMFRHLHEVRSGRTSSLSHEILGFDSQGNVVNYGCSELMTAERIGERSSKLVSFLDLAGHSKYQRTTVYGLTGYSPHYAMIVISATAGITPITEEHIGLLLALELPFFAVINKTELASSTKELVDRLGEILSTANKKPLLITDENLARNCIAPSILDSIDNEDKENEGSFIPVFPVSCVRGVGLNSLHAYLLALRPPAGGVETTREDETCEFQIDEIFHVASGAPVVGGLLARGALNEGDTLLVGPLDSGQFVKTTVLSIYRNRVPCASVRAGQSASLGLRPGPVLRPGMVLLAIPEDYGTGARPAFGGCGGLRCGGREISELVKSSQEKNRKNARRNKNIKEINITDKHTDKLTDALGDGDCVCSDVVPLEDPNDPRGCIYFQASVHLLRHSTSISPGFQCSVHVGNVRQTAIIEGILSAMSSLRPGQSACVLFRFARCPEYLRKGRRLLFTAGLGTRAIGVVTQTFPYIPQPKDNL-