Monarch geneset OGS2.0

DPOGS215203
TranscriptDPOGS215203-TA1257 bp
ProteinDPOGS215203-PA418 aa
Genomic positionDPSCF300143 - 53743-60525
RNAseq coverage17754x (Rank: top 1%)
Annotation
HeliconiusHMEL0096640.086.63% 
BombyxBGIBMGA008682-TA3e-5889.47% 
Drosophilaexba-PC4e-16664.04% 
EBI UniRef50UniRef50_Q9VNE26e-16464.04%Protein extra bases n=89 Tax=Eukaryota RepID=EXBA_DROME
NCBI RefSeqNP_001091797.10.088.24%eukaryotic initiation factor 5C [Bombyx mori]
NCBI nr blastpgi|1890312760.092.12%eukaryotic initiation factor 5C [Helicoverpa armigera]
NCBI nr blastxgi|1890312760.092.36%eukaryotic initiation factor 5C [Helicoverpa armigera]
Group
Gene OntologyGO:00160707.8e-45RNA metabolic process
GO:00054881e-38binding
KEGG pathway 
InterPro domain[254-416] IPR0160217.8e-45MIF4-like, type 1/2/3
[252-408] IPR0160241e-38Armadillo-type fold
[325-410] IPR0033071e-26eIF4-gamma/eIF5/eIF2-epsilon
Orthology groupMCL10854 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215203-TA
ATGAGTCAGAAGGTAGAAAAACCAGTATTATCGGGTCAACGGATCAAGACCAGAAAAAGAGATGAGAAAGAGAAGTATGACCCTAACGGGTTCCGCGACGCGTTGGTGTCGGGTCTGGAGCGAGCGGGGGACCTGGACGCGGCCTACAAGTACCTAGACGCGGCCGGGTCCAAGCTCGACTACCGCCGCTATGGCGAGGTCATCTTCGACGTGCTCATCGCCGGAGGGCTACTGCTGCCCGGCGGCTCGGTGTCTATGGACGGGGAAACTCCCAAGACTAACACTTGCATCTTCAACGCCAGCGAGGATATGGAGTCTATGAGGAACTTTGAACAGGTATTTGTCAAACTGATGCGTCGTTACAAATATCTCGAGAAGATGTTCGAGGAGGAAATGAAGAAGGTGCTGGTATACCTCAAGGGATTCGAACCTCTACAACGCATCAAATTGGCACGAATGACTGCTCTGTGGATCGGCAACGGCTGTGTGCCCCCCTCGGTGTTGCTGGTGCTGGTGAACGAGCATCTGCTGAAGGAGAATCTGGCGCTGGAGTTTGTGCTGGAGGTGTTCGCTACTGTTAAGGCGGAGAAGGGAGTCGCCAGTTTGGTCACCGCGCTCAAGAGAGGACAACTAGAGGGCAGACTGTTAGAGTTCCTCCCTCTGAACCGGCGCAGTGAGGACGTGTTGGCTAGCGCCTTCGCATCCCGCGGTCTCGCAGAGCTCTTGAGGCTGCACCGGGCTCAGGCGTCCCAGGAGGCTCGCCGCGAGCTGACCCAGGCGCTGCAGGAACAGCTGGCGGACGAGCGACCCGTCAGGGACCTCATCACAGACCTCCGAGACATGGCGCAGAGGCTCGACATACCTGACCACGAGGTCGTCGCTATTACCTGGCAATGCGTGATGTCCCGCGGCGAGTGGAACAAGAAGGAGGAACTGCTAGCGGAGCAGGCCGCCAAACATCTCCGACATTACACGCCGCTACTGGCAGCGTTCGCTCAGTCCGCGAAGGCTGAGATAGCTCTGCTCACTAAGGTTCAAGAGTACTGCTACGAGAATATGAGCTTCATGAGGGCCTTCAGTAAGCTGGTGCTGATGCTGTACAAGAGTAACGTGCTGAGTGAGGAGGTGATCCTCAAGTGGTACAGAGACCCCAACTCCAGCAAGGGGAAGGTCATGTTCCTTGACCAGATGAAGAAGTTTGTGGAGTGGCTTCAGAGCGCCGAGGAGGAATCGGAGAGCGGCGAGGAGGAAGATTAG

Protein sequence:

>DPOGS215203-PA
MSQKVEKPVLSGQRIKTRKRDEKEKYDPNGFRDALVSGLERAGDLDAAYKYLDAAGSKLDYRRYGEVIFDVLIAGGLLLPGGSVSMDGETPKTNTCIFNASEDMESMRNFEQVFVKLMRRYKYLEKMFEEEMKKVLVYLKGFEPLQRIKLARMTALWIGNGCVPPSVLLVLVNEHLLKENLALEFVLEVFATVKAEKGVASLVTALKRGQLEGRLLEFLPLNRRSEDVLASAFASRGLAELLRLHRAQASQEARRELTQALQEQLADERPVRDLITDLRDMAQRLDIPDHEVVAITWQCVMSRGEWNKKEELLAEQAAKHLRHYTPLLAAFAQSAKAEIALLTKVQEYCYENMSFMRAFSKLVLMLYKSNVLSEEVILKWYRDPNSSKGKVMFLDQMKKFVEWLQSAEEESESGEEED-