Monarch geneset OGS2.0

DPOGS211607
TranscriptDPOGS211607-TA1446 bp
ProteinDPOGS211607-PA481 aa
Genomic positionDPSCF300232 - 23742-27986
RNAseq coverage3236x (Rank: top 4%)
Annotation
HeliconiusHMEL0143120.077.60% 
BombyxBGIBMGA008226-TA4e-10590.10% 
DrosophilaeIF5-PF1e-12050.11% 
EBI UniRef50UniRef50_B4JN792e-12247.90%GH24190 n=23 Tax=Opisthokonta RepID=B4JN79_DROGR
NCBI RefSeqNP_001037662.17e-17872.82%eukaryotic translation initiation factor 5 [Bombyx mori]
NCBI nr blastpgi|1129832061e-17672.82%eukaryotic translation initiation factor 5 [Bombyx mori]
NCBI nr blastxgi|1129832060.073.43%eukaryotic translation initiation factor 5 [Bombyx mori]
Group
Gene OntologyGO:00064133.8e-60translational initiation
GO:00037433.8e-60translation initiation factor activity
GO:00160701.6e-29RNA metabolic process
GO:00054885.1e-22binding
KEGG pathway 
InterPro domain[14-129] IPR0027353.8e-60Translation initiation factor IF2/IF5
[273-439] IPR0160211.6e-29MIF4-like, type 1/2/3
[4-100] IPR0161899e-27Translation initiation factor IF2/IF5, N-terminal
[358-438] IPR0033075.1e-22eIF4-gamma/eIF5/eIF2-epsilon
[278-432] IPR0160243.4e-21Armadillo-type fold
[97-135] IPR0161904.6e-06Translation initiation factor IF2/IF5, zinc-binding
Orthology groupMCL13793 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211607-TA
ATGGGGTCGGTGAACGTGAACCGCAACGTCGCGGACGCCTTCTACCGGTACAAGATGCCGCGCATCTGCGCCAAGGTCGAGGGGAAGGGGAACGGCATCAAGACGGTCATCGTCAACATGCCCGAGGTGGCCAAGGCGCTCGGCAGACCGGCGACCTATCCGACCAAGTACTTCGGCTGCGAGCTGGGCGCCCAGACTCAGTTCGACTTCAAAAACGAGCGCTTCATAGTGAACGGCAGCCACGACTCCGCCAAGCTACAGGATCTGCTGGACGGCTTCATCCGCAAGTTCGTGCTGTGCCCGGAGTGTGACAACCCTGAGACGGAGCTGATCGTGTCCACCAAGCGGAACACCATCTCCCAGGGCTGCAAGGCGTGCGGTTACCACGGCACCCTGGACTTCAACCACAAGCTGAACACCTTCATCCTCAAGAACCCGCCCGCCGCCGACCCGGCCATGCAGGGCTCGTCTCTGACGGAGGGCAATCGCGGCAAGAGATCCAAGAGGAGCGGGCCGGCTAACGGGAACCACGACGGGGAACAACACGACGCTAAGAACGAGGGAGAAGCTCCCGTCACGCCCACAGCTCCTCAACCGAAGAGCAAGAAGGAAAAGAAGAAGGCCGAGGACGACGACGACGACGGGAACTGGACCGTGGACGTCAGCGAGGAGGCTGTGAGGGCCAGGATGCAAGGTGGGGAGATAGATACAGCTGACTCACTGCTTGATGCGGAGTGCTGCAGCTTGACGTACAAAAATGACTCGTATGGAGATGGCATCACGAATCTGACCGAAGGCGCCAAGAGCATGACGTTGTCGGAGGACAGCGAGAAAAACGAGAAGCAGAGGATGGACCTGTTCTACGCGTTCCTCAAACAGCGAGCTGACGCCGGAGACGTGGAGGGCACGCGGGCCATCAGCGACATCCTGCACGAGGTGGAGAGGCTGGACGTGAAGTCGAAGGCATTGCTGGTCGCTTTCGAGGTGCTGGTGGGCGCCAACACGCTGGCGGCGGACGTGAAGAAGCACCGCATGCTGCTGATACGCCTGGCGCGCGCCGACGCCAAGGCTCCGAGGGCGGCGCTCCACGCGCTCACCGCCCTGGCCCACACCAGCCCCGCCCTGCTGCAGCGGGTGCCCGCCGTGCTGAAGCTGCTGTACGACCTGGACGTGGTCGAGGAGAAGACCATACTGGAGTGGGCCGCGAAGCCCTCCAAGAAATACGCGCCGCGGGAGACCGTGGCCGACGTGGTGAGGCGTGCGCAACCCTTCATTGACTGGCTGCAGCAGGCCGACGAGGAGGACTCCAGCTCGCAGGAGGAGGACATTGAGATCCAGTATGACGACCGCGCTAAGGCGACCCCCATCAAGGCGGTGGTGGCGCCCTCTGCTCCGCGACAGAAGCCCGAGGAAGACGACATCGACGTCGACATAGACGCCATATAG

Protein sequence:

>DPOGS211607-PA
MGSVNVNRNVADAFYRYKMPRICAKVEGKGNGIKTVIVNMPEVAKALGRPATYPTKYFGCELGAQTQFDFKNERFIVNGSHDSAKLQDLLDGFIRKFVLCPECDNPETELIVSTKRNTISQGCKACGYHGTLDFNHKLNTFILKNPPAADPAMQGSSLTEGNRGKRSKRSGPANGNHDGEQHDAKNEGEAPVTPTAPQPKSKKEKKKAEDDDDDGNWTVDVSEEAVRARMQGGEIDTADSLLDAECCSLTYKNDSYGDGITNLTEGAKSMTLSEDSEKNEKQRMDLFYAFLKQRADAGDVEGTRAISDILHEVERLDVKSKALLVAFEVLVGANTLAADVKKHRMLLIRLARADAKAPRAALHALTALAHTSPALLQRVPAVLKLLYDLDVVEEKTILEWAAKPSKKYAPRETVADVVRRAQPFIDWLQQADEEDSSSQEEDIEIQYDDRAKATPIKAVVAPSAPRQKPEEDDIDVDIDAI-