Monarch geneset OGS2.0

DPOGS204523
TranscriptDPOGS204523-TA999 bp
ProteinDPOGS204523-PA332 aa
Genomic positionDPSCF300297 - 419287-421858
RNAseq coverage1144x (Rank: top 11%)
Annotation
HeliconiusHMEL0087300.096.69% 
BombyxBGIBMGA004302-TA0.093.37% 
DrosophilaeIF-2alpha-PA3e-13173.83% 
EBI UniRef50UniRef50_E2B0631e-15583.95%Eukaryotic translation initiation factor 2 subunit 1 n=13 Tax=Bilateria RepID=E2B063_CAMFO
NCBI RefSeqNP_001037516.10.093.07%eukaryotic translation initiation factor 2 alpha subunit [Bombyx mori]
NCBI nr blastpgi|274625920.093.96%eIF2 alpha subunit [Spodoptera frugiperda]
NCBI nr blastxgi|274625924e-17393.96%eIF2 alpha subunit [Spodoptera frugiperda]
Group
Gene OntologyGO:00037436.5e-39translation initiation factor activity
GO:00037236.5e-39RNA binding
GO:00058506.5e-39eukaryotic translation initiation factor 2 complex
KEGG pathwaynvi:1001196272e-156 
 K03237 (eIF-2A, EIF2S1)maps-> Protein processing in endoplasmic reticulum
InterPro domain[186-308] IPR0240557.8e-51Translation initiation factor 2, alpha subunit, C-terminal
[128-242] IPR0114886.5e-39Translation initiation factor 2, alpha subunit
[5-91] IPR0123403.3e-37Nucleic acid-binding, OB-fold
[92-180] IPR0240549.2e-35Translation initiation factor 2, alpha subunit, middle domain
[2-88] IPR0160271.5e-20Nucleic acid-binding, OB-fold-like
[13-87] IPR0030299.2e-18Ribosomal protein S1, RNA-binding domain
[14-87] IPR0229671.1e-14RNA-binding domain, S1
Orthology groupMCL13464 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204523-TA
ATGCCGTTGTCGTGTCGATTTTATCAAGAAAAATATCCAGAAGTAGAAGATGTGGTGATGGTGAACGTTAGGTCTATAGCTGAGATGGGTGCATACGTTCACCTGCTCGAATATAACAATATAGAGGGCATGATTTTGCTTTCTGAATTGTCCAGAAGACGTATCCGTTCCATAAACAAGTTGATTAGAGTAGGGAAGACAGAACCGGTCGTAGTTATCAGAGTAGACAAAGAAAAAGGTTACATTGATTTGTCGAAACGTCGTGTATCAGCCGAGGATATCGACAAATGTACTGAGAGGTACGCCAAAGCTAAAGCTGTGAACTCCATTTTGCGTCACGTAGCAGAACTGCTGCACTACCAAAGCTCGGAACAGCTGGAAGAGCTTTATAAACAGACAGCTTGGCACTTCGAAGAGAAATATAAAAAGAAAGCATCTGCTTACGACTTCTTTAAACAAGCCGCTGTAGATCCCTCTGTGCTCAATGAATGTGGTCTGGATGAGAAAACCAAGGAAGTGCTATTAGCTAACATCAAACGGAAACTGACTTCCCAGGCTGTGAAGATCCGAGCTGATATTGAATGTGCCTGTTACGGCTATGAGGGTATTGACGCCGTGAAGGAGGCATTGAAAGCTGGCTTGTCTCTCTCAACCCCAGATATGCCTATTAAAATCAACCTCATAGCCCCACCTTTGTATGTAATGACAACTTCTACCCCGGAGAAGACCGATGGGCTCAAAGCTTTGCAAGACGCAATCGATAAAATCAAAGATACAATCACCACTGCCGGTGGAGTGTTCAACATTCAGATGGCTCCTAAGGTTGTCACCGCAACGGATGAGGCAGAGTTGGCAAGACAGATGGAACGTGCAGAGGCTGAGAATGCAGAGGTCGCTGGTGACTCAGCAGAGGAAGATGCCGACCAAGGCATGGGCGATGCTGGTATGGATGAAGAACCACAACAGAACGGGGCTTCGGATGACACTGATGAGAATTGA

Protein sequence:

>DPOGS204523-PA
MPLSCRFYQEKYPEVEDVVMVNVRSIAEMGAYVHLLEYNNIEGMILLSELSRRRIRSINKLIRVGKTEPVVVIRVDKEKGYIDLSKRRVSAEDIDKCTERYAKAKAVNSILRHVAELLHYQSSEQLEELYKQTAWHFEEKYKKKASAYDFFKQAAVDPSVLNECGLDEKTKEVLLANIKRKLTSQAVKIRADIECACYGYEGIDAVKEALKAGLSLSTPDMPIKINLIAPPLYVMTTSTPEKTDGLKALQDAIDKIKDTITTAGGVFNIQMAPKVVTATDEAELARQMERAEAENAEVAGDSAEEDADQGMGDAGMDEEPQQNGASDDTDEN-