Monarch geneset OGS2.0

DPOGS203443
TranscriptDPOGS203443-TA1926 bp
ProteinDPOGS203443-PA641 aa
Genomic positionDPSCF300242 - 92236-98879
RNAseq coverage2100x (Rank: top 6%)
Annotation
HeliconiusHMEL0095132e-3782.50% 
BombyxBGIBMGA011143-TA2e-8035.58% 
DrosophilaGadd34-PA7e-1143.66% 
EBI UniRef50UniRef50_Q9EML35e-1956.72%AMV193 n=1 Tax=Amsacta moorei entomopoxvirus 'L' RepID=Q9EML3_AMEPV
NCBI RefSeqXP_314343.42e-1352.94%AGAP004848-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|99645072e-1856.72%protein phosphatase 1, regulatory subunit 15A [Amsacta moorei entomopoxvirus 'L']
NCBI nr blastxgi|1163267722e-1946.53%hypothetical protein TNAV2c_gp086 [Trichoplusia ni ascovirus 2c]
Group
KEGG pathwayoaa:1000871595e-06 
 K14019 (PPP1R15A, GADD34)maps-> Protein processing in endoplasmic reticulum
InterPro domain[466-543] IPR0195234.8e-14Protein phosphatase 1, regulatory subunit 15A/B, C-terminal
Orthology groupMCL30816 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203443-TA
ATGAATAATTTCTGTGGTGAGCGCAATAGGCGCCGTTTCTTAGATTTTGGCGTGTTTCCTACATTTCAATATCCAATATCAAATTATTCGACGAGTTCCCAAACGAAGACAGAGATGAAGAATACGTTCGAAGACAACTTTTTCGCGAACAATTTGCATCTCATCCAAGATCCTTTGAAAAAGAACCTCGCTAATACTGTGTTAACAGGCGAAGATAAGAAAACAAACATGGGGAAAACAAAGGTAACGGTGACAAAGGAAGAGGAAATACCTCAGAAGGATTACTATTCATTCCATTCTGGCATCACGGGAGTTCTCACAGGGATATTAAACTTCATGGGGTTATCTCAGACGCGTTCCACTTCGTCCAAACACCACGAGACTGATTACAACGATCGCCAGATGGCGTCAGGGAACTGGCACTACCCCAAATGCGAGGTTCAAAATAAAAATGAACGGGACCGTAATACAAATATGGATATAGAAGTCTGCAGATTAACTTCTGAACACTGTGATAATAAAAATAATTGCACCAATCTCTTCTCAGAATTCCAGCCTTCAGTTGCAAGAGAGTCTTTTATTCCAGGTGACTCGGAAGATATTGATATAGGCCTATTCAATGAGAGCTGTATAGAATATTTTTGTCCAAACACTTATCAAGAACAGGAACTGTGTGAAACAGCTGCACAGGGAGTAAAGACTGAGATATTAAACATATGTGTAGACATCGGAGACACACAAGCATATCGTAAGACTGAATTAAATGAGAGTATGGAGATAGAGAAGCCTTTACCAACGAATGGGCTGGAGATAATAGGAAAAACAAGAGAGGCAGCATCATCATGTGAGGACAAAATGTCTAAATTAAAAGCATTGCTTAAATGCCGTCAAAACAAAAACTATGAATCATCTCTACCGAGTCATGATAAGACAAAACCGGTCGATATTCCAAGTAAATATTCCGAAACAGTACAGGATACTTTGTTTGCAAACGAAATAGCTTCTTCAAATCTAACCAACAGCTTTAATGAAGTCTCAGGTAAATTCTGTTCATCATCGGTGGACAGTGAAGATTCCTTCCAAATAGTGTTCACCGACAGCCCACAGAATTTCGAAAGACGTCGCATTTCATCGGACTGCGAGTCTGAGGATTCGTTTATCGTTTTCGAAGAAACACCAGAAAACTGCTACACCAATAATGATGTGTTTGGTGATGAGAGCTCGGACTCGGATTCAGTGGTCGAGGAGTCTATGTGCATAGCACAGTTGTCACCGAACCTGTCAAAGACATTCAGTGACCTAACAGACACCAGCCTATACAGTGAGGATGTTGTGGACTTTGCTGAGAAGTGTGACAATGAGGACGATGTCCATCAGCCATTCACAGGACTTCTGATCAATGAGACACGGAAACAGGAGAAGTTACGGCAGCCTAAAAAAAGGGTCCGCTTCTCATCGCAGCCTCCCAAGGTCCACGTGATGAGGGTCTGGGCCTTCGCAGCCAGGCAGGCCCGCGCCGGCCACTGGGAGAGACACGCGCTGGACAGGGAGAGATTCAAAAGAAGGATAGCGGATGTAGAGATGGCTATATCCTGGGTCCTTAAACCTCAGCACCGATCTAGGATAGTCTTCCAGCGATTCATGCCCTGGTGGAACGCTGAGAGACGAAGGGAGTTAGCGGAGAAGAAGAGAGAGGAAGACACAAGGAGAGAAGCGGAAGAAGTGGAGAAGATGGCTATCCGTCGACAGAATGATGAGAATTGCAATGAAGTAGAAACTGAAAGCGATTTGATGAACAATGAATCAAAACAAAGTGATGATAGTGATGGAATAAGAGGAAAAGATAACGATGAATCAGCCGCGAGGGTAGTTCACATCGCCGAGACAAAACGACCAGGAAACAATTTGACAATAGTTGATACTTGA

Protein sequence:

>DPOGS203443-PA
MNNFCGERNRRRFLDFGVFPTFQYPISNYSTSSQTKTEMKNTFEDNFFANNLHLIQDPLKKNLANTVLTGEDKKTNMGKTKVTVTKEEEIPQKDYYSFHSGITGVLTGILNFMGLSQTRSTSSKHHETDYNDRQMASGNWHYPKCEVQNKNERDRNTNMDIEVCRLTSEHCDNKNNCTNLFSEFQPSVARESFIPGDSEDIDIGLFNESCIEYFCPNTYQEQELCETAAQGVKTEILNICVDIGDTQAYRKTELNESMEIEKPLPTNGLEIIGKTREAASSCEDKMSKLKALLKCRQNKNYESSLPSHDKTKPVDIPSKYSETVQDTLFANEIASSNLTNSFNEVSGKFCSSSVDSEDSFQIVFTDSPQNFERRRISSDCESEDSFIVFEETPENCYTNNDVFGDESSDSDSVVEESMCIAQLSPNLSKTFSDLTDTSLYSEDVVDFAEKCDNEDDVHQPFTGLLINETRKQEKLRQPKKRVRFSSQPPKVHVMRVWAFAARQARAGHWERHALDRERFKRRIADVEMAISWVLKPQHRSRIVFQRFMPWWNAERRRELAEKKREEDTRREAEEVEKMAIRRQNDENCNEVETESDLMNNESKQSDDSDGIRGKDNDESAARVVHIAETKRPGNNLTIVDT-