Monarch geneset OGS2.0

DPOGS212059
TranscriptDPOGS212059-TA1863 bp
ProteinDPOGS212059-PA620 aa
Genomic positionDPSCF300317 - 142091-144506
RNAseq coverage315x (Rank: top 36%)
Annotation
HeliconiusHMEL0093550.086.49% 
BombyxBGIBMGA009637-TA0.083.22% 
DrosophilaCG11534-PA8e-9342.33% 
EBI UniRef50UniRef50_D6WGG53e-10344.08%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WGG5_TRICA
NCBI RefSeqXP_969464.16e-10444.08%PREDICTED: similar to CG11534 CG11534-PA [Tribolium castaneum]
NCBI nr blastpgi|910787821e-10244.08%PREDICTED: similar to CG11534 CG11534-PA [Tribolium castaneum]
NCBI nr blastxgi|910787821e-10544.74%PREDICTED: similar to CG11534 CG11534-PA [Tribolium castaneum]
Group
KEGG pathwayxla:4144795e-07 
 K04350 (RASGRP1)maps-> MAPK signaling pathway
    T cell receptor signaling pathway
Orthology groupMCL14936 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212059-TA
ATGGCCACAACCCTGAACGATTCGCCAAAGTATCGTAATAGTTTAATCTGCTGCAGCCCCAGGAGTGTCGATGTGACTTCCAGATCATCGTCTTCCACATCCGGATGCGTCTCTGCTGACGAGAGTTACGATTCAAACTACATTCCGAAGTCTATAGCGAATAAGAAACTCAAAATACACAGCTCGGCTACGAAAGAGGAAGTGGAGAAAGCTATCAATCAATGCAAAGAGCTAGTTTTAAATAGCCCACAGTGTTCCGATGAACGCAAGTGGCTCGTGCGATATTTAGTTGAATTGAGACTGCGTTTAGAAGATTTAAAAGAAAACGACGGTCAACTGAGGTCTCGCGTGGCAATTAAGGGTCACCACTTCGAACAACAGACAACGACTAGCAATCGTAAACAATACTGTGATCATTGCAGCGGGGTAATATGGAGTATTGTTCAAAGTTCATATATATGTAAGGACTGCGGCTACATTTGTCATTACAAATGCGTCGACGATATCTGTAGAGTGTGTGCGCATGTTGTTATGACAGAAAAGGGACAGTTCGAGATGAATATATGTCCAGAGAAAGGTTTAGCGGCACAGGAGTACAACCCCAGGAGTGTGGATGTGACTTCCAGATCATCGTCTTCCACATCCGGATGCGTCTCCGCTGACGAGAGTTACGATTCAAACTACATTCCGAAGTCTATAGCGAATAAGAAACTCAAAATACACAGCTCAGCTACGAAAGAGGAAGTGGAGAAAGCTATCAATCAATGCAAAGAGCTAGTTTTAAATAGCCCACAGTGTTCCGATGAACGCAAGTGGCTCGTGCGATATTTAGTTGAATTGAGACTGCGTTTAGAAGATTTAAAAGAAAACGACGGTCAACTGAGGTCTCGCGTGGCAATTAAGGGTCACCACTTCGAACAACAGACAACGACTAGCAATCGTAAACAATACTGTGATCATTGCAGCGGGGTAATATGGAGTATTGTTCAAAGTTCATATATATGTAAGGACTGCGGCTACATTTGTCATTACAAATGCGTCGACGATATCTGTAGAGTGTGTGCGCATGTTGTTATGACAGAAAAGGGACAGTTCGAGATGAATATATGTCCAGAGAAAGGTTTAGCGGCACAGGAGTACAAGTGTGCAGAATGCAATACACCATTAACATTCAAAGACTCATGGAATGAGCCTCGTCTCTGTGATTACACCGGGATGTATTTTTGTGGAACCTGTCATTGGAATGATCTATCCCCGATACCGGCTAGGGTCGTCCACAATTGGGATTGGGAGAAAAGATACATATCGAGACTAGCTTACCAGATGCTGACATTATCTTGGTCTCGGCCCTACATTGATGTAGAAAATGTTAATTCCAAACTATTCAGCTTCATTGCTGAACTTGAATGGGTGCACAAGATGCGGAAAGATCTGGAATGGATGAAACGTTACTTGTGCGCTTGTAGTGAAGGCTCAAATCTACTATCACCGTTATTCGTTCAGCTCGGTGATGTCAACAGGAAATACAGCATGTCACATTTACAAGCCATCAACGACGGCAGCCTCGAGACACAGCTAACGGAACTAACGGAAGTATGCAGGTTGCACATCACCAACTGCTCTTTATGTTCAGGCAAAGGTTATTTATGTGAGGTATGCGGCAACAATGAGGTTTTATATCCATTCGACAGCGGAGCAATAATGTGTGACAAGTGCAATTCGATGTACCACAGAGGCTGCTGGCTCAGGAAGGGGCAGAATTGCTTGAAATGTTTGAGATTGGGAGAAAGAAAGAAAAATATTAACACCACAGATGATGTAGACACTAACCTAGAATATGATATCGATAAATTTGTTAAATAA

Protein sequence:

>DPOGS212059-PA
MATTLNDSPKYRNSLICCSPRSVDVTSRSSSSTSGCVSADESYDSNYIPKSIANKKLKIHSSATKEEVEKAINQCKELVLNSPQCSDERKWLVRYLVELRLRLEDLKENDGQLRSRVAIKGHHFEQQTTTSNRKQYCDHCSGVIWSIVQSSYICKDCGYICHYKCVDDICRVCAHVVMTEKGQFEMNICPEKGLAAQEYNPRSVDVTSRSSSSTSGCVSADESYDSNYIPKSIANKKLKIHSSATKEEVEKAINQCKELVLNSPQCSDERKWLVRYLVELRLRLEDLKENDGQLRSRVAIKGHHFEQQTTTSNRKQYCDHCSGVIWSIVQSSYICKDCGYICHYKCVDDICRVCAHVVMTEKGQFEMNICPEKGLAAQEYKCAECNTPLTFKDSWNEPRLCDYTGMYFCGTCHWNDLSPIPARVVHNWDWEKRYISRLAYQMLTLSWSRPYIDVENVNSKLFSFIAELEWVHKMRKDLEWMKRYLCACSEGSNLLSPLFVQLGDVNRKYSMSHLQAINDGSLETQLTELTEVCRLHITNCSLCSGKGYLCEVCGNNEVLYPFDSGAIMCDKCNSMYHRGCWLRKGQNCLKCLRLGERKKNINTTDDVDTNLEYDIDKFVK-