Monarch geneset OGS2.0

DPOGS207552
TranscriptDPOGS207552-TA1035 bp
ProteinDPOGS207552-PA344 aa
Genomic positionDPSCF300072 - 927093-939274
RNAseq coverage515x (Rank: top 24%)
Annotation
HeliconiusHMEL0180311e-10394.71% 
BombyxBGIBMGA004701-TA1e-9492.51% 
DrosophilaMkp3-PB3e-8752.10% 
EBI UniRef50UniRef50_D6WZI54e-9852.73%Dual specificity protein phosphatase n=5 Tax=Coelomata RepID=D6WZI5_TRICA
NCBI RefSeqXP_971654.22e-9952.29%PREDICTED: similar to AGAP012237-PA [Tribolium castaneum]
NCBI nr blastpgi|1892412245e-9852.29%PREDICTED: similar to AGAP012237-PA [Tribolium castaneum]
NCBI nr blastxgi|1892412242e-9752.83%PREDICTED: similar to AGAP012237-PA [Tribolium castaneum]
Group
Gene OntologyGO:00081382.1e-50protein tyrosine/serine/threonine phosphatase activity
GO:00064702.1e-50protein dephosphorylation
GO:00170172.8e-10MAP kinase tyrosine/serine/threonine phosphatase activity
KEGG pathwaytca:6603187e-99 
 K04459 (DUSP, MKP)maps-> MAPK signaling pathway
InterPro domain[185-326] IPR0204222.1e-50Dual specificity phosphatase, subgroup, catalytic domain
[193-325] IPR0003401.8e-37Dual specificity phosphatase, catalytic domain
[8-139] IPR0017635.2e-32Rhodanese-like
[29-39] IPR0083432.8e-10MAP kinase phosphatase
Orthology groupMCL13486 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207552-TA
ATGCCTATAGATTCAGAGTGCGATTATGACTTGGTGACAAAAGAGTGGCTTTTAGCCAAATTGCGTTCAGACGAGAGAGATACAATTTTGATTGACTGTCGAGGATCTAATGAATATGCCGTATCCCACATTCGTTCTGCAGTGAACTTCTCGATTCCGAGTATAATGTTGCGACGATTATCTGCTGGAAAGATCGAACTAGCTTCTACTGTTCAATGTAAAGAACTAAAGGCTCGTATCACGCATTGCTGTTCGAGAGGAACGTTCGTGTTGTACGGTGACGGAGCCCCGCGGGATCCAGACTCTGTGCACGGCATCCTGCTGAAACGGCTCAAGCAAGATGGCGTGCAGGTGGTTTGCTTAGAAGGTGATTTCTGCGAGTTCCGTCGTGCGTATCCTGAGTGGTGCAGCGAGGCCGGGGCGCAGCAGGTGCCGCACCTGCCGCTGATGGGACTGCGTTCCCTTCGTATATCTGGTTCGGGTTGTGACGACGCGCTCTCATCTGGCTCGTCTTCCGAGTGTGAGGACATGCACACACACGCGCCCCAGGACTTCCCCATAGAGATCCTGCCTAACCTGTACCTCGGGAACTCTAACAACAGCGAAGATTGTGAAGCGCTGGCCAGACACAATATTAAGTACGTGTTGAATGTGACTCCGGACTTGCCGAACACGTTCGAGGCTGACGGGTGCGGCATCAACTATCTCAAGATACCCATCGCGGACCACTGGAGCCAGAACCTCGCCGTGCATTTCCCTCAGGCCATACGATTCATTGAGGAGGCGATGTCAGCCGAGTGCGGCGTGTTGGTGCACTGTGTGGCGGGCGTGTCTCGCTCGGTGACGGTGACGCTGGCCTACCTCATGCAGCGCCACCGGCTCTGTCTGCGGGACGCCTTCGAGCTGGTCCGCAGCCGCAAGACGGACATCGCGCCCAACTTCCACTTCATGAGACAACTGCACTCGTTCGAGAGAGACCTCGGACTGCACGAGCGCAGCGCCAGCCTCGCCAAGGTCTGCCTCACCAGCTGTTAG

Protein sequence:

>DPOGS207552-PA
MPIDSECDYDLVTKEWLLAKLRSDERDTILIDCRGSNEYAVSHIRSAVNFSIPSIMLRRLSAGKIELASTVQCKELKARITHCCSRGTFVLYGDGAPRDPDSVHGILLKRLKQDGVQVVCLEGDFCEFRRAYPEWCSEAGAQQVPHLPLMGLRSLRISGSGCDDALSSGSSSECEDMHTHAPQDFPIEILPNLYLGNSNNSEDCEALARHNIKYVLNVTPDLPNTFEADGCGINYLKIPIADHWSQNLAVHFPQAIRFIEEAMSAECGVLVHCVAGVSRSVTVTLAYLMQRHRLCLRDAFELVRSRKTDIAPNFHFMRQLHSFERDLGLHERSASLAKVCLTSC-