Monarch geneset OGS2.0

DPOGS214749
TranscriptDPOGS214749-TA1059 bp
ProteinDPOGS214749-PA352 aa
Genomic positionDPSCF300022 + 708110-710531
RNAseq coverage1491x (Rank: top 9%)
Annotation
HeliconiusHMEL0060100.093.15% 
BombyxBGIBMGA004743-TA2e-17587.27% 
Drosophilaalph-PE2e-11563.31% 
EBI UniRef50UniRef50_Q17FN99e-11965.05%Protein phosphatase 2c n=7 Tax=Eumetazoa RepID=Q17FN9_AEDAE
NCBI RefSeqXP_966581.11e-13171.20%PREDICTED: similar to phosphatase 2C beta [Tribolium castaneum]
NCBI nr blastpgi|3800251059e-13573.79%PREDICTED: protein phosphatase 1B-like [Apis florea]
NCBI nr blastxgi|3800251053e-13173.79%PREDICTED: protein phosphatase 1B-like [Apis florea]
Group
Gene OntologyGO:00038246.3e-99catalytic activity
GO:00002873.6e-07magnesium ion binding
GO:00047213.6e-07phosphoprotein phosphatase activity
GO:00301453.6e-07manganese ion binding
KEGG pathwayptr:4703655e-104 
 K04461 (PPM1B, PP2CB)maps-> MAPK signaling pathway
InterPro domain[1-308] IPR0156551.7e-166Protein phosphatase 2C
[1-285] IPR0019326.3e-99Protein phosphatase 2C-like
[278-310] IPR0129113.6e-07Protein serine/threonine phosphatase 2C, C-terminal
Orthology groupMCL14975 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214749-TA
ATGGGGGCCTTTTTAAACAAGCCCGAAACTAAGAAGTACAACGAGAGTGGCGAAGGTAACGGACTACGCTATGGCGTGGCGTCGATGCAGGGCTGGCGCATGGAGATGGAAGATGCTCATCATGCGCAGCTTACTTTAAACGGTACATTATCGGACTGGTCATATTTCGGCGTATTTGATGGCCACGCGGGGGCCAAGGTGTCAGCACACTGTGCCGAAAATTTGCTGGAATGTATCTTGCAGACCGAAGAATTTCGGAGAGACGATATAGTGGAGGCGATTCGTACTGGTTTCCTAGATCTCGACATGAAAATGCGAGAATTGCCGGAACTTTCTAACGGCGCGGAGAAGTCAGGTTCGACAGCGGTATGCGCTTTTGTCTCACCAAAGCAGATATACATAGCAAATTGTGGCGATTCGCGCGCGGTTTTGGCACGGAACGGTGCTCCGATTTTTGCAACTCGGGATCATAAGCCGGAGCTGCCATCTGAGAAGTCTCGTATCGTCCAGGCTGGAGGTTCGGTCATGATACATCGTGTCAACGGCAGTCTGGCGGTGTCGCGAGCTCTAGGAGATTACGAGTACAAAAAGGTTCTGGACCTCGGACCTTGCGAGCAGCTGGTTTCTCCGGAGCCAGAGGTGTCCGTACACGAGCGTCTGGATGTGGAAGACGAGTTCCTCGTGCTAGCCTGTGACGGAGTATGGGATGTGATGAGCAACGAGGCATTGTGTGCCTACATCCACTCACTACTGCTACTGACGGATGACCTTGTTGCTATCACTAATCAAGTCATTGACACTTGTCTTTATAAGGGTAGTAAGGACAACATGAGCATTGTGCTGGTGGTGTTCCCGGCTGCTCCCAAGCCGAGTCCGGAGGCGCAACGTGCGGACCGAGAGCTAGACGACACGCTGCGACAGCGGCTCACAGGTTGCTATTACTCTTTACTGCATGCGTGCGCGCACTCGCCTCGGCCACACGGCTCCGACACGCAGACAGACAGACAGACACACACGCTAACAGAACATAACTTAGACCAGAGCGAAGTCAGGGGATGA

Protein sequence:

>DPOGS214749-PA
MGAFLNKPETKKYNESGEGNGLRYGVASMQGWRMEMEDAHHAQLTLNGTLSDWSYFGVFDGHAGAKVSAHCAENLLECILQTEEFRRDDIVEAIRTGFLDLDMKMRELPELSNGAEKSGSTAVCAFVSPKQIYIANCGDSRAVLARNGAPIFATRDHKPELPSEKSRIVQAGGSVMIHRVNGSLAVSRALGDYEYKKVLDLGPCEQLVSPEPEVSVHERLDVEDEFLVLACDGVWDVMSNEALCAYIHSLLLLTDDLVAITNQVIDTCLYKGSKDNMSIVLVVFPAAPKPSPEAQRADRELDDTLRQRLTGCYYSLLHACAHSPRPHGSDTQTDRQTHTLTEHNLDQSEVRG-