Monarch geneset OGS2.0

DPOGS207062
TranscriptDPOGS207062-TA924 bp
ProteinDPOGS207062-PA307 aa
Genomic positionDPSCF300001 + 2257053-2260851
RNAseq coverage536x (Rank: top 23%)
Annotation
HeliconiusHMEL0040149e-8569.15% 
BombyxBGIBMGA013010-TA4e-17896.09% 
DrosophilaPp4-19C-PE8e-17191.86% 
EBI UniRef50UniRef50_P605109e-17292.83%Serine/threonine-protein phosphatase 4 catalytic subunit n=547 Tax=root RepID=PP4C_HUMAN
NCBI RefSeqXP_002429697.14e-17493.49%serine/threonine-protein phosphatase PP-V, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3407273506e-17695.44%PREDICTED: LOW QUALITY PROTEIN: serine/threonine-protein phosphatase 4 catalytic subunit-like [Bombus terrestris]
NCBI nr blastxgi|3072078013e-17495.44%Serine/threonine-protein phosphatase 4 catalytic subunit [Harpegnathos saltator]
Group
Gene OntologyGO:00167871.4e-144hydrolase activity
KEGG pathway 
InterPro domain[20-290] IPR0061861.4e-144Serine/threonine-specific protein phosphatase/bis(5-nucleosyl)-tetraphosphatase
[48-240] IPR0048432.6e-45Metallophosphoesterase domain
Orthology groupMCL16003 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207062-TA
ATGTCTGATACGAGTGATTTGGACCGTCAAATTGAAAGGCTGAAAAGATGTAAGCTAATATTGGAGGCAGAGGTTAAAGCCCTCTGTGCGAAAGCCCGCGAGATCCTTGTCGAAGAAAGCAACGTCCAACGAGTTGACTCCCCTGTGACTGTTTGTGGTGATATCCATGGACAATTCTATGACCTCAAAGAGCTGTTCAAAGTTGGTGGAGATGTACCTGAGACAAACTATCTTTTCATGGGTGACTTTGTAGACAGGGGCTTCTATTCTGTTGAAACATTTTTGTTACTACTCGCATTAAAGGTGCGTTACCCAGACCGCATCACGTTGATAAGAGGCAATCATGAGTCACGCCAGATAACTCAAGTTTATGGATTCTATGATGAATGCATAAGAAAATATGGCTCAATCACAGTATGGAGATATTGCACTGAAATTTTTGACTATCTGTCTCTATCCGCGATCATTGATGGACGGATATTTTGCGTCCATGGAGGACTTAGTCCCTCTATACAAACATTGGATCAAATTCGGACTATTGATCGCAAACAAGAGGTCCCCCATGATGGACCCATGTGCGACCTGCTTTGGAGTGACCCCGAAGATACACAAGGCTGGGGCGTGTCTCCCCGTGGAGCTGGTTATCTATTTGGTTCGGATGTCGTGGCTCAATTCAATGTATCCAATGACATTGACATGATCTGTCGGGCTCACCAGCTTGTTATGGAAGGATACAAGTGGCATTTCAATGAGACCGTACTCACTGTATGGTCGGCTCCCAATTACTGCTATCGATGTGGAAATGTGGCGGCCATATTGGAGCTGAATGAGACACTTCAGCGAGAATTCACCATATTTGAAGCTGCTCCACAGGAATCCAGGGGTATTCCATCTAAAAAACCTCAAGCAGACTACTTCTTATAA

Protein sequence:

>DPOGS207062-PA
MSDTSDLDRQIERLKRCKLILEAEVKALCAKAREILVEESNVQRVDSPVTVCGDIHGQFYDLKELFKVGGDVPETNYLFMGDFVDRGFYSVETFLLLLALKVRYPDRITLIRGNHESRQITQVYGFYDECIRKYGSITVWRYCTEIFDYLSLSAIIDGRIFCVHGGLSPSIQTLDQIRTIDRKQEVPHDGPMCDLLWSDPEDTQGWGVSPRGAGYLFGSDVVAQFNVSNDIDMICRAHQLVMEGYKWHFNETVLTVWSAPNYCYRCGNVAAILELNETLQREFTIFEAAPQESRGIPSKKPQADYFL-