Monarch geneset OGS2.0

DPOGS203540
TranscriptDPOGS203540-TA1641 bp
ProteinDPOGS203540-PA546 aa
Genomic positionDPSCF300055 + 252472-266868
RNAseq coverage18x (Rank: top 80%)
Annotation
HeliconiusHMEL0132038e-7749.30% 
BombyxBGIBMGA004341-TA3e-5771.14% 
Drosophila% 
EBI UniRef50UniRef50_E2B7E82e-8751.14%Protein tyrosine phosphatase domain-containing protein 1 n=4 Tax=Formicidae RepID=E2B7E8_HARSA
NCBI RefSeqXP_393922.23e-8550.16%PREDICTED: similar to protein tyrosine phosphatase domain containing 1 protein isoform 2 [Apis mellifera]
NCBI nr blastpgi|3320194906e-8751.66%Protein tyrosine phosphatase domain-containing protein 1 [Acromyrmex echinatior]
NCBI nr blastxgi|1892337064e-8662.07%PREDICTED: similar to Protein tyrosine phosphatase domain-containing protein 1 [Tribolium castaneum]
Group
Gene OntologyGO:00081381.2e-12protein tyrosine/serine/threonine phosphatase activity
GO:00064701.2e-12protein dephosphorylation
KEGG pathway 
InterPro domain[134-258] IPR0003401.2e-12Dual specificity phosphatase, catalytic domain
[168-260] IPR0035954.2e-08Protein-tyrosine phosphatase, catalytic
Orthology groupMCL18998 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203540-TA
ATGTCCGTTGTGGAAGAGGCTCAGCTGTGTGCGAGCAGCGACTTACTGTGGGAGGAAATTAAGAAAGGTGGCGCCCGAAGAGGGACTCCGGGTCAAAACGAAGAGCAGGGACAGCCGTCGCCGTACGCATCGACCACCCACTGCCTTATACAACAGGTTCAGCGAGAAGATGCTGTCGAGTACACCACCAGCTCTGCAATGTGCGCTGTTCTGTGGGGGAAGACGCTGTCAATACGAGAGACCAGAGAACTGCAACTCCGCCATACAAGGCCTGTACTCGGATTGTTGATACAGCTCCGTCATGTGTCCAGGGTCACTGACGACATACTCGCGATGGCTCGGCCCAGCACAGCGAGCATCGCCGCCAGAAATATTATACAGCAGTTCCACAGTTGGGGTATCCGTACGGTGATCAATCTTCAGACAGCCGGTGAACACGCGAGCTGTGGCCCCCCACTCACCACGTCCGGGTTCACTTACGACCCCAATATATTTATGACTAATGATATCTATTACTACAACTTCGCTTGGCCTGACTACGGCGAGGCGTCTCTCAGCAGTCTACTGGACATGGTCAAAGTTCTGTCGTTCGCGTTCCAGGAGGGAAGGGTCGCCATACACTGCCACGCTGGTTTAGGTCGTACAGGTGTTCTGATCGCATGCTACCTCGTCTATGCGCTCCGGATCAAGGCTGACGACGCCATCCGATTGGTCCGTAAGAGAAGGCCCCGCTCCGTCCAGATGTCTTGTCAGATCTTCTGCGTACAGCAGTTCCAGCACTACCTGCTTCCGCAGACTGTTGTCTTCTCGACGGAGCAGTCCCGTCAGACCAAAGAACCCAGAACGTCAGAGTTCACACTGCGACAGTATCTATACAGACAGATGGTGACGCTGCACGGAGCTGATGAGAGAGCTTTCAGAGAAATACCCAAGATAGTATATTGTCTCTGCGAGCGTCTCCTCTATCTCTGCGGATGCAGTCAGAATAGCGGTCTGGACCTCCGAGTGAGAAACAGACCATTCTACAAGAATTTTATAAGCCACAGACTCAAACAGAAAAGGCCACCGGAAGTCAACGAAGACATCCAGACCGGACAGGTGACGTCACTTCCGATGGTTGAATGGCGGGACCCCGTCGAGGAGGAAGTCGAACGGAACTTGGAGTCAGTCAGCCGGCTGGCAGGAACACGGTCTGGAGTCATACCAGCTATACTCGTTTATGAGGCGTTCATGACGGATCATCAGAGTCTTCCAGAAGAGAGACGGAAACACCTGGAGAGAGTGAGAGCTCGCATCAACCAGAGGAGTGACTTTGATAAGAATATTGATGAAGAGGAGGACGTGATTCTCCTGGAGTATCTCCTGCGTGTGGTGCTCCGTCTCCGTCCCCTCACAGCCTCTAGGACATTGGACGTCATCAGGCGTGTGGCGGCGTCCCTCACTCATCAGACCCTCATCATCAACGACACTCGCCTCCCGTCCAAGGAACTTCCAGCTCTAAGAGACGGAACACACCAGCACCTTATAAACTTCATGCTGAAATTCCTTATAGAAATACAGAAGGACATCGCAAGACCCGGAGCTGACGACCTGGAGGTAGTCGCGCCACAGAAGACATACAGAATAAAAAAGTGGAAATGA

Protein sequence:

>DPOGS203540-PA
MSVVEEAQLCASSDLLWEEIKKGGARRGTPGQNEEQGQPSPYASTTHCLIQQVQREDAVEYTTSSAMCAVLWGKTLSIRETRELQLRHTRPVLGLLIQLRHVSRVTDDILAMARPSTASIAARNIIQQFHSWGIRTVINLQTAGEHASCGPPLTTSGFTYDPNIFMTNDIYYYNFAWPDYGEASLSSLLDMVKVLSFAFQEGRVAIHCHAGLGRTGVLIACYLVYALRIKADDAIRLVRKRRPRSVQMSCQIFCVQQFQHYLLPQTVVFSTEQSRQTKEPRTSEFTLRQYLYRQMVTLHGADERAFREIPKIVYCLCERLLYLCGCSQNSGLDLRVRNRPFYKNFISHRLKQKRPPEVNEDIQTGQVTSLPMVEWRDPVEEEVERNLESVSRLAGTRSGVIPAILVYEAFMTDHQSLPEERRKHLERVRARINQRSDFDKNIDEEEDVILLEYLLRVVLRLRPLTASRTLDVIRRVAASLTHQTLIINDTRLPSKELPALRDGTHQHLINFMLKFLIEIQKDIARPGADDLEVVAPQKTYRIKKWK-