Monarch geneset OGS2.0

DPOGS208025
TranscriptDPOGS208025-TA2106 bp
ProteinDPOGS208025-PA701 aa
Genomic positionDPSCF300203 - 210012-256406
RNAseq coverage1433x (Rank: top 9%)
Annotation
HeliconiusHMEL0108954e-14179.31% 
BombyxBGIBMGA006582-TA5e-4439.15% 
Drosophilal(1)G0232-PA3e-11258.88% 
EBI UniRef50UniRef50_Q179W10.053.13%Protein-tyrosine phosphatase n9 n=1 Tax=Aedes aegypti RepID=Q179W1_AEDAE
NCBI RefSeqXP_001650958.10.053.13%protein-tyrosine phosphatase n9 [Aedes aegypti]
NCBI nr blastpgi|1571101120.053.13%protein-tyrosine phosphatase n9 [Aedes aegypti]
NCBI nr blastxgi|3800230570.053.60%PREDICTED: tyrosine-protein phosphatase non-receptor type 9-like [Apis florea]
Group
Gene OntologyGO:00064702.5e-119protein dephosphorylation
GO:00047252.5e-119protein tyrosine phosphatase activity
KEGG pathway 
InterPro domain[403-679] IPR0002422.5e-119Protein-tyrosine phosphatase, receptor/non-receptor type
[560-676] IPR0035952.7e-33Protein-tyrosine phosphatase, catalytic
[70-288] IPR0012512.2e-27Cellular retinaldehyde-binding/triple function, C-terminal
[5-62] IPR0110746.9e-06Phosphatidylinositol transfer protein-like, N-terminal
Orthology groupMCL14473 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208025-TA
ATGGCGGTGGCCTTGTCACCAGATCAAGAGTTGGCCACTAAGGCGTTTTTGGACGCATTGAGCGAGTCGGGTGCTAGTTGCGTGTCTCGGGCTACAGCCATCAAATTTCTTCTGGCGAGGAAGTTTGATGTATCCCGAGCGCACACACTGTGGAGGCAGCATGAGGCGACCAGGCGGAGAGAGGGACTCATCAGGTTCGAACCTTTTGAGGACCCGCTGAAGACTGAACTGGAGACTGGGAAGTTTACTATACTGCCGACGAGGGACGCGACCGGTGCAGCTATAGCAGTGTTCACAGCCAACAAACACTTCCCCAGTCTGACTTCACACCAGACCACCTTGCAGGGTGTAGTGTACCAACTGGACTGCGCCCTCCAGTCGCTGGAGACCCAGCGGGCCGGGCTGGTCTTCGTCTATGATATGACTGACTCCAAATACACCAACTTTGATTACGAACTATCGCAGAAAATACTCACCATGTTGAAGCTGCAGTCATCTCGTTGTAAGCAATGCATAGACGTGTTGCGACGGATGTACCGAGTGTTGAAGAGCCCTCGTTTGAAGCAGCTTGAGATCAGCTCACAGTTCACTGGCGGTTATCCTGCTAAGTTGAAAAAGGTGTTAATAGTGACGGCACCGTTGTGGTTCAAGGCTCCGTTCCGTATCCTCCGACTGTTCGTGCGCGAGAAGCTCCGTGAGCGCGTGCACACGGTGAGCGCCCCGCAACTGGGCGCCCACGTGCCGCGGGCCAGTCTGCCCAAACAACTGGGCGGACAGTTAGAACCAGACCACGCAGCCTGGCTGGAACACTGCAGGAAATGTTACACGAACAACGTGAACACAAAACTGGACGGTGTCATTGATGACTATGTGATTACTAATCATATACCACCCATCAAGACACAGAATGGTATAATAGATGCGGATGTGTCGTGTGTCATGAACGATAGAGAAGACATCATGCACATAAGCGACGCGCCGTACACTTGCGACCCGAGCCCCTTGAGACACACACACGGCTACAATGGCGACATGAAGACAGGACACATAAATTCCGGCGATGAAGACGACTTGGATTTAATAACTCGCGGCACTTGGTCGTCGGGCGAGGCGTCTCCGGGGGCTTCGGACGAGGAGTGCGGTGGGGGCGGCACACCCGAGACCCCGGCCGCCATACTGGCGAGGGTCAGCTCGCTCGGCCGGAGGGGGCTTTGCGCCGAGTACGACGAGATACGGGCCAGACCTCTAACCGGGACCTTCCATCACGCCAAACTGCCGGCGAACCTGTCCAAGAACCGCTACACGGACGTGCTGTGCTACGACCACTCGCGCGTGGTGCTGTCCACGGTGGACGAGGACCCTCACTCCGACTACATCAACGCTAACTACGTGGACGGCTACAAGCAGAGGAACGCGTTCATATCCACACAAGGTCCGTTACCAAAAACATTCGGCGACTTCTGGCGCATGGTGTGGGAGCAGGGCTGTCTGGTGATAGTCATGACCACCAGGACGGTGGAGCGCGGCCGCGTCAAGTGCGGCCAATACTGGCCGGGGGTCGCCGGACAGAGCTCCGTGTACGGCGGACTGTCCGTGCACACGGAGGCCGTGGACGAGGGTGACCACTATACGGTGACCCACCTCGTGCTCACCGACACCAGGACCGACCAGCGGAGGAGGATCTGGCACGGGCAGTACACGCGCTGGCCGGACTACGGCGTGCCGGGCGGGGGGCGAGCCGCGCCCGTACTGGCCTTCCTCGAGGATGTACGGAGGGCGCAACAGCGGGCCAGGAACGAGCTGGGCGACGCGTGGGCGGGTCACAGGCGCGGGCCGCCCATAGTGGTGCACTGCTCGGCGGGCATCGGCCGCACCGGCACCTTCATCACCCTGGACGTGTGCAGCTCGCGGCTCCGAGCCGAGGGCGGGGCCGACGTCCGCGCCGCGGTCGAGGCGGTGCGGGCCCAGCGAGCGCACTCCATACAGATGCCCGACCAATACGTGTTTTGTCACCTGGCGCTGCTCGAGTACGCGGTGATGCACGGCTACCTGGAGTCGGCCGAGCTGACGGGCTTCGACGACGAGAACGACGACGAGTCCGAATGA

Protein sequence:

>DPOGS208025-PA
MAVALSPDQELATKAFLDALSESGASCVSRATAIKFLLARKFDVSRAHTLWRQHEATRRREGLIRFEPFEDPLKTELETGKFTILPTRDATGAAIAVFTANKHFPSLTSHQTTLQGVVYQLDCALQSLETQRAGLVFVYDMTDSKYTNFDYELSQKILTMLKLQSSRCKQCIDVLRRMYRVLKSPRLKQLEISSQFTGGYPAKLKKVLIVTAPLWFKAPFRILRLFVREKLRERVHTVSAPQLGAHVPRASLPKQLGGQLEPDHAAWLEHCRKCYTNNVNTKLDGVIDDYVITNHIPPIKTQNGIIDADVSCVMNDREDIMHISDAPYTCDPSPLRHTHGYNGDMKTGHINSGDEDDLDLITRGTWSSGEASPGASDEECGGGGTPETPAAILARVSSLGRRGLCAEYDEIRARPLTGTFHHAKLPANLSKNRYTDVLCYDHSRVVLSTVDEDPHSDYINANYVDGYKQRNAFISTQGPLPKTFGDFWRMVWEQGCLVIVMTTRTVERGRVKCGQYWPGVAGQSSVYGGLSVHTEAVDEGDHYTVTHLVLTDTRTDQRRRIWHGQYTRWPDYGVPGGGRAAPVLAFLEDVRRAQQRARNELGDAWAGHRRGPPIVVHCSAGIGRTGTFITLDVCSSRLRAEGGADVRAAVEAVRAQRAHSIQMPDQYVFCHLALLEYAVMHGYLESAELTGFDDENDDESE-