Monarch geneset OGS2.0

DPOGS214816
TranscriptDPOGS214816-TA1602 bp
ProteinDPOGS214816-PA533 aa
Genomic positionDPSCF300059 + 674827-678832
RNAseq coverage282x (Rank: top 39%)
Annotation
HeliconiusHMEL0063360.097.42% 
BombyxBGIBMGA004514-TA2e-8834.24% 
DrosophilaLar-PE0.081.53% 
EBI UniRef50UniRef50_E2BS160.087.31%Tyrosine-protein phosphatase Lar n=1 Tax=Harpegnathos saltator RepID=E2BS16_HARSA
NCBI RefSeqXP_971078.20.087.52%PREDICTED: similar to receptor tyrosine phosphatase type r2a [Tribolium castaneum]
NCBI nr blastpgi|2700040340.087.52%hypothetical protein TcasGA2_TC003342 [Tribolium castaneum]
NCBI nr blastxgi|2700040340.087.52%hypothetical protein TcasGA2_TC003342 [Tribolium castaneum]
Group
Gene OntologyGO:00064701.1e-130protein dephosphorylation
GO:00047251.1e-130protein tyrosine phosphatase activity
KEGG pathway 
InterPro domain[267-526] IPR0002421.1e-130Protein-tyrosine phosphatase, receptor/non-receptor type
[134-235] IPR0035959.5e-45Protein-tyrosine phosphatase, catalytic
Orthology groupMCL10719 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214816-TA
ATGGACGTTAACAAACCTAAAAACCGCTACGCCAACGTCATCGCATACGATCATAGCCGAGTTATTCTACAACCTATAGACGGAATACTAGGAAGTGATTATATAAATGCCAACTACTGTGACGGATACCGTAAACATAACGCATACGTCGCTACCCAGGGACCGTTACAAGAAACGTTCACCGACTTCTGGCGAATGTGCTGGGAACTGAGAACGTCTACTATAGTGATGATGACCAAGTTGGAGGAGCGGACGAGAATCAAGTGCGATCAATATTGGCCCAGTCGAGGCAGCGAGTCGTATGGCATGATGACGGTCTCTATAGCAGAGGTTCAGGAACTTGCCACCTACTGTATCCGAACATTCCAAGTGACTAGAAACGGAGGTGGTGAGCGGAGAGAAATCAAACAGTTGCAATTCACCGCTTGGCCGGATCACGGGGTCCCCGATCACCCGGCACCGTTCTTACAGTTCTTGCGTCGTGTTCGTGCTCTCAACCCACCGGATGCCGGGCCTCTAGTAGTTCATTGCTCTGCGGGCGTCGGCCGGACGGGCTGCTTCATAGTCATCGATTCGATGCTCGAAAGGGCGAGGCATGAACGCACCGTAGACATTTACGGACACGTGACCTGTTTACGGGCACAACGAAATTATATGGTGCAAACCGAAGATCAATACATCTTCATTCACGACGCTTTACTCGAAGCTGTGATCTGCGGCGACACGGAAGTGCCGGCTCGTAACTTGCACTCTCACATCCAGAAGCTCATGCGTATCGACACCATAGAGAACATAATCGGCATGGAATTCGAATTTAAGAAACTTGCCAACATGAAAGCCGATTCCAGCCGTTTCGTGTCAGCCAGCCTACCCTGCAATAAACATAAGAACAGGCTGGTGCACATCCTGCCTTACGAGTCTACTCGCGTCTGTCTGACTCCCAGAGACGGCTCGGATTACATCAACGCTTCGTTTGTGGATGGATACAAATACCGAGCGGCTTACATCGCAACCCAGGGTCCCTTGCCGGATACCACTGACGACTTCTGGCGAATGTTGTGGGAACATAATTCCACAATTATAGTCATGTTAACCAAATTGAAGGAGATGGGCAGAGAGAAATGTCATCAATATTGGCCATCTGACCGCTCGGTGCGCTATCAGTACTTCGTGGTCGATCCTATCGCCGAGTACAATATGCCTCAGTACATCTTGAGGGAATTCAAGTTCACAGATGCTCGCGATGGTTCATCTCGCACCGTCCGCCAGTTCCAGTTCACCGATTGGCCGGAACAGGGCGTGCCGAAGAGCGGGGAAGGCTTCATAGACTTCCTCGGACAGGTTCACAAGACCAAGGAACAATTTGGACAAGACGGACCCATCACTGTCCATTGCAGTGCGGGCGTTGGTCGTACCGGCGTGTTCATAACGCTGTCGACCGTGTTGGAGCGGATGCAGTACGAGGGTGTGGTCGACGTGTTCCAGACGGTCCGCACGCTCAGGACACAGAGACCGGCCATGGTCCAGACTGCGGATCAATACGACTTCTGTTATCGCGCCGCCCTGGAGTACCTGGGCTCCTTCGATCACTACGCCAACTGA

Protein sequence:

>DPOGS214816-PA
MDVNKPKNRYANVIAYDHSRVILQPIDGILGSDYINANYCDGYRKHNAYVATQGPLQETFTDFWRMCWELRTSTIVMMTKLEERTRIKCDQYWPSRGSESYGMMTVSIAEVQELATYCIRTFQVTRNGGGERREIKQLQFTAWPDHGVPDHPAPFLQFLRRVRALNPPDAGPLVVHCSAGVGRTGCFIVIDSMLERARHERTVDIYGHVTCLRAQRNYMVQTEDQYIFIHDALLEAVICGDTEVPARNLHSHIQKLMRIDTIENIIGMEFEFKKLANMKADSSRFVSASLPCNKHKNRLVHILPYESTRVCLTPRDGSDYINASFVDGYKYRAAYIATQGPLPDTTDDFWRMLWEHNSTIIVMLTKLKEMGREKCHQYWPSDRSVRYQYFVVDPIAEYNMPQYILREFKFTDARDGSSRTVRQFQFTDWPEQGVPKSGEGFIDFLGQVHKTKEQFGQDGPITVHCSAGVGRTGVFITLSTVLERMQYEGVVDVFQTVRTLRTQRPAMVQTADQYDFCYRAALEYLGSFDHYAN-