Monarch geneset OGS2.0

DPOGS210446
TranscriptDPOGS210446-TA1944 bp
ProteinDPOGS210446-PA647 aa
Genomic positionDPSCF300062 - 19244-32425
RNAseq coverage284x (Rank: top 39%)
Annotation
HeliconiusHMEL0063363e-7133.73% 
BombyxBGIBMGA001962-TA0.094.02% 
DrosophilaCG7180-PA0.077.12% 
EBI UniRef50UniRef50_E0VQ180.083.25%Receptor protein tyrosine phosphatase, putative n=16 Tax=Pancrustacea RepID=E0VQ18_PEDHC
NCBI RefSeqXP_001606669.10.081.96%PREDICTED: similar to ENSANGP00000011584 [Nasonia vitripennis]
NCBI nr blastpgi|1187893890.082.50%AGAP008077-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3838586960.084.64%PREDICTED: receptor-type tyrosine-protein phosphatase T-like [Megachile rotundata]
Group
Gene OntologyGO:00064701.6e-88protein dephosphorylation
GO:00047251.6e-88protein tyrosine phosphatase activity
KEGG pathway 
InterPro domain[64-332] IPR0002421.6e-88Protein-tyrosine phosphatase, receptor/non-receptor type
[223-329] IPR0035954.1e-28Protein-tyrosine phosphatase, catalytic
Orthology groupMCL14518 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210446-TA
ATGCCGGTAATTTGTAAGACGCTCGCTATCATTTTCTCATCTACGCCGTATTCCAGAACTCATCAATTCATCGTTAGGTTGACTAATAATGCTGACGAGAACGGATCAATATCGGAGACTATCCCTGACCGGCCTGTGGAGCTGAAAAACTTCCCCAAGCTCTGCGAACAGAGGAGGAAATTCCCTGTGCTATACAAACTTGAGTTTCAGACAGCCATAAAGGTGGAGACGCACGCATGCCGCCACGCCCAAAAAAAAACCAATTCCCACAAAAACCAGAACCAAAAAGTTACTCCCTACGATTACAATAGAGTGGTTCTACAGACAGTCGATAGAGAACCCGATTCGGATTACATAAACGCTTCCTACATAGATAGTATTTTAAAACCTAACGCATACATAGTGACTCAGGGGCCAACAGAGGAGACAGTGGTGTCATTCTGGCGAATGATCTGGCAGGAGAGAGCTGCTGCAATAGTCATGTTAACTAAGACATTCGATTTTATAAAAGTGATGTGCGTACAGTATTGGCCTCCCAGTAAGGACAAGGACGAAACTTACGGCGAGATAAGTGTAGGTATAGTTCAAGAGGAGGAACTAGCGAATTTCCACATACGCACTTTCCGATTGTACAAGATGGAGAAAGATGTAGTGGTAGAAGAAAGGTTCATTCTTCAATTCCATTACACGCAATGGCATTCCCATACATGCCCATTCAGCAATGCCTTGCTGGAGTTCAGACGTCGAGTGCGAGCGGTGGTTGGGAGAAGACTCGCTACTAATAACGTCACAGGACCTATGGTTGTCCACTGCAATGATGGAGGTGGACGATCTGGCGTGTACTTAGCTATTGACGCGAATTTAGAATTAGCTGAAGAAGAAGACTGCTTTGATGTTTTCGGATATTTGAAAAAATTGAGACAATCAAGAAAAGGCCTTATAGAGAATGAAGAGCAATACAAATTCGTGTATGACACGTTAGAGGAGCATGTAGTCTGTGGCGTATCTTGGTTTCCTGTCTCGGAACTGTCACAAAGACTGAAACAGAAGTCCCAAAGGGATCCCGTGACGAAGTTAAATGAATATCAAAAGGAATATCAGCAGATTTGCAAACAAACACCCAGATTTACCATCGGGGACTGTGCGGGGGGACACAGAGGGGATAACAGAGAAAAAAATAGAGATGTCCTCGTTGTGCCGCCGGACAATTTTCGTCCATATCTAACATCCTTTCAAGGGAATAGTTTCACTGATTACATTAACGCTGTATTCGTTGACGGTTACACAAAACCTCGTGAATACATTGTGACAGAGTGGCCCTTAATACGGACGCAAGGAGAATTTTGGTCTCTAGTGTATGATTATGAATGTGCCGCTGTTGTAGTACTTTGTGTTCCACCTAAAAACTCTCAACAATATCCACCATTTTGGCCTGAAGGACGCCACTCTAAGAAATACGGACCTGTCTTTACAATAGATCATGTTTCGCACAACCATTATACCAACATCAAGACGTGGATATTCAGAATTAACAAGAAAATCGTGTCTCTGACGGAATTGATGGCCGGATTGAAAGCTCCTCCGAAAACAGTACAACTGTTCCAATTGACGTGTTGGCCGATGGGCCATAAAGTGCCTTCGTCCACTAACTCACTAGTTGAACTTATGAATATGGTCGAGCGGTGGCGACAACGTACCGATTATGGGCCTGTTTGCGTTGTTTCACCGGATGGTCGTAGTCGTGCTGGGGTTTATTGCGCCGCTAACGCCTGCATAGAACAAGTTATTCAACATGGAGAAGTTGACGTATTTCAGGCTGTGAAAACAGTTCGACGACATCGACCTCAATTGGTAGAAAACATGACTGAATACAAATACTGCTACGACTTAGTTCTTCATTACGTACTACATTATTTAAATAAAGATATGAATGAGAAGAAGTGA

Protein sequence:

>DPOGS210446-PA
MPVICKTLAIIFSSTPYSRTHQFIVRLTNNADENGSISETIPDRPVELKNFPKLCEQRRKFPVLYKLEFQTAIKVETHACRHAQKKTNSHKNQNQKVTPYDYNRVVLQTVDREPDSDYINASYIDSILKPNAYIVTQGPTEETVVSFWRMIWQERAAAIVMLTKTFDFIKVMCVQYWPPSKDKDETYGEISVGIVQEEELANFHIRTFRLYKMEKDVVVEERFILQFHYTQWHSHTCPFSNALLEFRRRVRAVVGRRLATNNVTGPMVVHCNDGGGRSGVYLAIDANLELAEEEDCFDVFGYLKKLRQSRKGLIENEEQYKFVYDTLEEHVVCGVSWFPVSELSQRLKQKSQRDPVTKLNEYQKEYQQICKQTPRFTIGDCAGGHRGDNREKNRDVLVVPPDNFRPYLTSFQGNSFTDYINAVFVDGYTKPREYIVTEWPLIRTQGEFWSLVYDYECAAVVVLCVPPKNSQQYPPFWPEGRHSKKYGPVFTIDHVSHNHYTNIKTWIFRINKKIVSLTELMAGLKAPPKTVQLFQLTCWPMGHKVPSSTNSLVELMNMVERWRQRTDYGPVCVVSPDGRSRAGVYCAANACIEQVIQHGEVDVFQAVKTVRRHRPQLVENMTEYKYCYDLVLHYVLHYLNKDMNEKK-