Monarch geneset OGS2.0

DPOGS203407
TranscriptDPOGS203407-TA1599 bp
ProteinDPOGS203407-PA532 aa
Genomic positionDPSCF300003 + 1269897-1285658
RNAseq coverage285x (Rank: top 38%)
Annotation
HeliconiusHMEL0063761e-13952.78% 
BombyxBGIBMGA012343-TA3e-14464.03% 
Drosophilacsw-PB2e-6350.97% 
EBI UniRef50UniRef50_P293503e-9038.70%Tyrosine-protein phosphatase non-receptor type 6 n=47 Tax=Eutheria RepID=PTN6_HUMAN
NCBI RefSeqXP_971440.25e-11445.11%PREDICTED: similar to protein tyrosine phosphatase, non-receptor type 11 [Tribolium castaneum]
NCBI nr blastpgi|2700156169e-11345.11%hypothetical protein TcasGA2_TC012910 [Tribolium castaneum]
NCBI nr blastxgi|3479678676e-10843.74%AGAP002438-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00064706.3e-100protein dephosphorylation
GO:00047256.3e-100protein tyrosine phosphatase activity
GO:00055152.2e-27protein binding
KEGG pathwaytca:6600851e-113 
 K07293 (PTPN11)maps-> MAPK signaling pathway - fly
    Renal cell carcinoma
    Adipocytokine signaling pathway
    Natural killer cell mediated cytotoxicity
    Leukocyte transendothelial migration
    Neurotrophin signaling pathway
    Jak-STAT signaling pathway
    Chronic myeloid leukemia
    Epithelial cell signaling in Helicobacter pylori infection
InterPro domain[175-510] IPR0002426.3e-100Protein-tyrosine phosphatase, receptor/non-receptor type
[395-507] IPR0035951.5e-36Protein-tyrosine phosphatase, catalytic
[51-147] IPR0009802.2e-27SH2 motif
Orthology groupMCL25950 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203407-TA
ATGTCCAAGGATAGCCCCAGGCTGGGCAGGCTGGGTGTGTTAGCGCCCGCCGCGCTATCAGCTTGCATGGATGCTTCTGTGTTGGCTGTGCCTGCAGCTGGTGCTATAGTATCGACTGTCCTCAAGGAACTCAGCAAGGCTTTCATGTTAAAAAAATGGTTCCACGGCGTGATGTCAGCTAAGGAAGCCGAGCATCTGATGATGGAGAAGGGTCGGAACGGGTCGTTTCTGGTGCGAGAGTCCCAGGCTCACCCTGGCGAGTACGTGCTGTCTGTTCGCGTGCGAGGTCGGGTCAGTCATGTTATGATCAGGAAACAGCAAAACAAATACGACGTAGGCAGCGGTGAGCAGTTCGATGATCTGGTGGGTTTGATAGAACATTTCCGATCCTATCCCATGATAGAGACCTCTGGTGACGTTCTACGTCTTCTGCAGCCTGTCAGCGGAACCTGTCTCCGCGTGCATGATATCGATCAAAAAGTACAGCAAATGGATGATTTCCAAAAACCCGATCAGAGAAACGGTTTCGACGGTGAATTTCAATCGTTGAAGATGGTTGAGGACATGCACGTTTTTACAACGACCGAAGGCATGAAGGCGGAAAATTTTAATAAAAACAGATATCGAAACATTTTACCTTATGACCAGACGCGTGTGTTACTGCGTGGACGCGACGGTCGCACAGAGTCAGATTACATTAATGCTAACTTCATTCGCTCGTCCAGGCTGAGCGATTCCTCCAGCTCAGTACAATCCTCCAACGAGAGTCTGAACAGTGTCAATTCTTTAATCCTTGGTATTGACCCAAAGAGAACAGTTCCGCTAGTCACCAAATCCCTATCGGAAGAGGCGCTGAGGGATGTTAAAAAGAGTATAAAATTGGACAGAATTAACAGAAATATTTACAGAAATATAGTGAAAGAGAAAATATACATCGCAAGTCAGGGTTGTCTCTCAAATACTGTAGACGATTTCTGGAGAATGCTATGGCAAGAGGACGTCAGGGTTATAGCAATGATCACAAACGAATTTGAAAAGGGAAAGAAAAAATGCGAGCGTTACTGGCCAGCATCAGGCCAAGAGGAGCGTTACGATGAGCTGATTGTAAAATCAATTTCTGAGACCTGCTACGAAGACTATACTTTGAGAGAATTTGATGTAAGCGATAAAAACATCTGCAGGACCATCTATCAGTACCAATATACGTGTTGGCCTGATCACGGCACACCAGCTGAACCTGAGGGGGTGCTTTCGTTCATGGAAGATATTAATAGGAAGATGTATCAAATATCCCAACAAAAGGATGCGCCGGAACAGAATGTGTTGTGTGTGCACTGCTCAGCTGGAGTTGGAAGAACTGGAACGTTCATAGTGTTAGATATGCTTATTGATAAAATAAAGCTAACTGGTTTTAACTGCGAAATAGACGTCCATAGCACGGTGAAGTTGGTGCGAGCTCAACGCAGTGGCATGGTTCAGAATAAGGCGCAGTACAGATTCATTTATCTCGCATTGCAAAGTTACATAGATAATAATAATAAAGTTAAATTAAAAAGGAAAGTAATTCTGTTACCATTCTTACGAAACATGGTTTTTTAA

Protein sequence:

>DPOGS203407-PA
MSKDSPRLGRLGVLAPAALSACMDASVLAVPAAGAIVSTVLKELSKAFMLKKWFHGVMSAKEAEHLMMEKGRNGSFLVRESQAHPGEYVLSVRVRGRVSHVMIRKQQNKYDVGSGEQFDDLVGLIEHFRSYPMIETSGDVLRLLQPVSGTCLRVHDIDQKVQQMDDFQKPDQRNGFDGEFQSLKMVEDMHVFTTTEGMKAENFNKNRYRNILPYDQTRVLLRGRDGRTESDYINANFIRSSRLSDSSSSVQSSNESLNSVNSLILGIDPKRTVPLVTKSLSEEALRDVKKSIKLDRINRNIYRNIVKEKIYIASQGCLSNTVDDFWRMLWQEDVRVIAMITNEFEKGKKKCERYWPASGQEERYDELIVKSISETCYEDYTLREFDVSDKNICRTIYQYQYTCWPDHGTPAEPEGVLSFMEDINRKMYQISQQKDAPEQNVLCVHCSAGVGRTGTFIVLDMLIDKIKLTGFNCEIDVHSTVKLVRAQRSGMVQNKAQYRFIYLALQSYIDNNNKVKLKRKVILLPFLRNMVF-