Monarch geneset OGS2.0

DPOGS215123
TranscriptDPOGS215123-TA1350 bp
ProteinDPOGS215123-PA449 aa
Genomic positionDPSCF300530 + 49-3603
RNAseq coverage62x (Rank: top 68%)
Annotation
HeliconiusHMEL0027254e-10550.22% 
BombyxBGIBMGA005255-TA4e-14564.96% 
DrosophilaPtpmeg-PK3e-7046.39% 
EBI UniRef50UniRef50_E0V9765e-8444.21%Tyrosine-protein phosphatase non-receptor type, putative n=1 Tax=Pediculus humanus corporis RepID=E0V976_PEDHC
NCBI RefSeqXP_002422670.19e-8544.21%tyrosine-protein phosphatase non-receptor type, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420032592e-8344.21%tyrosine-protein phosphatase non-receptor type, putative [Pediculus humanus corporis]
NCBI nr blastxgi|3454940782e-8144.62%PREDICTED: LOW QUALITY PROTEIN: tyrosine-protein phosphatase non-receptor type 4-like [Nasonia vitripennis]
Group
Gene OntologyGO:00064701.3e-50protein dephosphorylation
GO:00047251.3e-50protein tyrosine phosphatase activity
GO:00055155.7e-21protein binding
KEGG pathway 
InterPro domain[166-439] IPR0002421.3e-50Protein-tyrosine phosphatase, receptor/non-receptor type
[20-125] IPR0014785.7e-21PDZ/DHR/GLGF
Orthology groupMCL10659 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215123-TA
ATGTCAAATAATGTGTGCAGTAGCGACGGCGGCTTCCTGGAGCGCGTGTTCCGCGTGCCGTTGACTTATGTTGACGATTCAGAATCTGAGTCGGTCACGGAGCGAGCGGAAGAGGATCCGGCGGAAGGAGTCGTGTGCGTCAGACTCTACCCCGGGACCGACGGGCGGTACGGGTTCAATGTGAGGGGTGGAGGGGGGGGAGCGGCCGTACTCGTGTCCAGGGTCATGCCCAGGACGAGAGCTCATCTGCAGGAAGGAGACCAGGTCATATCAATAAACGGTACTGACGTCGAAGAAATGACCCACGAGCAAGTTGTTCAGACCATAAGAAATACAAAGGGAGTTCTAACTCTAATGGTCAAACCCAACGCGGTGTACGAGCCGGAGGTGTGTGTGGAGGAACCGGCCGTTTGTTTCGTGCCGCTCGGCGCCGGCACCTTCGAGGGTGATCTGCAGCAGTCGATGTTGCTGCTGGGAGACGGGCTGGCGTCGGGGGCGGCGCTGCGGCAGTACGACGCACTGCTGAGGCGAGCGGCGGACCGACCCGCCACCGCCGCCCGACTGCCCGCCAACCTCGCGAGGAACAGATACAGGGATATCGCTCCATATGATTCAAGTAGGGTGATATTAAAGAACGGTCCCAACGGTGATTACATCAACGCTTCCTACATCAACATGGAGATAGCTAACTCTGACTTAGTCCTCACGTACATCGCAACTCAAGGCCCGCTAGCGTCCACGGTTGGTGACTTCTGGCAAATGGTTTGGGAAAGTGAGAGCAGTTTGGTGGTGATGTTGACGGTGCTGGCCGAGCGAGGGCGGGCCAAGTGTCACCAATATTGGCCCAAAGTCGGGACCGCGCTCAAAGCGACCAATTCATTGACCGTGGTCACAAACAGCGAACAGAATTTAGGACATTACACGCAGAGGGAGATGAGTTTGAAGGATAGCAACGGTGCCAGTCGTGACGTCACTCAGCTGCAGTACACCGCCTGGCCCGACCACGGAGTGCCTGACGACCATCAACAGTTTATCAGCTTCATTAACTTTCGGAACGTGGAGAAGTCGGTCGTACGCCGGCGCGTCCGCACAGTTCGGTCCCTCGTTCGCAAGTGCCACACGACGACGACAGTTGACCGCTCGTCAATCTTCGTGTCGCCGGCGCATATTTCCGAATACCCCCAAGTCAATGTTGACTTGACGTTCCGATTTATAATCTCCGACACGGCCATCTTGACCAGTCACACGTCCCTGTGTGATGATCCTGAAAGAAGTCAATATAAGTTCGTATGTGAAAGCATTCAATCTGCGTACGCGCAGGGTCTGACGGACGAGGGCGGGGCCAATTGA

Protein sequence:

>DPOGS215123-PA
MSNNVCSSDGGFLERVFRVPLTYVDDSESESVTERAEEDPAEGVVCVRLYPGTDGRYGFNVRGGGGGAAVLVSRVMPRTRAHLQEGDQVISINGTDVEEMTHEQVVQTIRNTKGVLTLMVKPNAVYEPEVCVEEPAVCFVPLGAGTFEGDLQQSMLLLGDGLASGAALRQYDALLRRAADRPATAARLPANLARNRYRDIAPYDSSRVILKNGPNGDYINASYINMEIANSDLVLTYIATQGPLASTVGDFWQMVWESESSLVVMLTVLAERGRAKCHQYWPKVGTALKATNSLTVVTNSEQNLGHYTQREMSLKDSNGASRDVTQLQYTAWPDHGVPDDHQQFISFINFRNVEKSVVRRRVRTVRSLVRKCHTTTTVDRSSIFVSPAHISEYPQVNVDLTFRFIISDTAILTSHTSLCDDPERSQYKFVCESIQSAYAQGLTDEGGAN-