Monarch geneset OGS2.0

DPOGS206061
TranscriptDPOGS206061-TA2958 bp
ProteinDPOGS206061-PA985 aa
Genomic positionDPSCF300028 - 582123-596005
RNAseq coverage525x (Rank: top 24%)
Annotation
HeliconiusHMEL0140660.077.10% 
BombyxBGIBMGA006843-TA2e-18058.90% 
Drosophilassh-PB1e-17061.70% 
EBI UniRef50UniRef50_D6WZR90.057.81%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WZR9_TRICA
NCBI RefSeqXP_974392.10.058.23%PREDICTED: similar to slingshot dual specificity phosphatase [Tribolium castaneum]
NCBI nr blastpgi|910909360.058.23%PREDICTED: similar to slingshot dual specificity phosphatase [Tribolium castaneum]
NCBI nr blastxgi|3838541740.058.55%PREDICTED: uncharacterized protein LOC100877919 [Megachile rotundata]
Group
Gene OntologyGO:00081384.1e-45protein tyrosine/serine/threonine phosphatase activity
GO:00064704.1e-45protein dephosphorylation
KEGG pathwaytca:6632420.0 
 K05766 (SSH)maps-> Regulation of actin cytoskeleton
InterPro domain[327-465] IPR0204224.1e-45Dual specificity phosphatase, subgroup, catalytic domain
[335-463] IPR0003407.3e-35Dual specificity phosphatase, catalytic domain
[270-322] IPR0148761.1e-15DEK, C-terminal
Orthology groupMCL16476 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206061-TA
ATGATAAGGTTAATAAATGTAATCCCATATTACTCCAGCTTTCCCAGTGAAATTAAGGTTTACAGTGTCTGTGGCTTCAGTCAAATCATCGAATATGCCCACATTATTTTCAGTTCTCTATATGTCTTGGATGAGGAAGAAAAGTCAGCTGAAGTCACAGAAGAGGATGTCGGGAATCGAACCAGTAAGAGTCTCAATGAGTGTTACTTCGCTAACAAAGGCGCGGCGTTGGTGCTCGGTAGTACGGAGCAGGGTTGCGCAGACCGGCGCGCGTCTCCCGCCCGTGTGCACCCGCAGCCCGACATACATCACCACCTACAGTCCATGTTTTATCTGGTGCGCCCGGAGGAGACGCTCAAAATGGCGGTGAAGCTGGAGAGCGCCCACGCTGGTCGTACTCGTTACCTGGTGGTGGTTTGTCGATGTGATGAGGCAGCGCTGCTTGGGATAGATTGTAACGAGCGCACCACCGTGGGACTGGTGCTGAGGGTACTGGCCGATACCTCCATCAAACTCGATGGTGATGGCGGTTTCAGCGTTTGTGTATGCAATCAACAGCACATATTCAAACCAGTGTCCGTTCAGGCCATGTGGTCAGCGTTACAAACGCTTCACCGTGCGAGCGCCCGGGCCCGGGAGTTGAACCATTTCGCTGGTGGTTCGTCCCACGGCTGGTGTTCCCACTACGAGAGGGCTGTGGACTCCGACCGCTCCTGCCTCAACGAGTGGCACGCCATGGACAGTATCGAGTCGAGGCGACCACCGTCACCTGACTCGCTCAGGCATAGGCCGCGAGAGCGTGATGAAACGGAACGAGTGATCAGATGCACCCTCAAGGAAATTATGATGAGTGTGGATCTTGACGAAGTTACAAGCAAAGCTATCAGAGGACGGCTCGAGGAAGAATTAGACATGGACCTTACGGAGTTCAAGTCGTTCATAGACCAAGAGATGCTTACGATACTAGGACAAATGGACGCGCCGACGGAAATATTCGACCACGTGTACCTGGGTTCCGAGTGGAACGCCAGCAATTTGGAGGAATTGCAAAGAAACGGAGTTCGCCATATACTCAACGTGACAAGAGAAATAGATAATTTTTTTCCGGGTATGTTTGATTACCTAAATATTAGAGTTTATGACGACGAAAAGACTGATCTACTAAAACATTGGGATAACACATTTAAGTACATAAACAAAGCTAGAAATGAAGGTTCTAAGGTTCTCGTGCATTGCAAGATGGGAATAAGTAGGTCGGCTTCAGTCGTTATCGCTTACGCCATGAAGGCTTTCAACTGGAATTTCGATAAGGCTTTGAAGCACGTGAAGACTAAGAGGAGTTGTATCAAACCGAACATAAATTTCCTCAGTCAGCTGGAGACCTACCAGGGCATACTGGACGCCATGAAGAACAAGGAGAAGTTACAGCGTTCTAAATCTGAAACTAACTTAAAAGCTCCGATTTCATCATCAAAGAGTGAAAACAAAAATATGGAGCCGACGCCGTTGGTGCTGGCACTGACGGGGTCGTACTCGGGCCGGCCGCGGTCCTGGTCTCCCGACACTAAGCTGGCTGCCGAGTTACTGCCGCCTACTTCCGTGTCGCTGGAGAATCTCGCCTCCGAGACTAGACACATGCTCATGCCGTGCGCCAGCGGCTCCTACAGCGTCTCGCCAAACCAGATAATACGGCTCAAGGAGGAAGGCGCACCTTCAGTCAAACACATCGTTAACGAAATCGAGAGTGCCGCCTCGAGCGACAGAAAAGATATCCCCAAAAGAAACCACAGGTTGAATTTCGGAAATTCCGGGGACGTGATTTCTGGTCGATCATCAGAGAGCTCCGGTCCTGTGGAATCTAGTGGCAAAAATCAATCATCTCCAATACAGAATACAGTGAACCAGCCCGACCTAGACGTGGAGAAAATTCACACCTGGGATCCGGGGGAGACCGCTTGGTCGCGTTGCGAGGAGGTCCGGACAGTCTCGGACAGTGATTATATAGTTAAGAGTGACAGTGGTATCATAGACAAAATTAAATTGAGTGACATTATATACAATTCGTTAGAACGCAACGTAGAGTTGGAGGAGAGGAGGGGCGGCGAGGAGGACGCGCCGCCACCCAGCAGGCAGAGTTCGTGGAGCTCGTTCGACAGCGCAGTGGTCGCTGACCTGTCTCGACATTCGTCGTGGGGGTCATACGATACACGCGGAGCGAGACCACCGGTGGGCCCGCGAGAGGTCCGAGAGGAGCCTGCGCCTCCCGCGGACCTGGCAGTCATAAGGGAGCATACCGAGCGCACCCGCCCTCTTTCGAACATAGCCGCCAACGAGAGGAAGTTCTACGAGACCTGTGCCATACTGAAGGAGCTGGCGGCCGCGCGCTCTGGCGCCTGCACCTGGGGCGGCCGGCTGTCCGCCTCCGCGCCCGCCGACACGTGGCTGCGCGCCGGTCCGCGCCGCCGTCGCCTAGCTGCGTCTTCGCACGGAGACCTGCCACGAGCCGCGCCCGCCGGTCCGCCTCCGCCGCCAGCGCTGGGCCTCGTCAGCAACCTCAAAAAGGAGTTCGAGGCTCGTTCCGAGTCGGAGGTCCCCCGGCGTTCGGGGTCTCGCACGAGACAACCGCAAATTGAGGATTTGTCGGTGCGCGTGCTCGTGGACCGCTACGACCAACCCGGCCGGACGCGTTCCGAATCCGCGGCGGAACCGATTCGAGTGAAAGCGCCTCAAGAGTCTGTGTCCAAGAAGTGCAAGTTGGCGTCTGAGGTCGACAGCCGGGCTCGTATGCGCAACTCGTACTGCGCGGGTCTGGCGGGCGGCGCCGGGGGCGAGAGGCCGCCCGTGGTGCCGACCGTGGTCGCGCTCGCTCCCCTCGACTACTCTAATGTAGTGGTATCAACTGTGATGTCGAAAGCTCAAAATAAAAACAATTACAGCATGGGAAAACCCATCCGTTGA

Protein sequence:

>DPOGS206061-PA
MIRLINVIPYYSSFPSEIKVYSVCGFSQIIEYAHIIFSSLYVLDEEEKSAEVTEEDVGNRTSKSLNECYFANKGAALVLGSTEQGCADRRASPARVHPQPDIHHHLQSMFYLVRPEETLKMAVKLESAHAGRTRYLVVVCRCDEAALLGIDCNERTTVGLVLRVLADTSIKLDGDGGFSVCVCNQQHIFKPVSVQAMWSALQTLHRASARARELNHFAGGSSHGWCSHYERAVDSDRSCLNEWHAMDSIESRRPPSPDSLRHRPRERDETERVIRCTLKEIMMSVDLDEVTSKAIRGRLEEELDMDLTEFKSFIDQEMLTILGQMDAPTEIFDHVYLGSEWNASNLEELQRNGVRHILNVTREIDNFFPGMFDYLNIRVYDDEKTDLLKHWDNTFKYINKARNEGSKVLVHCKMGISRSASVVIAYAMKAFNWNFDKALKHVKTKRSCIKPNINFLSQLETYQGILDAMKNKEKLQRSKSETNLKAPISSSKSENKNMEPTPLVLALTGSYSGRPRSWSPDTKLAAELLPPTSVSLENLASETRHMLMPCASGSYSVSPNQIIRLKEEGAPSVKHIVNEIESAASSDRKDIPKRNHRLNFGNSGDVISGRSSESSGPVESSGKNQSSPIQNTVNQPDLDVEKIHTWDPGETAWSRCEEVRTVSDSDYIVKSDSGIIDKIKLSDIIYNSLERNVELEERRGGEEDAPPPSRQSSWSSFDSAVVADLSRHSSWGSYDTRGARPPVGPREVREEPAPPADLAVIREHTERTRPLSNIAANERKFYETCAILKELAAARSGACTWGGRLSASAPADTWLRAGPRRRRLAASSHGDLPRAAPAGPPPPPALGLVSNLKKEFEARSESEVPRRSGSRTRQPQIEDLSVRVLVDRYDQPGRTRSESAAEPIRVKAPQESVSKKCKLASEVDSRARMRNSYCAGLAGGAGGERPPVVPTVVALAPLDYSNVVVSTVMSKAQNKNNYSMGKPIR-