Monarch geneset OGS2.0

DPOGS215328
TranscriptDPOGS215328-TA1116 bp
ProteinDPOGS215328-PA371 aa
Genomic positionDPSCF300120 + 262686-266184
RNAseq coverage17x (Rank: top 81%)
Annotation
HeliconiusHMEL0100162e-17076.20% 
BombyxBGIBMGA007967-TA5e-14264.34% 
DrosophilaMKP-4-PB9e-7440.50% 
EBI UniRef50UniRef50_F5HMZ94e-7743.14%AGAP002108-PB n=3 Tax=Anopheles RepID=F5HMZ9_ANOGA
NCBI RefSeqXP_320933.42e-7644.10%AGAP002108-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3479671981e-7643.14%AGAP002108-PB [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1953457697e-7441.21%GM22974 [Drosophila sechellia]
Group
Gene OntologyGO:00081381.2e-52protein tyrosine/serine/threonine phosphatase activity
GO:00064701.2e-52protein dephosphorylation
KEGG pathway 
InterPro domain[1-371] IPR0162781.2e-52Tyrosine protein phosphatase, dual specificity, 12
[26-167] IPR0204225e-31Dual specificity phosphatase, subgroup, catalytic domain
[34-164] IPR0003403.9e-28Dual specificity phosphatase, catalytic domain
Orthology groupMCL12697 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215328-TA
ATGGCGAGACAGATCGCAGATGTTGAGATTCATGCGAGCGACACTCGAGACTTAGAAGAGGATAATCTCGATGTTAGTGTCGACCTCATTGACGATGGTCTGTATTTAGGTAACCTAGCATGTGCAAGAGATCACAAAACTCTGGAACAGCTGGGTGTGACTCACATCCTCACTGTTGACCTGGTGCCATTGCCCAGGTCGATATTGGATCAAACCAGTCTCATCATCAAATATATTAAATTGGCAGATGTGCCAAAAGAAGACCTCATCACTCACCTGCCAGAATGTAATGATTTTATTAAGGATTCCATAGCAAATGGCGGAAAAGTTTTAGTACATTGCTACTTTGGTGTGTCAAGGTCAGCGTCGGTGGTGATTGGATACATCATGGAGAAATATGGACTGTGCTATGAAGATGCCTTTGTACTGGTGAAGTCCAAGAGGAGGTTCATTGGCCCTAACAACGGGTTTGTGGCTCAGCTCAAGCTGTTCGGACATATGGAGTACCGACTCAACAGAGACGACCCCAGATATAAGCAGTTTAGGTTGAAAATGGCTGGGCAGAAATTGAAACAAGTCAAAATTCTGCCGCAGTGCTTCGCAGACTTGATAAAACCAGATCCGGGATTGATAAGGGAGCGGCCGGACCCAATAGTTTACCGCTGCAAGAAGTGTCGCAGAATCGTTGCAAGCCAGAGCAACATAATACCTCACATTCCCAAGCAGGTCAAGGTGGAACTCGCCAAGAAGAACATGAGACCGCCGCCCAGTAAACACACAGGGTTGAACTGTGCAGAAAATGGACAATTGTTGATAGAGAAGTTGAAGAATCTGGCATGCCAGATGATGGAGAGCAAACTAACGGCCGATGATAGTCCAGGGAGGAGCGAGGAGAGTGGACAGGACAGCGACGGGGCAGCTCATACTGGTGACGAATATATGGAGCATAACGTAGATGGCGCCACGTGTCGCCTGGGACCCAACGTCTGTAGACTGATGTGGTTCGTGGAGCCGATGTCCTGGATGAAGGCCACCAGCTCGCCCCAGGGGAGACTCGCCTGTCCACAGTGCGGCGCCAAGATAGGCAGCTACAGCTGGGTCATGGGTGAGAACTAA

Protein sequence:

>DPOGS215328-PA
MARQIADVEIHASDTRDLEEDNLDVSVDLIDDGLYLGNLACARDHKTLEQLGVTHILTVDLVPLPRSILDQTSLIIKYIKLADVPKEDLITHLPECNDFIKDSIANGGKVLVHCYFGVSRSASVVIGYIMEKYGLCYEDAFVLVKSKRRFIGPNNGFVAQLKLFGHMEYRLNRDDPRYKQFRLKMAGQKLKQVKILPQCFADLIKPDPGLIRERPDPIVYRCKKCRRIVASQSNIIPHIPKQVKVELAKKNMRPPPSKHTGLNCAENGQLLIEKLKNLACQMMESKLTADDSPGRSEESGQDSDGAAHTGDEYMEHNVDGATCRLGPNVCRLMWFVEPMSWMKATSSPQGRLACPQCGAKIGSYSWVMGEN-