Monarch geneset OGS2.0

DPOGS214085
TranscriptDPOGS214085-TA2340 bp
ProteinDPOGS214085-PA779 aa
Genomic positionDPSCF300014 - 2392732-2401261
RNAseq coverage116x (Rank: top 58%)
Annotation
HeliconiusHMEL0027252e-16664.53% 
BombyxBGIBMGA005255-TA0.074.46% 
DrosophilaPtpmeg-PK7e-16942.94% 
EBI UniRef50UniRef50_D6WV440.048.65%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WV44_TRICA
NCBI RefSeqXP_623968.20.047.93%PREDICTED: similar to Tyrosine-protein phosphatase non-receptor type 4 (Protein-tyrosine phosphatase MEG1) (PTPase-MEG1) (MEG) [Apis mellifera]
NCBI nr blastpgi|2700113340.048.65%hypothetical protein TcasGA2_TC005339 [Tribolium castaneum]
NCBI nr blastxgi|3320257880.049.51%Tyrosine-protein phosphatase non-receptor type 4 [Acromyrmex echinatior]
Group
Gene OntologyGO:00064702.6e-49protein dephosphorylation
GO:00047252.6e-49protein tyrosine phosphatase activity
GO:00055155.4e-32protein binding
GO:00054888.8e-32binding
KEGG pathway 
InterPro domain[29-231] IPR0197492.1e-55Band 4.1 domain
[597-766] IPR0002422.6e-49Protein-tyrosine phosphatase, receptor/non-receptor type
[226-320] IPR0119935.4e-32Pleckstrin homology-type
[118-224] IPR0143528.8e-32FERM/acyl-CoA-binding protein, 3-helical bundle
[121-226] IPR0197481.1e-31FERM central domain
[236-324] IPR0189806.4e-23FERM, C-terminal PH-like domain
[426-531] IPR0014785.7e-21PDZ/DHR/GLGF
[37-120] IPR0189791.4e-16FERM, N-terminal
[66-78] IPR0197502.9e-15Band 4.1 family
Orthology groupMCL10659 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214085-TA
ATGATCGAGAGTGTATCTCGTCGTGCGTTCAGCAGTTCGAGCGGCACCTACAACGTTCGTGCCTCGGAACTGGCCCGCGAACGAAAGCTTAATATCTTCAATGTCACCGTCGTGTTTTTGGACGATACCCAGCAATTATTTCAGATCGAGAAAAAAGCCAAGGGAAGTGTGCTATTAGAACAGGTTTTTCAAAACCTTGAATTGGTTGAAAAGGATTACTTTGGCTTGCAATTCACTGAAAATGGTTTACCTCCTAATGCTACAAATACAGAGCTAACACGTTGGCTTGACAGTGGCAAATCAGTTAAAAAGCAAGTCGGTGTCAATCCGCATTTCTGGCTGGCGGTACGCTTCTATCCTCCTGAGCCGAGTCGTTTAGCCGAAGAGTACACGAGATATCTTCTTTGTCTCCAACTGAGAAAACTGCTTCTGGATGGAAGAATGATAGCTCCGAAGAATACAGCACTTCTACTGGCTTCATTTACAGTACAAGCCGAACTCGGTGATTATAACGCTACCGAGCACCAGAACAACTATCTGTCGGAGCTGTGTCTATTACCGAAACAGAGTCCTGAGGACGAGAGACGTATCAAGGAACTTCACAAGCTTCATAAAGGTCAATCACCCGCTGATGCTGAAGCGAACTTTCTGGAACACGCGAAGCGCTTGGATTGCTACGGCGTTGAATCACATCCGGCTAAGGACTACAACGGGAAGGACATACTGATTGGCGTCACGTCAATCGGCATTGTCGTTTTTCAGAACAACATACGAGTCAACACTTTCTCTTGGAGCAAAATTGTTAAGATATCGTTCAAGAAGAAGCAGTTCTTTATACAGTTGAAGCGAGAAGCCTCTGAATCGTACGACACCGTGCTTGGCTTCAACATGCGCTCGAGTCGTGCGTCGAAGGCCCTCTGGAGGTGTAGTGTTGAGCGTCACGGCTTCTTCCGTCTGCGGGCCCCCCGTCGCAAGGCCTTCCTCGGGGCCTGGGGGGCCCTCGCCGGGGTCACCGCCCCCGCTGTCATAAGGACGGAGACCCAGGCCCTAGAAGACGCTAAACGATCACGGTCGATAAATAGAAGTTTTGTACGTCGCAGTAGTTCCCGTAACCGTGACAAATCATCCGGGGCCTCCCCCGGCGTAGTGAGCAGCGCCCGGGGGTCCCGCGTCACTGGTCACGACGAGCCCCCGCCGCGAGAGGCCTGGGGGGACGAATTGCACAACAACGAAGACGATAGCGACGGCGGCTTCCTGGAGCGCGTGTTCCGCGTGCCGTTGACTTATGTTGACGATTCCGAATCTGAGTCGGTCACGGAGCGAGCGGAAGAGGATCCGGCGGAAGGAGTTGTGTGCGTCAGACTCTACCCCGGGACCGACGGGCGGTACGGGTTCAATGTGAGGGGCGGAGGGGGGGGAGCGGCCGTACTCGTGTCCAGGGTCATGCCCAGGACGAGAGCTCATCTGCAGGAAGGAGACCAGGTCATATCAATAAACGGTACTGACGTCGAAGAAATGACCCACGAGCAAGTTGTTCAGACCATAAGAAATACAAAGGGAGTTCTAACTCTAATGGTCAAACCCAACGCGGTGTACGAGCCGGAGGTGTGTGTGGAGGAACCGGCCGTTTGTTTCGTGCCGCTCGGCGCCGGCACCTTCGAGGGTGATCTGCAGCAGTCGATGTTGCTGCTGGGAGACGGGCTGGCGTCGGGAGCGGCACTGCGGCAGTACGACGCACTGCTGAGGCGAGCGGCGGACCGACCCGCCACCGCCGCCCGACTGCCCGCCAACCTCGCGAGGAACAGATACAGGGATATCGCTCCATATGATTCAAGTAGAGTGATATTAAAGAACGGTCCCAACGGTGATTACATCAACGCTTCCTACATCAACATGGAGATAGCTAACTCTGACTTAGTCCTCACGTACATCGCAACCCAAGGTCCGCTAGCGTCCACGGTTGGTGACTTCTGGCAAATGGTTTGGGAAAGTGAGAGCAGTCTGGTGGTGATGTTGACGGTGCTGGCCGAGCGAGGGCGGGCCAAGTGTCACCAATATTGGCCCAAAGTCGGGACCGCGCTCAAAGCGACCAATTCATTGACCGTGGTCACAAACAGCGAACAGAATTTAGGACATTACACGCAGAGGGAGATGAGTTTGAAGGATAGCAACGGTGCCAGTCGTGACGTCACTCAGCTGCAGTACACCGCCTGGCCCGACCACGGAGTGCCCGACGACCATCAACAGTTTATCAGCTTCGTGAGACTGTGCTCCCAGCTGAGGAACCATCGAGCCGGTGAGATATACAAACATATATATATATACAATTACACACTCTGA

Protein sequence:

>DPOGS214085-PA
MIESVSRRAFSSSSGTYNVRASELARERKLNIFNVTVVFLDDTQQLFQIEKKAKGSVLLEQVFQNLELVEKDYFGLQFTENGLPPNATNTELTRWLDSGKSVKKQVGVNPHFWLAVRFYPPEPSRLAEEYTRYLLCLQLRKLLLDGRMIAPKNTALLLASFTVQAELGDYNATEHQNNYLSELCLLPKQSPEDERRIKELHKLHKGQSPADAEANFLEHAKRLDCYGVESHPAKDYNGKDILIGVTSIGIVVFQNNIRVNTFSWSKIVKISFKKKQFFIQLKREASESYDTVLGFNMRSSRASKALWRCSVERHGFFRLRAPRRKAFLGAWGALAGVTAPAVIRTETQALEDAKRSRSINRSFVRRSSSRNRDKSSGASPGVVSSARGSRVTGHDEPPPREAWGDELHNNEDDSDGGFLERVFRVPLTYVDDSESESVTERAEEDPAEGVVCVRLYPGTDGRYGFNVRGGGGGAAVLVSRVMPRTRAHLQEGDQVISINGTDVEEMTHEQVVQTIRNTKGVLTLMVKPNAVYEPEVCVEEPAVCFVPLGAGTFEGDLQQSMLLLGDGLASGAALRQYDALLRRAADRPATAARLPANLARNRYRDIAPYDSSRVILKNGPNGDYINASYINMEIANSDLVLTYIATQGPLASTVGDFWQMVWESESSLVVMLTVLAERGRAKCHQYWPKVGTALKATNSLTVVTNSEQNLGHYTQREMSLKDSNGASRDVTQLQYTAWPDHGVPDDHQQFISFVRLCSQLRNHRAGEIYKHIYIYNYTL-