Monarch geneset OGS2.0

DPOGS215410
TranscriptDPOGS215410-TA4356 bp
ProteinDPOGS215410-PA1451 aa
Genomic positionDPSCF300088 + 515204-526685
RNAseq coverage432x (Rank: top 28%)
Annotation
HeliconiusHMEL0174250.049.50% 
BombyxBGIBMGA012369-TA7e-16745.06% 
DrosophilaCG42327-PE4e-4836.73% 
EBI UniRef50UniRef50_D6W9J01e-6342.05%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6W9J0_TRICA
NCBI RefSeqXP_972865.22e-6442.05%PREDICTED: similar to IP14232p [Tribolium castaneum]
NCBI nr blastpgi|1892345334e-6342.05%PREDICTED: similar to IP14232p [Tribolium castaneum]
NCBI nr blastxgi|2700017003e-6126.06%hypothetical protein TcasGA2_TC000572 [Tribolium castaneum]
Group
Gene OntologyGO:00064703.1e-41protein dephosphorylation
GO:00047253.1e-41protein tyrosine phosphatase activity
KEGG pathway 
InterPro domain[1176-1437] IPR0002423.1e-41Protein-tyrosine phosphatase, receptor/non-receptor type
[1268-1438] IPR0035951.7e-16Protein-tyrosine phosphatase, catalytic
Orthology groupMCL25956 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215410-TA
ATGATCATGGAGTCTCCAGTAAGAATATGCATCGGTGCGTTTGTTTTAAATTGCCAAAAAAGTGCAGCATTAGTAGTTCATCTACTACTAATCGTCACCACAACAGTATCTGCTAGCACATCACCGCCTGAAGCGACTTCTACAACTATAAAAAATGACAGGAATCCCAAGTTTGCTAGAGTAAGTGGCACAAACTGGTCAGCTAGAAATAAATCTAAGTTACCAGATTTATCAATAAATGAACTGAACCCAAAAGATTCGACCAGTGAAGAGCATAGCGAGAAAAATGCTAAATATAAAGACAGAGGAAGGGTTAAATTTAACATGCAATCCAAATCCACGACAGAAAGCACGTTGAGAAGAGTCTCACTTTCAACAAAAGCAGCTGGAGTGATAATAGTGACACCAACGCCTGAAGTTAAAAAACCAGCACAAATTATCGAAAGTATGCGTGCTTATAAAAAAACTAAAATACCGTCACTGACAGTTGTCAGTACGACAGAACAAGATGAATCAGATGCAAGCGATGAAGACGTCACAGAGGATGAGGACTACGATGATAAGGAACCGTTAAAACAGTCATCGTTGACAAAAATTAAAGAAAACAGCTTCTTTACTATACCTAGTTTTGACATAGAAGAAGAATTTAAATCAAATAGTAATCGTAATCGCGATAAGAACAATAACCAGGATTTCAGTACTCCGTCATTTGGGGGCTTGTCCAGTTTCTTTCCTAAAAGTGTATACAGTTCAAAGGAGGAGCCTTATAAACCTGACACATATTTCGACTTTGATTTAGATCTGACAACTCCTAAAAATGATTTCTTCGATAAAAAATATCATGAAGTAGCTTCTAGTATTTTAGACAATTTAGACGCGATTAAAACTAAAACATCACCAAATGAAACAAAAACGGAAAAGATTGATAAAAAAGGTTTCGGCTTTGAGAAACCCAGTAACGAGTCCCCTAATAATAAGTCGTCTGTTTATATAAAAAATACCAAAGAAATACGTTATCTAGATAATGACAAAGCTGGTTCCTCCAAAGGATTGCAGGATGTACAGGGTACGAGTATTTATTATGAAATGACCGTTTTATCTACTGAAACTTACAATATAACAGATGATCAAGATGAAGATGAGGATTGTGATAACGATAACCAAGCAGACCCTACTACAAACTCTAACAAAAAACTAGAGACCGTTTCAGTGAGAGTAACTTCCCCTACTGTAGATCATTCAACAAAGTCGAGCTTTTCTGATTCCGAGCCAAGTCCAGTGTCAATGTCCCCCATAATGGTAAACTTTTATCCTAGTAGTACGGCGCGATCATCTATTAATTCTTTTGGACCTATTTCTCTTAGTACTTATAATACTAAATTTGTAACGGAGAGCAGTGGAATGCCTTTCAGTACTCAGGGCAGAAATAGGAACTATTCAAAAAAACTTAATCTTTTCGGTAATAAAGAATCGTCTAATAGTGTATTTCCCACTTTACAATCAGGAAATAATTTCCAGTCTTCGACGAATCAAGGGTACAGACCGCAAACAAGGAGATTTCACTTTACCACTCCAAAAACTAAACCGATATGGATGGCACCGAAAAGAAATACCACTAGAAATTCGATACCCAGAGTTCCAACCACAATATATTACGAGCATTTCGATATCAAAGAAAAAGCTAACTCTCCAAGAGAACCCAACAGGATTTTAACTACTCAACCTTCGGATATAGATCCAGTATTACAAAGCGATGTCAGCGACTCAAAGAAAGTCGTTCATTCTCATGCGATATCCGATAATACAATACCGTCATTATGGAAACGAGGTTCAACAAAGTATTCATCACCAACTGTTGCAACAGCTGAAAATAATACTGGGACGGGTGAGATGGAGATTCCTCCAACGTTAACAGCGTGGGCGCTGGCCAGTATGCGTATTCCTCCGTCGTTGACAAGCCCTGTGGTGAACGTTACAGGATCACCACAAAAAAATATTGACGAAAACGAGCTGCAGAAAGTCGGAGAAGTTACAGACAAGAAAGAATCGTCCACAAGCTCAACAACGACGGCGACAACGATTTATCTTAACAACGACTTTGAAACAAAAAACATTACAGAGATATATCAAAACAAACTTCCTTTGAAGTCGGTTTCTTCAACTATATCGGACACTGACGGAACTGAAACGGAAGCTAAAGAAACTGGTTCTGACTCTAACTCTGAACGATTTCCTGCAGAAAGTGTTATTTCTCTTCAAAGCAGCAAAGAAATTCTCACCACTGAAGAAATGATTGCCGGTACAGATTACACAACTATATCAGAAGAAATAAAGACAAAAAATGGGTTGGAGCCTTTAATAAATGAAAGTGATCAAGTAACACCGAGTGAAACGAAGGAATCTTCTAAAGTAGAAACAATCCCTCCCCCTGATACTACTAGGGCAACTGGTTTCGAATCTATTACAAAACTACCAGCTGGTATAACCACTCCAAAAGATGAGACAATAGCTACTGAGGGAGACGTTGATCAAAATCCTTCTGATAGTGGTGTTGTAGTCACTGACTTTGAAATAACCACAATAAGGTTTTCATACGTACCGACTGAACCTACAACTGAACAATCTACACAAGCGGACATAAGCGATCAACATGTTGTTTTTCCCACACGAACTATGCCGTCGACGGCGAAAGAAACTTCCATTACAACATATAGACCAAAATTCTTTACTACTACAGAAGACGTAGAAACCTCCACAGTTATTATTGAAACCACTCCTCAGCTAGAGGTTTCTTCACAGGTTGTTGAAAATGTTACCAGAAGAGACACAACAACTTCTACTACAACGGAAATAACGGAAGAAAGCGAGACGACGACAAAGAAACCGGGTTTTGTCACAACAGAGATGATATTAGAAACTACTGAGAGAACAACAGAGCCTGAGAAAGAAAATGTTACTGAAATAATTACTGAAACACCTAAGGATGAAATGCCAGTAGAGACTACGACAATTGTGGTTACTGTTGTGACCGAAATCAATTCAGAAAGAACTCCTGTAACTGACGCTAATCCAACGACAGAACTATCGTCTGACTGTGAAGCTAGTCTAACAACAGAACTTTTGTCCGACTCAGAAACGACATCAGAAGAAAATAGCGACAGTAACGAAGTTATTGCCCCTCAACACAAAAAGACATCCATTAAACCAGAAACCACAACGACTTCCACCTCCACGACTACCACTACCACGACTACTACCGCTGCCCCTACTACTAGCACGACAACCAGACTAACACATCAAGAAGAACATATAGTCGAAACGACGACAGAAGATTTGGATTTAATGACGAAATCTGTAAATGATTTGGAGGATTTGACGAGTTATGCTGGCGATGTGACAACAGAGTCTTCTTCGAGGGTGTTGAGTGAGGAGTCAGGATCGGGAGCAGCCATCGCTATCGCTGTCAGTACTATAGGAGTCGTTGCTCTTATACTACTAGTCGGTTTGTTGGGCCCAGGTAACATCCACCGCTACTACATCGCTTGCCAGGCTCCTCTCGCCAACACCGTCGCGGACTTCTGGAGAATGATTTGGGAACAGAACTCACGGCTCATCGTCATGCTCACGGAATATATGGAGAATGGTGTTGAAAAGAGCTATGAATATCTCCCACCATCAGAAATATCGGACAACAAGAGAACCTTCGGCGACTTCCAGATTATATTGAAAAAACGAGAACAACGCGATAAATACGCGATATCTAGTATCCAGCTGATCAACATGGTGACGAGGACCTGGAGGGAGGTCACTCATCTCTGGTATTTTTGGCCGGCGAAGGGTGTCCCAGATGATTACGATTCCGTTATAGACTTCCTCTCAGAAATGAGGAGTTACATGAAGATATCACAAACAGCCAAGGAGTACGACGAAGACGGTGTGGAGGTCATATACGAAGACGAGAGTCGCTCGTCCTATCAGAACTTGTCGAAGCTTCGTGAAGAGAACGGCTCCAGCAACGGAGTGAACGTGTACTCACCAGCTCGAGCCGAACAACTACAGAGGAGACAAACAACCAACGGCACGCTCGGGAGGATGAAGACGGCCTCCGAGGTCGAAGGCGTTCGTCCGTGCGCGTGCGTGTGTGCGTCGGGCGCGGGGCGGTCGTGCGCGCTCATAGCGGCGGAGGTGTGTTCCCGGGCGCTGGCCGGCGGCCGCGTGGACGTGCCTCGGACTGTTCGCGCCCTCAGGCAACAACGGCCTCACGCGCTCGCCAACAGACACCACTACGTGTTCCTCTATAGGCTGCTCAGCGAGTACGGCAACAAGCTGATGGGCGGCGGCGTGGACACCATCTGA

Protein sequence:

>DPOGS215410-PA
MIMESPVRICIGAFVLNCQKSAALVVHLLLIVTTTVSASTSPPEATSTTIKNDRNPKFARVSGTNWSARNKSKLPDLSINELNPKDSTSEEHSEKNAKYKDRGRVKFNMQSKSTTESTLRRVSLSTKAAGVIIVTPTPEVKKPAQIIESMRAYKKTKIPSLTVVSTTEQDESDASDEDVTEDEDYDDKEPLKQSSLTKIKENSFFTIPSFDIEEEFKSNSNRNRDKNNNQDFSTPSFGGLSSFFPKSVYSSKEEPYKPDTYFDFDLDLTTPKNDFFDKKYHEVASSILDNLDAIKTKTSPNETKTEKIDKKGFGFEKPSNESPNNKSSVYIKNTKEIRYLDNDKAGSSKGLQDVQGTSIYYEMTVLSTETYNITDDQDEDEDCDNDNQADPTTNSNKKLETVSVRVTSPTVDHSTKSSFSDSEPSPVSMSPIMVNFYPSSTARSSINSFGPISLSTYNTKFVTESSGMPFSTQGRNRNYSKKLNLFGNKESSNSVFPTLQSGNNFQSSTNQGYRPQTRRFHFTTPKTKPIWMAPKRNTTRNSIPRVPTTIYYEHFDIKEKANSPREPNRILTTQPSDIDPVLQSDVSDSKKVVHSHAISDNTIPSLWKRGSTKYSSPTVATAENNTGTGEMEIPPTLTAWALASMRIPPSLTSPVVNVTGSPQKNIDENELQKVGEVTDKKESSTSSTTTATTIYLNNDFETKNITEIYQNKLPLKSVSSTISDTDGTETEAKETGSDSNSERFPAESVISLQSSKEILTTEEMIAGTDYTTISEEIKTKNGLEPLINESDQVTPSETKESSKVETIPPPDTTRATGFESITKLPAGITTPKDETIATEGDVDQNPSDSGVVVTDFEITTIRFSYVPTEPTTEQSTQADISDQHVVFPTRTMPSTAKETSITTYRPKFFTTTEDVETSTVIIETTPQLEVSSQVVENVTRRDTTTSTTTEITEESETTTKKPGFVTTEMILETTERTTEPEKENVTEIITETPKDEMPVETTTIVVTVVTEINSERTPVTDANPTTELSSDCEASLTTELLSDSETTSEENSDSNEVIAPQHKKTSIKPETTTTSTSTTTTTTTTTAAPTTSTTTRLTHQEEHIVETTTEDLDLMTKSVNDLEDLTSYAGDVTTESSSRVLSEESGSGAAIAIAVSTIGVVALILLVGLLGPGNIHRYYIACQAPLANTVADFWRMIWEQNSRLIVMLTEYMENGVEKSYEYLPPSEISDNKRTFGDFQIILKKREQRDKYAISSIQLINMVTRTWREVTHLWYFWPAKGVPDDYDSVIDFLSEMRSYMKISQTAKEYDEDGVEVIYEDESRSSYQNLSKLREENGSSNGVNVYSPARAEQLQRRQTTNGTLGRMKTASEVEGVRPCACVCASGAGRSCALIAAEVCSRALAGGRVDVPRTVRALRQQRPHALANRHHYVFLYRLLSEYGNKLMGGGVDTI-