Monarch geneset OGS2.0

DPOGS203439
TranscriptDPOGS203439-TA5055 bp
ProteinDPOGS203439-PA1684 aa
Genomic positionDPSCF300242 - 156990-164255
RNAseq coverage492x (Rank: top 25%)
Annotation
HeliconiusHMEL0150300.072.92% 
BombyxBGIBMGA011162-TA0.070.88% 
DrosophilaPTP-ER-PA3e-8332.37% 
EBI UniRef50UniRef50_Q16Q352e-10039.85%Putative uncharacterized protein n=1 Tax=Aedes aegypti RepID=Q16Q35_AEDAE
NCBI RefSeqXP_001661653.14e-10139.85%hypothetical protein AaeL_AAEL011434 [Aedes aegypti]
NCBI nr blastpgi|1571293598e-10039.85%hypothetical protein AaeL_AAEL011434 [Aedes aegypti]
NCBI nr blastxgi|1571293591e-12631.66%hypothetical protein AaeL_AAEL011434 [Aedes aegypti]
Group
Gene OntologyGO:00064702.6e-79protein dephosphorylation
GO:00047252.6e-79protein tyrosine phosphatase activity
KEGG pathway 
InterPro domain[1334-1678] IPR0002422.6e-79Protein-tyrosine phosphatase, receptor/non-receptor type
[1522-1675] IPR0035957.2e-28Protein-tyrosine phosphatase, catalytic
Orthology groupMCL18369 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203439-TA
ATGTATGTCTCTATAGCCAAGAATCTCTGTTACTTTCTGTTCACTACCGTCAATTCACATGAAGTCATTCTGATCCTCATCACAGCTGAACAGAGGACTTCACCGACAACCTGTCGCTGCAAACCAAAAATTAAGTATCACTCCAATTACTGTTATTCAGTGACTCTATACAGTTTCAATGCTTGGCAGTTCTCCAGATGTGAAGGTGTCGGTATGTGGGGCGCGTGGGGCGCGCAGGCGGGCTGGTGGTGCGGCGCGCTGGCGGCCGCGCTGCTGGCGCTGCTCGCGTTGTGTCGCCGCGCCCAGAAGAGGACACCGCCGCGACCGCCTGGCACCGTCACACAGGTGCAGATCGTGTATGGCGGTCCGGCGCGTGTAGTCGTGGCTGGTCCGCCGCCGGGCGCGTCTTGTGCTACAGCGTCCCTGTCGGCTGCGCCGCCCCCCGACCCTCCGACCCCCGCCGACCCCCCGCAACCCGCTCTCCCACAAATCCTCGTCGTCCAGGACAGCACTCAAAATACTGTCCCCCAACAGAGTGATCCTCAACCCTCGAGCTCCCAAGGCCCCAGACCTCTGGAGTACCACGGTATCCCCAAATTGCCACTTCCTGAGCAGTGGCTCCATAAACGACCTTTCGATTACAAGTACTCTGCTATACCCCGACCTCTAGCTCAAATCGTCAGCTCGGATCTATCAGTAGATCTGCAGAGAGAGTCGCGGGCTAAGAAATTTAGTGCTACGAGGTCGCCATCGTTGGAAAAAGACGAGAAGTCCCCCTTCGTTAGCAGTGCTAAGTCTGAAGATATGGAAGTGTTCGGTTTCGATGCTGAATCCGTGAGAAAATTAAGTGAAGATAGAGAAAAACCCGTGGAAATGGCGGAACCTCCGCCACAATCTCCACAAGAGAGCGATAAAGAAATGTCGCCCATTAACCTCAGGAGGTTCCGGTCAGTGAGCACTCGCCTTAATCTGTGCAGTGTCAGCGAAAGCCCCCATCCCGAAGGTCATCAAGAAAGGTTGGAAATGCAGAACTCCCCTCGCGTCATTGAATGCAGTTCCGCCAAAGAGAAGTCATCGGGTACTAAATTCGACTTCTCTCCAAAAATCCTAGCTGATAGCATGTGCTCCCCGCGTTTCTTCACTCCTCCCGAGATGGTTTCACCGATGTTCTTCGCGGAACCTTCACCAAAATCGGTTTATTACGATAGCGTGCGATCTCCGAAGTTTTTTCCTGAAACTCCACGAGACGTTGAGGTGCCGCAATCGCCGCGTCTTTTGGGAGACCATCGAAACAAAACGAACGGCTTAACCGAGAAACCGACCTCTCCGAAGGTTTTAAGTGCTCGACCCAGTCCTAAGGTTCATAGTCACAATAAAGAGAATTACTTCACCTTCGAACAAGCGAGTGCCACAGAAGAGAGTGCAGCGAGGATGTCGCCCAGACCGAGGAGGTACGGAGGAAGAAATTCCATAGAAAGAGAAAGGTTCACGCCCCCCGCTGATAAAGAAGTTAAGAAATACAGGCGACACAACAGTTGCGACACGAGTTACAAAAGTGTAGAGTTAAATATATCCAAGTGTGATAATATCTTAGAGACTGATGCCACCACCGTGGATGTGATAGAGAAAGACGTAGTGGACTGCTGTGCCAAAGTAGATTCCTTGCATGTCGATGCGAAACTTGAAAGTGATAAACACGAATTGAAGCCAATATCGAATACGGCGCAGGCGAGACGACAACGATTAAAATCGATATCACTCGACTCCGATAATGCCAAAATAATCGAACAGAATTTAGGTTTACCGATAGCGAAACAAATGAAAGACCAAATGAATGAGGCGTATAAGAATCAAGAAGTGAGTACATCGTGCGAGAGTATGGAGAGATACCCAAAAACACCGACAGCTGAACGACATATATTCAAATTTGATGCAGAGCATAACGAAAAACCTGAGCCAGAGGCTAAACCGGAAGTAAAACAGAAAAAGTGCCTTCGTCAAAGTTCTGACACGCAGTCTTTTCTCGATATGCCCAGGTTTTCTCCTAAAGAATTTGAAATAACGGTTACATCAGAGGAAGGCGACACTACAGCCGCTGAAACACAGACCAAAAGGAAAGGCAAGAATTTGAGAAACTTAACAATAGATTTGTCTAAACGAGACAGTGATTTAGAAAAAGAATTGCTCGAATTCGAAAAATTGTATACGGACGCCGAAGAAAAGAAAGTTAAAACGCCAACACTGAAAGTTAAGGCTACTTCATTAGACTCATCAGAGAGTGTAAACCTCTCATTGCCGCCCAAGAAGTCTCTGGAAGTTCCGCAGAACTCCATATCAGTTCCAAACACGCCGAAGCGTCAGTTGAAGAGAATTTTGGCTCAGAAAAGCGGAAAACATGACTTTGTTGTTGGGAAACTGGGTTATAATACCATTCAGAGTGGCGCAGACCAAAGACGGTTATACATGAAGGGACAGGATAGTGGGATATTTTTACGGGAAAACCACGCAAGTCTCATGCTGTACCAACCAGGAACATCTCGTATTGGTTCGAGGACAATGGGTTCATTCGACGAAAATATGGCGGATAATGCACCAGAAATAAATATCGTTGAAAGTAGACCTTTTCACATAGATCCAGCTCAAACACTTGGTTCCAATCTGCTTAACTACAAACAGAATCTAAGCGTATCCAGCACAAACTTGAAGACTCTTCCTGAAGGAGTTCCGTCTGATGATTTCGAACCTAGCGCAGAAGACGAGAAAGTTGTGAAAAAACTACATCGACGGAATTCCAATCAGAGTTTAATGCTAAGTACACATAGCTTACAAGAGTCCAATTGTTCGCTCAGCAGCGCTGGTACCTCCTGCCACAACCTCAATACAGTCAGAACTAGCATATCGAATTTCAGTCTGAATGACCACAGGCAGAAGAAACTGTCTCTGGAAAGGAGGGATTCAAACGTCAGTATAAACCCCATGGATCATATTACACCCACAACACGGGTCATATGTTCCTCGAACACGAATTTAACCGGAGACGTGTCAAAGAACTGCTTACTGCAGCGGCGCGGATCGAATAACAGCTTAACGTTAAACATACATTCTTCAAATAATTTAAGTCGGCATTCTAGTAACAGTTCCTTAAACAAAGATGCTAAAATCGGCCATAAAAAGGGTCTGTTGGAGCGTAGGAGCTCAAACACATCTCTCACCCTGAACATAAACTCTTCAAATCCTCAGCTGTCTACTAATAGAGGATTGAGTATATCAAACTACAACCTGAACGGATCGACCTGCAACCTAAGTAGATACAACAGTAACCACAGCATAGACAACGCGGAACCTCGGAAAGGTATCTTGGAAAGGCGTAGTTCGAACACGTCTCTCACTCTAAACATCCCTCAGGAACCTCGAGATCTGGAGATAGATGAGACGATGTTAGACGCGAACTTGAAAGATATTCCACACAGAGATAAACACAGAAAATCGTTAAGCACGGAAAATTTGATACCGAAATCCTATAAAAACAGGACACGCCTGCGGTCGACTGAAAAGGTTTTCGGTTCACACGATAATCTATGGTCAACGTCCTTCAGTGAGCAGGAGTATGGACAGAATTTGACATACGTCTGCGGCGATCAGGAAAATGAGATTATTTATGCGTTCGGCCGTCAAGAAGACCAGAATTTCCAAGCTGGTTTTGTTAGGAACATTACCACGAAGCCACTTAGTCCTCAGAGCACTTCCGAGGACTTTAGGTTATACTTAGCTAATATGCAACACTTACAGAATGCATCTAGCGTATTAACTCGTCAGCAGCTTAGGGACTTAAACGACGTTTTCCAAAATGGTTACTCCAAAGTTAAATGTCTCAGTACAAACGAAGGCCAACATTGCTGTACTGGCAGAGTTGACGACATCGCCAAGGAAAACCCTCAGATGGTGATCCCAGAGGTAGCACCACCACCATGTTCGGAATACCAAAAGATGCTATTGAGGAATCTCCATCAGGAGTTCTGGGATATGCCGACAAATTTTCAGGAAAAGCCCATAGTTTCTGGATCACATCCCAAGAACAGATACAAGACGATCTTACCAAACGAACATTCCAGGTTCATTCTACGAGCGGACGCTGGCAATACCGAGGGCTACATTAACGCCAACTATATCAAGGGCCACGAATACACTAAAAACAGCTACATCGCGACGCAGGGTCCGCTTCAGAACACCGTCTATGACTTCTGGCTCATGGTGCGCCAGAATAATATGGAACTTCAGGCCAGAGCGGAGACATTACTCAACAGGACAGAGGAAAGGTCAGAAGCTATACAGAAAATAGTGATGCTCACCAACTTTATCGAGAACAATAGACAGAAATGCGAGAAATACTTCCCCTTAGAGAAAGGCGAGGAGATCGCCATATCCAGTCCAATCTCGAGCGAAACTTGTTCAGAAGACTCGCCCAAAAACAGTTTTATCATAAAGAATGTGGGTATGTCTAAAAAATCTGGCTATACCGTCCGGAAGCTGGATGTGAGGTATAGTGGGGAAACCGAATCCTTAACAGTATACCACTACTGGTTCCACAACTGGGCTGATCACAAGTGCCCCAAAGACGTGAACGCTTTACTCAACCTAAGTCTGGACGTTTTACGAGAAGACATCAATGATTTCGAGGCTCGCGATGACGAGAAGGACGAACAGTGCAAGTGCGTCGACAGCCCCAAGGGCTCCAAGTTCGTGTTTCCGCCGATGGAGTCGGCGAGTGTCGCGTGTCCGGTCAAAGTGTGCGTCTCCACTCCCATGCAGTTCACGAACGAGAGCAACTCCCCGCCCACCATAGTCCACTGCTCGGCCGGCATAGGGCGCACCGGCTGCCTCATCGCCATACTGAACGGCATCAAGCAGCTGACGAGCGAGGAGAAGGTGGACGTGCTGGGTATCGTGTGTAACATGAGGCTCAACAGGGGCGGGATGGTGCAGAACTCTGAGCAGTACGAACTGATACATAAAGTGCTCTGCCTCTTTGAGCAGGCCTGCCTGCCACACTTATAG

Protein sequence:

>DPOGS203439-PA
MYVSIAKNLCYFLFTTVNSHEVILILITAEQRTSPTTCRCKPKIKYHSNYCYSVTLYSFNAWQFSRCEGVGMWGAWGAQAGWWCGALAAALLALLALCRRAQKRTPPRPPGTVTQVQIVYGGPARVVVAGPPPGASCATASLSAAPPPDPPTPADPPQPALPQILVVQDSTQNTVPQQSDPQPSSSQGPRPLEYHGIPKLPLPEQWLHKRPFDYKYSAIPRPLAQIVSSDLSVDLQRESRAKKFSATRSPSLEKDEKSPFVSSAKSEDMEVFGFDAESVRKLSEDREKPVEMAEPPPQSPQESDKEMSPINLRRFRSVSTRLNLCSVSESPHPEGHQERLEMQNSPRVIECSSAKEKSSGTKFDFSPKILADSMCSPRFFTPPEMVSPMFFAEPSPKSVYYDSVRSPKFFPETPRDVEVPQSPRLLGDHRNKTNGLTEKPTSPKVLSARPSPKVHSHNKENYFTFEQASATEESAARMSPRPRRYGGRNSIERERFTPPADKEVKKYRRHNSCDTSYKSVELNISKCDNILETDATTVDVIEKDVVDCCAKVDSLHVDAKLESDKHELKPISNTAQARRQRLKSISLDSDNAKIIEQNLGLPIAKQMKDQMNEAYKNQEVSTSCESMERYPKTPTAERHIFKFDAEHNEKPEPEAKPEVKQKKCLRQSSDTQSFLDMPRFSPKEFEITVTSEEGDTTAAETQTKRKGKNLRNLTIDLSKRDSDLEKELLEFEKLYTDAEEKKVKTPTLKVKATSLDSSESVNLSLPPKKSLEVPQNSISVPNTPKRQLKRILAQKSGKHDFVVGKLGYNTIQSGADQRRLYMKGQDSGIFLRENHASLMLYQPGTSRIGSRTMGSFDENMADNAPEINIVESRPFHIDPAQTLGSNLLNYKQNLSVSSTNLKTLPEGVPSDDFEPSAEDEKVVKKLHRRNSNQSLMLSTHSLQESNCSLSSAGTSCHNLNTVRTSISNFSLNDHRQKKLSLERRDSNVSINPMDHITPTTRVICSSNTNLTGDVSKNCLLQRRGSNNSLTLNIHSSNNLSRHSSNSSLNKDAKIGHKKGLLERRSSNTSLTLNINSSNPQLSTNRGLSISNYNLNGSTCNLSRYNSNHSIDNAEPRKGILERRSSNTSLTLNIPQEPRDLEIDETMLDANLKDIPHRDKHRKSLSTENLIPKSYKNRTRLRSTEKVFGSHDNLWSTSFSEQEYGQNLTYVCGDQENEIIYAFGRQEDQNFQAGFVRNITTKPLSPQSTSEDFRLYLANMQHLQNASSVLTRQQLRDLNDVFQNGYSKVKCLSTNEGQHCCTGRVDDIAKENPQMVIPEVAPPPCSEYQKMLLRNLHQEFWDMPTNFQEKPIVSGSHPKNRYKTILPNEHSRFILRADAGNTEGYINANYIKGHEYTKNSYIATQGPLQNTVYDFWLMVRQNNMELQARAETLLNRTEERSEAIQKIVMLTNFIENNRQKCEKYFPLEKGEEIAISSPISSETCSEDSPKNSFIIKNVGMSKKSGYTVRKLDVRYSGETESLTVYHYWFHNWADHKCPKDVNALLNLSLDVLREDINDFEARDDEKDEQCKCVDSPKGSKFVFPPMESASVACPVKVCVSTPMQFTNESNSPPTIVHCSAGIGRTGCLIAILNGIKQLTSEEKVDVLGIVCNMRLNRGGMVQNSEQYELIHKVLCLFEQACLPHL-