Monarch geneset OGS2.0

DPOGS216205
TranscriptDPOGS216205-TA4539 bp
ProteinDPOGS216205-PA1512 aa
Genomic positionDPSCF300080 + 466468-483538
RNAseq coverage233x (Rank: top 44%)
Annotation
HeliconiusHMEL0058430.078.42% 
BombyxBGIBMGA004514-TA0.062.78% 
DrosophilaPtp69D-PA0.037.15% 
EBI UniRef50UniRef50_E2AHY30.042.14%Tyrosine-protein phosphatase 69D n=7 Tax=Pancrustacea RepID=E2AHY3_CAMFO
NCBI RefSeqXP_001121003.10.042.43%PREDICTED: similar to Protein tyrosine phosphatase 69D CG10975-PB, isoform B [Apis mellifera]
NCBI nr blastpgi|3071781650.042.14%Tyrosine-protein phosphatase 69D [Camponotus floridanus]
NCBI nr blastxgi|3407276370.042.80%PREDICTED: tyrosine-protein phosphatase 69D-like [Bombus terrestris]
Group
Gene OntologyGO:00064702.1e-114protein dephosphorylation
GO:00047252.1e-114protein tyrosine phosphatase activity
GO:00055151.9e-10protein binding
KEGG pathway 
InterPro domain[931-1228] IPR0002422.1e-114Protein-tyrosine phosphatase, receptor/non-receptor type
[1123-1225] IPR0035957e-39Protein-tyrosine phosphatase, catalytic
[432-550] IPR0089574.2e-17Fibronectin type III domain
[435-549] IPR0137836.5e-12Immunoglobulin-like fold
[439-532] IPR0039611.9e-10Fibronectin, type III
[44-118] IPR0131517.1e-09Immunoglobulin
Orthology groupMCL15587 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216205-TA
ATGTTGCTCTTGGTCAGGAAGTACCCCGTAACACTTGTGCTTTATTTTGGGTGTATCGCATCGTGTTATCTCGTTTTAGGCCAAGATGCATCAGAGATATTGGACTTACAATTGGCTAAAGGGGACATTGGCGATGAGACGAATATCAGTTGCAGTGTAGCACCATCGGAAGTGAATGTGGCATGGCTCTACAATAATAAACCGTTTAAGATTGGTGAAAGGATAAAACAGCGAGATGAAGAAAGACTTCTACAGAAGAAAGATCTTGATGGTAATCCCAGGAAATATAAAATTTACAATCTCACGTTGACCAACACAACGGCGAACGACGATGGTAATTATACGTGCGTGGCGGTGCTAGGGGAACTGAGGGCGGAGAAAACTATCGTCTTAGACCTCAGCTTCCCGGGAAGATTAATAAATAAGACGACTGGACCAATAAAACAAAATGTTACAGACCAACAGAACGTCACGATGTTCTGTGCATTCGAAATTTATCGGCCAAATGAAGTGAGGTGGTGGAAAAGGGGCAAAGACGACGAAATTATAGAACTCGGAACAAAAGCCGCTAAAGTGATCGATCTCAAACGGATGCAGAGTCAATATGATCTCCACATACGTAGCCCGGAAGACAATGGGACTTACATCTGCGAGATATGGGATTCAGTGACGTCATCGAACCTGACAGGGGGTATTGACGTCATAGTCTACGCAGCACCACAAGTAGTCATAGACACGGTCATACCGATCAGTGCCTCCCAACTATTCCTTAATTGGACAATCCGTTCATACAATTCACCAATCAAAAGCTACAATCTGATGTATCGAAAATTACCCTCAACCGACTTCAGTCTGTACACTACTGAGAAGATAAGAGTTAATAATATATCATTTGTCATGGAGGGTTTGGAGAAATCTACTAAATATCAGCTGAAATTGGAGGTGACAACTACCTATGGATCCAGCAAGCCCCATATATATGAGTCTATAGTTCGGACTTTGGATAAAGATCCAATATTTGTTCCACATATCTCCATCAATGGATTCTCAGCTACTTCAGTGACTATCGGATGGGCGCCACCTCCAGAGGATATAGCCGAGTTGATACACTATTATTTATTGGAGGCGAGGAAGATGGATGAGGTCGCACCAAGGAGAGCATATCACTCCAGGGACAGCAGAAATTTGCCATACATGTTCGATAACTTGGAACCTCACAGCACTTACGTATTTCGGGTTTGTGCGTGCTCAGATTTCACAAAGAAATGCGGTAATTGGTCTCTAGAGATGCAGGCTGCGACGTTGGACGGTATACCGGGACGGCCGAGCAATGTCACGGTCACTTGTAGTACTAGCTGGATGAATTTAACCTGGCAGCCCCCGGTCAAACCTAATGCTGAGATCAAAGGATACACTATGGAACTAACAGGGAACGCGACCTATAGAGACAGGTACGGCACATACAAAGAGGAGATGTGGGGACCGCTCACCAAGTTTAAAACAAACGATTCTAGAAGTGTCAGGTTTGAAGACTTGAAGCCGAATACTAACTACACGGTGCGGCTGAGTGCGATGACTCGCACCCGGCGGCGCGGGGACGAGGAGGTCCGTCACTGCGCTACAGCACCCGCTCCCCCCGACTACCCGCCGCGACTGAGGTTTGAAGACTTGAAGCCGAATACTAACTACACGGTGCGGCTGAGTGCGATGACTCGCACCCGGCGCCGCGGTGACGAGGAGGTCCGTCACTGCGCTACAGCACCCGCTCCCCCCGACTACCCGCCGCGACTGAGGTGGAGAAAGGAGCTGGATAATAATAAATACGTATTCACAATGCACTTACAAAGAATAGATGAGCGAAACGGACCTATATGCTGTTATAGGGTGTACATGGTGAGATTGTTGCCACACTCAGATTGGAATAATCTTCCCCCTCCGCGTGATATTAGTATAGTTGACTATGAAGAGGCCCACGGGGTACAACCTGTACTTGGAGCTTATATCACTGATGTATTCTCTAATGAAAAATTCCCACCCGAAGCGAATCTTATAATGGGGGATGGTAAATCGTACTTCGATAAAGACGATCCCGGTCTCAATAGGGATTCCTGTAAGCGATGTTTAAGGAAACCGAGGCGCGTCTACGACCTGCCTCGACCACCGACCACCACACCGACCACAATACCTACCACCACTTACCAACCGCCCACTACCAGGGACGATCTATTCGAAGAGGAGATAGATACAACCGCTGAACCTGAAGAGGAGAGGAGGGAGAGACGGTCATACTTAGATATTGATAGGGATTATATGAAAAATCCAATGATGATGGACAAACTGATCGAGGTCGAAGTGAAAGAAGATCTGAATATTAAAGACGGTCTACTGGACCCGTCGGCGAACTACACCGTCTTCATAGAATTAATACCCGGTTCACCATCAGACGACCCTCTGTACAGCGAGTATCTGAACGTGTTAATGGCGGCCGCCACCCCCGTACCAACACAACCACCGTCGGCTATGGAATTAGCTATATTGGCGTCGTGTGTGGCGGCGGGGGCGGCGGTGTTGTCTCTAGCCGCGTGGTGTGTGCTAAGGGCGAGACGATCTCGTAAGCTCCCTCCACACCATCACGTAGAAATGAATCCTATACAGGCTGCTCTAAGGTACGTCGTAGGTCACATCGGTGGACGTCAACAGTTAATAAGTGCCGTGCCCCCGGACATGCCGCCCATAGCCAAGGAAGACCTCGCCGCCGCCTACCACGAGAGACAAGCTGACTCCGACTACGGCTTCCAGAAGGAGTTCGAGATGTTACCGGAGTGCTATCCAGACCGCACCACACACGCTTCGGAGGCAAGAGAGAATCAACCCAAGAATAGGTACCCGGACATCAAAGCATACGATCAGACGAGGGTCAAATTGACCCAGATAGATGGCATCAGTGGCTCTGATTACATAAACGCCAACTATGTCATGGGTTACAAGGAGCGTAAGCAATTCATTTGCGCCCAAGGTCCTACTGATACGACTGTGAACGATTTTTGGAGGATGATTTGGGAGCATGACCTTGAACTGATAGTGATGCTGACCAACCTCGAGGAGTACTCCAAGGTCAAGTGCAGCAAGTACTGGCCGGACGAGGTGAGAGGCGGCCGGGCCTTTGGCAGCATCAGCGTCTATCACGTGGCTGAGAAGAGATATTCGGATTACATCGTGAGAGAGTTGAAGATATCGAAACAGCCTCTGAACTCGGACGGACAGCCGGTAGTTGAAAACAATGGAGTAGCTAAGAGGAATGGAGATTGTGGTATGAGTGACAGCGTGCCGACCTCGCCTCGTGATAACAAGTCTACGGACTGTCGCCTCGTCAGACAGTACCACTTCCTCATGTGGAAGGACTTTGCTGCTCCGGAGCACCCACACTCTATACTCAAATTTATAAAGAGAGTAAACGAAGCATGGTCAAGTATGGTCGGTAGGCCGGTGGTAGTTCATTGTTCTGCCGGCGTAGGCCGCACGGGGACGCTCGTAGCACTGGACTGTCTACTGGAACAGCTACGAGCTACGGGACACGCCTCCGTTTTCAACACCGTAGCCGAGCTACGACGACAGAGGAACTTCCTTGTTCAATCATTGAAACAATACGTTTTCGTATATCGAGCGTTGGTTGAGTACGCGCACTACGGCGACACTGAAATACCGGCGTCAAGACTGAAGAGTTCCATCGACAGGCTCAGGAACACACCAGAGGGCGCTGACAAGTGTCTCATGGAACACGAGTTCGAGAAGATGATGTCACCTCCTATATCCGAGGCTACGAAATCGTGTGCGGCGGGCGGAGCCGGCGGTTCAGATGAACTGCGAGCCAGGAATCGAAGCCCGGACTGTCTGCCTTACGACAGGAACAGAGTCATCCTCCCCCCACTACCAGGACGAGATTACTCCACATACATCAACGCTTCATTCATTGAAGCATATGATAATACAGAAGGCTTCATCATCACGCAGGATCCACTTCCGAACACTATTATGGACTTCTGGAGGATGGTAGCGGAACACAATGTGTCCACTATCGTTATGTTAAGTGAGCTTGGTGAAGGTAAATGTCCCCGTTATTGGGACGACGGTACCATACAATACGAACACATCTCAGTGCAGTACGAAGAGAGCGAGTCCTGTCCGTATTACACGAGGAGACAATTCAGAGTTACCAACAACAAGAGCGGAGAGTGGCGTTGCGTGAGACATCTTCAATACCAAGGTTGGCCGACAGCGGCGGGACACGTGCCGGAAGTGACGCGAGGGTTGGCGGAACTGGCGGAACTAGCGGCGCCCTTAGACTCAGCGCCCGCGCCGCCGCTCGTAGTACACTGCCAATTTGGTACAGAGCGTTCACCTTTATTCGTAGCTCTATGTACTCTTATGTGTCAGCTGCGTGTTGAACGTCGTGTAGATGTCGCCACAGTCGCCAGGAAGGTTCGCTCACAACGGGCGAGAACCATTGACACGTTTGTGAGTATTCGACTCTATATATAA

Protein sequence:

>DPOGS216205-PA
MLLLVRKYPVTLVLYFGCIASCYLVLGQDASEILDLQLAKGDIGDETNISCSVAPSEVNVAWLYNNKPFKIGERIKQRDEERLLQKKDLDGNPRKYKIYNLTLTNTTANDDGNYTCVAVLGELRAEKTIVLDLSFPGRLINKTTGPIKQNVTDQQNVTMFCAFEIYRPNEVRWWKRGKDDEIIELGTKAAKVIDLKRMQSQYDLHIRSPEDNGTYICEIWDSVTSSNLTGGIDVIVYAAPQVVIDTVIPISASQLFLNWTIRSYNSPIKSYNLMYRKLPSTDFSLYTTEKIRVNNISFVMEGLEKSTKYQLKLEVTTTYGSSKPHIYESIVRTLDKDPIFVPHISINGFSATSVTIGWAPPPEDIAELIHYYLLEARKMDEVAPRRAYHSRDSRNLPYMFDNLEPHSTYVFRVCACSDFTKKCGNWSLEMQAATLDGIPGRPSNVTVTCSTSWMNLTWQPPVKPNAEIKGYTMELTGNATYRDRYGTYKEEMWGPLTKFKTNDSRSVRFEDLKPNTNYTVRLSAMTRTRRRGDEEVRHCATAPAPPDYPPRLRFEDLKPNTNYTVRLSAMTRTRRRGDEEVRHCATAPAPPDYPPRLRWRKELDNNKYVFTMHLQRIDERNGPICCYRVYMVRLLPHSDWNNLPPPRDISIVDYEEAHGVQPVLGAYITDVFSNEKFPPEANLIMGDGKSYFDKDDPGLNRDSCKRCLRKPRRVYDLPRPPTTTPTTIPTTTYQPPTTRDDLFEEEIDTTAEPEEERRERRSYLDIDRDYMKNPMMMDKLIEVEVKEDLNIKDGLLDPSANYTVFIELIPGSPSDDPLYSEYLNVLMAAATPVPTQPPSAMELAILASCVAAGAAVLSLAAWCVLRARRSRKLPPHHHVEMNPIQAALRYVVGHIGGRQQLISAVPPDMPPIAKEDLAAAYHERQADSDYGFQKEFEMLPECYPDRTTHASEARENQPKNRYPDIKAYDQTRVKLTQIDGISGSDYINANYVMGYKERKQFICAQGPTDTTVNDFWRMIWEHDLELIVMLTNLEEYSKVKCSKYWPDEVRGGRAFGSISVYHVAEKRYSDYIVRELKISKQPLNSDGQPVVENNGVAKRNGDCGMSDSVPTSPRDNKSTDCRLVRQYHFLMWKDFAAPEHPHSILKFIKRVNEAWSSMVGRPVVVHCSAGVGRTGTLVALDCLLEQLRATGHASVFNTVAELRRQRNFLVQSLKQYVFVYRALVEYAHYGDTEIPASRLKSSIDRLRNTPEGADKCLMEHEFEKMMSPPISEATKSCAAGGAGGSDELRARNRSPDCLPYDRNRVILPPLPGRDYSTYINASFIEAYDNTEGFIITQDPLPNTIMDFWRMVAEHNVSTIVMLSELGEGKCPRYWDDGTIQYEHISVQYEESESCPYYTRRQFRVTNNKSGEWRCVRHLQYQGWPTAAGHVPEVTRGLAELAELAAPLDSAPAPPLVVHCQFGTERSPLFVALCTLMCQLRVERRVDVATVARKVRSQRARTIDTFVSIRLYI-