Monarch geneset OGS2.0

DPOGS212771
TranscriptDPOGS212771-TA4974 bp
ProteinDPOGS212771-PA1657 aa
Genomic positionDPSCF300012 + 909702-927544
RNAseq coverage291x (Rank: top 38%)
Annotation
HeliconiusHMEL0141571e-12446.60% 
BombyxBGIBMGA013218-TA0.054.08% 
DrosophilaCG8334-PA0.047.68% 
EBI UniRef50UniRef50_D2A6080.046.07%Ubiquitin carboxyl-terminal hydrolase n=2 Tax=Tribolium castaneum RepID=D2A608_TRICA
NCBI RefSeqXP_001811202.10.046.32%PREDICTED: similar to ubiquitin specific protease [Tribolium castaneum]
NCBI nr blastpgi|1892383850.046.32%PREDICTED: similar to ubiquitin specific protease [Tribolium castaneum]
NCBI nr blastxgi|1892383850.046.44%PREDICTED: similar to ubiquitin specific protease [Tribolium castaneum]
Group
Gene OntologyGO:00065114.4e-61ubiquitin-dependent protein catabolic process
GO:00042214.4e-61ubiquitin thiolesterase activity
GO:00055098.3e-17calcium ion binding
KEGG pathway 
InterPro domain[746-1621] IPR0013944.4e-61Peptidase C19, ubiquitin carboxyl-terminal hydrolase 2
[430-580] IPR0066153.6e-21Peptidase C19, ubiquitin-specific peptidase, DUSP domain
[188-280] IPR0119928.3e-17EF-hand-like domain
Orthology groupMCL16746 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212771-TA
ATGGGTGCGAAGGATTCAAAGCTTAGTTTTATATCATACGACGACGCTGTGAAACGAGTGTCTGAGAGTGAGTTGAGGCGTATACGTGAAGCGTTTAAGAGATGTGCTGGAGCTAACGGTTCCGTGTTGAGCTTCGAAGCGTTCGTGCAGGAGGTGCTCTGCGATGGAGTCCCCCTCGAAGTGGCGGAGTGGTTATACCAGGCTTGCGGAGGCACCAAGCGAGGGATCACCTTCAAGGACCTTTTACGTGGGGTTGTCGTTCTTACAAAGGGAAATATAGAAGAGAAAATCAAGTTCCTGTGGACATTGTATGTGAATAATCAAAATGACAATGGCACATACATATACAAACGAGAGTTTGCTAGAGCACTTCACCTTGAAAATTCATCCTTACCAGAAAACGAATCTCAGAGAACTCTGGACATATTGACCAGTCTGTTTGGTTCCTGTGAGAGGGTCACCTTTGATCAGTTCAGATCGTGGCTGCTCATACACAAAGATGCTACGGTTCTATCAAAATGGTTGTTATTTGATAGAAATAGTACACCACAGGATTTAGATACACCCACATTTTACCAGAGCTTAGCTGGAGTCACACATCTGGATGAACGGGATATAATAGAATTGGAGAAATGCTTCTGGTGTCTTCGAAACTCCGCTCCCACTGGAGAGTTGGATGTACAGAGTATGAGGGGACTGTTGTCACCTCCTCTGCCGAGAGCAGCCGTCGAGGGTACCTTCCTGGCCTTCGATGAGAACAGAGATGGTCACATAGACTTTAAAGAGCTTTGCTGCGGACTCAGCGCCGCCTGCAGGGGACCTACAACAGAGAGGCTCAAATGTAAGAACAATCACTGTACATATAACAAATTACACGTAATACGGGGCCTTGTTAATGTTCGTATATATATAACTAACCAGGACTCGAGATCGTCCACGCCCTCGGACGGGGAGTCGGAGGGGGACAGGGGCTTCGACCCGGAGCTGGTGCTGAAGAACCTCAAAGAGAAACTGGTCTCCGTGCCCGCGGACGCCAGGAAACCGATGTTCCAACTAGGACCCACTGATGCTGAGAGAACCGTCACAACGACTGAGGGTCAGGACTATGGACCCGGTCTCCGTCTCGAGGAGTTCCTCATCTGGAGCGTGGAGAGTGCCGGGGCGCTGGTGACGCCGTGCCGGGAACTGTTACTGGAGCTATGCCACGTGGTGCTGGGTCTCCGGCCCGAGTGTCGACATCGCGAGAGAGACATCGTCCTGGGCTGGTTGCGTCGCGAGACTTGCCGCGGTCTGTCTGTGGGTCAGTTCTGGTACCTGGTGTCGGCGTCCTGGTGGCGCGCCTGGCTCCAGTACTGCGGCTCGCCCGGCTCGTGCTGCCGCCGGGACGACATCGTACCGGACGACAGTTTCACTACTAATTCAACCGAGTCAATGGGTTCCTTGTTATGGCCAGCGGAAAGCGCGTCTCTTGGCAGCGCTGGTGGCAGCGCGGGTAGCGCCAGCAGCGGCGTGGGCAGCGTCAGGGCCGCGCCTCATCCGGGGCCCGTGGACAACTCCCCGCTCCTGGGGGGCGGAGGACCAGCGGTGCGGGCGCTCACCGGCGAGGGCGGCACCCTCCGCCGTGACGTCACCCTGGCGCAGCATCGCGACTTCGAGCTGGTCCCGGACGCTCTGTGGCACGCGCTCGCCCTGTGGTACGGAGCGGCTGACCCCCTACCGCGACAGGTTATCAGGCCGCTACACGCGGACGTGGAGTTGGAACTGTATCCGCTGCAGATGAGGATCTACAGACACGTGCCAGGACCGCAGATGGATGTGTCTGCGTCAGGAGCGACCACCTTGGGTCTGTTGGGGGCGGGTGCGGCGGGGGCGGGCGTGGGGGCGGGCGCCCTGTACGCCGCGCCACCCGACAGGCAGCTCGCATACACAGCGGCCTTCTCTAGGCTGGCCACCATCAAACAGGTGACCGAATTTCTCTGCGGGGCCCTGGGCCTGGCGCGGGAGGACGTGCGGCTGTGGGCCCTGGGGACCGGCGCTTTGTTGCTGGATGACGAGCGACCGACGCTACAGGCGCTCAGGTTGGATGAGCGGTCCAAGCTACTGTTGGAGGTGCGCAACCCTGATCTGACCTGGCCCGAGGAAATAGGCGCGCTCGGAGCACAATCAGCGTGCGGCGTGACGGGGGCGGCGCGATGGGCCGAGCGACGGGAAACATTGACGGCGCCCCAGCTGCCAGGGGCTACAGGGCTCCACAATCTCGGCAACACGTGCTACATGAACGCCGCACTGCAGAGCGTTTGGAACACGGGCCCGCTGGCGCGCTACTTCAATTCGGGTCTCCATCTGTACGAAGTGAACTCCGCCAATCCACTGGGTACCGGCGGCTCGCTGGCGTTACGCTTCGGCGAGCTGTGCAAGGAGGTGTGGTCTTCGAGCGCTCGGTCCATAGCGCCGGTGAGGCTGCGGTGGTGCGTGTCTCGTTACGCGCGCGACCTGGCGGGTGGCGGCCAGCACGACGCCCAGGAGCTCCTGGCCTGGCTGCTCGACGCACTGCACGAGGATTTAAATCGTGCTTCGCCGCCCGCCCCCGCCCCCTCTCCCGCCCCGCCCAACCAGCCTGCCGGCCCCCGTGCGGACCGTGAGTCAGCCGCCGAGGCCTGGGCCGCTCACACCGTTCGCAACGACTCCATCATATCGGAGCTGTTCTACGGTCAGCTCAAATCGAAGGTTCGCTGCAGCGTGTGCGCCAGTGAGTCGGTGCGGTTCGACACCTTCAACATGCTCAGTCTTCCGTTGCCGATGGAGTCGTACGTGTGCGCCATCGTGAGAGTGGTCCTGTTGGATGGTTCGGTTCCGACGAAGTATGGCGTGAGAGTGAACTCGGAGGGCACTTACATGGATCTAAAGGAGAAGCTGTCGGAACTGTGCGGCCTGTCGCCGGACTTGATGCTCCTGGTGGCGTTGTCGGGAGCCACGATAGGGCGCGTCCTCGAGTCCGATAACAAAGTGAGTGCGGCCATCGCGAGAGAGCTGGTCGCATATGAGCTGCCCTCGGACAATGGAAACGACGGCAGCGACCAGGACGAGTGGTCTAGTGATGTAGAGGAGAGTGACTCGGGCGTGACGGAGGGCATGCTGTCCGACGAGGACGAGAGGATCCGACCGCCGGCTGACGAGCGTGACGGACGTGACGCACGTGACGCACGTGACGGCCGCGACGACAGCGACGACCGGGACGGACGACTGGCGGGCTCGGTGAGTGTGGCGCGCGGCCGGACATCCTCCTCGCTCTGCATGCCTGCACTCTTCTGCTTTAAGCGTTCCCGCTCCGAACTGCTGATGTCGGCCTCACCGACGGCACTGTACGAGCGGCACACGCTGCCGCGGGCGATGTCCGCGCCCACCTCCCGCACGCACACGCACAACGCTCAACTCGCCCATCACGCCCACGTCTCCAATCAGACAACGTACGAGGAAGGGGACAGCTATCTGATAGCTGTTCACAGGAAGCAGGTGTCCGGCGAGGGTTACCTAGTGGGGGGCGGAGGTCGTGCGGCGTTGTTCGGCTCCCCGTTGGTGGTGTGCACTCGGCCCGGGACCTCCGGCAGAAGAGTGTACGGTCGCGTGTGGACCCAGCTGGCGAGATTGCTATCCGCTCGACCAGCCCCTCGGCCGCACACCAGGCACAACCATGCTACCGACTGTGACGACAGTCTCGGCTACGAGTTCCCCTTCACGCTGCGTTTGGTCGGCGCGAGCGGCTTGTGGTGCGCTCTCTGTCCCTGGCCGGCGCTCTGTAGGGGCTGCGTGCTTCCTGCCACAGACGACGTGCTCATAAGAGACGGAGCTTGTCGTCCTAGAAGGAGGACGGAGCCCCGTGACGAGGGTCCCGACACGGACTCGCCCATAGCAAGAGCGAAACTACAGAGGCAAGCCAGCTCACGACTCGGCAACCATACTGGTTCCCACCAGTCTTCCGAGGGTGTGGTCCGCCGTCTTGACTTGTCAGGGCTGCGTCGAGGAGGTGTGAGGGTCATGCTCGCCATTGACTGGGATCCCACAGCACTGCATCTCAGATACCAGTCCACCAGGGAGAAGGTTTTCGTGGAGCACGGTTCGGTGCAGGCGTGCCTGTCAGCCGGTTCCCAGCCCGTGGACCTGGCCAGCTGTCTGCGGGCCTTCACCTCGGAGGAGCGGCTCGAGGCTCGCTACCACTGCGGCCCGTGTTCCGCCCTCCAACCAGCTACAAAGAAACTACAGATGTGGAGACTGCCACCTGTACTGATCATACACCTCAAGCGGTTCCAGTACGTGAACAACAAGTGGATCAAGTCCCACAAGGTGGTCGACTTTCCCTTCGAGGACTTCGACCCGACGCCCTACCTCGCATCAGTTCCGCAGGAGACGATCCTGCGACACGAGGAACTGAACCAAAAACGAAGATCATCGAACTTCATAGATATAGAAGACAGAATATCAGAGAGCGACGCCGAAACCGAGGAGGAAATAGAAATAACAGGCGACGAGGCTGCGAAGAGACGGAGCAAAGAGAGGAGAAGAAGGGAGTCTGTAGAGGTGAAGGGCAGGAGACGGCTGGAGTCCACCAGCCTGATCACGACCCCGGTGGTGGACGACAACCTGATGGACTACCACCAGCACCGCCTGCTGCCGGAGCGAGACGTGTTCGACCTGAAGTACAGGCTGTATGCGGTCGTGTCTCACTCGGGCCAGCTGTCAGGTGGTCACTACGTGTCTTACATCCGTCATTCCTCGGGCTCCTGGCTGTGTTACAACGACAGCTCGTGCCGCGAGCTGGGATCAGCGCCTACGCTGGACGCGGCCGCAGCCTACCTACTGTTCTACGAGCGCGTAGGCCTCCGCTACGACGCCTACCTGCCCTCACCACCGGACCGCCCCCCTCCACCCCCGCCCGCCGACGACCCCGACCTCAAGAACGTCTGCAGCATCGTGTAG

Protein sequence:

>DPOGS212771-PA
MGAKDSKLSFISYDDAVKRVSESELRRIREAFKRCAGANGSVLSFEAFVQEVLCDGVPLEVAEWLYQACGGTKRGITFKDLLRGVVVLTKGNIEEKIKFLWTLYVNNQNDNGTYIYKREFARALHLENSSLPENESQRTLDILTSLFGSCERVTFDQFRSWLLIHKDATVLSKWLLFDRNSTPQDLDTPTFYQSLAGVTHLDERDIIELEKCFWCLRNSAPTGELDVQSMRGLLSPPLPRAAVEGTFLAFDENRDGHIDFKELCCGLSAACRGPTTERLKCKNNHCTYNKLHVIRGLVNVRIYITNQDSRSSTPSDGESEGDRGFDPELVLKNLKEKLVSVPADARKPMFQLGPTDAERTVTTTEGQDYGPGLRLEEFLIWSVESAGALVTPCRELLLELCHVVLGLRPECRHRERDIVLGWLRRETCRGLSVGQFWYLVSASWWRAWLQYCGSPGSCCRRDDIVPDDSFTTNSTESMGSLLWPAESASLGSAGGSAGSASSGVGSVRAAPHPGPVDNSPLLGGGGPAVRALTGEGGTLRRDVTLAQHRDFELVPDALWHALALWYGAADPLPRQVIRPLHADVELELYPLQMRIYRHVPGPQMDVSASGATTLGLLGAGAAGAGVGAGALYAAPPDRQLAYTAAFSRLATIKQVTEFLCGALGLAREDVRLWALGTGALLLDDERPTLQALRLDERSKLLLEVRNPDLTWPEEIGALGAQSACGVTGAARWAERRETLTAPQLPGATGLHNLGNTCYMNAALQSVWNTGPLARYFNSGLHLYEVNSANPLGTGGSLALRFGELCKEVWSSSARSIAPVRLRWCVSRYARDLAGGGQHDAQELLAWLLDALHEDLNRASPPAPAPSPAPPNQPAGPRADRESAAEAWAAHTVRNDSIISELFYGQLKSKVRCSVCASESVRFDTFNMLSLPLPMESYVCAIVRVVLLDGSVPTKYGVRVNSEGTYMDLKEKLSELCGLSPDLMLLVALSGATIGRVLESDNKVSAAIARELVAYELPSDNGNDGSDQDEWSSDVEESDSGVTEGMLSDEDERIRPPADERDGRDARDARDGRDDSDDRDGRLAGSVSVARGRTSSSLCMPALFCFKRSRSELLMSASPTALYERHTLPRAMSAPTSRTHTHNAQLAHHAHVSNQTTYEEGDSYLIAVHRKQVSGEGYLVGGGGRAALFGSPLVVCTRPGTSGRRVYGRVWTQLARLLSARPAPRPHTRHNHATDCDDSLGYEFPFTLRLVGASGLWCALCPWPALCRGCVLPATDDVLIRDGACRPRRRTEPRDEGPDTDSPIARAKLQRQASSRLGNHTGSHQSSEGVVRRLDLSGLRRGGVRVMLAIDWDPTALHLRYQSTREKVFVEHGSVQACLSAGSQPVDLASCLRAFTSEERLEARYHCGPCSALQPATKKLQMWRLPPVLIIHLKRFQYVNNKWIKSHKVVDFPFEDFDPTPYLASVPQETILRHEELNQKRRSSNFIDIEDRISESDAETEEEIEITGDEAAKRRSKERRRRESVEVKGRRRLESTSLITTPVVDDNLMDYHQHRLLPERDVFDLKYRLYAVVSHSGQLSGGHYVSYIRHSSGSWLCYNDSSCRELGSAPTLDAAAAYLLFYERVGLRYDAYLPSPPDRPPPPPPADDPDLKNVCSIV-