Monarch geneset OGS2.0

DPOGS205354
TranscriptDPOGS205354-TA3711 bp
ProteinDPOGS205354-PA1236 aa
Genomic positionDPSCF300295 + 134943-148023
RNAseq coverage1537x (Rank: top 8%)
Annotation
HeliconiusHMEL0061530.088.48% 
BombyxBGIBMGA007315-TA0.082.97% 
DrosophilaUsp7-PB0.050.92% 
EBI UniRef50UniRef50_F4W5Y20.067.85%Ubiquitin carboxyl-terminal hydrolase 7 n=8 Tax=Bilateria RepID=F4W5Y2_ACREC
NCBI RefSeqXP_974951.10.067.16%PREDICTED: similar to ubiquitin specific protease 7 [Tribolium castaneum]
NCBI nr blastpgi|910897130.067.16%PREDICTED: similar to ubiquitin specific protease 7 [Tribolium castaneum]
NCBI nr blastxgi|910897130.067.28%PREDICTED: similar to ubiquitin specific protease 7 [Tribolium castaneum]
Group
Gene OntologyGO:00065111.1e-32ubiquitin-dependent protein catabolic process
GO:00042211.1e-32ubiquitin thiolesterase activity
GO:00055153.4e-29protein binding
KEGG pathway 
InterPro domain[341-558] IPR0013941.1e-32Peptidase C19, ubiquitin carboxyl-terminal hydrolase 2
[192-331] IPR0089743.4e-29TRAF-like
[200-305] IPR0020832.3e-21MATH
[191-325] IPR0133229.3e-12TRAF-type
Orthology groupMCL10867 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205354-TA
ATGCTGAACGCCGGGCTGACATTTGATGATTTAAAATCTAAATGTTCAATGTCTAATATGCTTTTGACAGTCAGCTGGAAATCAACTATAGATGCCAAAAATTGCTATCATGGCGCTAACATCGGCCGCGCCGCTAGTAAACACTCTTTAAAAATTAACATCCTATCATCCAAGAAAAGTAAACGCCCGGCTGCGGAGGGTGCTCGAGTCGATGAGAAAGGTCCGTTAAAATTCGTTCAGCGAATTTTTAGCAAGCCCAAAGGGAATTGCTCCCTGTCGTTAAAGGAAATTTCTAAGAGTTGTGGGCGACAGTCCTCGGTGGCGCGCCCGGCTAACATGAATCACACACCTGCACCCGACAGGCCACAGCATGACCCTCACGTGAACCAAGTCGAGGAAATGGAGACGCAGGAAGTGGAAACTGTTGACACGTGCACAGATAAGATTTGGATGGAAGAACTTATCAAGGGACCCAATGGGGGAGTCGTTATGAGTGATGTTATGAAAAACCAAGAAACCTCATCTACAGATATGCCGTTGGCGTGCTTGGACGCTGAGATGGAAGATGATGAAGCGAGGTCGGAGGCGACCTTCAGGTTCACAGTTCATAACTTCCGCAACCTGAAGGATTCTGTCCTCTCACCTCCGTGTTACGTCCGCAACCTGCCGTGGAAGATCATGGTTATGCCTCGCCAGGCACCATCCCCTGACCGACAGCAGCAGAAGTCTTTGGGTTTCTTTCTTCAGTGTAACGGGGAGAGCGAGTCCTCATCGTGGTCCTGTCATGCAATGGCCGAACTGCGACTCCTCTCCCACAAACCGGAAACTGAGCCGTTCTATAGAAAGATTCAGCATTTGTTTTATAGTAAGGAAAACGATTGGGGTTTCTCCCACTTTATGTCATGGAATGATGTGTTGGACCCCGAGAGGGGTTACATTAAGGATGACTCCATCACACTGGAGGTCCACGTCACTGCCGAGGCACCCCACGGGGTCTCCTGGGACTCTAAGAAGCACACCGGCTACGTTGGTCTTAAGAATCAAGGAGCCACCTGCTATATGAATTCCCTTCTTCAAACTTTGTATTTCACCAATCAGTTGAGGAAAGCTGTATACAAAATGCCCACAGAGTCGGATGACAGTACACGGTCTGTAGCTTTAGCCCTACAAAGAGTGTTCTACGAATTGCAATTCTCTGATAAACCAGTCGGCACTAAGAAGCTGACTAAGAGTTTTGGTTGGGAAACTCTGGATTCGTTCATGCAACACGATGTACAGGAGTTTTTGAGGGTGCTGCTGGACAAATTGGAAAGCAAGATGAAAGGTACCTGTGTGGAGGGAACCGTGCCGAGGCTGTTTGAGGGGAAGATGACTTCGTACATTAAGTGCAAGAACGTTAATGTTTCCAGTACGAGAGTCGAAACCTTCTATGACATACAACTCAACATTAAAGGGAAAAAAAACATCGATGAGTCGTTCAAAGATTACATCAGTACTGAGACCCTGGACGGTGAGAACAAGTATGATGCTGGTGAGCATGGGCTACAGGAGGCTGAGAAGGGTGTCATCTTTGCTGCTTTTCCGCCGGTACTGCACTTGCACCTCATGAGGTTCCAATATGACCCCATTACAGACAGCTCGGTGAAGTTCAACGACAGGTTCGAGTTCTACGAGACCCAGGGGCGACGGGAAGGTACGCTACCGGCCTTGGCTATATATATTTTTTGTTACATATTGGTACAAACCTCAAATTGGCCAACGTTTTTCATAGTGGTGCATAGTGGTGGAGTTCTCTATAACTACACCTCGGCCAACTTTCTCGACTGTGGTCCCATATGGTGCAAGTTTGACGACGACGTGGTTTCACGCTGCACCAAGCAGGAGGCCATCGAGTACAACTACGGCGGCCAGGACGAGGACATGGCGCTCACAGTCAGGCACTGTACCAACGCGTACATGTTGGTGTATATTAGGGATTCTCAATTAAAGACTGTGTTACAAGAAGTCACTCAAGCAGACATACCAACGGAACTCAGCGAACGACTGGCGGACGAGAAGAGAATAGAAACCATCCGTCGCAAGGAGCGCAACGAGGCGCATCTCTACATGAATGTGAACGTGGTACTAGAGGAAGCCTTCGATGGACACCAGGGGAATGATTTGTACGAGCCGGAGAGGGCTCACTGTCGCGTGCTGAGGGTCCGGAAGCAGGCCACGCTGGCAGAGCTGCTGGCCGCGCTGTCTGACCACTTCCGGTACCCGCGGGCTCATCTGCGACTGTGGCCCTTCAGCGCTCGCTCCAATCAGACGTGCCGCCCGACCTGCCTGGACGTAGCCAACGACCAGAACAAGACCATAGTGGACATATCAGAGAACATGAACCCTTGGAGCGTGTTCCTGGAGATGTTGCCGCCCGACTCGGGGCTGAACGCGCTGCCGCCCTTCGACAAGGAGAACGATGTGGTGCTGTTCTTCAAGTTCTACGATCCGAAACAGAAGAGGATCCACTACTGCGGACATCACTACTTACCGATAGCGAGCAAACCGGCGGACCTCATACCCATCCTTAATAAGAGAGCGGGCTTCCCTCCAGACACTCCTCTGAACCTCTACGAAGAAATCAAACCGGACTTTGTGGAGAAGATCAACAACTACAACGATCCCCTCGAAAAGGTATTGGACGAGCTGATGGACGGGGACATCATCGTGTTCGAGCGAGCCGAGCGCCACGACGACCTGGAGCTGCCCACCTGCCAGGACTACTTCAAGTACATCTTCTACAAGGTGGAGGTGCAGTTCGTCGACAAGACCGTGCCCAACGACCCCGGATTCACGATGGAGCTGTCGATGCAGATGCGTTACGACCAGATGGCGCGTGCTGTTGGCCAAAGGCTGAACGTGGACCCCTTCCTCATACAGTTCTTCAAGTGCCAGAATTACAAGGACACCCCGGGGATGCCGCTACGTTACTCGTACGACGGCATCCTCAAGGAGCTGTTGGTGTACTGCAAGCCCAAGTGCCCCAAGAAACTCTTCTACCAGATACTGTCCATCAAAGTCAACGAGCTGGACAACAAGAAACAGTTCAAGTGTCTCTGGGTCGGCCCGAATTACAAGGAAGACAAGGAGCTAATTTTATATCCCAATAAAGGCGGCAAGGTGTCGGACATACTAGAAGAGGCGGCCAAAGTAGTGGAAATGTCTCCCGAGGGCTCTGGAAGGTTAAGGATCGTGGAGGTGTCGTGTCACAAGGTGCTGCCGGGGCCGGACCCCGAGCTGACGCTGGACCAGGTCACCATCTCCCCGCCGCGCCTCTACAGGATAGAGGAGATACCGCGCGACGAGATAGACTTACAGGAGGATGAGATCCTGGTGCCGTGCGCTCACTTCCAGAAGCAGGTGTACTCAACGTTTGGGATACCTTTCTACGCTCGCCTCAAACACCACGAGCCGTTCTCGGCGCTCAAAGATCGACTGCAGAGAAAGTTGGATGTTCCGGACAAGGAATGGGAGAAGTACAATTTTGCAATAGTGACGAACGGCCGACCTAATTATATAAGCGAAGGAGCTACGATTAACATATACGACTTCCGGACGACTAGTAATGCGAACGCGACGGGTCGGCCGTGGCTGGGCTTGGAACACATCAACAAAACGCCCAAACGCTCGCGTGTCAACTACCTCGAGAAACCCATCAAGATCTATAACTGA

Protein sequence:

>DPOGS205354-PA
MLNAGLTFDDLKSKCSMSNMLLTVSWKSTIDAKNCYHGANIGRAASKHSLKINILSSKKSKRPAAEGARVDEKGPLKFVQRIFSKPKGNCSLSLKEISKSCGRQSSVARPANMNHTPAPDRPQHDPHVNQVEEMETQEVETVDTCTDKIWMEELIKGPNGGVVMSDVMKNQETSSTDMPLACLDAEMEDDEARSEATFRFTVHNFRNLKDSVLSPPCYVRNLPWKIMVMPRQAPSPDRQQQKSLGFFLQCNGESESSSWSCHAMAELRLLSHKPETEPFYRKIQHLFYSKENDWGFSHFMSWNDVLDPERGYIKDDSITLEVHVTAEAPHGVSWDSKKHTGYVGLKNQGATCYMNSLLQTLYFTNQLRKAVYKMPTESDDSTRSVALALQRVFYELQFSDKPVGTKKLTKSFGWETLDSFMQHDVQEFLRVLLDKLESKMKGTCVEGTVPRLFEGKMTSYIKCKNVNVSSTRVETFYDIQLNIKGKKNIDESFKDYISTETLDGENKYDAGEHGLQEAEKGVIFAAFPPVLHLHLMRFQYDPITDSSVKFNDRFEFYETQGRREGTLPALAIYIFCYILVQTSNWPTFFIVVHSGGVLYNYTSANFLDCGPIWCKFDDDVVSRCTKQEAIEYNYGGQDEDMALTVRHCTNAYMLVYIRDSQLKTVLQEVTQADIPTELSERLADEKRIETIRRKERNEAHLYMNVNVVLEEAFDGHQGNDLYEPERAHCRVLRVRKQATLAELLAALSDHFRYPRAHLRLWPFSARSNQTCRPTCLDVANDQNKTIVDISENMNPWSVFLEMLPPDSGLNALPPFDKENDVVLFFKFYDPKQKRIHYCGHHYLPIASKPADLIPILNKRAGFPPDTPLNLYEEIKPDFVEKINNYNDPLEKVLDELMDGDIIVFERAERHDDLELPTCQDYFKYIFYKVEVQFVDKTVPNDPGFTMELSMQMRYDQMARAVGQRLNVDPFLIQFFKCQNYKDTPGMPLRYSYDGILKELLVYCKPKCPKKLFYQILSIKVNELDNKKQFKCLWVGPNYKEDKELILYPNKGGKVSDILEEAAKVVEMSPEGSGRLRIVEVSCHKVLPGPDPELTLDQVTISPPRLYRIEEIPRDEIDLQEDEILVPCAHFQKQVYSTFGIPFYARLKHHEPFSALKDRLQRKLDVPDKEWEKYNFAIVTNGRPNYISEGATINIYDFRTTSNANATGRPWLGLEHINKTPKRSRVNYLEKPIKIYN-