Monarch geneset OGS2.0

DPOGS203970
TranscriptDPOGS203970-TA2253 bp
ProteinDPOGS203970-PA750 aa
Genomic positionDPSCF300005 + 701572-724269
RNAseq coverage240x (Rank: top 43%)
Annotation
HeliconiusHMEL0103720.073.80% 
BombyxBGIBMGA000731-TA0.061.54% 
DrosophilaCG8830-PA4e-8431.81% 
EBI UniRef50UniRef50_Q16MY52e-8330.05%Ubiquitin specific protease (Fragment) n=1 Tax=Aedes aegypti RepID=Q16MY5_AEDAE
NCBI RefSeqXP_001975909.12e-8632.23%GG22578 [Drosophila erecta]
NCBI nr blastpgi|1948836404e-8532.23%GG22578 [Drosophila erecta]
NCBI nr blastxgi|1954853362e-8532.89%GE13447 [Drosophila yakuba]
Group
Gene OntologyGO:00065112.9e-41ubiquitin-dependent protein catabolic process
GO:00042212.9e-41ubiquitin thiolesterase activity
KEGG pathway 
InterPro domain[327-707] IPR0013942.9e-41Peptidase C19, ubiquitin carboxyl-terminal hydrolase 2
Orthology groupMCL12418 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203970-TA
ATGGCAGTCAAAAAACATGAGTTGGTTGCGGATAATCGCAACCAACTGGATCTGACTGCGTTGGCGCAATACTTCCAGATGATGAATGAACAGGCGAACTATCTGCCACCACCTCAAGAGCTTGTTAAACATTGTCAGGATATTGTTAAATGCTTAGCAAGAACAACAGGAACAGAGGAACAATTATGGCAACTGCTACAGCCAGTCGAGAGGTTTTTGCTTCGCGTAGTGACAACAGTACACGGCGATGTTCGTCATGAAGTCTTTACAGCAATTCTAGACAAATTCTATTCCTACATATCAGATCCACAATCGGACGCATGTCCCGCGACTGCCGCGGTGCTGGTAGTGATCGACGAGTCTGATGACGAGGGTGCTTTGTCAGCGGCCCGTTGGTTGGTCCAGCAGCGTGAGAACGGTCCGGCCGGGGCTGGACTGAGAGCAGCGTTGTCCTGCCTATTTAGATGGTTATTACATTGGCACGGGACGCCTGCATTGGGGAATTGGATAGTCGCATATATAAAAGCTTTGGAGGAAAACGAACAATACGACATATTGATACAGACGAGTTTGGACAACGTGGCCAAACTTATATCATCAATGACCAACCCAGCACCGCTACATCAGAAATCAGCTGATGTTATCTTTCACGTTCTATCATCTTTGCGTGAATCCGCTGAGGCGTTTGATAGGATATCCACTGAAATCCCTGATGTGCTGGTAAGCTTGGCGTCTGACAGTGGTCAGTGGAGTCGCCAATTGCTACAAACGCTGGTAGACATACTCACATCGATGGTCGATGGAATACTTGAAACGCTTAAGGGCGACGCCCAAGACGTATTCAGAGATAAATACGCTGACGTGATACTGTGCCTGGAGAGGCACATGGCGACCCGTGGCTGCGGTCACCTGCAGCTAACACCATGGCGGACCCGGAACGTATGCGTTTCGGACACACCACACGTCCCCATGAGAAAGGTTGGCCTCCTCAACCTCGGCAACACATGCTACATGAACAGCGTGATGCAAGCTTTGCTTGTCACCAGGCAGTTTAGTACGCATGTGATGCTAAAGATGACAGCTGTGCCCTATTGGGCAAAAATGAGCATACTGTTCTCAAAGATGATGCACTCCGTATCAACCAAGCTTAACCCGGACGAGTTCTTCAGTGTTGTGAAGCCACCGTTTTTTACGATGGACAACCAGCACGATAGTTCCGAGTTTCTCGGATACCTTTTCGAATTGCTGCAATCGTATGAACACTGCTCGGACAAGAACTTCGATTATACGCGCCCTGCCGTTCTGAACGGTGCAAGCCGCCTGCGCATGTTACATCCACACGGACAGACACACACGTCACAATCAAGTGATGAACGCCCCGGAACAAGTCAATCTCGTCGCTCTCTATCACCAAGACCTGGTAGCAGCGCTGCTGGGAGTAGTTCTGGATCCTCGCAATCTTTGAAGCGTTCATCTGCTGACAATCAGAATCCGACACAGAAAAAACGTTTGAGATTGAGCGAGGCAACGTTTTTACGACGGGATTCGTTTATAGACAGTATGTTTGGAGGTGTTCTTCTAACACGAGTTGAGTGCTCCGTCTGCCATGCTACATTCCTCTCGCGAGATGTCTTCAGAGACCTCCAACTGGCTTTCCCCGAAAAAACTGAAGGATGTCAGCACAGTGTACAAAGCCTCTTAGAATTCTACTGCTCAAAGGAACGAATGTCAGGAGACAATCAATACGAATGCCGAGACTGCGGTAGACTTCGTGACGCTGAAAGGAGCGTCCTAATTGAAACTACGCCCAAGTACCTCATTTTAGTTTTGAAGAATTTCAAGTTTGACTCCAAGCTACAAACTCAGACCAAGCTCATGCACACCGTGTACCATAACCACACTGTGACATTACCGACGGTCCGTACACAGCCAGTCCACCCTGTATACAACTTGTTCGCTGCGGTCATACATGCGGGAACGACCCTTGACTCCGGCCATTATTTCACTTTAGCCAAGGACAATGATCAGTGGCACAAGTTTAACGATGATGAAGTTAGCTTCGCTGATGAGAACCTCTTAAACGGCCTGAGTCGCTCTAGCACTCCATACATACTTTTTTACCGTAGAACTGATGTTGAAGAGGGTGCTGTACCAACTCTAGAGGAACTACCTCCCAAACTACAGGAATCAGTCCTCTCACATAATAAGAATTATGTAGAGACCGTACGACAGATCCGTCATAATCGTCCCTGA

Protein sequence:

>DPOGS203970-PA
MAVKKHELVADNRNQLDLTALAQYFQMMNEQANYLPPPQELVKHCQDIVKCLARTTGTEEQLWQLLQPVERFLLRVVTTVHGDVRHEVFTAILDKFYSYISDPQSDACPATAAVLVVIDESDDEGALSAARWLVQQRENGPAGAGLRAALSCLFRWLLHWHGTPALGNWIVAYIKALEENEQYDILIQTSLDNVAKLISSMTNPAPLHQKSADVIFHVLSSLRESAEAFDRISTEIPDVLVSLASDSGQWSRQLLQTLVDILTSMVDGILETLKGDAQDVFRDKYADVILCLERHMATRGCGHLQLTPWRTRNVCVSDTPHVPMRKVGLLNLGNTCYMNSVMQALLVTRQFSTHVMLKMTAVPYWAKMSILFSKMMHSVSTKLNPDEFFSVVKPPFFTMDNQHDSSEFLGYLFELLQSYEHCSDKNFDYTRPAVLNGASRLRMLHPHGQTHTSQSSDERPGTSQSRRSLSPRPGSSAAGSSSGSSQSLKRSSADNQNPTQKKRLRLSEATFLRRDSFIDSMFGGVLLTRVECSVCHATFLSRDVFRDLQLAFPEKTEGCQHSVQSLLEFYCSKERMSGDNQYECRDCGRLRDAERSVLIETTPKYLILVLKNFKFDSKLQTQTKLMHTVYHNHTVTLPTVRTQPVHPVYNLFAAVIHAGTTLDSGHYFTLAKDNDQWHKFNDDEVSFADENLLNGLSRSSTPYILFYRRTDVEEGAVPTLEELPPKLQESVLSHNKNYVETVRQIRHNRP-