Monarch geneset OGS2.0

DPOGS210469
TranscriptDPOGS210469-TA1497 bp
ProteinDPOGS210469-PA498 aa
Genomic positionDPSCF300062 + 380759-386921
RNAseq coverage651x (Rank: top 20%)
Annotation
HeliconiusHMEL0127713e-14166.67% 
BombyxBGIBMGA002759-TA0.081.70% 
DrosophilaCG5384-PA2e-15855.90% 
EBI UniRef50UniRef50_Q9VKZ83e-15655.90%Ubiquitin carboxyl-terminal hydrolase n=46 Tax=Bilateria RepID=Q9VKZ8_DROME
NCBI RefSeqXP_969056.10.065.19%PREDICTED: similar to ubiquitin specific peptidase 14 [Tribolium castaneum]
NCBI nr blastpgi|910866850.065.19%PREDICTED: similar to ubiquitin specific peptidase 14 [Tribolium castaneum]
NCBI nr blastxgi|910866852e-17965.19%PREDICTED: similar to ubiquitin specific peptidase 14 [Tribolium castaneum]
Group
Gene OntologyGO:00065112e-50ubiquitin-dependent protein catabolic process
GO:00042212e-50ubiquitin thiolesterase activity
GO:00055153.1e-05protein binding
KEGG pathway 
InterPro domain[105-466] IPR0013942e-50Peptidase C19, ubiquitin carboxyl-terminal hydrolase 2
Orthology groupMCL14316 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210469-TA
ATGCCCAAAGTTTCAGTGAAAGTAAAATGGGGTAAGGAGATGTATCCCGGCGTTGAAGTTAACACAGACGACGATCCCGTTTTGTTCAAAGCTCAGATCTTTGCCCTCACAGGAGTACAGCCGGAAAGACAGAAAGTTGTATGCAAAGGGGTCACCCTCAGAGATGACGCTTGGGGCAATTTCAAATTAACAAATAATGCCCTAGTTCTGGTAATGGGTAGTAAAGAGGAAGATGTACCGGCTGCTCCTGTTGAACAGACTCGATTTGTGGAAGACATGAATGAATCGGAATTAGCTACGGCTCTCGACCTCCCAGAAGGTCTCATCAACTTGGGCAACACCTGCTATATGAATGCTACAGTACAATGTCTGAAGACTGTTCCCGAATTGAAAAATGCATTACTTAATTATGATCCAACATCAGGTGGAGGCACAGCAGGAGGCCTGACATCGGCCTTGAGCGAAACAATGAGGTCTCTGGAGGGGGGCGGGGCAGGGGCGTGTGCTGCGGCCGCGGCACGGCTTCTACACGCTCTGCACGCTGCAGCGCCACGTCTGGCCGAGCGGGGAGCTGGAGGACAACTGGCCCAACAAGATGCCTCTGAGTGCTGGACCGAAATCATACGAGCTCTGAGTATGAGGCTGCAATCCACACCTGAAAGTCACAGCAAGCCATTGATAGAGCAGTACTTCGGTGGAACTCTGGATGTAGAGTTAGTGTGCAGTGAGGCAGACGAGCCACCAACTCGGTCCACAGAGACCTTCCTGCAGCTCTCCTGCTTCATATCACAGGACGTCAAGTATCTACAGTCCGGACTCAGATCTAAAATGTCTGAACAAATTACAAAGATGTCAGAAACGTTGGGTAGAGATGCTGTTTACACTAAAACTAGCAAAATTAGTCGCCTGCCCGCCTACCTGACGGTCCAGTTCGTGAGGTTCTACTACAAAGAGAAGGAATCCATCAACGCCAAAATTCTCAAAGACGTCAAATTTCCTCTCGAGCTCGATGTTTACGAACTCTGCTCACCAGAACTGCAGGAGCGTCTCACCCCGATGCGGACCAAGTTTAAGGAACTCGAGGAAGCGTCGGTGGAAGCGGCTCTGAGCTCCAAGAATAAAAATCACGGAGACAGTAAAAAGGAGATCAAGAGGAAGGCGACGCTGCCGTACTGGTTCGAGAATGACGTGGGCAGCAACAACAGCGGCTACTACCGTCTGCAGGCGGTGCTGACTCACCGCGGCCGCTCGTCCTCGTCCGGTCACTACGTGGCGTGGGTCGCGCGCGGGGACGGCTGGCTCCGCTGCGACGACGACGCCGTGTCGCCCGTCACCGAGGAGGAGGTGCTCAAACTGAGCGGCGGAGGTGACTGGCACTGCGCGTATCTCTTGCTGTACGGACCAAAGATCCTGGAGCTATCTCAGGAGGGAGACAGTCCTGAGCCGATGATAACCGATGAGGCCTCCGGGCCCGACCCGCCGACGGCGCTCGCCTAA

Protein sequence:

>DPOGS210469-PA
MPKVSVKVKWGKEMYPGVEVNTDDDPVLFKAQIFALTGVQPERQKVVCKGVTLRDDAWGNFKLTNNALVLVMGSKEEDVPAAPVEQTRFVEDMNESELATALDLPEGLINLGNTCYMNATVQCLKTVPELKNALLNYDPTSGGGTAGGLTSALSETMRSLEGGGAGACAAAAARLLHALHAAAPRLAERGAGGQLAQQDASECWTEIIRALSMRLQSTPESHSKPLIEQYFGGTLDVELVCSEADEPPTRSTETFLQLSCFISQDVKYLQSGLRSKMSEQITKMSETLGRDAVYTKTSKISRLPAYLTVQFVRFYYKEKESINAKILKDVKFPLELDVYELCSPELQERLTPMRTKFKELEEASVEAALSSKNKNHGDSKKEIKRKATLPYWFENDVGSNNSGYYRLQAVLTHRGRSSSSGHYVAWVARGDGWLRCDDDAVSPVTEEEVLKLSGGGDWHCAYLLLYGPKILELSQEGDSPEPMITDEASGPDPPTALA-