Monarch geneset OGS2.0

DPOGS210564
TranscriptDPOGS210564-TA1182 bp
ProteinDPOGS210564-PA393 aa
Genomic positionDPSCF300911 - 557-5554
RNAseq coverage23x (Rank: top 78%)
Annotation
HeliconiusHMEL0084191e-15584.23% 
BombyxBGIBMGA009941-TA4e-16488.26% 
DrosophilaCG12082-PA2e-10862.29% 
EBI UniRef50UniRef50_E3X2893e-11064.65%Ubiquitin carboxyl-terminal hydrolase n=9 Tax=Pancrustacea RepID=E3X289_ANODA
NCBI RefSeqXP_001847129.14e-11665.10%ubiquitin carboxyl-terminal hydrolase 5 [Culex quinquefasciatus]
NCBI nr blastpgi|3071925388e-11967.45%Ubiquitin carboxyl-terminal hydrolase 5 [Harpegnathos saltator]
NCBI nr blastxgi|3071925385e-11867.45%Ubiquitin carboxyl-terminal hydrolase 5 [Harpegnathos saltator]
Group
Gene OntologyGO:00082701.4e-23zinc ion binding
GO:00065116.2e-08ubiquitin-dependent protein catabolic process
GO:00042216.2e-08ubiquitin thiolesterase activity
KEGG pathway 
InterPro domain[121-176] IPR0016071.4e-23Zinc finger, UBP-type
[102-215] IPR0130834.2e-21Zinc finger, RING/FYVE/PHD-type
[249-280] IPR0013946.2e-08Peptidase C19, ubiquitin carboxyl-terminal hydrolase 2
Orthology groupMCL10894 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210564-TA
ATGTCGGAAATCCCGGAGCCAGAACAAACAGGAGATGGTCCAGAGAAGAAAATAACTCGCCTGGCGATCGGTGTGGAGGGTGGCTTCGACCCTGACTGTGGCAAACCAAAGTACACTTACACAGAACACTACAGCGTTGTGGTGCTGCCGGGGTTTCACACATTCCCCTGGCCTAATGACGCTTTACCTGACGTGGTAAAGAAATCTGTTCAGGCTGTGCTAGATGCGGATTCTCCATTCAAGCTCGCTGAGGCGGAGGCTTTACACGGCACCTGGGATGGGGAGAAGCGAGAAGTATCCGTCCACTCGGTTAACTTGAAGCAGTTAGATAACGGCGTTAAAATACCACCTTCCGGCTGGAAATGTGCCAAGTGTGATCTGACGAACAACTTGTGGTTGAATCTGACCGACGGGTCCATATTGTGTGGGAGGAGATTCTTCGATGGCTCCGGCGGAAACGATCACGCGGTGGAGCATTTCCGCGCGACCGGATATCCGCTCGCTGTGAAGCTTGGCACGATAACAGCTGACGGTACTGGCGACGTGTACTCGTACGCCGAAGACGATATGGTCGAGGACCCCTACCTGGCGGAACACCTCAAACACTTCGGCATCAACGTCCAGCAGTTACAGAAGACGGAGAAGTCGATGGTGGAGTTGGAGCTGGAACTGAACCGCCGTACGGGCGAGTGGAACACCATCCAGGAGTCTGGAAGTGAGCTGCGACCGCTGCACGGACCAGCACTCACAGGTGTCAACAACCTCGGCAACTCCTGTTACATCAATAGTGTGGTCCAGGTGCTCTTCCGTATGCCGGACTTCATACGTCGCTACGTGGAAGGCGCGCCAGAGATATTCTCGACCTTCCCCGAGGATCCTGCTAACGATTTCAACGTGCAGACAGATCCGTCCGAAGTGGTCCGTCCCCTGATACCGTTTCAAGCGTGTTTAGACGCGTTCATGAAGGAGGAACTCATTGAACAGTTCTTTAGTTCAGCTCTCAATAAGAAAGTTACTGCTCGCAAAATAACCCGGCTGGCGACTTTCCCCGATTACCTTTGGATCCAGTTAAAGAAATTCACTATCAAAGAAGATTGGACACCCGCCAAGCTAGATGTGGCCGTGGACATGCCGTGGGAGGTCGGTGTCATTGTCATCGTCCCAAAACAAACGTTTTTTTAA

Protein sequence:

>DPOGS210564-PA
MSEIPEPEQTGDGPEKKITRLAIGVEGGFDPDCGKPKYTYTEHYSVVVLPGFHTFPWPNDALPDVVKKSVQAVLDADSPFKLAEAEALHGTWDGEKREVSVHSVNLKQLDNGVKIPPSGWKCAKCDLTNNLWLNLTDGSILCGRRFFDGSGGNDHAVEHFRATGYPLAVKLGTITADGTGDVYSYAEDDMVEDPYLAEHLKHFGINVQQLQKTEKSMVELELELNRRTGEWNTIQESGSELRPLHGPALTGVNNLGNSCYINSVVQVLFRMPDFIRRYVEGAPEIFSTFPEDPANDFNVQTDPSEVVRPLIPFQACLDAFMKEELIEQFFSSALNKKVTARKITRLATFPDYLWIQLKKFTIKEDWTPAKLDVAVDMPWEVGVIVIVPKQTFF-