Monarch geneset OGS2.0

DPOGS215177
TranscriptDPOGS215177-TA1065 bp
ProteinDPOGS215177-PA354 aa
Genomic positionDPSCF300143 - 424761-428346
RNAseq coverage440x (Rank: top 28%)
Annotation
HeliconiusHMEL0092535e-15082.84% 
BombyxBGIBMGA008661-TA2e-7380.67% 
DrosophilaUbpy-PA7e-12259.70% 
EBI UniRef50UniRef50_Q8MQX45e-11959.40%Ubiquitin carboxyl-terminal hydrolase n=10 Tax=melanogaster group RepID=Q8MQX4_DROME
NCBI RefSeqXP_002096515.14e-12260.30%GE25711 [Drosophila yakuba]
NCBI nr blastpgi|1954984197e-12160.30%GE25711 [Drosophila yakuba]
NCBI nr blastxgi|3867661989e-11958.50%UBPY ortholog, isoform B [Drosophila melanogaster]
Group
Gene OntologyGO:00065113.8e-66ubiquitin-dependent protein catabolic process
GO:00042213.8e-66ubiquitin thiolesterase activity
KEGG pathwaydya:Dyak_GE257111e-121 
 K11839 (USP8, UBP5)maps-> Endocytosis
InterPro domain[25-348] IPR0013943.8e-66Peptidase C19, ubiquitin carboxyl-terminal hydrolase 2
Orthology groupMCL14845 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215177-TA
ATGGTGTTGAAGGTGAAGGTCAGACTCAAATACTCGCAGAACCACATCGTCTGGCTCTTCAAGCGAGGCCGCGGTCTGACGGGTCTCAAGAACTTGGGCAACACCTGCTACATGAACTCAATAATCCAGTGTCTCAACAACACAGCCATCCTCGTCACGTACTTCTGTAACGGACAGTACCTCGAACACGTCAACAGATCTCATAGCACCAGAGGCGCGATCGCTGAAGAGTTGGCGGCTGTGGTACGGGCGTTGTGGTCGGGGCAATACAGGTTCATAGCGGCAAAGGATCTGAGGAGCGAGGTGGGTAAACACCAGCGCTCTTTCCGCGGCAACGAGCAGCAGGACTCGCACGAGTTCCTAACCATCCTGATGGACTGGCTTCATCTCGACCTTCAGTTCACCATCAAACCACCACACAAGGAAACCCTGGGAGCGTCCGAGCGCGCCTGGCACGAGTACACCAAGTCTAAGGAGAGTCTCGTCCTGCGTCTGTTCTACGGTCAGATACGATCCACAGTACGCTGTACGGTGTGTCGCGCAAGTTCACCGACATACGACTCCTTCTCCAATCTATCGCTGGAACTGCCGCCGGCCGCCGCCAGGTGTACGCTCGCGGATTGTCTGAAGCTCTACCTGAACGGTGAAACGATACCAGGTTGGAACTGTCCCAACTGCAAAGAGAAGAGAGATGCCGTCAAGAAGCTGGACATCTCCCGCCTGCCGCCCGTGCTCGTCATACACTTCAAGAGGTTCTACGTGGACCCCAAGGAATATATGTGCAACGCGTACAGGAAGAAGCAGACCTACATCGACTTCCCCCTCGAAGACCTGGACATGAGGCAGTTCTCGTTGCACTGTCCCGGGAACCCCATATATAATCTGTACGCTGTGTCCAACCACTACGGAACCATGGAGGGCGGACACTATACAGCCTACTGTAAAAGTAGCGTTTACGGCAAATGGTACAAATTCGACGACCACCTCGTGTCGGAGATGTCGTCAGGCGAGGTCCGCTCTTCCGCCGCCTACATCCTGTTCTACTCGGCCTGTAAGCCTTCCTGA

Protein sequence:

>DPOGS215177-PA
MVLKVKVRLKYSQNHIVWLFKRGRGLTGLKNLGNTCYMNSIIQCLNNTAILVTYFCNGQYLEHVNRSHSTRGAIAEELAAVVRALWSGQYRFIAAKDLRSEVGKHQRSFRGNEQQDSHEFLTILMDWLHLDLQFTIKPPHKETLGASERAWHEYTKSKESLVLRLFYGQIRSTVRCTVCRASSPTYDSFSNLSLELPPAAARCTLADCLKLYLNGETIPGWNCPNCKEKRDAVKKLDISRLPPVLVIHFKRFYVDPKEYMCNAYRKKQTYIDFPLEDLDMRQFSLHCPGNPIYNLYAVSNHYGTMEGGHYTAYCKSSVYGKWYKFDDHLVSEMSSGEVRSSAAYILFYSACKPS-