Monarch geneset OGS2.0

DPOGS206464
TranscriptDPOGS206464-TA1428 bp
ProteinDPOGS206464-PA475 aa
Genomic positionDPSCF300070 + 124259-129866
RNAseq coverage116x (Rank: top 58%)
Annotation
HeliconiusHMEL0129400.073.17% 
BombyxBGIBMGA005342-TA0.072.50% 
Drosophilacalypso-PA1e-11171.53% 
EBI UniRef50UniRef50_Q17N729e-13055.45%Ubiquitin carboxyl-terminal hydrolase calypso n=13 Tax=Bilateria RepID=CALYP_AEDAE
NCBI RefSeqXP_001651006.12e-13055.45%ubiquitin c-terminal hydrolase x4 [Aedes aegypti]
NCBI nr blastpgi|3800155991e-12967.16%PREDICTED: ubiquitin carboxyl-terminal hydrolase calypso-like [Apis florea]
NCBI nr blastxgi|1571102193e-12555.68%ubiquitin c-terminal hydrolase x4 [Aedes aegypti]
Group
Gene OntologyGO:00065113.3e-161ubiquitin-dependent protein catabolic process
GO:00056223.3e-161intracellular
GO:00042213.3e-161ubiquitin thiolesterase activity
KEGG pathway 
InterPro domain[5-440] IPR0015783.3e-161Peptidase C12, ubiquitin carboxyl-terminal hydrolase 1
Orthology groupMCL15612 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206464-TA
ATGCCCGTGGAATTAAATAGTCTTACAGAGGGATGGCTAGAATTAGAAAGTGATCCGGGTCTATTCACTTTACTCCTTGAAGATTTCGGCGTAAAGGGAGTCCAAGTGGAGGAAATATATGATTTACACAAACCTCTAGAAAGCCCGGTTTATGGGTTTATTTTTTTATTTCGCTGGATTGAAGAAAGACGATCACGTCGTAAATTTGTCGAACAGATTGAAAGCTTCGTGCGTGATGAAGAAACAATAAATAATATATTTTTCGCCCAACAAATGGTTCCGAACAGTTGTGCTACACATGCACTACTGTCCATATTACTTAATTGTCCAAATCTCCATTTAGGAGAAACACTGAGCCGATTAAAGCATCATACACTAGGTATGAATCCTGAGAACAAAGGTTGGGCCATAGGAAACACGCCTGAATTAGCTTGTGCACACAATTCCCATGCTATACCACAAGCTCGCAAAAAGACTGATAAAAATGCTGGAGTTCCTACTGGTCGCTTTACAGGTGAAGCGTACCATTTTGTAAGTTTTGTGCCCATAAATGGTCATTTATTTGAATTGGATGGCTTAAAGCCCTTTCCAACTGACCATGGTCCATGGGCCTCTGACGAAGATTGGACTGACAAGTTTCGAAGAGTCATGGCTGAGAGACTAGGAAGAGATGCCGGAGAACAAGTACATGATATAAGATTTAATCTCATGGCGGTAGTACCAGATAGACGAATAGCATTGACTCAAAGATTAAGTGCTTTGGAATTAAACCAAAGGAGACTCAAAGAGGCCATCTCTAAAATAGGCAAACATCTAAGACATTTATTGAACAAGAATAGAGATTTTAACGAAGATTCACTGTCCAGCATGAACAATGTTGATGGATCAAATGAAAGTTCCATTCAAATATGCGAGGACGCAATTCTCAATGCCTTAGAGGCTTCAGAGGTGTCATCCTTGAATATAAATATAACAGATGCCATTACAATCGAAATTGGTGCCTCGGACAGTGCGTATGATAGCACTATAACCTTAGCTGATCCTATTGATCAAACTGCTGTTGTCAAATTTGTGACAATTAATGGTGAAAACGAGATATTGTCAGATATATACCCCACATCTACGACCGCACTCATCAAGAGTAACAACATGCCAGTTGTATTGTGTTGTGAAGTGGTCCCGGAACAGCCTTACAGGATGAGGAAGCTTCTGTTCACTCACGCCGAGCTGAACTCACTCATGAACAGCATCATGAGTGAGGTCCAGGCGTGCCAACAGGCGCTCAATGATGAGAATGATAAAAGAGATATGTACAAGGTCATGTCGGCAGTAGCACCAGTATTATCAGCGATTCCTCTGGCTCACAACCAAAACTTGCCGCTATCACTAAAAACCACAGCTCTACTCTGGGTGTCCGTGGACCTTTAG

Protein sequence:

>DPOGS206464-PA
MPVELNSLTEGWLELESDPGLFTLLLEDFGVKGVQVEEIYDLHKPLESPVYGFIFLFRWIEERRSRRKFVEQIESFVRDEETINNIFFAQQMVPNSCATHALLSILLNCPNLHLGETLSRLKHHTLGMNPENKGWAIGNTPELACAHNSHAIPQARKKTDKNAGVPTGRFTGEAYHFVSFVPINGHLFELDGLKPFPTDHGPWASDEDWTDKFRRVMAERLGRDAGEQVHDIRFNLMAVVPDRRIALTQRLSALELNQRRLKEAISKIGKHLRHLLNKNRDFNEDSLSSMNNVDGSNESSIQICEDAILNALEASEVSSLNINITDAITIEIGASDSAYDSTITLADPIDQTAVVKFVTINGENEILSDIYPTSTTALIKSNNMPVVLCCEVVPEQPYRMRKLLFTHAELNSLMNSIMSEVQACQQALNDENDKRDMYKVMSAVAPVLSAIPLAHNQNLPLSLKTTALLWVSVDL-