Monarch geneset OGS2.0

DPOGS213441
TranscriptDPOGS213441-TA1581 bp
ProteinDPOGS213441-PA526 aa
Genomic positionDPSCF300356 + 167856-171170
RNAseq coverage145x (Rank: top 54%)
Annotation
HeliconiusHMEL0061530.075.51% 
BombyxBGIBMGA007317-TA0.086.47% 
DrosophilaUsp7-PB0.065.45% 
EBI UniRef50UniRef50_F4W5Y20.072.87%Ubiquitin carboxyl-terminal hydrolase 7 n=8 Tax=Bilateria RepID=F4W5Y2_ACREC
NCBI RefSeqXP_974951.10.071.92%PREDICTED: similar to ubiquitin specific protease 7 [Tribolium castaneum]
NCBI nr blastpgi|3320307820.072.87%Ubiquitin carboxyl-terminal hydrolase 7 [Acromyrmex echinatior]
NCBI nr blastxgi|3320307820.072.87%Ubiquitin carboxyl-terminal hydrolase 7 [Acromyrmex echinatior]
Group
Gene OntologyGO:00065112.9e-56ubiquitin-dependent protein catabolic process
GO:00042212.9e-56ubiquitin thiolesterase activity
GO:00055155.5e-24protein binding
KEGG pathway 
InterPro domain[176-482] IPR0013942.9e-56Peptidase C19, ubiquitin carboxyl-terminal hydrolase 2
[27-166] IPR0089745.5e-24TRAF-like
[38-159] IPR0020834.3e-14MATH
[29-159] IPR0133222.3e-11TRAF-type
Orthology groupMCL10867 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213441-TA
ATGATTGAAGCCGCTAGCGTGGATAATGCGGGCAATACGCCACAAACCATCGCTAAACCTATCGAAGATGAAGATATCGCACGACCCTCAGCAACTTTCCGATATACACTGCAGAATGTAAGTCAACTCAAAGAACAGGTTTTGTCTCCCGCATACTATGTGCGGTGTTTGCCATGGAAGATACTAGTCCTGATAAGAAACACTACCACACCCGATCGCCAACAGCAAAAGGCACTCGGCATCTTCCTACAGTGCAATGGTGAATGCGATTCACCGGGGTGGTCGTGTTATGGTTTAGGTGAATTAAAATTACTTTCTCACAAATCAGATGGGGAGCATTTATGCCGAAAGCTACACCACATGTACCACAGCAAAGAAGATGATTGGGGATTCGCTCATTTTATATCTTGGAAAGACCTTATAGATCCAGATAATGGATTTGTCAAAGACGATTCTATTACTATCGAGGCACATGTTATTGCAGACGCACCTCATGGTGTTTCTTGGGATTCTAAGAAACATACGGGCTATGTTGGACTTAAGAATCAAGGTGCAACTTGCTACATGAACTCCCTCCTGCAGACTTTATTTTTCACGAATGTTCTTCGCAAAGCTGTCTACAAAATACCAACGGTAGGTGACGACAGTTCTCGATCAGTGGCATTCGCATTGCAACGTGTTTTCTATGACTTGCAATTCCTGGATAAACCTGTTGCTACAAAAAAACTCACTAAAAGCTTTGGTTGGGAAACGCTAGACTCATTTATGCAGCATGACGTTCAGGAATTTCTCAGGGTGCTCCTAGACAAATTAGAAAATAAAATGAAAGGGACAGTCGTTGAAGGTACGGTGCCTAAATTGTTTGAGGGGAAAATGACGTCTTTTATTAAATGTAAAAATGTCAACTGCACAAGTACTCGCGTCGAAACATTCTATGATATACAGCTTAGTGTTAAGGGAAAGAATAATATTTATGAGTCATTTAAAGATTACATAAGTATAGAACTACTTGACGGTGAGAATAAGTACGATGCAGGAGAACATGGACTACAAGAAGCAGAGAAAGGAGTGCGGTTTGACGTGTTTCCGCCTGTGTTGCATTTGCATCTAATGAGATTTCAATATGATCCACAGAGCGACGCTTCAGTTAAATTCAATGATCGATTTGAATTCTACGAGGAAGTGAATCTAGATCAATATTTACAAGAGATTCCTCAAACACCAGCGCACTACACCTTACACGCAGTTTTAGTGCACTCTGGCGATAACCATGGTGGGCACTACGTGGTCTTTATAAACCCTAAGGGAGATGGCAAGTGGTGCAAATTCGACGATGATGTTGTATCTCGATGCAGTAAGCAGGAAGCTATTGAGTATAACTTTGGAGGGAAGGAAGACGCCCCACATCAGGCAAGAAGGGCTACAAGTGCCTATATGCTTATTTATATACAGACATCCCAATTAAAGTATGTGTTGCAAGATGTGACAGAGACCGACATACCAACTGATCTTTGCGACCGAATAACTGAAGAGATGAGATACGAGATGGCCATAGCCGGCTCCAAATATAAAGGGAGATAA

Protein sequence:

>DPOGS213441-PA
MIEAASVDNAGNTPQTIAKPIEDEDIARPSATFRYTLQNVSQLKEQVLSPAYYVRCLPWKILVLIRNTTTPDRQQQKALGIFLQCNGECDSPGWSCYGLGELKLLSHKSDGEHLCRKLHHMYHSKEDDWGFAHFISWKDLIDPDNGFVKDDSITIEAHVIADAPHGVSWDSKKHTGYVGLKNQGATCYMNSLLQTLFFTNVLRKAVYKIPTVGDDSSRSVAFALQRVFYDLQFLDKPVATKKLTKSFGWETLDSFMQHDVQEFLRVLLDKLENKMKGTVVEGTVPKLFEGKMTSFIKCKNVNCTSTRVETFYDIQLSVKGKNNIYESFKDYISIELLDGENKYDAGEHGLQEAEKGVRFDVFPPVLHLHLMRFQYDPQSDASVKFNDRFEFYEEVNLDQYLQEIPQTPAHYTLHAVLVHSGDNHGGHYVVFINPKGDGKWCKFDDDVVSRCSKQEAIEYNFGGKEDAPHQARRATSAYMLIYIQTSQLKYVLQDVTETDIPTDLCDRITEEMRYEMAIAGSKYKGR-