Monarch geneset OGS2.0

DPOGS205709
TranscriptDPOGS205709-TA1416 bp
ProteinDPOGS205709-PA471 aa
Genomic positionDPSCF300250 + 75721-79632
RNAseq coverage224x (Rank: top 44%)
Annotation
HeliconiusHMEL0148081e-7995.95% 
BombyxBGIBMGA009920-TA8e-7258.80% 
DrosophilaCG3016-PA4e-4056.73% 
EBI UniRef50UniRef50_Q16LM87e-8040.41%Ubiquitin-specific protease (Fragment) n=1 Tax=Aedes aegypti RepID=Q16LM8_AEDAE
NCBI RefSeqXP_001662730.11e-8040.41%ubiquitin-specific protease [Aedes aegypti]
NCBI nr blastpgi|1571329773e-7940.41%ubiquitin-specific protease [Aedes aegypti]
NCBI nr blastxgi|1571329776e-8840.07%ubiquitin-specific protease [Aedes aegypti]
Group
Gene OntologyGO:00065119.2e-40ubiquitin-dependent protein catabolic process
GO:00042219.2e-40ubiquitin thiolesterase activity
KEGG pathway 
InterPro domain[39-457] IPR0013949.2e-40Peptidase C19, ubiquitin carboxyl-terminal hydrolase 2
Orthology groupMCL13909 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205709-TA
ATGGATGGTGGTGATAGGATTCTTGTGGCCGCTGGTCTAACTGCGGCAGTGGTGGTTGGAGCATTCGTTTTATGGGGTCCCGGAGGAGCTCCCAAAGTTCGAAAGCGCCGAGGACAGATCGCTGGACTTCAGAATTTAGGAAGAACTTGCTTTCTAAATACATTATTGCAAGCACTGGCTGCCTGTCCTACTTTCATAGACTGGCTAAAAAAATACGCGAAGGCCGATGGACATAACAGTATGATTACTACTCTCTACACTGTAATTGAAGTGGTGAATGGTACCCACGAGTCCGCCCGCGGGACCCCTGTCTGTCCTCTGGGAGTGCTGCAGTCACTGCGTGCGGCCGGCTGGGTGGTGCCAGCCGACCAGCAAGATGCTCATGAGCTGTTACATGTTCTGTTATCTTGCATTGAAGAGGAAACAACCGCTATGTCTAAGAAGCCTGGCTGTCTCTCTGATGCGCTGGGTCTGGGCGGTGGTCGCGCGTGGTCCGCACTGGCGTCCCCGGCGTCCCCGCCTGCGGCGCCCCTGTCCCTGAGGGACGACGGAGCCCCGCCCCGGCCCGCCCGCCCGGCCTCCGCCGCCCCTGGCGGGGAACCCGACCTTCAAGACCCAGACCCGTGCGATCCTCCGGCCCGCCTGTGTAAGGGTGTGTCTCGGTCGTTCTGTCACCTCAGCTCGGTGGGTCGTCGCTGGGCGGCGGCGCCCATATCATGTTACGTGTGTATGTTCCAGAGCCCCGTGAGGTACGACAAGTTCGACAGTATATCTCTGTCGATGGCCAACGCCAGCACCGGCCTGACCGGCGGCTTCAGCTTATCCGGTCTGCTGAGAGCGTTCACGGCCCCGGAGATGGTGTCGGGCGTGAGGTGTGACAAGTGCTGCCCGGGAGGCGAGGAACGGGGAGCCGTGTCGCCCGCCACACAACACGCCTACACCAAGCACATACGGACGGTCGGCTTCGGGAAGCTGCCGCCGTGCCTGGTGCTGCAGGTGGCGCGGGTGGAGTGGCGCTGCGGGGCTCCGGCCAAGCGAGCCGACCACGTGGCCTTCCCGGAGACGCTGCCCATGGCGCCCTACACCACCGCGCCCAAACCGCAACCGGAGCTGTCGTCCCTGATGAGCGAGGGTCGCCTCCGCGGCGGGCTGTCCGTGCTGAACGCGACGAGTCCCCCGGCCGACGACCGAGCCCTGTACCGCCTGTGTGCGGTGGTAGTGCACGTGGGAGGACCTCGCAGCGGACACTTCGCCACGTACAGGAGGGGGAACGGCTTCGAGAGCAAACGGTGGTGGTACACATCTGACACGTTGGTGCACGAGGTGTCTTTGGCGGAGGTGCTCCGCTGCTCGGCCTACATGCTGTTCTACGAGCGACTGGCGCCGCCGCCGCCGCCTCTCACCACGCACTTCTAA

Protein sequence:

>DPOGS205709-PA
MDGGDRILVAAGLTAAVVVGAFVLWGPGGAPKVRKRRGQIAGLQNLGRTCFLNTLLQALAACPTFIDWLKKYAKADGHNSMITTLYTVIEVVNGTHESARGTPVCPLGVLQSLRAAGWVVPADQQDAHELLHVLLSCIEEETTAMSKKPGCLSDALGLGGGRAWSALASPASPPAAPLSLRDDGAPPRPARPASAAPGGEPDLQDPDPCDPPARLCKGVSRSFCHLSSVGRRWAAAPISCYVCMFQSPVRYDKFDSISLSMANASTGLTGGFSLSGLLRAFTAPEMVSGVRCDKCCPGGEERGAVSPATQHAYTKHIRTVGFGKLPPCLVLQVARVEWRCGAPAKRADHVAFPETLPMAPYTTAPKPQPELSSLMSEGRLRGGLSVLNATSPPADDRALYRLCAVVVHVGGPRSGHFATYRRGNGFESKRWWYTSDTLVHEVSLAEVLRCSAYMLFYERLAPPPPPLTTHF-