Monarch geneset OGS2.0

DPOGS211913
TranscriptDPOGS211913-TA1134 bp
ProteinDPOGS211913-PA377 aa
Genomic positionDPSCF300011 + 88957-92141
RNAseq coverage282x (Rank: top 39%)
Annotation
HeliconiusHMEL0177056e-10396.72% 
BombyxBGIBMGA001054-TA0.079.76% 
DrosophilaCG7023-PB9e-11690.95% 
EBI UniRef50UniRef50_G3LY580.079.76%Ubiquitin carboxyl-terminal hydrolase 46 n=1 Tax=Bombyx mori RepID=G3LY58_BOMMO
NCBI RefSeqXP_625039.10.090.56%PREDICTED: similar to CG7023-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3838565300.090.33%PREDICTED: ubiquitin carboxyl-terminal hydrolase 46-like [Megachile rotundata]
NCBI nr blastxgi|1583004860.087.80%AGAP012139-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00065116.5e-72ubiquitin-dependent protein catabolic process
GO:00042216.5e-72ubiquitin thiolesterase activity
KEGG pathway 
InterPro domain[36-370] IPR0013946.5e-72Peptidase C19, ubiquitin carboxyl-terminal hydrolase 2
Orthology groupMCL11321 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211913-TA
ATGTCAATATGTCAGGATCTCGGCCTTAACTTCCAACAGAGAATGGGCGCAAATATATCGCAGCTAGAAAGAGATATTGGGTCAGAGCAGTTTCCACCGAACGAGCATTATTTTGGTTTAGTTAATTTTGGCAACACCTGCTATAGTAATTCCGTTTTACAAGCCCTGTACTTCTGTAGACCATTTAGAGACAGGGTACTAGAATACAAAGCTAAGAACAAGAGAACCAAGGAGACTCTACTAACATGCTTAGCGGACTTGTTTTATAGCATCGCAACACAGAAGAAAAAAGTAGGGTCCATTGCACCTAAAAAATTTATAGCTAGATTAAGAAAAGAAAAAGAAGAATTTGATAATTATATGCAACAGGATGCTCACGAATTTCTAAATTTTCTTATAAACCACATCAATGAGATAATATTGGCTGAAAGAAATCAAAGCACATTGAAATTACAAAAAACTGATGGTGTGAAAGAGAATGTTACCTGCAACGGTAGTATACCACAGAACACGGAGCCGACGTGGGTTCACGAGATATTCCAGGGAACTCTAACTAGCGAGACGAGGTGTCTGAACTGTGAAACGGTCAGCAGTAAGGACGAGCACTTCTTCGATTTACAGGTGGACGTTGATCAGAACACGAGTATAACACACTGCCTCAAGTGCTTCAGTGACACAGAAACCCTTTGCAACGACAACAAATTCAAATGTGACAACTGCAGCAGTTACCAAGAGGCACAGAAACGTATGAGGGTGAAGAAGTTGCCGCTGATACTAGCGTTACATTTGAAGAGATTCAAATACATGGAACAGTATAATAGGCATATCAAAGTGTCGCATAGAGTTGTGTTCCCGCTCGAGTTGAGGCTCTTTAACACGTCAGATGACGCTGTGAACCCTGATCGTTTGTACGACCTAGTAGCTGTTGTGGTGCACTGTGGCTCCGGACCCAACCGTGGACATTACATCAGCATTGTCAAGAGTCATGGGTTCTGGCTGCTCTTTGATGATGATATGGTTGATAAAATTGATGCATCAGCTATAGAAGACTTCTACGGCCTAACTTCTGATATACAAAAGTCATCGGAGACAGGATATATACTTTTCTACCAATCTAGGGATGCTACTTGTTAA

Protein sequence:

>DPOGS211913-PA
MSICQDLGLNFQQRMGANISQLERDIGSEQFPPNEHYFGLVNFGNTCYSNSVLQALYFCRPFRDRVLEYKAKNKRTKETLLTCLADLFYSIATQKKKVGSIAPKKFIARLRKEKEEFDNYMQQDAHEFLNFLINHINEIILAERNQSTLKLQKTDGVKENVTCNGSIPQNTEPTWVHEIFQGTLTSETRCLNCETVSSKDEHFFDLQVDVDQNTSITHCLKCFSDTETLCNDNKFKCDNCSSYQEAQKRMRVKKLPLILALHLKRFKYMEQYNRHIKVSHRVVFPLELRLFNTSDDAVNPDRLYDLVAVVVHCGSGPNRGHYISIVKSHGFWLLFDDDMVDKIDASAIEDFYGLTSDIQKSSETGYILFYQSRDATC-