Monarch geneset OGS2.0

DPOGS200262
TranscriptDPOGS200262-TA2775 bp
ProteinDPOGS200262-PA924 aa
Genomic positionDPSCF300026 - 1065126-1071948
RNAseq coverage426x (Rank: top 29%)
Annotation
HeliconiusHMEL0053813e-15367.68% 
BombyxBGIBMGA007219-TA0.068.71% 
DrosophilaCG30421-PA2e-17944.03% 
EBI UniRef50UniRef50_D6WVD60.046.92%Ubiquitin carboxyl-terminal hydrolase n=2 Tax=Tribolium castaneum RepID=D6WVD6_TRICA
NCBI RefSeqXP_396802.20.048.74%PREDICTED: similar to CG30421-PA isoform 1 [Apis mellifera]
NCBI nr blastpgi|3287887930.048.74%PREDICTED: ubiquitin carboxyl-terminal hydrolase 31-like isoform 1 [Apis mellifera]
NCBI nr blastxgi|3287887930.047.52%PREDICTED: ubiquitin carboxyl-terminal hydrolase 31-like isoform 1 [Apis mellifera]
Group
Gene OntologyGO:00065113.3e-66ubiquitin-dependent protein catabolic process
GO:00042213.3e-66ubiquitin thiolesterase activity
KEGG pathway 
InterPro domain[141-701] IPR0013943.3e-66Peptidase C19, ubiquitin carboxyl-terminal hydrolase 2
Orthology groupMCL10725 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200262-TA
ATGTCAGCTAATAATTTATCAACAGTTTTAAAATCATGTTCGGAGAGTGAACTTATAAACGAAGTTGCGGAAGCAAATATGACATCAAAACAGGTAGAAGGATCCCGATTAAAACGTACCTTTACGTTACCTCGTAATCCCTTTGGGACATCGAAGGCCACTTCAAGTAAAAATAAGGCCAATGAGAATGAAAGTAAGCCTACTAGTGGAAAGAATACAATGAGTGTTAATCAAAAAGATGAGGGAGGTATGGAAAGGAAGTTATTTAGAAGGCCTTCATGGAAGATATTTTTGAACAAAATCGCTCAACATATGACTACAGTGAACATTTCAGGCGTAAAATCTCAACCAGTTATACCACATACCGAAAGAGTACCTTGCAGTGGAGAGCCGCCATGGCCGCCTGGAGCCGTCCCAGCAGCCACTGGCATCAAGAATCACGGCAATACTTGCTACATGAATGCTGTATTACAATGCCTCTCTCATACAGATGTTATTGCTGAGTATTTCGTTTTAGAACATTATAAGATTGATTTGCAAAAAAGAAATAAGATAAATTCCAAAAAATATGGTACACGGGGTGAGGTCACCGAGCAACTGGCAGCTTTACTAAAGGCACTATGGTCATGTCGATACACATCAGATATGAGTGTATCTTTTAAGGCAGCGGTGGAACGGCACGGAACACAATACAGAGGACACAGTCAGCATGATGCACAGGAATTCCTTTTTTGGCTATTGGATAAAGTTCATGAGGATTTGAATACAGCTACTAAGAAGAAATATAAGACAATCAAAAATACAGTAGGTAAATCTGATGAGGTGGTGGCAGCAGAGACGTTGGCGAACCATGCTCGTAGGAACAGTTCTTTTGTGCAGGCTGTGTTTCAAGCACAGTACAGATCAGCCCTTACATGTGCGAAGTGCGAGCGTACATCTTGTACATTTGATCCGTTTCATTGTGTGAGTGTTCAACTTCCAACGCGACCAGCCACAGCACAACCCTCACCGTTACCTGTCAATGTCGTGTATGTTAATCAACAACCTCGCCAAGTGCGTATTGGAGTGGAGCTTCCTCCTAACTCGACAATGGATGACCTCCGAACCGCTCTGAACTCGGATACTGGCATTGATAGAGATAATATCATACTGGCTGAGATAAATGAGACCGGTTGGAGCACTGCTCGCGCCGGTTGGGAGGCAGCTGGTATAGACGCAGGCGCCCTATACTGTCTAGAGGCGCCTCCTCTTCTTCAGACACCCACTGCACCATATCTCCTGCTACTATGGGTCAATCTGCTGGAAGGAGATCGTTTCGGTTCTCCATACGCTATGCAAGTTCCAAGAGAGATATCATACGAAGATCTTCAGAAACTCATGTTGAAAGAGATGTGTCAAGTGATAGCTGAGAATCTACCTCATCCCCTGTTTGCTTTGGATGTCGAACAAGCACTGTGCGCACACGACAAATATCCTCATCTTAGACTCGAACTGTTATGGGATCCTGCTCATAGGGACAGTATAATCCGTGAGTCGAGCGAGTCGTGTGAGGTGCACGTGTCCGCGGTGCAACCCGGGCCGCTCACACTACACGCGTGTCTACAACACTACACGAGGGCGGAGCAGCTGGCGCAGGAGGACGCCTGGAGATGTCCACAGTGCCAGAGATATATGCCGGTCGTAAAAACTCTTGGTCTATGGTCACTACCTGATGTTTTAGTCATTCATCTGAAGAGATTTAGACAACAAGCCAAATGCCGCACCAGTACAAAACTGACCACAATGGTAGAGTTTCCTATCAACGACTTTGATATGACACCCCACCTCGTACGTCGGAACCAAGGTAATGTGGAGTCGCCCGGACATTCCCGGTCGCCTCGACGGAGACATTCGAAAACGCCGGCTACTCATGAAAATATATATGATCTTTACGCTATATGCTATCACCACGGAGATGATTTAGAAACCGGTCACTATACAGCAGCGTGTAGAAATCCATACGATAGGCACTGGTACAAATTTGACGATTCCAAGGTCACCCGAGTAGATGATGAGAACAAGTACTCGGAACTGGTGAATAACTCTGCCTACATGCTGTTCTACAGACGGAAGAAACCGCATGTCGTCCATTCGTGTTCGACGGAAGACAATGAACATTGGGCCTTACGTATGCCAAAATACGTGAAGCCGGTCGGAGAACATCTCAACGAAATAGCAGAGATCAAGGAAGAAAATGTCGATGATACTAAAACGGCTTCCAAAGAAACTTTACAGCTCGAATCACCGACCGTAACGCCGTCTAGAAGTATATCTAGTCTATCCGACAAGACGGAGAGTCCCACCAAGCAGCCCATCGTTATACACAGCACCACAATGCAATCTCCAACATTACAGCGACCGTTAATAGTCGAAGTTAACGGTGTTGCGAACGACGACCTAAGCGATTATGAAAACGAATCGAGTATGCCAATTGAGCCGTACATACATAAAGATGTACATGTCAATCCTAAAATGACCCCAGTGGATACAAGGCGGCCCAGGTCAGTAGATTACCCCGCCAAGGCCAGTACCTCGCCGTCGTCCAGGGATTCGAATAGGAACTATGAAAGTTCTCCACTGGTTGCTAGCATCAATGGAGTCGAATATCATCCCACCACCGAGGATCTGATGCTGTCCATGTTCCAGGAGTCGAAATACATAGTGCCGAGACATATGACAGGGGAATCCCACAGGACAGGTGAGAAATCTGTTTGGAAAACTCGAATATCCACGTGA

Protein sequence:

>DPOGS200262-PA
MSANNLSTVLKSCSESELINEVAEANMTSKQVEGSRLKRTFTLPRNPFGTSKATSSKNKANENESKPTSGKNTMSVNQKDEGGMERKLFRRPSWKIFLNKIAQHMTTVNISGVKSQPVIPHTERVPCSGEPPWPPGAVPAATGIKNHGNTCYMNAVLQCLSHTDVIAEYFVLEHYKIDLQKRNKINSKKYGTRGEVTEQLAALLKALWSCRYTSDMSVSFKAAVERHGTQYRGHSQHDAQEFLFWLLDKVHEDLNTATKKKYKTIKNTVGKSDEVVAAETLANHARRNSSFVQAVFQAQYRSALTCAKCERTSCTFDPFHCVSVQLPTRPATAQPSPLPVNVVYVNQQPRQVRIGVELPPNSTMDDLRTALNSDTGIDRDNIILAEINETGWSTARAGWEAAGIDAGALYCLEAPPLLQTPTAPYLLLLWVNLLEGDRFGSPYAMQVPREISYEDLQKLMLKEMCQVIAENLPHPLFALDVEQALCAHDKYPHLRLELLWDPAHRDSIIRESSESCEVHVSAVQPGPLTLHACLQHYTRAEQLAQEDAWRCPQCQRYMPVVKTLGLWSLPDVLVIHLKRFRQQAKCRTSTKLTTMVEFPINDFDMTPHLVRRNQGNVESPGHSRSPRRRHSKTPATHENIYDLYAICYHHGDDLETGHYTAACRNPYDRHWYKFDDSKVTRVDDENKYSELVNNSAYMLFYRRKKPHVVHSCSTEDNEHWALRMPKYVKPVGEHLNEIAEIKEENVDDTKTASKETLQLESPTVTPSRSISSLSDKTESPTKQPIVIHSTTMQSPTLQRPLIVEVNGVANDDLSDYENESSMPIEPYIHKDVHVNPKMTPVDTRRPRSVDYPAKASTSPSSRDSNRNYESSPLVASINGVEYHPTTEDLMLSMFQESKYIVPRHMTGESHRTGEKSVWKTRIST-