Monarch geneset OGS2.0

DPOGS205326
TranscriptDPOGS205326-TA2769 bp
ProteinDPOGS205326-PA922 aa
Genomic positionDPSCF300322 + 230244-236561
RNAseq coverage1141x (Rank: top 11%)
Annotation
HeliconiusHMEL0160070.074.69% 
BombyxBGIBMGA007386-TA0.065.72% 
DrosophilaCG4165-PC2e-6460.10% 
EBI UniRef50UniRef50_B3MR176e-12537.64%Ubiquitin carboxyl-terminal hydrolase n=5 Tax=Drosophila RepID=B3MR17_DROAN
NCBI RefSeqXP_001607172.12e-12737.80%PREDICTED: similar to CG4165-PA [Nasonia vitripennis]
NCBI nr blastpgi|3800256514e-14937.61%PREDICTED: ubiquitin carboxyl-terminal hydrolase 45-like [Apis florea]
NCBI nr blastxgi|910889818e-17842.81%PREDICTED: similar to CG4165 CG4165-PA [Tribolium castaneum]
Group
Gene OntologyGO:00065116.7e-39ubiquitin-dependent protein catabolic process
GO:00042216.7e-39ubiquitin thiolesterase activity
GO:00082701.4e-17zinc ion binding
KEGG pathway 
InterPro domain[244-918] IPR0013946.7e-39Peptidase C19, ubiquitin carboxyl-terminal hydrolase 2
[25-147] IPR0130832.1e-19Zinc finger, RING/FYVE/PHD-type
[54-129] IPR0016071.4e-17Zinc finger, UBP-type
Orthology groupMCL10828 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205326-TA
ATGGTCAAGAAAAAGAGACAAAGCGATCCCAGTGAAAACGGAGACGACTCGACAGAATCTTGCGATGAAACCGTCAAATCCGCATGTCCTCATGTAGCCAAAGCTGTTGACCTTACTAGATTAAAGAAGGCTTTAAAAACCGGAGGTTTCGAAAAGGAATGTTCGGAGTGCAAAAAAAGTCCGAAAACCGAAATTGCTGATTCAAATTATGAAGAAGATGTCACTCTCTGGATGTGTCTGCGTTGCGGAACTCAGTTGTGCGGCAGAGCGAGAAACAAGCATGCCCTTAACCATTTCCACACGCCACATTCTGATTGCCATGCATTGACAGCGAACACCACAACCTGGGAGATCTACTGTTATAACTGTAACAATGAAATCACAGCCTCCAGCGCTAAAAAACTACATGAATGCATAGAATATTTGAAAAAGCAATCTTCAAATAACCCTAAGTTGCCTCCTATAGCATTACCCCTCGGCTCGTTAGAATCTAAGTTAGAGTTGCCAATGCCACTTGAACCTATCTCAAGAAATGACAAAGGTAAAGATAAAGCTATGGCACTAAACCTGCCGAGGACCCCTTATTTATTAGAAGTTTTGCAAGAAATGTCATCACCCGGTGAGAAGTTCACTCTACCTGGTGGTAAATTAAAAATTAAAGGTGATCAGAGTGGGGATGAGGGTGTGGAAATGGAACTTCCTCCGATCACAGGACAGTTAGCAGAATGGGGCACATTAACAAAAACTTTAGCTGAGACTTTAGCCGAGTTGCAAGCGGGTGAAGGAGGTGTGTATAATCCTCGGCGTTTGCTGTCGGCATTAGTCACCAAGCTCCCACAATTTGGTGGCGGGGACCAACATGATGCTCATGAACTCTTGAGGCATCTACTTGAAGCTGTCAGATCTGAAGATCTCCGTCGCTACCAGTCAGTGATACTCAGCAGCCTCGGTATGAATTCAAAAGTTGATCCCGCCAAAGTTAATGGGGAAGTTAAGCAAAAAGTCAAATTCTACGGGCAGCAGGCCTCCGACACTATGCTCAGACCTGAACAGGTTTTCCGTGGTTTTCTGGTGTCAACCCTGGAGTGTCAGGAGTGTTATTCCCATTCTGACCGGGCGGAATACTTCTTGGACCTGTCTTTACCCGTGGCCGCATTTCGTCCGCAGCCACCAGCCATCGTCCGTAGGAAGACTAACGAGGAAAACAATACTAATACCCAAGAAGAGAAGCCATCTAAACATCAGTTGAAGAAAGAGAGATATGCTAATCGAAGAGTAGCCAGAAAGAGCCATAAAGGCACTTCCAAAGATAAAGAAACAAATGGACCAAAGGAGGACGAAAAATCATCATCCGAGTCGGATGCTGATGTTGAAGATAACTTGGAAGATCAGCCTCGACAGACGGACGCGTCGACATCCGTTGGCACGCAAGCCGTCGCACACACGGCCGCCAACTTCGCAGCCTACCACATGGAATCTGGTTATAACTCCGAAAAAGTTATCAGCTCAGACTCGATACGCACCAGCCCCGTGGATTTGGATAAAGAAAAGACGGACAACACGCCGGAGTCGACGGAAAAGGATAAGGAATTCGTTGAAAACTCCACCTCCACGAACATCATACCCTCAGAGTATAAACCTTTAATACCATTAGAGAATTTCTCCAACCCGGACTCCGGTGTCGCGAGTCCGGAGGCGACGAAGCATAATTCAACGGAAACCGTGGACAACGTCGACTCGCCTCTGAATGGGAAAGAGCTCGGCAGTCACAGTTCATTGTCCAGCGAGATCAACTTGGACCTTTCGAGCCCCCAGCACAACAAACTGTCACCGGTCAAGAGCGTCTTCGAAAGACCGGTATCACGAATATCTTTCGCGCCAGAATACTCGAACGAGGTTGTGTCGAGGGGTATCAGCGCACAGGGCTGTCGTGAGCTCTTCGACAACAGCTGGGAGGTGAACACCCTGGAGGAGGCCGTGTTCCAGGAAGAAATCGCTCTTGATAAACTTAAAATTGAACCTGAAGCGAAACCTCCACCTCCGTCGTCCCCGGCGGCAGTGCCTCCACCGCCCGCCCCCAAACCGAAGCTTCCGGAAGCGGAGCCGGAGAGCGTCGTGAACAGAGATCTCATGTCCTTCTCCCGCCAGAGCCCGTCGTCCCCGCGTTACGTATGCGATGAAGACGAATGCAGCGTGCAGTCCTGTCTCAGCCAATTCACAGCGCTCGAGCTGCTCACCGGCAATAATAAAGTCGGCTGCGACACGTGCACCGAACGCATCAACGGCAAGGGCGGCAGGACCGTGTACACGAACGCGACCAAGCGGTTCCTGGTGTCGAAGCCGCCCGCGGTTCTCATCCTACACCTAAAGCGCTTCCAGCTGGGACCCCGCTGCATGTTCCGCAAGATGACCAAGCACGTGGACTTCCCCATACTACTGGACCTGGCGCCTTTCTGCGCCGCGGACAAGTCGAGACGCCGCGGTCGGCTGTTGTATTCTCTGTACGGGGTCGTGGAACATTCTGGTGGTATGCACGGTGGACATTACGTGGCGTACGTGAAGACACGGTCCTCGCCCGCCGGCCGCCGCTTCCTGCCGGGGCGGGTCCGCGACGACGATTCTGAACTATCGGGGTACGAATCGGGCGAGGCGCCGCCGCCGCCCGCCGCTCGCTGGTATTATGTATCCGACAGTATGGTGTCGGAGGTCAGCGAGGAAAAGGTGCTCCGCGCTCAAGCCTACCTTCTGTTCTACGAGCGCGTGCTGTAG

Protein sequence:

>DPOGS205326-PA
MVKKKRQSDPSENGDDSTESCDETVKSACPHVAKAVDLTRLKKALKTGGFEKECSECKKSPKTEIADSNYEEDVTLWMCLRCGTQLCGRARNKHALNHFHTPHSDCHALTANTTTWEIYCYNCNNEITASSAKKLHECIEYLKKQSSNNPKLPPIALPLGSLESKLELPMPLEPISRNDKGKDKAMALNLPRTPYLLEVLQEMSSPGEKFTLPGGKLKIKGDQSGDEGVEMELPPITGQLAEWGTLTKTLAETLAELQAGEGGVYNPRRLLSALVTKLPQFGGGDQHDAHELLRHLLEAVRSEDLRRYQSVILSSLGMNSKVDPAKVNGEVKQKVKFYGQQASDTMLRPEQVFRGFLVSTLECQECYSHSDRAEYFLDLSLPVAAFRPQPPAIVRRKTNEENNTNTQEEKPSKHQLKKERYANRRVARKSHKGTSKDKETNGPKEDEKSSSESDADVEDNLEDQPRQTDASTSVGTQAVAHTAANFAAYHMESGYNSEKVISSDSIRTSPVDLDKEKTDNTPESTEKDKEFVENSTSTNIIPSEYKPLIPLENFSNPDSGVASPEATKHNSTETVDNVDSPLNGKELGSHSSLSSEINLDLSSPQHNKLSPVKSVFERPVSRISFAPEYSNEVVSRGISAQGCRELFDNSWEVNTLEEAVFQEEIALDKLKIEPEAKPPPPSSPAAVPPPPAPKPKLPEAEPESVVNRDLMSFSRQSPSSPRYVCDEDECSVQSCLSQFTALELLTGNNKVGCDTCTERINGKGGRTVYTNATKRFLVSKPPAVLILHLKRFQLGPRCMFRKMTKHVDFPILLDLAPFCAADKSRRRGRLLYSLYGVVEHSGGMHGGHYVAYVKTRSSPAGRRFLPGRVRDDDSELSGYESGEAPPPPAARWYYVSDSMVSEVSEEKVLRAQAYLLFYERVL-