Monarch geneset OGS2.0

DPOGS215176
TranscriptDPOGS215176-TA1986 bp
ProteinDPOGS215176-PA661 aa
Genomic positionDPSCF300143 - 431207-442025
RNAseq coverage370x (Rank: top 32%)
Annotation
HeliconiusHMEL0044238e-13950.00% 
BombyxBGIBMGA008660-TA0.064.42% 
DrosophilaUbpy-PA5e-5129.18% 
EBI UniRef50UniRef50_Q7QD144e-5830.31%Ubiquitin carboxyl-terminal hydrolase n=1 Tax=Anopheles gambiae RepID=Q7QD14_ANOGA
NCBI RefSeqXP_001979329.13e-5429.83%GG24410 [Drosophila erecta]
NCBI nr blastpgi|3479688151e-5730.31%AGAP002900-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3479688152e-6330.24%AGAP002900-PA [Anopheles gambiae str. PEST]
Group
KEGG pathwayder:Dere_GG244101e-53 
 K11839 (USP8, UBP5)maps-> Endocytosis
InterPro domain[8-112] IPR0150631.9e-29Domain of unknown function DUF1873
[147-303] IPR0017631.5e-19Rhodanese-like
Orthology groupMCL22191 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215176-TA
ATGACGGAGACGCGAAGATTACAATTACATTTAGGAAAATGTATAGAGGATTTGGACAAATTATACAATGTACCTGACTTGAAATCTAAAAGAGCAACGATGTTATGCAAAACAGCTCAAAAACTCTTCGAGTCTGCAGAGGAGGCTCGCGAGAAAGGTGACGAAGAGTATTCCTATGTTCAGTACATGAAGTACTTACGTATCATCGCTTACATAAGCAAAGACAAAGACTACTTAAAGGACAAAACATACTTCAACAGTATGCTCGGTTCCAAAAACCCTAATAAGGCCTTGGACGCTGCTGAAAAATTAAAGAATAGTTTAATAATAAGATATGAGAAAGAACAACAAGTAAATCGTCTGAACGACATCCAAGAAAACGAGCTGATCAAGCAGAAGATGGAAGACAACAGGAAGAAGGCTATCGAGGCTATGGTGGTCGCAGAACCAACACATCAGGGACTACCAGGTCCGGATGAAGTGTCCATAAAGTCTGAACAGTTATATGTCCTGTTGAAGAGTAGCAAGCTCAAGATTATGATTCTGGACGCTCGGCCCAGTCAGCATTACCAGGAATCGCATATCAACCATCCGGTGTGCATCAATGTCCCCGAGGAGTGCATTTCACCCGGTCAGTCGGCCAACATGCTGGAACAGAAGTTACCGCAGGTGTCCAGGAGCGTGTGGGCCGAGCGCGCTTCCATGGAACTCATCGTGATGATGGACTGGAACAGTATCACCGTCATACCGGGACAGAAGCTACATCTGCTCAAAACCATACTACTAAAGTGGGACGTGAAGGTCCACTACGCTCGCCAACCGGTGTGGCTGGTCGGCGGCTACGAGGACTGGCTTCTCAAGTACCCCGCCTTCACCACCAACCCTCGAGCGATCCCACCCACCAGAGAACAGGACGTGGACGACATGCTGGATGAGATCGAGTACCCGGCCTGGTCGGATCTGAGTCCTCCTCCGCTGGCTGTCAATAGATCCTCCAAGCCGTCGGAGCCTCTCGTAGATAGAAGCAGTAAGCTGGCCGCTGTCCAGCTGTATGAAGAGCGAGCTCGCGGCGTCCAGAACATCCTGGACCAGCAGGAGAGGATCGCCGACACCTCGCTCACACTGGAGATGCAGCCCGACCTTCGACTTGACTGGGAGAAAGTTAGATCTCAGAGGGAGGGGGAGCAGAGGGACGAGATGAGGGCCATGTACAAGCTGCGAGAGCAGGAGATCATATCGCAGCTGATGCAACTCGAGAGCAAACAGCTCCGCGAGCAGCTGGAGGAGTATCAGCGGAGGGAGAGGGAGGAGTCGGACCGACTGGAGGGGGGCGAGGACGAGCACCACGACGGAGACGCTATAGCAGAGCGCGCGCGGCAGGCTGTGAGGGATGTGGCGGCCAAGAGAGCGAGGATAGCGGCCGTCACGGCTCAGAGGGAGCGGCTCGACAGGAGGCGAGAGGTGCTGGAGATGGAGCGGAAGAAGAAACTAGCGGAGGCGCGGGCGGCGAGGAAACCCGGGGACAAAGAGGAAGACGAGGCTCGTCCTGACAGTCCGGCGCTGCCGCGGTCGCAGTCCTCGCCGAACATCGCCAAGGTGTCGTCTGACGAGGAGGAGGTCACCAGCCCCGTGTTTGACCGGAGCACGAAGCCGGCCAAGATGGCGCCCTCCAGTGACATGCATCACAGAGACTTCCTACCCGTGTGGGGTGACGTGTGCGATCCGCGAGATCCGCGAGGGCAGGGTGACAGCTCCCGGGACTATAAAGGTCAGGGGCATTATAAAAGGTTATTCTCTTCACGGTGCAGCATACAACAAGCTAACGGCCTTCCGAAATTGGCAGAAGTCTTGACAAAACTGCAGCCGTGGAAACTAGCTTGGGTTATAACAAAGGATAGTGGATCTCGGAAATCCATCTTCAGTCGAAGAAACATAGATTCGTTCGATTCTCGGTCTGGTTACAATCTTGATGGCTTCATTCGGTAA

Protein sequence:

>DPOGS215176-PA
MTETRRLQLHLGKCIEDLDKLYNVPDLKSKRATMLCKTAQKLFESAEEAREKGDEEYSYVQYMKYLRIIAYISKDKDYLKDKTYFNSMLGSKNPNKALDAAEKLKNSLIIRYEKEQQVNRLNDIQENELIKQKMEDNRKKAIEAMVVAEPTHQGLPGPDEVSIKSEQLYVLLKSSKLKIMILDARPSQHYQESHINHPVCINVPEECISPGQSANMLEQKLPQVSRSVWAERASMELIVMMDWNSITVIPGQKLHLLKTILLKWDVKVHYARQPVWLVGGYEDWLLKYPAFTTNPRAIPPTREQDVDDMLDEIEYPAWSDLSPPPLAVNRSSKPSEPLVDRSSKLAAVQLYEERARGVQNILDQQERIADTSLTLEMQPDLRLDWEKVRSQREGEQRDEMRAMYKLREQEIISQLMQLESKQLREQLEEYQRREREESDRLEGGEDEHHDGDAIAERARQAVRDVAAKRARIAAVTAQRERLDRRREVLEMERKKKLAEARAARKPGDKEEDEARPDSPALPRSQSSPNIAKVSSDEEEVTSPVFDRSTKPAKMAPSSDMHHRDFLPVWGDVCDPRDPRGQGDSSRDYKGQGHYKRLFSSRCSIQQANGLPKLAEVLTKLQPWKLAWVITKDSGSRKSIFSRRNIDSFDSRSGYNLDGFIR-