Monarch geneset OGS2.0

DPOGS208930
TranscriptDPOGS208930-TA1854 bp
ProteinDPOGS208930-PA617 aa
Genomic positionDPSCF300009 + 44118-49251
RNAseq coverage856x (Rank: top 15%)
Annotation
HeliconiusHMEL0047560.082.72% 
BombyxBGIBMGA002507-TA0.076.26% 
DrosophilaCG15817-PD3e-3844.38% 
EBI UniRef50UniRef50_D6WB386e-10540.24%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WB38_TRICA
NCBI RefSeqXP_968799.11e-10540.24%PREDICTED: similar to ubiquitin specific protease [Tribolium castaneum]
NCBI nr blastpgi|910762942e-10440.24%PREDICTED: similar to ubiquitin specific protease [Tribolium castaneum]
NCBI nr blastxgi|3072027725e-11242.05%Ubiquitin carboxyl-terminal hydrolase 1 [Harpegnathos saltator]
Group
Gene OntologyGO:00065118.6e-39ubiquitin-dependent protein catabolic process
GO:00042218.6e-39ubiquitin thiolesterase activity
KEGG pathway 
InterPro domain[128-611] IPR0013948.6e-39Peptidase C19, ubiquitin carboxyl-terminal hydrolase 2
Orthology groupMCL16156 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208930-TA
ATGCCAGTTAACTCTTTATCAGAAACGACAAAAGGTGTTGCGAGAACTAAATTATCCCTCTCTCTTAGAAGAAATAATGCAACAAATCCGATCAACGCAAATCAGAAACCAGGAGATTTAAAGGACAGCTCTTACAAGGGATCAAATGACAATAAAGAGAATAAACCAGTGAAGAGACCTATAACAGGCACATATCTTAATACATTGAATGCTGCTAAAAAGCTTAAGCCTTTGACTGAAACGTCGAAACCGAAAGAAACAGTTAGTGAGCCACAGCTAGTTGTATTTGAACCATCAATAAGTAACATTGAAATGTTGAATGGCCACCATCCTTCGGACAACCAGTATGGAGGGAGTACACAGTGGAAAGCCCCCATTGCCACTCTGTCTAATCTTGGAAATACTTGTTTCCTCAATAGTGTACTATATACCCTACGATATGCTCCACAATTTGTGCACAACCTCCATCATCTAGTGTCCGATCTCACTAGAGTGGAACAGAAATTGGGCAGCATCAGGTTAAAAAGCTCATCTCTGGGACGAAGTGCAGCTGGGCTTGCATCTTCTGGTACTAGATCATGGAGCAGTAAAGATTTGTTATCTCTGGGACAATCGGATAACACCACAGGAAAAAGTAAAATACAGATAGCCACAGAGAAGCTTCATGAAACATATCTTAGCCTACGGGCCGCGGAAAGCAAGTGTATAAACAGTGGTGCTGCTGATGCCAGCCCGGAACCATATGCTGCTGATGCATTTCTAGCGGCATTGAGAGAAGTCAACTCTACATTTGAAGGTAATCGGCAACAAGATGCACATGAGCTTCTTGTTTGTATCTTGGACAGCATTAGAGAAACATGCAGAGCTCTAAGTGCAAGAGCATCCCGTCTTCAATTACATGAAAATGGCGACAGCAATGGCCTCGGTCGCCAACCCAGCCTTGACGGTGATGGCAGTAAGACGTTAGGACATCTCCGTAAGTCGTGGAAAAAACGCAAGGAAACAAAAACCACTGACAAAAGAAGCTCACCCTCAGAAGAACGTCCGCCTTCACCGGATCCTGAGAAGGATGAACGCTCGAGGCCTGGCTGGGACTTTGTTGCTGATGATTTTGAAGGTACCATGGTTGTTCGAACTATGTGCCTGGAGTGTGAGGCGGTGACGGAGAAAGCTCAAGCTGTGTGTGAGCTCTGTGTGCCAGTAGGTGATGATGATACAAATGAAGAACCATTTAGAGCTGCATGTCTCTCTAGTGAATATTTGAGGGATCAGAATAAGTATTGGTGCGAGCGCTGCCTGCGCTACAACGAGGCTAGACGCAGTGTCGCGTATTCGCGACTGCCACGGCTGTTAGTGTTGCAGCTCAAGCGCTTCAGCGGCGGCATGGAAAAGATCACAAGACACGCGCCCACGCCACTTCTCATGCCTTGCTTCTGTGAGCCATGTGCCAAACGGCCACCTGATCATCCACCCACACACAGATACATCCTATGGGCGGTGATAATGCACCTTGGTCAGGCGTTGACCGGTGGCCACTATGTAGCGTACGCGAGAGATCGTTCCAACGCCAGCAAATGTGAGAGAACTGGCGGTGGTGACGCAGCATCTAACAACAGCGGCTCAAGCTTCATGCGAACTCTATTTAATCGCCCGAGAGCACAACCATCTGGCTGCGCTGCCAATGATTGCTGTGTACCGCGCCCTCGGCTAGACACCTGCTGGCTGGCCTGCGACGACGACCTCGTCAAACCCATATCAAATGAAGAGTTCGAGGATCTATTATCCGCCGAGCCGAAAATGCGCTCCGCAGCAACACCATACTTACTGTTCTATGTGAAGAGCGAAGTCGGTTAA

Protein sequence:

>DPOGS208930-PA
MPVNSLSETTKGVARTKLSLSLRRNNATNPINANQKPGDLKDSSYKGSNDNKENKPVKRPITGTYLNTLNAAKKLKPLTETSKPKETVSEPQLVVFEPSISNIEMLNGHHPSDNQYGGSTQWKAPIATLSNLGNTCFLNSVLYTLRYAPQFVHNLHHLVSDLTRVEQKLGSIRLKSSSLGRSAAGLASSGTRSWSSKDLLSLGQSDNTTGKSKIQIATEKLHETYLSLRAAESKCINSGAADASPEPYAADAFLAALREVNSTFEGNRQQDAHELLVCILDSIRETCRALSARASRLQLHENGDSNGLGRQPSLDGDGSKTLGHLRKSWKKRKETKTTDKRSSPSEERPPSPDPEKDERSRPGWDFVADDFEGTMVVRTMCLECEAVTEKAQAVCELCVPVGDDDTNEEPFRAACLSSEYLRDQNKYWCERCLRYNEARRSVAYSRLPRLLVLQLKRFSGGMEKITRHAPTPLLMPCFCEPCAKRPPDHPPTHRYILWAVIMHLGQALTGGHYVAYARDRSNASKCERTGGGDAASNNSGSSFMRTLFNRPRAQPSGCAANDCCVPRPRLDTCWLACDDDLVKPISNEEFEDLLSAEPKMRSAATPYLLFYVKSEVG-