Monarch geneset OGS2.0

DPOGS213475
TranscriptDPOGS213475-TA2736 bp
ProteinDPOGS213475-PA911 aa
Genomic positionDPSCF300100 - 55006-68573
RNAseq coverage842x (Rank: top 15%)
Annotation
HeliconiusHMEL0168580.072.25% 
BombyxBGIBMGA004500-TA0.063.48% 
DrosophilaCG8494-PA7e-17544.73% 
EBI UniRef50UniRef50_F4WH960.047.03%Ubiquitin carboxyl-terminal hydrolase n=3 Tax=Formicidae RepID=F4WH96_ACREC
NCBI RefSeqXP_001122214.10.048.89%PREDICTED: similar to CG8494-PA [Apis mellifera]
NCBI nr blastpgi|3800150990.047.62%PREDICTED: ubiquitin carboxyl-terminal hydrolase 20-like [Apis florea]
NCBI nr blastxgi|3228003590.046.59%hypothetical protein SINV_03789 [Solenopsis invicta]
Group
Gene OntologyGO:00065113.8e-64ubiquitin-dependent protein catabolic process
GO:00042213.8e-64ubiquitin thiolesterase activity
GO:00082702e-11zinc ion binding
KEGG pathway 
InterPro domain[144-576] IPR0013943.8e-64Peptidase C19, ubiquitin carboxyl-terminal hydrolase 2
[598-681] IPR0066154.2e-17Peptidase C19, ubiquitin-specific peptidase, DUSP domain
[6-97] IPR0130831.2e-16Zinc finger, RING/FYVE/PHD-type
[30-93] IPR0016072e-11Zinc finger, UBP-type
Orthology groupMCL11399 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213475-TA
ATGGACAAAGGTGTGACGTGTGAACATTTAAATAAACTTGTGGATTTCCTTGGAAAAGAGCTATGGCAGTCGAAAGAGAGTCTGAACTGTTTCGACTGTGGATGTCCCGGCCCCAATCTCTGGATTTGTTTGCAACCTGATTGTCACCATATAGGCTGTTCTGAAGTTAAAAATGATCACAGCACAATACACCAAAAGAACTTTCCATCTCATTGTGTTCATATGAATGTTACAACTGAGAGGATATGGTGTTACTTGTGTGAGAAAGAGGTTCACATCAGAACAGCCATCGCAAAGGCAAAGATGAAGCCAGACTCCACGACTGTGGAGGAGATGCTGGTGTCACGTACGGGGTCTGTGGGGATCAGCGTCAACTCGGACGAGGACCTCGACCCTGATGACATGGAATACGAGGACCAGAGACCCAGAGGTCTAGTTGGTCTGCAGAACATGGGCAACACCTGCTACATGAATGCGGCGCTGCAGGCCCTCAGCAACACGCAGCCGCTGACGTCATACTTCCTGGAGTGTTCCGCAGCTGTGGCCTTGCTGGTGGGCGACAAGAAACCAGGGATTAGTCGAGCATACCAAAAATTAATTAGAGAAATGTGGAGTAGGAAAACTAGAGGCTATGTTGTACCAAATGGTATTCTGTATGGGATAAGGAATGTACATCCGATGTTCCGTGGGTACCAGCAACATGATACACAGGAATTCCTGCGCTGTTTCATGGACCAGTTACACGAGGAACTCAAAGAACCGGTGTGGGATAGCGTGTCTGATGATAAACTGGCGTCAGAGGTCGAAGGTGACCAAGAGACGAGGAACATTCACACAAGAAGACGGGCAGCTTCATCAGGAGAAGTGATATCGCCATCAGCCGTAGCTTATTCAGCAATTGATGAACAAAGAACTAGTCGACTGGACGTGGGTTCCGAGTCGGAGTTGTCCAGCGAGGCGGAGGAGAGGTACGAGACGTGCTCCAGCGGCGCCAGTGAAGCACCCGACGCTCATGAGAACACTTCCCCGTCCTGGAGTGGTGGTGGAGGTGACGGTGGGGGTGGTGCGAGGTACCGCAGCATCATCTCGGACGTGTTCGATGGGAAGCTGCTGTCGTCAGTACAGTGTCTCATCTGTGATAGGGTGTCGACCCGTGTTGAGACATTCCAGGACCTTTCATTGCCCATTCCATCCCGGGAACACCTCGCCATGTTACATCACACACCGCACACCGATCAGGACTCGTGGCTGTGGTGGTTGTTTTCGTGGTTCCGCTCGTGGCTGTACGGCCCGGCCGTGTCGCTGCAGGACTGCCTGGCGGCTTTCTTCAGCGCGGACGAACTCAAAGGCGACAACATGTATAGTTGTTCGAGATGCAACAAACTCCGTAACGGTGTGAAGATGTCGGGCGTGATCAGGCTCCCGGAGGTGCTGTGTGTACATCTGAAGCGGTTCCGTCACGAGCTCATGTTCAGTGCCAAAGTGGCGGCGAGAGTGTCGTTTCCGATCAACGATTTAAATATGGCCCCTTATTGTCATAAAGAGTGCACGTCATCCGTGTCCCGCTACTCTCTCTGTGCTGTTATATGTCACGCGGGCACGGCGGGCGGTGGTCACTACACGTGCGTCGCCCGCGTCGACGACAGGTGGTACTCGTTCGACGACGCGTCCGTGACGCCGCTCACCACCCACCACCTAGCATCCTGCGAGGCCTACGTACTGTTCTACAGAAAAATAAATCCACAAATGGCAACCCTGAGACAAAAGGCGGCCGAAATATTAGAATCGTCCAACTCGGAACCGAACGACATTAAGTTTTACATCTCCAAGCAGTGGATTAATAAATTCAACACGTGGGCGGAGCCTGGGCCCATAGACAATAGTGACTTCGTGTGCGTCCACGGCGGGGTTCGTCCTGAGCGGGCGCCTCATCTGCCAGCCCTGGCTGCCCGTCTCCCGCAACCACTATGGGACTTCCTCTACCATCAGTTCGGCGGTGGACCGGCGGTGTCTCACGCCCACGAGTGTGGAGTGTGTGCGCGAGCTCAGCATAGACTCAGGGCGAGGAGAGCCAGGGAACTCACAGCCTTCGCTGAACTACACGCCATGTTCCAGGACCAGGAGCGCCCTCTAGCGGTGTTCGCTATCAGTATGGCGTGGTTCAGACAGTGGCAGGCGTTCGTCCGGGACAAGGCGAGACACCCGCCGCCACCCGTCGACAACACGTCCATCGTTGTTAAACAGGAAATCGAGGGGATCGTGTCATATGTACTGAAGCCGGGTTCGGATCACGCGCAGCTCAGCGAGGAGTTGTGGAGGTTCTTCACCGATATATACGGCGGAGGTCCCGAGGTCCGGCTGTCAGCACCGCCGCCGCCGCGGGTCACACGATCCTCCAGGAACTACTCCGAATCGGACAGAGAGGAATACTGCACTAAATCCTCGTCCGAGGTCAACCTGTGGCTGCAGAAGAATCGCTCGCTTCAGAACATCAGCAGGCGGTACAAGGCGGACTCCGACGAGGAGATATACAGGAAGTACAAGCGGCATCCCACCAGCTACGACTCCGACGACGGCATGGAGATCAGCCCGACGCACAGCCACAACACTATCAGGATGGAGAACGGCCTGTCGGAGCACGCCGCCCCCGACGACCTGAACCTAGACAGCATATCACTAAAGAATACACCAAAAACATGTAAAGTTAGAAAGACGAAACGCAGGACGGTCAAGTGA

Protein sequence:

>DPOGS213475-PA
MDKGVTCEHLNKLVDFLGKELWQSKESLNCFDCGCPGPNLWICLQPDCHHIGCSEVKNDHSTIHQKNFPSHCVHMNVTTERIWCYLCEKEVHIRTAIAKAKMKPDSTTVEEMLVSRTGSVGISVNSDEDLDPDDMEYEDQRPRGLVGLQNMGNTCYMNAALQALSNTQPLTSYFLECSAAVALLVGDKKPGISRAYQKLIREMWSRKTRGYVVPNGILYGIRNVHPMFRGYQQHDTQEFLRCFMDQLHEELKEPVWDSVSDDKLASEVEGDQETRNIHTRRRAASSGEVISPSAVAYSAIDEQRTSRLDVGSESELSSEAEERYETCSSGASEAPDAHENTSPSWSGGGGDGGGGARYRSIISDVFDGKLLSSVQCLICDRVSTRVETFQDLSLPIPSREHLAMLHHTPHTDQDSWLWWLFSWFRSWLYGPAVSLQDCLAAFFSADELKGDNMYSCSRCNKLRNGVKMSGVIRLPEVLCVHLKRFRHELMFSAKVAARVSFPINDLNMAPYCHKECTSSVSRYSLCAVICHAGTAGGGHYTCVARVDDRWYSFDDASVTPLTTHHLASCEAYVLFYRKINPQMATLRQKAAEILESSNSEPNDIKFYISKQWINKFNTWAEPGPIDNSDFVCVHGGVRPERAPHLPALAARLPQPLWDFLYHQFGGGPAVSHAHECGVCARAQHRLRARRARELTAFAELHAMFQDQERPLAVFAISMAWFRQWQAFVRDKARHPPPPVDNTSIVVKQEIEGIVSYVLKPGSDHAQLSEELWRFFTDIYGGGPEVRLSAPPPPRVTRSSRNYSESDREEYCTKSSSEVNLWLQKNRSLQNISRRYKADSDEEIYRKYKRHPTSYDSDDGMEISPTHSHNTIRMENGLSEHAAPDDLNLDSISLKNTPKTCKVRKTKRRTVK-