Monarch geneset OGS2.0

DPOGS203238
TranscriptDPOGS203238-TA1464 bp
ProteinDPOGS203238-PA487 aa
Genomic positionDPSCF300210 - 111323-117607
RNAseq coverage105x (Rank: top 60%)
Annotation
HeliconiusHMEL0058273e-15498.46% 
BombyxBGIBMGA007073-TA0.090.69% 
Drosophilanot-PB1e-17156.78% 
EBI UniRef50UniRef50_B0W7A52e-17857.51%Ubiquitin carboxyl-terminal hydrolase n=3 Tax=Culicidae RepID=B0W7A5_CULQU
NCBI RefSeqXP_395389.20.065.60%PREDICTED: similar to Ubiquitin carboxyl-terminal hydrolase 22 (Ubiquitin thioesterase 22) (Ubiquitin-specific-processing protease 22) (Deubiquitinating enzyme 22) [Apis mellifera]
NCBI nr blastpgi|3072070010.065.93%Ubiquitin carboxyl-terminal hydrolase 22 [Harpegnathos saltator]
NCBI nr blastxgi|1565490780.064.71%PREDICTED: ubiquitin carboxyl-terminal hydrolase 22-like [Nasonia vitripennis]
Group
Gene OntologyGO:00065113.5e-60ubiquitin-dependent protein catabolic process
GO:00042213.5e-60ubiquitin thiolesterase activity
GO:00082704.2e-13zinc ion binding
KEGG pathway 
InterPro domain[156-474] IPR0013943.5e-60Peptidase C19, ubiquitin carboxyl-terminal hydrolase 2
[46-105] IPR0016074.2e-13Zinc finger, UBP-type
[33-116] IPR0130832.9e-12Zinc finger, RING/FYVE/PHD-type
Orthology groupMCL11014 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203238-TA
ATGAATGACACTGGATGTATACATTTGAACAATTTTAAAGCAGCAAAGGGCACAAATCCCTACAAAATAGTACATGCATTTTTTGTGTCATGCACTTCCTACGAAGCTAGAAGATATAAGGCTACATCATGTCTATGCTTTACTTGTGGAAAGGCAGGTCCCAGAATGCATTCATGCCTGCATTGTATTTTTTTTGCTTGTTACAATGGACACATTCAGGATCATTCCAAGTCTAAAAAACACTTCCTATATGTTGATCTCAGTTATGGTAATGTGTTTTGTGCTCAGTGCCAAGATTACATTTATGACAGAGAATTGACTGAGCTATCAAGGAATTACAGGTTAAAAGAAGCTAAAGCTTTAGGAATCAGTGTGCCATACACACCATGGCTGCCTAACAATAATGAGGTAATGGCTCTGAGAAGATTGCGTAAGAGACGTCTCATAAGTCCTCACACCACCATCGGCCTTCGTGGATTACAGAACCTTGGATCAACATGCTTCATGAATTGCATTGTTCAGACTTTAATCCATACACCTCTGTTACGAGATTATTTCCTGGGCGAGAAGCACAAATGCAAAACCCAGGGCTCCGGTAAATGTCTCGTTTGCGAGGTTTCGAAGCTTTTCCAGGAGTTCTACTCCGGCGCGAAGACCCCTCTGACGTTACACCGTTTACTTCATCTTATCTGGACCCACGCGCGTCACCTCGCCGGCTACGAACAGCAAGACGCTCACGAGTTCTTCATAGCGACGCTTGACGTCTTACATAGACATTGCATGAACGGCGTCGAGGACACTGAGAAGAAGGAGAACGGTCGCTGTAACTGCATCATAGATCAGATATTCACTGGCGGGTTGCAGAGCGACGTCGTGTGTACCTCGTGCTCAGGGGTCTCGACCACCATCGACCCCTTCTGGGACATAAGTCTAGACGTGGCCGGCCCCGGCTCGCTGCAAGCCTGCCTCGAGCGCTTCACCAGAGCCGAACACCTCGGCTCGGCTGCCAAGATAAAATGCTCCAACTGCCGGGCGTACCGCGAATCGACCAAACAGCTGACGCTGGAGACACTGCCGATAGTAGCCAGCTTCCACCTCAAGAGATTCGAACATTCCTCACAGATAGACAGGAAGATATCCGCCTTTGTGTCCTTCCCGGCTGAACTAGATATGACGCCGTTCATGTCCACTCACAGGAGAGCTGTGGAGGCGGCCGATAATAACAACGCCCCCGAGGGCGTGTTTGAGGATAATCGTTACTCGTTGTTCGCGGTCGTCAACCACCTGGGGTCACTGGACGCTGGTCACTACACCGCCTACGTGAGACAGATGAAGGGCAGCTGGTTCAAGTGTGACGATCACATGATAACACGCGCCTCCTTGAGGGAAGTTCTCGATAGTGAAGGGTACCTTCTGTTCTATCACAAGACTGTGTTGGAGTACGAGTGTGAGGTCTCGAGCTAA

Protein sequence:

>DPOGS203238-PA
MNDTGCIHLNNFKAAKGTNPYKIVHAFFVSCTSYEARRYKATSCLCFTCGKAGPRMHSCLHCIFFACYNGHIQDHSKSKKHFLYVDLSYGNVFCAQCQDYIYDRELTELSRNYRLKEAKALGISVPYTPWLPNNNEVMALRRLRKRRLISPHTTIGLRGLQNLGSTCFMNCIVQTLIHTPLLRDYFLGEKHKCKTQGSGKCLVCEVSKLFQEFYSGAKTPLTLHRLLHLIWTHARHLAGYEQQDAHEFFIATLDVLHRHCMNGVEDTEKKENGRCNCIIDQIFTGGLQSDVVCTSCSGVSTTIDPFWDISLDVAGPGSLQACLERFTRAEHLGSAAKIKCSNCRAYRESTKQLTLETLPIVASFHLKRFEHSSQIDRKISAFVSFPAELDMTPFMSTHRRAVEAADNNNAPEGVFEDNRYSLFAVVNHLGSLDAGHYTAYVRQMKGSWFKCDDHMITRASLREVLDSEGYLLFYHKTVLEYECEVSS-