Monarch geneset OGS2.0

DPOGS213058
TranscriptDPOGS213058-TA1458 bp
ProteinDPOGS213058-PA485 aa
Genomic positionDPSCF300016 - 1061324-1064231
RNAseq coverage224x (Rank: top 44%)
Annotation
HeliconiusHMEL0103360.092.78% 
BombyxBGIBMGA007676-TA0.092.18% 
DrosophilaCG7288-PA0.072.18% 
EBI UniRef50UniRef50_Q17K270.075.38%Ubiquitin specific protease 39 and snrnp assembly factor n=6 Tax=Eukaryota RepID=Q17K27_AEDAE
NCBI RefSeqXP_974413.20.080.53%PREDICTED: similar to ubiquitin specific protease 39 and snrnp assembly factor [Tribolium castaneum]
NCBI nr blastpgi|3071846340.076.63%U4/U6.U5 tri-snRNP-associated protein 2 [Camponotus floridanus]
NCBI nr blastxgi|1892350590.080.53%PREDICTED: similar to ubiquitin specific protease 39 and snrnp assembly factor [Tribolium castaneum]
Group
Gene OntologyGO:00065112e-42ubiquitin-dependent protein catabolic process
GO:00042212e-42ubiquitin thiolesterase activity
GO:00082705e-14zinc ion binding
KEGG pathwaytca:6632640.0 
 K12847 (USP39, SAD1)maps-> Spliceosome
InterPro domain[151-480] IPR0013942e-42Peptidase C19, ubiquitin carboxyl-terminal hydrolase 2
[30-129] IPR0130831e-19Zinc finger, RING/FYVE/PHD-type
[52-113] IPR0016075e-14Zinc finger, UBP-type
Orthology groupMCL12198 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213058-TA
ATGAGTAATGGAGATACAATGAAACGGAAACTTAGCGGTGAAAATGGTATAGAGGCTACAGAACCAGCAATTAAAAGTAGCAAGACTCACATTCAGTGTCCCTATTTGGATACTATTAATAGGCATGTGTTGGATTTTGACTTCGAAAAATTGTGCTCCGTTTCTCTTACAAGAATAAATGTATATGCTTGTCTGGTCTGTGGGAAGTACTTCCAGGGCCGTGGTACTAATACACATGCATACACACATTCTGTAGCAGAGGGCCATCATGTGTTCCTCAATTTGCATACTCTTAAATTTTACTGTCTACCAGATAATTATGAAGTCATTGATTCATCTCTCAATGATATAAAATATGTTTTGAATCCCATCTTCACACCGGACCAAATAAAGCAGTTGGACCAAAATATTAAAATGTCAAGAGCAATAGACGGAACCATGTATATGCCTGGTATAGTCGGACTTAACAATATCAAGGCCAACGATTACTGTAATGTTATATTACAGTGTTTAGCACAAGTGAGACCACTTCGTAATTACTTTCTACGGGAAGAAAACTATGCTGATGTGAAAAGGCCTCCTGGTGATTCTTCATTCCTGCTGGTGCAAAGATTCGGTGAACTTTTGCGTAAGCTCTGGAATCCCAGAGCATTTAAAGCCCATGTGTCTCCTCATGAGATGTTGCAAGCTGTTGTGTTGTGGTCTAAAAAGAGATTTCAATTCATCAAACAGAGTGATCCGATCGATTTTCTATCTTGGTTTCTCAACTCGCTCCACATGGCCTTGAATGGAACCAAAAAGCCAAACAGTTCCATAATATATAAATCTTTTCTTGGTCACATGAGAATATATACTAGGAAACTACCTCCCCCAGATGCTGATGAGGCAGCAAATGTGGACTTAAATAGTGAAGAATATAACGAAATGATAACAGAATCACCATTTCTTTATTTAACATGTGACTTACCTCCAACACCGCTTTTTACTGATGAATTTAGGGAAAATATTATCCCTCAGGTTAATCTCTATCAGCTTTTATCAAAATTTAATGGGCAAGCATCGAAAGAATATAAAACGTACAAGGAGAATTTCCTAAAAAGGTTTGAAATAACCCAACTTCCACCGTACCTGATATTATATATAAAGAGATTCACGAAAAACACATTCTTTGTAGAAAAGAATCCCACTGTTGTTAATTTTCCAGTGAAGAATGTTGATTTTGGAGATATTCTTACGCCTGAAATAAAAGCAAAACACAATGGCAAAACTATATACGAGCTTGTTGGTAATATAGTACATGACGGAACACCCGAGAAAGGCACTTATAGAGCTCATGTGCTGCACACACCTACACAGCAATGGTATGAAATGCAAGATCTTCATGTAACCAGCATTTTGCCGCAAATGATCACTCTCACAGAAGCTTATATACAAGTGTATGAATTGAAACAGGATTAA

Protein sequence:

>DPOGS213058-PA
MSNGDTMKRKLSGENGIEATEPAIKSSKTHIQCPYLDTINRHVLDFDFEKLCSVSLTRINVYACLVCGKYFQGRGTNTHAYTHSVAEGHHVFLNLHTLKFYCLPDNYEVIDSSLNDIKYVLNPIFTPDQIKQLDQNIKMSRAIDGTMYMPGIVGLNNIKANDYCNVILQCLAQVRPLRNYFLREENYADVKRPPGDSSFLLVQRFGELLRKLWNPRAFKAHVSPHEMLQAVVLWSKKRFQFIKQSDPIDFLSWFLNSLHMALNGTKKPNSSIIYKSFLGHMRIYTRKLPPPDADEAANVDLNSEEYNEMITESPFLYLTCDLPPTPLFTDEFRENIIPQVNLYQLLSKFNGQASKEYKTYKENFLKRFEITQLPPYLILYIKRFTKNTFFVEKNPTVVNFPVKNVDFGDILTPEIKAKHNGKTIYELVGNIVHDGTPEKGTYRAHVLHTPTQQWYEMQDLHVTSILPQMITLTEAYIQVYELKQD-