Monarch geneset OGS2.0

DPOGS210139
TranscriptDPOGS210139-TA2079 bp
ProteinDPOGS210139-PA692 aa
Genomic positionDPSCF300261 + 33511-37254
RNAseq coverage496x (Rank: top 25%)
Annotation
HeliconiusHMEL0116098e-17162.65% 
BombyxBGIBMGA003787-TA0.063.35% 
DrosophilaUlp1-PA2e-6253.73% 
EBI UniRef50UniRef50_D6WMX82e-8535.03%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WMX8_TRICA
NCBI RefSeqXP_002433046.12e-8550.00%sentrin/sumo-specific protease, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2700078696e-8535.03%hypothetical protein TcasGA2_TC014610 [Tribolium castaneum]
NCBI nr blastxgi|2700078694e-9135.88%hypothetical protein TcasGA2_TC014610 [Tribolium castaneum]
Group
Gene OntologyGO:00082345.1e-42cysteine-type peptidase activity
GO:00065085.1e-42proteolysis
KEGG pathway 
InterPro domain[512-685] IPR0036535.1e-42Peptidase C48, SUMO/Sentrin/Ubl1
Orthology groupMCL15527 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210139-TA
ATGCAGTCCATAGTGGCTTATGTGCGTACATTACTTGGTTGGTGCGACGATAATCCTATAGTCAGGACAAGCTTCAAAAGAGACAGGAACGACGATGATTTCTCCTCAGATGAGGAATCTCCCGTTTTGAAGAGATTAAAAAAATTGCCTAGAACATCATTCCCCGATGTCATGGATTCCAAAGATATGTGGAGTGAAGAAAAACAAAACAAGAATGTTAGGTATATACCAATTCAAATCGAATCTGACCACCCGAGCACCTCGACGCCTATTGATGTAACATCTAAGTTGGGGAGCCGGTTTCGGACGGTCCCAATGAGGTTAACATCAAATGGACCTTCACCCGTTCAAAGACCCAGGAAACACACCCCAGTGAGACCTGTCACTCAACCTGTGATAGACGATGAGGATGATCAGCAAGAGGTGACATGGGTTGAAGCTAAACCTGAAACTAAGAAGCCCTCAAAGGTTTACATAAATATTGATGATGAAGATGAAGACTATGAAAATCAAGATGATGATGTAATATTTGTTAAAGCGGTGTCATCACCACCACCCATCAGACCGTATAAATTCTTTGTCAGTAAAAATAACAATGAATCAAATTCTAAAGACGATGAATATGTTAAGTTCATAAAAGTAGGTTCCAAGTTAAATGATAGAAGTTCTTTATCTCCTAGAGCATACACTAAGTTTAAAGCCCCGGCTGGAATCACTAAAGTATATAAAAAAGCCCCCACTCCTGTACCGAGATGGATGCAAGCACACAAGACTAACCTTCAACAATCTAGAAGTAAACTCACCAATGGTAACGGAAAATCTGCAATGGCAGAAGTATTCAATTTAGACGAAAAAAGAAATTACCAGGAGCTCATCAGGAGAGTGACTGGAAATATAAAACCATCGTCGATAGGCAAACCTTTGGATATCAAGAACTTGGCAGATGAGTCCGCTGCTTTCAGATCAACTCTCAGGTCACAGAAGAGTGCGTTAAATGAATTGAAACTGGTGGAACAAGGACTCAGTAGCATTAAAGACAATGAATCCACTCAAGAATATGATCCCATCACGGTAGCATCCATCAACTCCTCCGACTCAGAAGTGGAAATCATCACATCGGATGCTTCTACATCATCATCCGTCAAAATCGATCCCATCAATACACTCAAAGATTCGTACAGAGACAGAGCCATAACATCCACGGACTGGCTCTCTAAACTGGACAGTAAATATAGGAAAAAGAGGCAGGAGACACAGGAGAAGCTGAAAGATGCACGGCGGGAGTCAGATATAATATCTAAAGTTAACTATGAACAAAAATTAGCACATCTGGAGCACAAGCTCAAGTATGAGTTGAGCATACCGGAGAGTCTCATAGAGGAGGTCCAGCCCACTGTGGAGTTACCACCTCTGACGCCAGAACAAGAGAAACTAGTCAACAGAGCTCTGGGACCTGGACCACCTGGGCAGTTGCTGGTAGAGAAATTTAACTTAAGGATACACAGGCGGGATCTCCAAACTCTGGCGGGTTTGAATTGGCTCAACGACGAGGTGATCAATTTCTACATGAACCTGCTGATGCAGAGGAGCGAGGAGCGCAAGGAGCTGCCGCGGGTGTACGCCACCAACACCTTCTTCTACCCTAAGCTGATGCAGAGCGGCCAGGCGGGCCTCAGGAGGTGGACGAGGAAGGTGGACATCTTCGGCCACGACTTGATGGTGGTCCCGGTCCACCTTGGCGTCCACTGGTGCCTCAGTCTCATCGACTTCCGGGAGAAGAAGATCTCGTACCTGGACAGCATGGGCGCGAGAAACGAGCCGTGCCTCGCCGCGCTCCTACAGTACCTGAGGGACGAGCACCAGGACAAGAAGGGACAGGCCTTCGACGACGCGGGCTGGAAGACGGAGAACATGAAGGACATCCCCCAGCAGATGAACGGCAGCGACTGCGGCATGTTCGCGTGCACGTTCGCCGAGTTCAGCTCGCGAGGGGCGCGGTACACCTTCAGCCAGGCGCACATGCCCTACCTGCGGAGGAAGGCCGCCCTGGAGATCCTGCAGGCGCGTCTGCTGCTCTAG

Protein sequence:

>DPOGS210139-PA
MQSIVAYVRTLLGWCDDNPIVRTSFKRDRNDDDFSSDEESPVLKRLKKLPRTSFPDVMDSKDMWSEEKQNKNVRYIPIQIESDHPSTSTPIDVTSKLGSRFRTVPMRLTSNGPSPVQRPRKHTPVRPVTQPVIDDEDDQQEVTWVEAKPETKKPSKVYINIDDEDEDYENQDDDVIFVKAVSSPPPIRPYKFFVSKNNNESNSKDDEYVKFIKVGSKLNDRSSLSPRAYTKFKAPAGITKVYKKAPTPVPRWMQAHKTNLQQSRSKLTNGNGKSAMAEVFNLDEKRNYQELIRRVTGNIKPSSIGKPLDIKNLADESAAFRSTLRSQKSALNELKLVEQGLSSIKDNESTQEYDPITVASINSSDSEVEIITSDASTSSSVKIDPINTLKDSYRDRAITSTDWLSKLDSKYRKKRQETQEKLKDARRESDIISKVNYEQKLAHLEHKLKYELSIPESLIEEVQPTVELPPLTPEQEKLVNRALGPGPPGQLLVEKFNLRIHRRDLQTLAGLNWLNDEVINFYMNLLMQRSEERKELPRVYATNTFFYPKLMQSGQAGLRRWTRKVDIFGHDLMVVPVHLGVHWCLSLIDFREKKISYLDSMGARNEPCLAALLQYLRDEHQDKKGQAFDDAGWKTENMKDIPQQMNGSDCGMFACTFAEFSSRGARYTFSQAHMPYLRRKAALEILQARLLL-