Monarch geneset OGS2.0

DPOGS207930
TranscriptDPOGS207930-TA4977 bp
ProteinDPOGS207930-PA1658 aa
Genomic positionDPSCF300090 - 524167-540460
RNAseq coverage430x (Rank: top 28%)
Annotation
HeliconiusHMEL0070680.072.90% 
BombyxBGIBMGA000374-TA0.072.73% 
DrosophilaCG10107-PA3e-15639.17% 
EBI UniRef50UniRef50_UPI00022C9F3E0.041.94%UPI00022C9F3E related cluster n=1 Tax=unknown RepID=UPI00022C9F3E
NCBI RefSeqXP_971090.21e-18045.23%PREDICTED: similar to sentrin/sumo-specific protease senp7 [Tribolium castaneum]
NCBI nr blastpgi|3504137690.041.94%PREDICTED: hypothetical protein LOC100745098 [Bombus impatiens]
NCBI nr blastxgi|2700071550.037.39%hypothetical protein TcasGA2_TC013691 [Tribolium castaneum]
Group
Gene OntologyGO:00082342.7e-36cysteine-type peptidase activity
GO:00065082.7e-36proteolysis
KEGG pathway 
InterPro domain[1159-1441] IPR0036532.7e-36Peptidase C48, SUMO/Sentrin/Ubl1
Orthology groupMCL12936 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207930-TA
ATGGCTCAGTATTACCAAATAGTTGGACAGGGGCATTCATTGACTATTGAGGATTTACAACGTTATTGTCCCAATATAATATGTTCAGATTCGTTGATAGTCAACCAAGATGATCAACAACAATATGCCCAACAACAGGTAGTGGTCCAACAATCTGAACAGCCAAATGATATAATAGTAAGCGAAAGTGTTCAACCACAACTTGTTGTTGGTGATGGCTTGCAATATCAGCAGCAATACATTATCCGCCATGACCCACCATTGCAACAGCCACCTCGTACAGACTTTCAACAACAGCAACAAGTCGTCGTCGTCGTTGTTGTTGTTGTTGAGCAACAACAACACCAGCAAGCCTTACATAGACACCAAGTTTACTACACATCTAATTTATCTGTGGATGCACAGGTTGTGTTGCAACAAGCTCAACAGATCATTGATGCATCTTTGATGCAACAGTCATCCCCACAACGCCCACAACATTTGGTGAACCCAGCGCACATACTATCACAGCCGACTGTCATCCATCCAGCACAACAGCCGCCGCCGCAGCCACCGCCCACACCGACACATGCGGCCCAAGCTGCACCGCAACAACAACAGCAACAGATATCGAGAGCGTCCCCGCAACTGATGCAGCACTTGCAACAACATCAACAGCAGATACACCTACAACAACTCCAGCAGCAGCAGCAACAGCAACAGCAATTGCAACATCATTTACAACAGCAACAACATTTACAATCTCAGCAGATACAGCCTCAGCAAATACAACAAATTGTTATGCATACTGCATATGTTAATCAACCAGGTACACCGACGCGTCCAGCGCCTCCGAGACTTCTCAATCAAGCACTAAACAACCCATCCGGTGTTACGTTAGTACGTGCACCGGTGCGTACGGTGAGGCCACGGAGACAGGCGATGCGATCACAGATGCCCACTCCGCCCTTACAAGTACCTGCCGCAAGACTGCTCAACTCACAACATCAGCAGGCGAATGCTAGTGCTGGTCGCGGTATTAACGCGAACGGCGCCAGTGTGTCCGGCGGGGTGGGGGCGGGGAGGGGCGCGCGCGTCCCCAGGCCTGGCTCGCCTCGCGTCCTGAACCCACATGCTGGGCACGCACAACACAGACCACCACGGTCAAGGACGCCGTTGATGCAGCAAACCCAACAACAACAGCAACAACACCAATTAACGCAACATGTCATCAATCATTCACAAGTTCAACAGCTTAATCAACTAAAACTCGCCACACAACAGCAACAAATCCTCCTCAATCAACAAAACCAGCAACACGTTCAGCAGCCAGCGCGTGTTATAACACCGCAGGGAACTATCGTAACGCAGACCTTACAAACGCCCAATACCCAACATTTAACTGTACAGACCAACCAAAACGGACCCCTACCTCAAACATTGAACGCTACCCAGCGAGCCTCTCCGCATCCGCAGGTCAAAAAGGTCGTAGTTCAACCGAACAACGCCAATGATATGGATCTCGAGGAGAGCATCACAGCGGCGATAGTACAAAAGAATCCAGTCAATGAGAACGTTACCACGCCACAACAATTCCACACGCAACCATCGATGCGACAAAACCACAATCCAGGCGCTCATAGTGTTCAACACATAAACTTTAGCCCACAGCATGGGTTCCAACAACAAATAGCCTATGAACACCACATGCCCCAAACGCCCCAGTCTCTTAATCAGTTGCTCTTAGAATCTGAAAACCCAGATGAGGGATGTCAATATGTAACATTATCCAATGGCCAGAGCATGTCAGTAGCTGAATATAAAAGAATGCAGCATACATCTCGGACTTTAATACCTCACCAGCAACAAGCAAGGGAAAGTGGTAGACAGACATTAGTAAGAAGTAAAGAACAACCACGACCAGTGCAAAGAGTACATCCCATGCAAACACAACAGGCTAATCACATGAACGAGTGCAATGATTCAGCAAGTGGACCTTCTGAACAATCATCAGAATTACCTTCAGCTAAAATGTTAATATTCCTGCAAAGTGGAGAACAGAGACTTATTACTTTTACCTTACCTAAAGAGAGTTGTACACTTCAAGAAGTTCTTGAACAGGTCAATGTTCCTTTCACAGAAGATACTCAAATACAGTGTATGCAGAATACCAGTACTGAAATAGATTATTTTGTAACAGTAGGTTCAACTATGAGAATTGAGGAAATGTTAGAAAATCATCCAATGTTAGTTGGAGGAGATGTTAGTAATAGTTCCACAAATATATCATCACAAGCACAGAGCAGTGAAATGTCCACTCCCGAAAAAGGACCATGCCCTGAAAATGCAAATAATAGTCCAGAATCTCCGGAACAAAGATCGCCGCCGCCGCGGTATGTTGACGGTATGCTGGCTGTTTGTAAATCTTGTGGTTACACTGGATTCGACTTCAGCAGGTGTCAAAGATGTAAAAGAGTTTTCACAGAAGAGCCTAAAAGTGTACCAATACCAAATAAAAATGTAGACCTCAAGAAAAAGGAACCAGAAAAATCTTTATTAGACAAACACCTCCCTTGTGGAGAAAGTATTAAATTAAATCTCCTCAAATCAAATGTGAAGACTATAAACAATAAAACTATTATACAAGAGAAAAAACCAACACGTGTGAGAAAACAGCGAGGTAAGCAGCCAGATCCTGAACCAGTTATATTGACCTTAAGTTCGGATGAAGAGGATTCAAACAGTTCAATGTTAAATAACCAGCACGAACAATCAATAAACATGAAGGAGCCTTCGTTGAGTGAAATTGAAGGTGGCACTTCAGATAGCGGAATTGGAATTGATAATATGGATGATGAAAGCAGAGAAGAGATATTTCACAATCTACAAGGTATTGAGGCTTTTCTCAATTGCCGGACAATTAGGATAGGTTCATACAGATACTCTCCCAAAGAAAAGGTTCACATATCATCTAAAGGTATAAAAATTGTTGCTCCCTCATTAAAAAATGAAAACCGAGATGTTGCTCTTCAAATACCATTGAAAGATGTTGTCCGTATGCTTGTACATTTTGGCAAAGGCCTACCTGTAATTTTTTTATACACAATGAACAAGTGTGGTGTATTAATAAGGAAAGCATTGGAAATGATGGAAGAATCAGGTCAGTTTTATAATCCATCATCAAAAGAAGATCCATACAAAAGAATAACTTTATTACCTGATGTTCTCACTGACGAGACGAAAACTGTTTTTAAATCAATATTTGGTAAAAGAATGGAAGAGCTTACGGCTAAAGAAGCAAACGAAATACTTGTTAGAACCTGTCCCATAGAAAATAGTAACACTTCAAAATTAATAACAAGGTCAATGAGCACTTCCATTAATTCAGGAAATAAAAGTACACATCCAGCTGAGATCAGGCAAATATTAATATATCCACCTGGAAAAGGAGGGATTCCCATCAACACAGAAGACTATATGTGCTTAGCTCAAGATCAGTTCTTGAATGACGTTATAATTGATTTTTATTTAAAACATTTAGTTCATGATATCTTAACAGCTAGTCAGAGAGAAAAGACTCATATCTTTAGTACCTTTTTTTATAAAAGGCTGACAACAAAACCCAGCAAGGTTAATAAGAGTTCAAATCCCCATGAATGGGATAGCACTTTAACACCAGCACAGAAACGTCATGCGAGGGTGAAGACATGGACAAAGAATGTTAATATATTTGAAAAAGATTTCATAGTAGTTCCTATAAATGAAAATTGCCATTGGTTTGTGGCTATTATATGTTATCCAAGCTTAGATGGGTGCAGAAGTATGATCGACAATAGAGCTGTTATACCACATGTGACTAAGAGAAGAGAACGTAAATCCTCAATGCAAATAGGTAGTACTACAATTACACCATTGACAAAACAGGAACAATTGACATTGAATTGTGACTCTGATAATCTTAGTGAAAGGGATGAAGCGGAAGCTGAGGAAAGTGACCTTGACATGCAATGTGATTCTGACGACGAAGAAAGTGAAAAAGTAGTTGAGAAGAAACCCGAAACCCCTGTAGTTAGAAAAAATGAACCCATTAAGCAGCCCTGTATTTTGATATTTGATTCATTGGCTGGAGCTTCAAGATCCAGAGTTGTGGCTACGTTACGGGATTACCTTACTTGTGAATATCAGGCAAAGATATCTCCACATAAAGTATTCAACAAAGATAATATTAAAGGTAGCTGTCCGAAAATTCCTCAACAAAACAATTTTACAGATTGTGGCTTATATTTATTACAATATGTAGAACATTTTTTTAAGGATCCTATTGTGGATTATTCTCTGCCCATAAAACAGTTAGCCAACTGGTTTGATGAAATTGTAGTAACAAGAAAACGTGAAGAGATTTCAATATTGTTAAAGAATTTAATGCACAAATATAACCCAGACTCGCATTTAGCTTTGCCTGACATTACATTCCCAACATTACATGGTAAATTAATAGAAAATGAAGAACAAGCAGAGGAAAGAATGGATTGTGAAAAGGGTCCTTCAAATAAAGTACACAAAAACGACAAGGAATCTGAGGGACCAACTGTTACATTTCTAAAACAAACTCCATCCGGTGAAATTCTAGTTAAAAGAAATTTTGGTGAAGAGACAATACTAAGAAAGACAATAAGACTCTCTAATGATTCAGAAAATAGATCTGTAGTGCAAATCAAACAGGAATATGGTAAAAAAGGTGACAATGAAAATCAAATAGTTATACCAATTAAACTGCACTCTAGTGATTTAGTTCAAAACTCTTTAATTGTGAATAAAGGTAAAAACTTTAGTGATAAAAAACATGGTGTAAATAATCAGACTGTATCCAGTGTTAACTGTGTCCGACAAAATGTTAGTGAGAGTGACAGCAATTCATTTCTTAAAGGTAGAAGGATAAATAAATTAGAGGATGTGGTTAATGATTCAAATAAGAAGTTTAAAAGGAATGAATGCTAA

Protein sequence:

>DPOGS207930-PA
MAQYYQIVGQGHSLTIEDLQRYCPNIICSDSLIVNQDDQQQYAQQQVVVQQSEQPNDIIVSESVQPQLVVGDGLQYQQQYIIRHDPPLQQPPRTDFQQQQQVVVVVVVVVEQQQHQQALHRHQVYYTSNLSVDAQVVLQQAQQIIDASLMQQSSPQRPQHLVNPAHILSQPTVIHPAQQPPPQPPPTPTHAAQAAPQQQQQQISRASPQLMQHLQQHQQQIHLQQLQQQQQQQQQLQHHLQQQQHLQSQQIQPQQIQQIVMHTAYVNQPGTPTRPAPPRLLNQALNNPSGVTLVRAPVRTVRPRRQAMRSQMPTPPLQVPAARLLNSQHQQANASAGRGINANGASVSGGVGAGRGARVPRPGSPRVLNPHAGHAQHRPPRSRTPLMQQTQQQQQQHQLTQHVINHSQVQQLNQLKLATQQQQILLNQQNQQHVQQPARVITPQGTIVTQTLQTPNTQHLTVQTNQNGPLPQTLNATQRASPHPQVKKVVVQPNNANDMDLEESITAAIVQKNPVNENVTTPQQFHTQPSMRQNHNPGAHSVQHINFSPQHGFQQQIAYEHHMPQTPQSLNQLLLESENPDEGCQYVTLSNGQSMSVAEYKRMQHTSRTLIPHQQQARESGRQTLVRSKEQPRPVQRVHPMQTQQANHMNECNDSASGPSEQSSELPSAKMLIFLQSGEQRLITFTLPKESCTLQEVLEQVNVPFTEDTQIQCMQNTSTEIDYFVTVGSTMRIEEMLENHPMLVGGDVSNSSTNISSQAQSSEMSTPEKGPCPENANNSPESPEQRSPPPRYVDGMLAVCKSCGYTGFDFSRCQRCKRVFTEEPKSVPIPNKNVDLKKKEPEKSLLDKHLPCGESIKLNLLKSNVKTINNKTIIQEKKPTRVRKQRGKQPDPEPVILTLSSDEEDSNSSMLNNQHEQSINMKEPSLSEIEGGTSDSGIGIDNMDDESREEIFHNLQGIEAFLNCRTIRIGSYRYSPKEKVHISSKGIKIVAPSLKNENRDVALQIPLKDVVRMLVHFGKGLPVIFLYTMNKCGVLIRKALEMMEESGQFYNPSSKEDPYKRITLLPDVLTDETKTVFKSIFGKRMEELTAKEANEILVRTCPIENSNTSKLITRSMSTSINSGNKSTHPAEIRQILIYPPGKGGIPINTEDYMCLAQDQFLNDVIIDFYLKHLVHDILTASQREKTHIFSTFFYKRLTTKPSKVNKSSNPHEWDSTLTPAQKRHARVKTWTKNVNIFEKDFIVVPINENCHWFVAIICYPSLDGCRSMIDNRAVIPHVTKRRERKSSMQIGSTTITPLTKQEQLTLNCDSDNLSERDEAEAEESDLDMQCDSDDEESEKVVEKKPETPVVRKNEPIKQPCILIFDSLAGASRSRVVATLRDYLTCEYQAKISPHKVFNKDNIKGSCPKIPQQNNFTDCGLYLLQYVEHFFKDPIVDYSLPIKQLANWFDEIVVTRKREEISILLKNLMHKYNPDSHLALPDITFPTLHGKLIENEEQAEERMDCEKGPSNKVHKNDKESEGPTVTFLKQTPSGEILVKRNFGEETILRKTIRLSNDSENRSVVQIKQEYGKKGDNENQIVIPIKLHSSDLVQNSLIVNKGKNFSDKKHGVNNQTVSSVNCVRQNVSESDSNSFLKGRRINKLEDVVNDSNKKFKRNEC-