Monarch geneset OGS2.0

DPOGS210043
TranscriptDPOGS210043-TA4023 bp
ProteinDPOGS210043-PA1340 aa
Genomic positionDPSCF300017 - 1228420-1251304
RNAseq coverage219x (Rank: top 45%)
Annotation
HeliconiusHMEL0059070.085.71% 
BombyxBGIBMGA000473-TA0.080.70% 
Drosophilapcm-PB0.052.98% 
EBI UniRef50UniRef50_G6CU560.0100.00%Putative 5-3 exoribonuclease 1 n=4 Tax=Eukaryota RepID=G6CU56_DANPL
NCBI RefSeqXP_001603129.10.063.67%PREDICTED: similar to 5-3 exoribonuclease 1 [Nasonia vitripennis]
NCBI nr blastpgi|1565371190.063.67%PREDICTED: 5'-3' exoribonuclease 1-like isoform 1 [Nasonia vitripennis]
NCBI nr blastxgi|3454791630.064.02%PREDICTED: 5'-3' exoribonuclease 1-like isoform 2 [Nasonia vitripennis]
Group
Gene OntologyGO:00045273.9e-101exonuclease activity
GO:00056223.9e-101intracellular
GO:00036763.9e-101nucleic acid binding
KEGG pathwaynvi:1001193390.0 
 K12618 (XRN1, SEP1, KEM1)maps-> RNA degradation
InterPro domain[1-228] IPR0048593.9e-101Putative 5-3 exonuclease
Orthology groupMCL11199 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210043-TA
ATGGGGGTACCGAAGTTTTTTAGATACACCAGTGAACGGTATCCTTGTCTTAATGAGTTAGTTAAGCAGTATCAGATTCCAGATTTTGACAACATGTACCTGGATATGAATGGAATCATACACAACTGTTCTCATCCTGATGATTCCAATCCACATTTCCGTATCACGGAAGAGAAAATATTCAAAGACATATTCCATTACATCAGCATTTTGTTTCAAATTATTAAGCCCAAGAAGCTATTTTTTATGGCTATCGATGGTGTAGCCCCAAGAGCTAAAATGAATCAGCAGAGGGGAAGGAGGTTCCGATCAGCAAGGGAAGCTGAAAAGTTAGAGGAAACTGCTAAAGAAAAGGGTGAGGCTCTACCGACGGAAAAGAGGTTTGATAGCAACTGTATCACTCCCGGAACAGTGTTCATGGCTCGTCTTCATGAACAGCTCAAATATTTTATCAAAGAGAAGATATCAACGGACCCTCTTTGGTCCAAAGTTAAAGTTATTCTGTCCGGACATGAGACGCCCGGTGAAGGAGAACACAAAATAATGGATTACATTCGCTGGGCTCGCTCGCAGCCCGACTATGATCCCAGCACCAGACACTGCTTGTATGGACTCGATGCAGACCTCATTATGTTAGGAGTTTGCACACACGAACCACACTTTGCCTTGCTGCGAGAAGAGGTTAAATTTGGCAAAACAACTCAAAGGGCAACCAGCCCCGAAGAAACTAATTTCTACTTGCTCCACCTATCACTACTAAGGGAATATTTGGAGCAGGAATTCATATCCATCAAGGATAATCTGCCATTCCCTTACGATATTGAAAATATTATTGACGATTGGGTTCTCATGGGGTTCTTAGTGGGCAATGATTTTATACCAAACTTGCCCAACATGCATATCAGCAATGACGCTCTGCCGCTCCTGTACAAAACATACATGACTGTCCTGCCTACTTTGGACGGCTATATAAATGAATCGGGAGATTTGAATTTAGGGAGGTTTGAAGTATTCATGCAGGAGCTGGCTAAGATTGATAAAGAGAAATTTCAAGACACTTATGCCGACTTGAAATACTTTGAAGCCAAAACTGGCAGACGGCCCAACGCTAACGAGAGGAGAGATTACAAGCCCAACAATGACGACACATTCAATGTCAACTTGGACGATATCAAAGCCAACAAGCCAGATGACGAACTGCAGGCTCTTATTGATGCTACACAGGAAATGTTTATGGATGACATGAAGAGCGATGAAGACTATGAAGAAACTAGTGATGAGGAAGCGAACATGGAGATGGAATTCATTCTACATAAGAAAGACTATTACATGAACAAGTTGGACTATTCAAAGGTTACCGACGAGGTGCTATCAGACCAAGCCGAGTGTTATGTCCGAGCCATCCAGTGGAATCTGTGGTACTACTACCGAGGCTGCCCCTCCTGGTGTTGGTACTACCCTCACCACTACGCTCCATACATCTCGGATATCAAGGACTTCGGGAACATGAATATGGAGTTTGAGCTGGGAGAACCGTTCAAGCCCTTCGAGCAGTTGTTGGCGGTGTTGCCAGGCGCCAGTAAGCACCTCCTGCCGACTCCGTTCCACGACCTGATGACGGACGAGGACTCGCCCATAGTCCACTACTACCCGGTCTCCTTCGAGACCGACCTCAACGGGAAGAAGAACGACTGGGAAGCTGTCGTCCTTCTGCCCTTCATCGACGAGACGAACCTGCTATCGGCCATGTCTCCTTGCTACCAGCGACTCACTGAAGAGGAATTGAAGAGAAATTCTCACGGACCGATGTTAGTTTACAACTGGACAGCTGACAGTTTAGGACCAATAATATCCCCGGAATATTTCCCATCGATAAAAGAAAATCACGCCGTGGAAAAGGCTGTGTGGCGACACGAGCTCGACGTGCCGCTGCATCTGTTGAAGCGGGGGATGCTGCCCGACGCCGACAAGGACGTGCTGTACCCGGGCTTCCCCACCATGCGCCACCTGAAGTACAAGACGAGTATAAAGAAGTGCAAGGTGAAGGTTTTCGACCAGCCGTCGCGCAATGAGAATATGATGCTGCAAATTGTACCCACTGCCACAACTGACCCCGCGCTAGAGGAACTGGCCGCCAAGATACTGGGCCAGGTCGTGTGGGTCGGCTGGCCGCATTTAACACGGGCCAAAATTAATAATATAGGTTTAATTGTTATATGGGTAGGGTATGCAGTGCTTATTTGCGTCTATGGTGTGCACGCCACTGATATCCCGATCCCTGTAGGTCAAGTGCTGAGTTCAGAGTCTATTCGTAGCGGTCGTATAAAGGTGTCCGTCCGCGAATGTACGGAGCCGGTAGTGGCCAGCCAGTGTTTCGTGAGCCCCTACCGCACCACTCACCACGCGGCCGCTTCATGCGGGATATCTAATCAGCTATTGTCCCGTATAACGGGAACAGTCCTCGTCATACCCGGAGAGAGAAACGATCTCCCCACTGAGACGCAGAACAAAATAAACGTCGGACTAAATCTTAAGTTCAATAAGAAGAACCAGGAGGTGTCGGGTTACAGTCGTCGGAGTGCGAACGGCTGGGTGTATTCCCCCCGGTGTGTGGCGCTCGTGCAGGAGTACGCGGCCAAATATCCTGAATTGTTCGACGCCCTCGCCAACGCTCATAGAGACGTGTTCTTTGAAAGCGATTTATGGCCCGGGGATTTGGGCAAGAACAAAGTTCAAGATATAGCGGCCTGGTTGAAGTCCCAGCCTCACAGCAGCGCGCCGAGGCGAGAGTGCGGCTCCGAGGCCTTGGAACCGGAAGAGATGAGGGCTCTCTACAACACGCTGGAGACACAGATACGCGACCTCAAGGACAAGGAGAAGAACGTCACGCTGCACGTCAAATGGAGCCTGCTTTATAAGTGTGAACTCCACGAGGGGAACATCCAGCCGGATGTCAAAGCTGACTACCGGATGTACGACCGCGTGGTGTGCGTGGCCAGCAATATAACTGTGCCTCTGGGGTCAAAGGGCACCATCACCGCCATCTATCAGCCGTCCAACGGAAACACCGTCCGCCTATCTGACAAGTTGAACGCCTCGCCCAGCTACCAGGTCATGTTTGACGAACCCTTCCCTGGCGCCATGAAGGAAGATCTGTTCGAGGAGGCCAGGTTCTATAGGATGCAGCCCGCTCATATATTGAATATATCATACGGGCGGAAATTACGCACCGCGAGCGAGCCCCAGGGCTTCGAGTACAACCAGTCGGCACAACATAACTACACTTGTAACTCACAACCGCCCACCGTACTACGGAGAGATGACGGACACTACTCTGCCTTCGCCAGCTACAGCCCCCCGCGAGAGATCAAGACACCAGTGATTGAACATAAGCCTATCGTTAACAACAACGTTAAAAACGGCCAGACGCCGGACAGCGCCACCAACCTGCTGAGAAGTCTGTTGAGGATCAGCGAGGGAGAGGCGGACGGATCCAGGAGCAATAAGAATGTTCCAGAGACGAACAGCAACTGGCGTTCAAGAAGTGACAAAGCGACCTCGCCGAATAAAACAACTCAAAACAACTGGCGAAGAGAGGCAAATACATACAGTCAGGGAGAGTGGAGCAACACACAGAGACAGAAGCCTATCGGGATGCCATCTATGCCGTACCCGTGTTTCGGCGCCTCGCCCCCTCGCCCGCACCAGCCCCAAAGCTTCCCCAAACACTTACCGGACAACATAAAATCCGTTCCGCAGCCGGCTCAGCAAACAAACAGACAAGTCAACAACGGGGAGAAGTACAGCAATCCATTTGTTCCGCTGCAAGTTCAAACCAGCCGGAGGCGCGTCCAAAACTCAAGTGGTTCATCACAGAGACGTGATCTCGAAGGACTACCGACACCGAAGGTCATTCACCCCACACCAAATAACACCTTGTTTAATGTTCAGCCGCAGCAGAATCGTCCTCAGAGGAAGAAAAAACCAAGAATAGCTGCCAACTTGCCCTTCCAGATGGACTAA

Protein sequence:

>DPOGS210043-PA
MGVPKFFRYTSERYPCLNELVKQYQIPDFDNMYLDMNGIIHNCSHPDDSNPHFRITEEKIFKDIFHYISILFQIIKPKKLFFMAIDGVAPRAKMNQQRGRRFRSAREAEKLEETAKEKGEALPTEKRFDSNCITPGTVFMARLHEQLKYFIKEKISTDPLWSKVKVILSGHETPGEGEHKIMDYIRWARSQPDYDPSTRHCLYGLDADLIMLGVCTHEPHFALLREEVKFGKTTQRATSPEETNFYLLHLSLLREYLEQEFISIKDNLPFPYDIENIIDDWVLMGFLVGNDFIPNLPNMHISNDALPLLYKTYMTVLPTLDGYINESGDLNLGRFEVFMQELAKIDKEKFQDTYADLKYFEAKTGRRPNANERRDYKPNNDDTFNVNLDDIKANKPDDELQALIDATQEMFMDDMKSDEDYEETSDEEANMEMEFILHKKDYYMNKLDYSKVTDEVLSDQAECYVRAIQWNLWYYYRGCPSWCWYYPHHYAPYISDIKDFGNMNMEFELGEPFKPFEQLLAVLPGASKHLLPTPFHDLMTDEDSPIVHYYPVSFETDLNGKKNDWEAVVLLPFIDETNLLSAMSPCYQRLTEEELKRNSHGPMLVYNWTADSLGPIISPEYFPSIKENHAVEKAVWRHELDVPLHLLKRGMLPDADKDVLYPGFPTMRHLKYKTSIKKCKVKVFDQPSRNENMMLQIVPTATTDPALEELAAKILGQVVWVGWPHLTRAKINNIGLIVIWVGYAVLICVYGVHATDIPIPVGQVLSSESIRSGRIKVSVRECTEPVVASQCFVSPYRTTHHAAASCGISNQLLSRITGTVLVIPGERNDLPTETQNKINVGLNLKFNKKNQEVSGYSRRSANGWVYSPRCVALVQEYAAKYPELFDALANAHRDVFFESDLWPGDLGKNKVQDIAAWLKSQPHSSAPRRECGSEALEPEEMRALYNTLETQIRDLKDKEKNVTLHVKWSLLYKCELHEGNIQPDVKADYRMYDRVVCVASNITVPLGSKGTITAIYQPSNGNTVRLSDKLNASPSYQVMFDEPFPGAMKEDLFEEARFYRMQPAHILNISYGRKLRTASEPQGFEYNQSAQHNYTCNSQPPTVLRRDDGHYSAFASYSPPREIKTPVIEHKPIVNNNVKNGQTPDSATNLLRSLLRISEGEADGSRSNKNVPETNSNWRSRSDKATSPNKTTQNNWRREANTYSQGEWSNTQRQKPIGMPSMPYPCFGASPPRPHQPQSFPKHLPDNIKSVPQPAQQTNRQVNNGEKYSNPFVPLQVQTSRRRVQNSSGSSQRRDLEGLPTPKVIHPTPNNTLFNVQPQQNRPQRKKKPRIAANLPFQMD-