Monarch geneset OGS2.0

DPOGS212864
TranscriptDPOGS212864-TA2553 bp
ProteinDPOGS212864-PA850 aa
Genomic positionDPSCF300086 + 388840-395255
RNAseq coverage1340x (Rank: top 9%)
Annotation
HeliconiusHMEL0081884e-17765.93% 
BombyxBGIBMGA000808-TA0.095.24% 
DrosophilaRat1-PA0.064.61% 
EBI UniRef50UniRef50_Q29JY30.065.03%GA10268 n=2 Tax=Arthropoda RepID=Q29JY3_DROPS
NCBI RefSeqXP_392371.20.067.53%PREDICTED: similar to CG10354-PA [Apis mellifera]
NCBI nr blastpgi|3838543460.068.48%PREDICTED: 5'-3' exoribonuclease 2 homolog [Megachile rotundata]
NCBI nr blastxgi|3838543460.068.36%PREDICTED: 5'-3' exoribonuclease 2 homolog [Megachile rotundata]
Group
Gene OntologyGO:00056340nucleus
GO:000453405'-3' exoribonuclease activity
GO:00061390nucleobase, nucleoside, nucleotide and nucleic acid metabolic process
GO:00045273.1e-112exonuclease activity
GO:00056223.1e-112intracellular
GO:00036763.1e-112nucleic acid binding
KEGG pathwayame:4088400.0 
 K12619 (XRN2, RAT1)maps-> RNA degradation
InterPro domain[1-851] IPR01715105'-3' exoribonuclease 2
[1-256] IPR0048593.1e-112Putative 5-3 exonuclease
Orthology groupMCL12602 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212864-TA
ATGGGAGTACCAGCATTCTTTCGATGGCTAAGTCGTAAATACCCTAGCGTTATTGTCGAATGTGTTGAACAGAGGCCAACCGACGTAGATGGCCAGCTCATATATGCAGACTCCTCATTACCAAATCCTAATGGGATCGAATTTGATAACCTATATTTGGATATGAATGGAATCATCCATCCCTGCACACATCCTGAAGATAAGCCGCCTCCCAAGGATGAAGACGAGATGATGGTGGCTATCTTCGAATGTATTGACAGGTTGTTTCGCATCGTAAGGCCGAGAAAACTGTTGTACATGGCTATTGATGGAGTTGCACCTAGAGCTAAAATGAACCAACAAAGGTCTCGGCGTTTTAGAGCGTCTAAAGAGACACAGGAAAAAATAGATGAGATAGCCCGGATCAGAAACGAGCTCCAAGTCAAGGGGGCATACTTACCACCAGAGAGACCCAAAGAGGCACATTTTGACTCCAACTGTATCACTCCAGGAACGCCATTCATGGATCGGTTGAGCAAGTGCCTGCATTACTATATTCATGACAGATTAAACAATGACCCTGGCTGGAAGGGAGTCAAGGTTATACTATCGGATGCAAATGTGCCCGGGGAGGGCGAGCACAAAATAATGGATTACATCAGGAGACAAAGGGCCCAGCCAGATCATGACCCCAACACTCAGCATGTGCTGTGCGGAGCTGATGCGGACCTCATCATGTTAGGTCTGGCGACCCACGAGCCGAACTTCACCATAATCCGTGAGGAGTTCAAACCCAACAAGCCCAGGCCATGTGATGTCTGTGGCCAGCTAGGTCATGAAATGAAGGAGTGTACAGGAACTACACCGGACGCCTCACTGGTGCGATCCGATCCCTCCTTCGGTAACCAGGACAGCTTCATCTTCGTGAGGTTGACGGTACTGCGAGAGTATCTCGAGAAGGAGCTGAGCATGCCGAATCTTCCGTTCAAGTACGACTTCGAGCGCGCCCTGGACGACTGGGTGTTCATGTGCTTCTTCGTGGGGAACGACTTCCTGCCGCACCTGCCCTCGCTGGAGATACGGGAGGGCGCGGTGGACCGGCTCGTCAACTTGTACAAGAAATGCGTCTACAAAACTAGGGGTTGGATCACGGACTCGGGCGACGTGAACCTGGACCGGGTGCAGGTCATCATGGACGAACTGGGGCGCGTCGAGGACGAGATCTTTAGGCGGCGGCATCAGAACGAGCTCAGCTTTAAGGCGAGGGAAAAGAACAAGAAGCAGCAGAAGATCAACTTTGAACTGCTGGAGAAGACGCAGTTCGCACCAGTGAAAGTGGGAGAGGAGTCCAAAACAGTGGAGAACGCTCGCAAGGAGGCGGCCAACATCCGCCTGGCGGGGATGCAGGCTGTGGCGGAGGCGGAAAAGGAGCAGCGCGGCCAGAAGCGGTCGGCGGAGCAGGCGGGGCTGGACGACGACGACGCTCATGATGAAGTGAGGCTCTGGGAGGAAGGCTTCAAAGAGAGATACTACGAGAGCAAGTTCGAGGTGGCCAGGGACAACCTGGAGTTCAGGTACCGCGTGGCGCTGCAGTACGTGCGCGGCCTCTGCTGGGTGCTCCGCTACTACTACCAGGGCTGCGCCAGCTGGAAGTGGTACTTCCCGTATCACTACGCACCGTTCGCCTCCGACTTCGTCAACATCCAGGGCCTGTCCACCAAGTTCGAGAGAGGCACGCAACCGTTCCGTCCCCTGGAGCAGCTGATGGGCGTGTTCCCGGCCGCCAGCTCGCAGCACGTGCCTCGCCCCTGGGCCACGCTCATGTCGGACCCGTTCTCCCCCATCATCGACTTCTACCCCACGGACTTCAAGATAGACCTCAACGGGAAGAAGTTCGCGTGGCAGGGGGTCGCCCTGCTGCCCTTCGTCGACGAGACCAGGCTGTTCAAGGCCCTGGAGCCGTACTACGACGACCTCACACAGGCCGAGAGTCAGTATTGTTGGACTCTACGTGGCGCCGGCCAACAAGAGCTACGAGTTCCTGTCGGCCCTGTACTCGGAGGCCGGGGACGACCAGCACAGGCTCATACACGCGGACCAGAAGTACCCCTTCAGTATCTAGTCGCCGCAGCGGACGGCCCGTGTGCGGTGGACAGGCTTAAGCTAATTATAAAATCCATGGTCCTTAGCCAGCTACCGTCGCCAGTGGTGGGTCTGGAGCCGGTGACGGACAACCGGGTGGTGTGTGTGAGGTACCAGGACCCGCAGTTCCCCGAGGAGTTCGTGTTCCCGGCCAGGAGGCTGCGCGGGGCCGTCGACCCGCCCAGGGTGCTGAAGCCAGGGAACCTCAGCCATCAGGAGAACAGGAACTGGCGTCCACAGATAGGAATGGTCCGCTCGCACACGGTGGCGTCCCTGGAGGTGGCCGGCCACCGCATGCTGGGTCACCAGCTGTCCCGGAACCCGCGCGCCGCCGGGCCGCCGTCGCAACACAGCGGAGGCGGAGGTGACGTCACGCCGCGCGCCGGTCACGTGGCCGGGGACGTGTGTGTAAACAAATTTGACGACTATTAA

Protein sequence:

>DPOGS212864-PA
MGVPAFFRWLSRKYPSVIVECVEQRPTDVDGQLIYADSSLPNPNGIEFDNLYLDMNGIIHPCTHPEDKPPPKDEDEMMVAIFECIDRLFRIVRPRKLLYMAIDGVAPRAKMNQQRSRRFRASKETQEKIDEIARIRNELQVKGAYLPPERPKEAHFDSNCITPGTPFMDRLSKCLHYYIHDRLNNDPGWKGVKVILSDANVPGEGEHKIMDYIRRQRAQPDHDPNTQHVLCGADADLIMLGLATHEPNFTIIREEFKPNKPRPCDVCGQLGHEMKECTGTTPDASLVRSDPSFGNQDSFIFVRLTVLREYLEKELSMPNLPFKYDFERALDDWVFMCFFVGNDFLPHLPSLEIREGAVDRLVNLYKKCVYKTRGWITDSGDVNLDRVQVIMDELGRVEDEIFRRRHQNELSFKAREKNKKQQKINFELLEKTQFAPVKVGEESKTVENARKEAANIRLAGMQAVAEAEKEQRGQKRSAEQAGLDDDDAHDEVRLWEEGFKERYYESKFEVARDNLEFRYRVALQYVRGLCWVLRYYYQGCASWKWYFPYHYAPFASDFVNIQGLSTKFERGTQPFRPLEQLMGVFPAASSQHVPRPWATLMSDPFSPIIDFYPTDFKIDLNGKKFAWQGVALLPFVDETRLFKALEPYYDDLTQAESQYCWTLRGAGQQELRVPVGPVLGGRGRPAQAHTRGPEVPLQYLVAAADGPCAVDRLKLIIKSMVLSQLPSPVVGLEPVTDNRVVCVRYQDPQFPEEFVFPARRLRGAVDPPRVLKPGNLSHQENRNWRPQIGMVRSHTVASLEVAGHRMLGHQLSRNPRAAGPPSQHSGGGGDVTPRAGHVAGDVCVNKFDDY-