Monarch geneset OGS2.0

DPOGS204020
TranscriptDPOGS204020-TA2250 bp
ProteinDPOGS204020-PA749 aa
Genomic positionDPSCF300138 - 163349-170649
RNAseq coverage224x (Rank: top 44%)
Annotation
HeliconiusHMEL0106440.081.36% 
BombyxBGIBMGA004873-TA0.073.23% 
DrosophilaCG11337-PD0.058.72% 
EBI UniRef50UniRef50_A8JRH90.059.58%CG11337, isoform C n=16 Tax=Coelomata RepID=A8JRH9_DROME
NCBI RefSeqXP_974155.10.063.87%PREDICTED: similar to CG11337 CG11337-PA [Tribolium castaneum]
NCBI nr blastpgi|910857630.063.87%PREDICTED: similar to CG11337 CG11337-PA [Tribolium castaneum]
NCBI nr blastxgi|910857630.063.87%PREDICTED: similar to CG11337 CG11337-PA [Tribolium castaneum]
Group
Gene OntologyGO:00037233.6e-294RNA binding
GO:00046543.6e-294polyribonucleotide nucleotidyltransferase activity
GO:00064023.6e-294mRNA catabolic process
GO:00063961.3e-23RNA processing
GO:00001751.6e-193'-5'-exoribonuclease activity
KEGG pathwaytca:6629950.0 
 K00962 (pnp, PNPT1)maps-> Purine metabolism
    RNA degradation
    Pyrimidine metabolism
InterPro domain[21-748] IPR0121620Polyribonucleotide nucleotidyltransferase
[15-169] IPR0205683.7e-42Ribosomal protein S5 domain 2-type fold
[170-264] IPR0158471.3e-23Exoribonuclease, phosphorolytic domain 2
[252-353] IPR0158481.6e-19Polynucleotide phosphorylase, phosphorolytic RNA-binding, bacterial/organelle-type
[350-485] IPR0012472.4e-18Exoribonuclease, phosphorolytic domain 1
[654-739] IPR0160276.2e-10Nucleic acid-binding, OB-fold-like
[660-733] IPR0123403.2e-08Nucleic acid-binding, OB-fold
Orthology groupMCL13498 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204020-TA
ATGTTAAGAAAGAATCGCATATTTTTATCGAAAATGAGATGTAGGAGTCGATATTATTATAGACTTCTATCTTCTCAAGCTCCTATTGGTGAAGTTGATATACCTTTTTCTAACGGGATTTCATTAAAGTTATCTACTGGTAAATACGCAAGATTTGCGGATGGTGCTTGTGTAGCAACAATAGGGAACACGAGTGTACTATCAACAGTTGTTTCTAAAGCGAAACAAAGCGCTTCCAATTTTCTACCTTTAGTTGTAGACTACAGACAAAAAGCTGCTGCCGCCGGAAGAATACCAACCAACTTTTTAAGAAAAGAATTGGGACCAACGGAAAGAGAAATCCTTACATCACGTCTAATCGACAGATCACTACGACCATTGTTTCCATCGAATTATAATTTCGACACGCAAATAGTATGTAATATGTTAGCAGTGGACGGGACGAACCCTCCGGAGACGGTCGCCATTAATGCTGCTAGCGCTGCACTTGCTTTATCCGACGTTCCTTGGAACGGACCTGTAGGGGCTGTCAGATTGGGCCTAATTGACAATGAGCTGATTATAAATCCAACGAGAAGAGATCTTGAGAGGTCAATCCTTAACCTAGTTGTAGCGGCCACTGCTGGTAATTTGGTGGTCATGATGGAGGGGAGCGCCAAGGTCATTCTCCAACAAGACCTCTTAAAGGCTATCAAACTAGGTGCTAAGGAAGCTCAGAATGTTGTACGTGGTATAGAAAAATTACAGAAGAGTCACGGGAAAATGAAAAGGGAATATGAAATGCCAGCCGTCCTCGACCCAAGTGTTGTGGACTCCATCAAGACATTGTCTTCTATGAAGATCAGAGAAATACTCAGTGATTACAGTCATGACAAGACCAGTCGCGACCTGGCCATATCGGATCTGCGGCAGACGGTCCTCAACCAGCTGAGAGACACGGACGTCGACGTGACCCTACTACAGGACGGCTTTAACAACCACCTCAAGGAAATCTTCAGGGATATGATATTCGAGAACGACGTGAGATGCGACGGTAGGGGACTGGACGAGCTCAGGAAGATATCCTGTGAGGTAGGTTTGTACGAGCCTCTCCACGGCAGTGCTCTGTTCCAACGAGGTCAGACCCAAGTCCTGTGCACCGTGGCCTTCGACTCGCCGGAGAGCGCGCTCAAAATGGACCCTCTCACTATGATCACAAGCGGTGTAAAGGAGAAGAACTTCTTCCTCCACTACGAGTTCCCGTCCTACGCAACGGGCGAGGTTGGGCGCGTGAGCGGGGGCGGGGGCGGGCGGCGGGAGGCCGGACACGCTGCGCTCGCTGAGAGGGGGCTCCTGCCGGTCGTGCCACAACACCAGTGTACCGTGAGACTCACCGCTGAAGTGCTCGAGAGTAACGGTTCAAGTTCCATGGCGTCCGTGTGCGGGGGTTCCCTGGCGCTCCTGGACGCGGGCCTGGCGCTCTCCGGGGCCGCCTCGGGCGTCGCCGTGGGCCTGGTCACACGATACAAGGACGGGAAGATAGAGGACTACAGGATACTCACAGATTTGTTAGGTATAGAGGACTACATGGGTGACATGGACTTCAAGATAGCCGGCACCAAGAAGGGTGTGACGGCTCTCCAGGCGGATGTTAAGATCCCTGGCCTTCCGCTGAAGGTTGTCATGGAGGCGGTGCAGAGAGCCAGCGACGCTAAGGCCAAGATCATTGATATTATGAACGCTTGTATAGACAAGCCGAGAGACGGTCGCAAAGAGAACATGCCGGTCATAGAGGAGATGGAGGTGGAGGTACACAAGAGAGCGAAGCTGCTGGGAGTGGGCGGAGCCAACGTTAAGAGACTGTATCTAGAGACGGGGGTTCAGATAACTCCGATCGACGAAACGCATTACCGAGTGTTCGCCCCTTCCCCGGCCGCGCTGGAGGACGCTCGCTCCAGACTCGCCGCCATACTGAACGCCACCAGGACACCGGAAATGGAGTTCGGGGCGATATATACGGCCAAGGTGGTTGAAGTGAAGGATATCGGGGTGTTGGTGACGCTGTACCCGGACATGTCCCCGGCGCTGGTCCACAACACCCAGCTAGACCACCGGAAGATAATGCACCCATCAGCGCTCGGCCTGACCGTGGGCTCGGAGATACAAGTCAAATACTTTGGTCGTGATCCCGTTTCCGGTCAAATGAGGCTCTCGAGGAAAGTTCTGACGTCACCTCCGCCGGGCATAGTCAGGAACCACGACAAGAGCTAG

Protein sequence:

>DPOGS204020-PA
MLRKNRIFLSKMRCRSRYYYRLLSSQAPIGEVDIPFSNGISLKLSTGKYARFADGACVATIGNTSVLSTVVSKAKQSASNFLPLVVDYRQKAAAAGRIPTNFLRKELGPTEREILTSRLIDRSLRPLFPSNYNFDTQIVCNMLAVDGTNPPETVAINAASAALALSDVPWNGPVGAVRLGLIDNELIINPTRRDLERSILNLVVAATAGNLVVMMEGSAKVILQQDLLKAIKLGAKEAQNVVRGIEKLQKSHGKMKREYEMPAVLDPSVVDSIKTLSSMKIREILSDYSHDKTSRDLAISDLRQTVLNQLRDTDVDVTLLQDGFNNHLKEIFRDMIFENDVRCDGRGLDELRKISCEVGLYEPLHGSALFQRGQTQVLCTVAFDSPESALKMDPLTMITSGVKEKNFFLHYEFPSYATGEVGRVSGGGGGRREAGHAALAERGLLPVVPQHQCTVRLTAEVLESNGSSSMASVCGGSLALLDAGLALSGAASGVAVGLVTRYKDGKIEDYRILTDLLGIEDYMGDMDFKIAGTKKGVTALQADVKIPGLPLKVVMEAVQRASDAKAKIIDIMNACIDKPRDGRKENMPVIEEMEVEVHKRAKLLGVGGANVKRLYLETGVQITPIDETHYRVFAPSPAALEDARSRLAAILNATRTPEMEFGAIYTAKVVEVKDIGVLVTLYPDMSPALVHNTQLDHRKIMHPSALGLTVGSEIQVKYFGRDPVSGQMRLSRKVLTSPPPGIVRNHDKS-