Monarch geneset OGS2.0

DPOGS212149
TranscriptDPOGS212149-TA1590 bp
ProteinDPOGS212149-PA529 aa
Genomic positionDPSCF300038 + 509475-521219
RNAseq coverage93x (Rank: top 62%)
Annotation
HeliconiusHMEL0125302e-8957.97% 
BombyxBGIBMGA006602-TA0.056.78% 
Drosophila% 
EBI UniRef50UniRef50_F4WZ853e-9032.51%Poly(A)-specific ribonuclease PARN-like domain-containing protein 1 n=1 Tax=Acromyrmex echinatior RepID=F4WZ85_ACREC
NCBI RefSeqXP_971302.12e-6628.76%PREDICTED: similar to Poly(A)-specific ribonuclease PARN-like domain-containing protein 1 [Tribolium castaneum]
NCBI nr blastpgi|3320200231e-8932.51%Poly(A)-specific ribonuclease PARN-like domain-containing protein 1 [Acromyrmex echinatior]
NCBI nr blastxgi|3320200231e-8832.51%Poly(A)-specific ribonuclease PARN-like domain-containing protein 1 [Acromyrmex echinatior]
Group
Gene OntologyGO:00036764e-48nucleic acid binding
GO:00056341.4e-47nucleus
KEGG pathwayame:4086264e-42 
 K01148 (PARN)maps-> RNA degradation
InterPro domain[1-405] IPR0123374e-48Ribonuclease H-like
[1-374] IPR0069411.4e-47Ribonuclease CAF1
Orthology groupMCL17368 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212149-TA
ATGGATATAACGAACAAAAACTTTAATCAAGAATTTGATAGCATAATCAAATGTTTGAAAAAGTCTTGTTTCATTGGCTTTGACGCAGAATTTACAGCTATATTATCGGGAGAATGTTTTAAATATAGATTGTTCGATACTAATAAAGAAAGATATGACCGAATGAAGGATGAAGTCTGTAAGATGATAATGAATCAAGTTGGTCTAACAATGTTTCAGTATGATCGGGATAATGACAGATATATCTCCCATACCTACACATTCCACCTCTGTCCCCAAGTATTTGGTGACATCAATCAGTCCTTTACATTCCAAGCATCAGCCATTAGATTCCTTTGTCTGCACAATTTTGATTATAACAAGTTTACATATGATGGTCTCCCCTATATGAGTCGATCCGAAGAAGCTCATATAAAGAAACAGCTGAAGGAAAAAACACTTCTTAATAATCTGATACAAGCAATGGAAATGAATGATGAGAGGAAACTTCAAAGCTACTGTTCCGAAGTCGCTAAATGGCTCGTAAATAGTACTGATGACACAATATACCTGGATATTGACAATCCCATAATGAGATACATAGTGCACTGTGAATTGAGGAATCGCTTCCCAGAGGTCCTGACAACTGATAGCTTAGGTAATAGTCATAAGGTCCTGATATATAGAAATGGTGACGTAGATGGCGCCACCAGCGCTTCTATGTCAACACTAGAAGGAAACCTAATAACTTACGTCCTAGGTTTCTCACAAATCATAGAGCTTCTAGCGGAATACAAGAAACCAATCATCGGCCACAACATGTACCTAGACACATTATTACTACACAGCCAATTCATAGGTCCGTTGCCTAAAAGATATTCTGCATTTAAGAAGAATATTAACAACCTATTTCCTAATATGTATGACACCAAGTACATCTCACACGAAATGGGCAGGAAGCTGACACTCGATGAAGTATGGAAGTCTAATGCACTGCAAGATTTATATGAATTCTTCTATGAGGGTAAATGTAAGAAGTTGCAGAAGGGTTTAAGTCAAATAAAACTGAATACTCCATTTGATGTGAAACAATCATATCACGAGGCCGGTTGGGATTCATATTGCTCAGGTTACTGCTTCATCCGCCTGGGCGAGTGGTCGTCCACAGATCACCGGGGTACAGCTCGCCCAGCATCCCCCAATGAAATAATCTCATCACTCTCATCATACTGCAACAAGGTCAACGTCATACGATCCGCTGTGGCTTATATGAATCTCATTGACGAGGACCCACCAAGCCATAGACCAGAGCTACTTCACATTAGATCTTTGAGGGAACCAGCAATACGTATGAGCCAGGTGTCGTCTCTACTGTCTGGTTTCGGTGCTTTGGATATAAAGCCTTACGGAGCAAGAACCGCTTTGATAGCGGCTGGAACACATCACACTGCCAACAAATTATTACGACACTTCAATCACCACGAGGACTATAGGATCACTCCGTTCAAGCCCTTCCGACACTCACCAACGGGTCGCCTCGCAATTTGGGGAGGTGCATTAATCACGGGTACCATTGTCATATACTTCATACATAAAAAAGTCAACAAGTAA

Protein sequence:

>DPOGS212149-PA
MDITNKNFNQEFDSIIKCLKKSCFIGFDAEFTAILSGECFKYRLFDTNKERYDRMKDEVCKMIMNQVGLTMFQYDRDNDRYISHTYTFHLCPQVFGDINQSFTFQASAIRFLCLHNFDYNKFTYDGLPYMSRSEEAHIKKQLKEKTLLNNLIQAMEMNDERKLQSYCSEVAKWLVNSTDDTIYLDIDNPIMRYIVHCELRNRFPEVLTTDSLGNSHKVLIYRNGDVDGATSASMSTLEGNLITYVLGFSQIIELLAEYKKPIIGHNMYLDTLLLHSQFIGPLPKRYSAFKKNINNLFPNMYDTKYISHEMGRKLTLDEVWKSNALQDLYEFFYEGKCKKLQKGLSQIKLNTPFDVKQSYHEAGWDSYCSGYCFIRLGEWSSTDHRGTARPASPNEIISSLSSYCNKVNVIRSAVAYMNLIDEDPPSHRPELLHIRSLREPAIRMSQVSSLLSGFGALDIKPYGARTALIAAGTHHTANKLLRHFNHHEDYRITPFKPFRHSPTGRLAIWGGALITGTIVIYFIHKKVNK-