Monarch geneset OGS2.0

DPOGS215091
TranscriptDPOGS215091-TA1113 bp
ProteinDPOGS215091-PA370 aa
Genomic positionDPSCF300187 + 249055-250980
RNAseq coverage413x (Rank: top 29%)
Annotation
HeliconiusHMEL0105391e-10780.77% 
BombyxBGIBMGA007193-TA3e-17375.38% 
DrosophilaDcp2-PE6e-8851.71% 
EBI UniRef50UniRef50_E2AD157e-10764.81%mRNA-decapping enzyme 2 n=11 Tax=Neoptera RepID=E2AD15_CAMFO
NCBI RefSeqXP_968428.12e-10653.28%PREDICTED: similar to Decapping protein 2 CG6169-PA [Tribolium castaneum]
NCBI nr blastpgi|3407171042e-11159.01%PREDICTED: mRNA-decapping enzyme 2-like isoform 1 [Bombus terrestris]
NCBI nr blastxgi|3838559901e-10959.20%PREDICTED: mRNA-decapping enzyme 2-like [Megachile rotundata]
Group
Gene OntologyGO:00037235.9e-30RNA binding
GO:00167875.9e-30hydrolase activity
GO:00301455.9e-30manganese ion binding
KEGG pathwaytca:6568346e-106 
 K12613 (DCP2)maps-> RNA degradation
InterPro domain[18-102] IPR0077225.9e-30mRNA decapping protein 2, Box A
[97-216] IPR0000861.1e-29NUDIX hydrolase domain
[104-268] IPR0157973.6e-26NUDIX hydrolase domain-like
Orthology groupMCL11925 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215091-TA
ATGGCTTCATCAGAGGGAAATATTCGCCAAAACAAACATTCAATTCCAATTGATATTTTAGATGATCTTTGCAGTCGTTTTATAATAAATCTGCCGCCTGAAGACAAAGCGAATTTGGTTCGAATCTGTTTCCAAATAGAATTGGCACATTGGTTTTATCTGGATTACTATTGCACCGACGAAAGCACAAGACTCAACCCTTGTGGCATCAGGGAATTTGCAGCACATATATTTCAGCATGTCCCTACACTTCGTGAGCATATTAGGAATCTTGATGAGGTTTTGGATAACTGGAGGGAGTACAAGCAAACTGTTCCAACATATGGTGCCATCCTCCTCGACACTGACTTGACACATGTATTGTTAGTGCAGTCATATTGGGCGAAGGCGTCATGGGGTTTCCCTAAAGGCAAGGTCAATGAAGATGAGGAACCATGGAAATGTGCTAGTAGAGAGGTGCTAGAAGAAACTGGTTTTGATATAAGCAATCTTATTAATAAACAAGATTATATTGAAGCTACAATTCACGATCAAATCGCTAGATTGTATATTATAGGGAATATATCACGAGATACAAAGTTTCAACCGCGAACAAGGAATGAGATTAAAGCCTGTGAATGGTTTCCAATAGCGGATCTACCAGCGAACAGGAAGGATATGACGCCCAAAGTTAAAATGGGGGTCAGTCCGAACGCATTTTTTATGGTGCTACCATTCGTTAAACGTATACGACGCTGGGTGGCCGAAAGGCATCAAAAAGTCTTCAGGCGCACCCGCCATAAGTCAATGGGTGATATAGATATGACTCAAAATAAAAATAAAACTATCTCACAAGGCCTGCAAAACGAAATTCAGGAGTACCAGCAGAATACCAACCAGAAAGTATACAAGAATCACAGCGTCACCAAAACGAACGGCAACAGAAAGGCGGGAAAAATGTCCAAACGGCAACTATTCACGCCGCAGAACGTTCAAACGTTCAATTTCTCGCCTGTGCACAAAGATAATAGTCCGATAGATGATGTAGAAGATAACTCGTATTATAACTTCGTGGCGCCATCTTGGGCTAATTTTAAATTTGACAGACGCGCCATCTTAGACTGTTTGACTTGA

Protein sequence:

>DPOGS215091-PA
MASSEGNIRQNKHSIPIDILDDLCSRFIINLPPEDKANLVRICFQIELAHWFYLDYYCTDESTRLNPCGIREFAAHIFQHVPTLREHIRNLDEVLDNWREYKQTVPTYGAILLDTDLTHVLLVQSYWAKASWGFPKGKVNEDEEPWKCASREVLEETGFDISNLINKQDYIEATIHDQIARLYIIGNISRDTKFQPRTRNEIKACEWFPIADLPANRKDMTPKVKMGVSPNAFFMVLPFVKRIRRWVAERHQKVFRRTRHKSMGDIDMTQNKNKTISQGLQNEIQEYQQNTNQKVYKNHSVTKTNGNRKAGKMSKRQLFTPQNVQTFNFSPVHKDNSPIDDVEDNSYYNFVAPSWANFKFDRRAILDCLT-