Monarch geneset OGS2.0

DPOGS212538
TranscriptDPOGS212538-TA1014 bp
ProteinDPOGS212538-PA337 aa
Genomic positionDPSCF300315 - 60912-62826
RNAseq coverage977x (Rank: top 13%)
Annotation
Heliconius% 
BombyxBGIBMGA008192-TA1e-14774.48% 
DrosophilaCG2091-PA3e-8449.66% 
EBI UniRef50UniRef50_Q16XY11e-8445.45%Histidine triad (Hit) protein member n=5 Tax=Diptera RepID=Q16XY1_AEDAE
NCBI RefSeqXP_969966.19e-9154.90%PREDICTED: similar to histidine triad protein member [Tribolium castaneum]
NCBI nr blastpgi|910792142e-8954.90%PREDICTED: similar to histidine triad protein member [Tribolium castaneum]
NCBI nr blastxgi|910792141e-8954.90%PREDICTED: similar to histidine triad protein member [Tribolium castaneum]
Group
Gene OntologyGO:00167873e-129hydrolase activity
GO:00002903e-129deadenylation-dependent decapping of nuclear-transcribed mRNA
GO:00038243.5e-62catalytic activity
KEGG pathwaytca:6584903e-90 
 K12584 (DCPS, DCS)maps-> RNA degradation
InterPro domain[43-317] IPR0085943e-129Scavenger mRNA decapping enzyme
[135-326] IPR0111463.5e-62Histidine triad-like motif
[32-135] IPR0111451.4e-26Scavenger mRNA decapping enzyme, N-terminal
Orthology groupMCL15093 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212538-TA
ATGTCCGACGGTAAATGCCAAAGCGAACTTATTGAGCCGCCTTGCGCAAAGAAAATTAAAAAAGACGAACAAAATGATAACAGTGTAACAGAAACTGATTTGAAATTGAAAGACTTTATTCCAAGCAAGATTTTAAATAATAATACAAATAGAAAATCTGTTTGTGTGCTTGGAAATTTTAGAAACAAAAGTGGTGTGGCGTTAATAATACTCGAGAAAAATGCTTTCAAGGAAGACCACTTAGACAGTAAGGGTTACTTTTCCGAAGATTGTGAGCTTGCGACATTCTTTCAAAACGATATATACGGAAATTTTGAGTGTTTCCCGAAGCCTGAAATTAACGGTGTTAAGACGACAATTATTTACCCGGCCAGTGACAAACACATAGCAAAATTCAGCAAACAACAGGTCCACATTATATTGGAAACTCCGGAATGTTATAATAAATTAACATTACCACATATTGAAAAGGAACAATTTAGATTACAGTGGGTATACAACATATTGGAAGGAAAAAGCGAGCAAGACAGAATAATACACAACAATAAATGTGAGAAGGAGGGTTTTGTTTTGGTTCCCGATCTTAAGTGGGACGGTATCACTAAGGAGACACTATATTTGCTAGCTATTGTGAGACAGAGAAATATTAAATCACTGAGAGATCTGAATGAAAATCATTTACCGTTGCTGAAGAGGATCAGGGACGAGGGGAAGAAAGCAATTTTCGATAAATACAAAGTTATCGGCAGTCAATTAAGGATCTATCTACACTACCAACCCTCATTTTACCATCTACACATACATTTCACTTACCTCCGTCACGAAGCGCCCGGGATATATGCTGAGAAGTCACATTTACTCGACACTGTTATCGATAATATTGAAATAATGGGTGATTATTATCAAAAAGCTACTTTACCGTTCTGTAAAGGTGAAATTGATTCACTATTTAATGTATATGAAACAAATGGTTACGTTACTAAGATTCAAACGGACGAACTTATTGACAAATAG

Protein sequence:

>DPOGS212538-PA
MSDGKCQSELIEPPCAKKIKKDEQNDNSVTETDLKLKDFIPSKILNNNTNRKSVCVLGNFRNKSGVALIILEKNAFKEDHLDSKGYFSEDCELATFFQNDIYGNFECFPKPEINGVKTTIIYPASDKHIAKFSKQQVHIILETPECYNKLTLPHIEKEQFRLQWVYNILEGKSEQDRIIHNNKCEKEGFVLVPDLKWDGITKETLYLLAIVRQRNIKSLRDLNENHLPLLKRIRDEGKKAIFDKYKVIGSQLRIYLHYQPSFYHLHIHFTYLRHEAPGIYAEKSHLLDTVIDNIEIMGDYYQKATLPFCKGEIDSLFNVYETNGYVTKIQTDELIDK-