Monarch geneset OGS2.0

DPOGS207791
TranscriptDPOGS207791-TA1002 bp
ProteinDPOGS207791-PA333 aa
Genomic positionDPSCF300042 + 199007-200957
RNAseq coverage66x (Rank: top 67%)
Annotation
HeliconiusHMEL0175721e-11572.00% 
BombyxBGIBMGA005485-TA2e-8564.10% 
DrosophilaCG4098-PA1e-7753.70% 
EBI UniRef50UniRef50_F4WIM33e-8457.36%ADP-ribose pyrophosphatase, mitochondrial n=2 Tax=Endopterygota RepID=F4WIM3_ACREC
NCBI RefSeqXP_001603303.15e-8559.00%PREDICTED: similar to SJCHGC05997 protein [Nasonia vitripennis]
NCBI nr blastpgi|3838616833e-8559.46%PREDICTED: ADP-ribose pyrophosphatase, mitochondrial-like [Megachile rotundata]
NCBI nr blastxgi|3838616838e-8759.46%PREDICTED: ADP-ribose pyrophosphatase, mitochondrial-like [Megachile rotundata]
Group
Gene OntologyGO:00167871.3e-42hydrolase activity
KEGG pathwaynvi:1001195472e-84 
 K13988 (NUDT9)maps-> Purine metabolism
InterPro domain[1-249] IPR0157971.3e-42NUDIX hydrolase domain-like
[109-248] IPR0000863.1e-13NUDIX hydrolase domain
Orthology groupMCL15006 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207791-TA
ATGTCTGTTCATATCAAGTGTCGCAGTCGATTTTACCCTCGTTCTAGTATTGAAAGATTCATTGTGCCAGATAAGAAAGTTCCCTGGTCTGTGGAATTTAAAGAATATTGTCCAAAAACTTATAATGCACCTTCAATTCATGGCAAACCCTGGGCAGATCCAGATATTAGAAATCCTAATTTCACACCAAAATGGAATGATATAGATGGACAGGTGAATAGGAAAAGCTACACAGGAATTTATAAAATTTCTGATGGAATGCCACTGAACCCTTTTGGTAGGACTGGTATATCAGGAAGAGGAGTTTTAGGCCGATGGGGACCAAACCATGCAGCAGATCCAGTTGTTACTAGATGGAAGGATTCAAATCATTCCATCCTACAATTTGTGGCAATAAAACGTGGGGATACAGGAGAGTGGGCTCTTCCTGGTGGCATGGTGGATCCCGGGGAAAAATTTGCAACAACAGCTATAAGGGAATTTCAAGAAGAAGCTATGAATTCCCTTGAAGCATCACAAGATGAGAAAAACAAATGGGTGGAAAAATTCAAAGATTTCTTCAGTAGTGGCATTGAAATTTACAGCGGATATGTAGATGATCCTCGTAATACTGACAATGCTTGGATGGAAACAACAGCCTACAATTACCACGATGAAACTGGCACAACAGTCGGGGCTTTAAACTTGAAAGCTGGCGATGACGCTGTCGGCGTCCAATGGGTGGATATTACGCCTATTTTAAATCTTAGTGGCATTGAAATATACAGCGGCTATGTAGATGATCCTCGTAATACCGACAATGCTTGGATGGAAACAACAGCCTACAATTACCATGATGAAACTGGCACAACAGTCGGGGCTTTAAACTTGAAAGCTGGCGATGACGCTGTCGGCGTCCAATGGGTGGATATTACGCCTACTTTAAATCTCTATGCCAGTCACAAAGACATAGTTAATAAAGTGTACAAAACCATAGTGCCTGATTCAAGAGAAAACAAATAA

Protein sequence:

>DPOGS207791-PA
MSVHIKCRSRFYPRSSIERFIVPDKKVPWSVEFKEYCPKTYNAPSIHGKPWADPDIRNPNFTPKWNDIDGQVNRKSYTGIYKISDGMPLNPFGRTGISGRGVLGRWGPNHAADPVVTRWKDSNHSILQFVAIKRGDTGEWALPGGMVDPGEKFATTAIREFQEEAMNSLEASQDEKNKWVEKFKDFFSSGIEIYSGYVDDPRNTDNAWMETTAYNYHDETGTTVGALNLKAGDDAVGVQWVDITPILNLSGIEIYSGYVDDPRNTDNAWMETTAYNYHDETGTTVGALNLKAGDDAVGVQWVDITPTLNLYASHKDIVNKVYKTIVPDSRENK-