Monarch geneset OGS2.0

DPOGS203154
TranscriptDPOGS203154-TA1002 bp
ProteinDPOGS203154-PA333 aa
Genomic positionDPSCF300035 - 971567-972568
RNAseq coverage314x (Rank: top 36%)
Annotation
HeliconiusHMEL0065039e-17687.99% 
BombyxBGIBMGA011509-TA7e-16279.88% 
DrosophilaCG10898-PA7e-10854.95% 
EBI UniRef50UniRef50_Q7QC178e-11457.86%AGAP002403-PA n=8 Tax=Pancrustacea RepID=Q7QC17_ANOGA
NCBI RefSeqXP_966613.11e-12061.01%PREDICTED: similar to 7,8-dihydro-8-oxoguanine-triphosphatase, putative isoform 1 [Tribolium castaneum]
NCBI nr blastpgi|910944492e-11961.01%PREDICTED: similar to 7,8-dihydro-8-oxoguanine-triphosphatase, putative isoform 1 [Tribolium castaneum]
NCBI nr blastxgi|910944495e-11761.01%PREDICTED: similar to 7,8-dihydro-8-oxoguanine-triphosphatase, putative isoform 1 [Tribolium castaneum]
Group
Gene OntologyGO:00167877.7e-24hydrolase activity
KEGG pathway 
InterPro domain[49-161] IPR0157977.7e-24NUDIX hydrolase domain-like
[60-183] IPR0000861.1e-22NUDIX hydrolase domain
Orthology groupMCL14039 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203154-TA
ATGTCGCGCCAAGTTGACTCAAATATAAGCAAGCTTCTTGATGGTAGCGGCTTAAATGGGGAAGAAAATGACTTTTGTGACTTCACAATAGCAGATCAAAATTCTGTTGCTGAATCTCAGGGTATTACCCCAACTACTCCATCCAATTTTAAACCCATACTTGGAAGTAATGTCACGTACGTTGTCGCTTCAGTAATATTAAATGAAAAAAATGAATTGCTTATGATGCAAGAAGCAAAAGAAAGTTGTGCTGGAAAATGGTACTTACCTGCTGGACGAATGGAAAAAGGTGAAACAATAATTCAGGCTGCAACCAGAGAGGTACTCGAGGAAACAGGGTTGCATTGTAAGCTGGATACTTTACTGATGGTGGAAACAGCTGGTGGTACCTGGTTTAGATTTGTTTTAACAGGAAATATTGTTGGAGGGGACCTTAAAACACCCGCTAATGCTGACAAAGAATCCTTACAAGCCAAATGGATAGCCAATCTACAAGAAATATCACTGAGATCAAATGATATCTTACACCTCATTGAGAAAGCTAAAATGTACAAACAGAAACCACCTGGTGTAAATTGGCACCAGCCTATTTTGCCAGCACCCATCCCACATATTAAAGATCTGTTGAGGCTTATTGTATTAATAAAAAAAAGAAACACCAACAGGTTGCATGTGCTTCTGAGTGAGAAAACAACATTACATTTTCCGACATGTGAAATAAATCCTGCAAAAAGCGTACATTCAACGCTGAGAAGGTTCATGGTTGAAATGTTTGGTGCTGATGTCGCTCAACACAGACCATTAGGGCTTCTGAATGTAGAGGCCGATCCCAGCGCTGACGGATGTTGCCTAACATTACTAGTAGCATTTCGGCCACCGCTTGAAGAAGTACCTTTAATCGGAAAATGTGCCTGGCACGAATTGTCACAAGATGTGGAAAAGCAGCTTATTCCCATAGTCACATCAAAGAATTCTACAATTGAGTTGCATGTGGTACGTTGA

Protein sequence:

>DPOGS203154-PA
MSRQVDSNISKLLDGSGLNGEENDFCDFTIADQNSVAESQGITPTTPSNFKPILGSNVTYVVASVILNEKNELLMMQEAKESCAGKWYLPAGRMEKGETIIQAATREVLEETGLHCKLDTLLMVETAGGTWFRFVLTGNIVGGDLKTPANADKESLQAKWIANLQEISLRSNDILHLIEKAKMYKQKPPGVNWHQPILPAPIPHIKDLLRLIVLIKKRNTNRLHVLLSEKTTLHFPTCEINPAKSVHSTLRRFMVEMFGADVAQHRPLGLLNVEADPSADGCCLTLLVAFRPPLEEVPLIGKCAWHELSQDVEKQLIPIVTSKNSTIELHVVR-