Monarch geneset OGS2.0

DPOGS204354
TranscriptDPOGS204354-TA1917 bp
ProteinDPOGS204354-PA638 aa
Genomic positionDPSCF300040 - 951124-954542
RNAseq coverage159x (Rank: top 52%)
Annotation
HeliconiusHMEL0118180.057.06% 
BombyxBGIBMGA005885-TA1e-15145.10% 
DrosophilaParg-PC8e-12847.07% 
EBI UniRef50UniRef50_B0WST71e-12843.08%Poly(Adp-ribose) glycohydrolase n=2 Tax=Culicinae RepID=B0WST7_CULQU
NCBI RefSeqXP_001853435.13e-12943.08%poly(adp-ribose) glycohydrolase [Culex quinquefasciatus]
NCBI nr blastpgi|1700486185e-12843.08%poly(adp-ribose) glycohydrolase [Culex quinquefasciatus]
NCBI nr blastxgi|1571190288e-12846.89%poly(adp-ribose) glycohydrolase [Aedes aegypti]
Group
Gene OntologyGO:00046498.3e-144poly(ADP-ribose) glycohydrolase activity
GO:00059758.3e-144carbohydrate metabolic process
KEGG pathway 
InterPro domain[48-493] IPR0077248.3e-144Poly(ADP-ribose) glycohydrolase
Orthology groupMCL11552 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204354-TA
ATGTGTAGTAGTTGGAAGGGTGTACCTATTTCCTACATTGTTGGCTCGCAATCGCCGTGGGGTGCACCGGAGTTTCCCTTAGTACAGCCGTCGTATAACCACACAGTGTTGTACCACATACCAGATGATGCTCAACTAGACAGACCTCCAAAACCACAGATTGGTCATGAAAAATGGGATCAGGAACATGTGAGGTTGCCATTCTCTACACAAAGCCTGTATCCTGTTGAAAACAGTGCAGGTGAAACTAAACTTAAAAACCGATGGGACATGGTTCAGAATGCTTTAAACAGGCCAATACGTAACAGTAAGGAGCTCGCTAAAGCTATATTGAGTTATAACACTCAGTTCAAAAATAGATGGAAATTTACGGCTTTACATTACTTGTTTGAGGAGTATTTAGAAGAAGAGGAGTCTCAGTACTTTTTTGATGTCACATTACAAGAAATTGCTAAGCTCGCTTTGTCAATAACAAAATTAATACAAGCTCCAATACCTTTGTTGAAACAAAACAAGAACCGATCTATATCTTTGTCGCAGCAGCAAATATCATGTTTATTAGCGAATGCATTCTTCTGTACATTTCCACGACGGAACACTACTAAGAAAAATTCTGAATATGCCTCATACCCCTATATTAACTTTAATGTTTTGTATGAATGTGAGCCATCTAACCATGTGGTGGAGAAATTGAAATGCATCTGTCACTACTTCAGGAGAGTTTGCACAAAAGTTCCAGTTGGAGTGGTTACAGTGTCTCGTCGTTCTGTTCCTGTAAAGGAGTTACCGGATTGGAAGAGCTCCGAGAGAATCATCTCCGAACTGCCTGTTCATTGCGACTCGGAGAACACTATAGAAGAAGCACATGGCTTGATACAAGTGGATTTTGCTAATAAGTACTTAGGCGGCGGTGTATTGAGTTACGGCTCGGTCCAAGAGGAGATAAGATTCATGATATGCCCCGAGCTGATGATATCAATGTTGTTTACCGAGGAACTGAAGCCCAATGAAGCTTTGATGGTTATAGGTTGTGAACAGTACAGCACATACTCTGGCTATGGTCACAGTTTCTCGTGGGGCTCCAACTATAATGACATAACACCGAGGGACTCCTCTGGCAGGAAACGGACCGCAGTCCTGGCTATAGACGCCCTGCCTGTGAGGAGTCGTCTACACGAGATGAATGCTAACACCGTCACTAGGGATATCAATAAGACTATGGCCTTGACTGAGGCTGGCCGGCCGTTGGCCTACTACACCTTCGACGATAAAGAGTTGAGAGACGACATTATCGGATGCTACGAGTTGCTCGTCAGACATCAGGTTACCGTCGGTCAATTGTATAATATTATAATGAACTACTGTGACTCGAATCAACACAGCGGCGGTATTTACACATATCTGGAACACGCTCTGGATAATAGAAAACCGGTTAATAATAAGAATGACACGGGAAAAAACCTAAAATCCGATACAAATGACAGCGGTAATGTCTGTGATGATCTGATTCTTGCTAGAGCGTTGGATTTTTCACCGGACATATTCTTACAAGACGAAGATATGAGTGAATATTCGATGGACTTGAAAGTTAACACGGAAGATACGGCCGTTATAGATCTAGACACGAGTACGAGCGACGGCAATAAATGTGTTGATAGTAGAGTGACCGCCGAAGAACAGGAAGTTCAGGAAAATACTAAAACAAACCAAACGTCCAGATTATTCGACGAGATGGAGAAGTTAGATCAAGACAGCGGGAAATTGAATCTCAAGAGTCAGCAGAAAACATTCTTTGGACAGAAAAATAATGATTTGTCCATGGACGCTGGGGAGAAGTTACATACAGATATTTCGCCGGATGTCAAAAAGAAATTAACCAAAAAAATTACAGATTATTTCTCCAAGAGACCTATATGA

Protein sequence:

>DPOGS204354-PA
MCSSWKGVPISYIVGSQSPWGAPEFPLVQPSYNHTVLYHIPDDAQLDRPPKPQIGHEKWDQEHVRLPFSTQSLYPVENSAGETKLKNRWDMVQNALNRPIRNSKELAKAILSYNTQFKNRWKFTALHYLFEEYLEEEESQYFFDVTLQEIAKLALSITKLIQAPIPLLKQNKNRSISLSQQQISCLLANAFFCTFPRRNTTKKNSEYASYPYINFNVLYECEPSNHVVEKLKCICHYFRRVCTKVPVGVVTVSRRSVPVKELPDWKSSERIISELPVHCDSENTIEEAHGLIQVDFANKYLGGGVLSYGSVQEEIRFMICPELMISMLFTEELKPNEALMVIGCEQYSTYSGYGHSFSWGSNYNDITPRDSSGRKRTAVLAIDALPVRSRLHEMNANTVTRDINKTMALTEAGRPLAYYTFDDKELRDDIIGCYELLVRHQVTVGQLYNIIMNYCDSNQHSGGIYTYLEHALDNRKPVNNKNDTGKNLKSDTNDSGNVCDDLILARALDFSPDIFLQDEDMSEYSMDLKVNTEDTAVIDLDTSTSDGNKCVDSRVTAEEQEVQENTKTNQTSRLFDEMEKLDQDSGKLNLKSQQKTFFGQKNNDLSMDAGEKLHTDISPDVKKKLTKKITDYFSKRPI-