Monarch geneset OGS2.0

DPOGS204355
TranscriptDPOGS204355-TA1488 bp
ProteinDPOGS204355-PA495 aa
Genomic positionDPSCF300040 - 940223-943807
RNAseq coverage140x (Rank: top 55%)
Annotation
HeliconiusHMEL0118190.063.47% 
BombyxBGIBMGA005885-TA3e-12348.86% 
DrosophilaParg-PC7e-12545.89% 
EBI UniRef50UniRef50_B0WST73e-12748.31%Poly(Adp-ribose) glycohydrolase n=2 Tax=Culicinae RepID=B0WST7_CULQU
NCBI RefSeqXP_001853435.16e-12848.31%poly(adp-ribose) glycohydrolase [Culex quinquefasciatus]
NCBI nr blastpgi|1700486181e-12648.31%poly(adp-ribose) glycohydrolase [Culex quinquefasciatus]
NCBI nr blastxgi|1700486185e-12448.31%poly(adp-ribose) glycohydrolase [Culex quinquefasciatus]
Group
Gene OntologyGO:00046497e-152poly(ADP-ribose) glycohydrolase activity
GO:00059757e-152carbohydrate metabolic process
KEGG pathway 
InterPro domain[39-470] IPR0077247e-152Poly(ADP-ribose) glycohydrolase
Orthology groupMCL11552 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204355-TA
ATGCATTACACTGACTGTTGGCTGGCTAAAGATTTCCCTCCAGTTGAACCAAGCGAAGATCATATAGTGTTATTCAGAATATGCAATGGGCAATATATTCCAAATATTGGAGAAGATAAATGGGACAAGGACCATGTGAAAATGCCATGTTCTGATAGAAATATCAACAAAGAGATTAATGAAACTAAACAATGGGACGTCATAGTAAAAGCTTTGTCTCAAAAAATTAAAAATTCCGAGGCTTTAGCATCCGCGATACTCACATATCAGACACAGTTTAAGGATATTTGGAAGTTCAAAGCTATGCATAGGTTTTTTAACGAGTACTGGGATAAAAATGACAGTGAATACTTCTTCGAAAACACTTTGCCTAAAGTTGCGCGTTTAGCTTTGGATTTACCGGAGTTAATCAAATCTCCTATACCGTTACTGAAACAGGGATGTAATATATCGCTATCATTTACACAGTTGCAACTCGCTAGTTTGCTAGCAAACGCATTCTTTTGTACATTCCCAGAGAGGAATAATAAGAGAAAGGATTCAGAATACAAAACATATCCGCCGGTAAATTTCAACGTACTATACGACGGCGGCGGACCAAAAGTAATGGAGAAATTTAAATTTATCTGTCATTACTTCAACAGAGTCTGTGAAGTAAATCCAACCGGAGTCGTCACGTTTTCTCGTCGCCATATCCCTGTAGACAAATGTCCTGACTGGGCTCGCGTCACCCTTCCCATGTCAACGGTGCCCCTTGGTGTAGATGATAGCAAACTCATTGAAGACGCAAAGTACTGGATACAAATGGATTTCGCCAATAAATACATAGGTGGCGGCGTCCTTCGTCGCGGAGCCGTGCAAGAGGAGATCAGGTTCGTGTCGAACCCTGAACTCATGGTGTCGTTACTGTTCACCGAGGTCCTGAGTCCCACCGAAGCCGTCATGATTATAGGTACAGAACGTTATAGCACGCACACTGGTTATAGTTCCACTGTCAAATGGTCTGGGAATTATATAGACGAGACCTCCACAGACTCGTCTGCCAGGCGGCAGTGTGCGATACTAGCGTTAGACGCTAGAAGATTCCCTAAGCCTGACGAGCAGTACTGTAAGGAAATGATAGACAGAGAGTTGAATAAGGCATACGTTGGATTTTCATTTTATTCGAAGGCCGGTGGATTGAGTTACCCGGGGATAGCGACGGGTAACTGGGGCTGTGGGGCCTTTGGAGGTTCGGCCAGATTGAAGTCCTTATTGCAGATTATGGCCTGCGTCAGAGCCGGCAGACCGATTTCTTACTTCACATTTAATGATGTAACATTAAAAAATAATATAGAACATATGTACGAGTTCCTCAGAACAAATAATGTTACAGTCGGCGATCTGTATCAATGTCTGATGGATTTCTGTGAATCTGAGGATCATATTAGTGTGTTTGTTAACTTATATGAAAACATTGAGAATTATTTCAACAAACAAATGATGTGA

Protein sequence:

>DPOGS204355-PA
MHYTDCWLAKDFPPVEPSEDHIVLFRICNGQYIPNIGEDKWDKDHVKMPCSDRNINKEINETKQWDVIVKALSQKIKNSEALASAILTYQTQFKDIWKFKAMHRFFNEYWDKNDSEYFFENTLPKVARLALDLPELIKSPIPLLKQGCNISLSFTQLQLASLLANAFFCTFPERNNKRKDSEYKTYPPVNFNVLYDGGGPKVMEKFKFICHYFNRVCEVNPTGVVTFSRRHIPVDKCPDWARVTLPMSTVPLGVDDSKLIEDAKYWIQMDFANKYIGGGVLRRGAVQEEIRFVSNPELMVSLLFTEVLSPTEAVMIIGTERYSTHTGYSSTVKWSGNYIDETSTDSSARRQCAILALDARRFPKPDEQYCKEMIDRELNKAYVGFSFYSKAGGLSYPGIATGNWGCGAFGGSARLKSLLQIMACVRAGRPISYFTFNDVTLKNNIEHMYEFLRTNNVTVGDLYQCLMDFCESEDHISVFVNLYENIENYFNKQMM-