Monarch geneset OGS2.0

DPOGS206884
TranscriptDPOGS206884-TA591 bp
ProteinDPOGS206884-PA196 aa
Genomic positionDPSCF300001 - 2041133-2041723
RNAseq coverage39x (Rank: top 73%)
Annotation
HeliconiusHMEL0068785e-8077.27% 
BombyxBGIBMGA012841-TA2e-8172.34% 
Drosophila% 
EBI UniRef50UniRef50_UPI000203A21D1e-2238.89%UPI000203A21D related cluster n=1 Tax=unknown RepID=UPI000203A21D
NCBI RefSeqXP_002738372.12e-2237.32%PREDICTED: methyl-CpG binding domain protein 4-like [Saccoglossus kowalevskii]
NCBI nr blastpgi|3272803865e-2238.89%PREDICTED: methyl-CpG-binding domain protein 4-like [Anolis carolinensis]
NCBI nr blastxgi|3287128294e-2243.85%PREDICTED: methyl-CpG-binding domain protein 4-like [Acyrthosiphon pisum]
Group
Gene OntologyGO:00062811.2e-19DNA repair
GO:00038241.2e-19catalytic activity
KEGG pathwayxtr:7335283e-20 
 K10801 (MBD4)maps-> Base excision repair
InterPro domain[22-171] IPR0112571.2e-19DNA glycosylase
Orthology groupMCL19609 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206884-TA
ATGAATTTGGAGAACGACTTGAGTTTATCACAATTAACAATAGAAGAGCCGGACCCATTAAACATACCGCCTTTCTTCAATCTCACACCTCGCTTAATGCCCGAATCACCTCACTATATAATAGAGGAGGAATTCTCTCTCAACCCCTGGGCAATGTTAGTAGCAACTATATTCCTAACAAAGACGTCCGGCAAAACAGCTAGGCCGTATATAAAGAGTTTTTTTACGGACTACCCGACACCTTATCAAGTTTTGGATGAAACTCCATCGTCATTAGAGAGGTTCTTCGAGAACTTAGGATTAAAAAAACGTGGTAATATGATTTGGAAGCTGAGTTACCAGTTCGTGTCTGGTAAATGGCGGCGAGCTAGCGATCTCTGTGGAATCGGGAAGTATGGGGAGGACGCTTATAGGATCTTTTGCTTAGGTCACACGGATGTAAACCCTGACGATAGATATTTAAAGCTCTATTTAGATTGGCTGCAGTGTCACACTGAGTTCATAAAAGACAGGAGCGTAACTGACAGCGAAAACCTATTACAAGATCCGGTTCTGAAATATTATAGAATTACTTTGAAAAGTAATGTATAA

Protein sequence:

>DPOGS206884-PA
MNLENDLSLSQLTIEEPDPLNIPPFFNLTPRLMPESPHYIIEEEFSLNPWAMLVATIFLTKTSGKTARPYIKSFFTDYPTPYQVLDETPSSLERFFENLGLKKRGNMIWKLSYQFVSGKWRRASDLCGIGKYGEDAYRIFCLGHTDVNPDDRYLKLYLDWLQCHTEFIKDRSVTDSENLLQDPVLKYYRITLKSNV-