Monarch geneset OGS2.0

DPOGS213693
TranscriptDPOGS213693-TA1398 bp
ProteinDPOGS213693-PA465 aa
Genomic positionDPSCF300219 + 362543-365129
RNAseq coverage211x (Rank: top 46%)
Annotation
HeliconiusHMEL0166060.090.75% 
BombyxBGIBMGA010350-TA0.080.43% 
DrosophilaDMAP1-PA9e-13554.20% 
EBI UniRef50UniRef50_B0XFR81e-13656.82%DNA methyltransferase 1-associated protein 1 n=13 Tax=Coelomata RepID=B0XFR8_CULQU
NCBI RefSeqXP_001815879.11e-14658.58%PREDICTED: similar to DMAP1 CG11132-PA [Tribolium castaneum]
NCBI nr blastpgi|2700105751e-14659.31%hypothetical protein TcasGA2_TC009994 [Tribolium castaneum]
NCBI nr blastxgi|2700105754e-15159.44%hypothetical protein TcasGA2_TC009994 [Tribolium castaneum]
Group
Gene OntologyGO:00458923.4e-67negative regulation of transcription, DNA-dependent
GO:00056343.4e-67nucleus
KEGG pathway 
InterPro domain[243-433] IPR0084683.4e-67DNA methyltransferase 1-associated 1
Orthology groupMCL13952 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213693-TA
ATGGCAGATATTTTGGATATATTAGATATAGAACAACCTGGAGCTTCAGAAATAACAAGAGACAGTATCATCCATGGAGACAAAGCAAAAAAAAAATATGTTACCGCCAAGACGGTGCGTCGTCCGGAGGGAATGCACCGTGAAGTATTTGCTCTACTATACAATGATAACAAGGATCTGCCTCCTCTCTTACCTACTGACACTGGTAAAGCGTATAAACAAACTAAAGCGAAACTCGGTATGCGAAAAGTGAGGAAATGGGTTTGGGCACCATTCACTAACCCAGCACGCAAAGACAATGCTGTGTTTCATCATTGGAAGAGGGCATCTGATGAGGCCAAAGAGTATCCATTTGCACAATTCAATAAGCAAGTATCAATCCCATCCTATTCAGAATCAGAATATAATCAGTATTTGAAATCTGAAGACTGGAGTCAGGCCGAGACGGACCACTTGATGGATCTGTGTCAAAGGTTTGATCTGCGGTTCATTGTGATACATGATAGATGGGACCGAGCTGCCTTCCGAGACAGAAGTGTTGAGGACTTGAAGGAGAGATATTATAATATTTGTGCCATTTTAAGTAAGGTGAAAACAAATCCTTGGTCGAATTCCGTAACAATGGTCAATGGTGAGAAAAGAGTTTACCATTATGATGCTGAACATGAAAGAAAAAGAAAGGAACAATTGAAAAGGCTCTTTGATAGGACTCAAGAACAGATTGATGAAGAACAAATGTTATTGGCGGAGTTGAAAAAAATTGAAGCAAGGAAACGTGAAAGGGAAAGGAAGACCCAAGATTTACAGAAACTCATTTCTAGAGCCGACAGTGGGAATGGTATTGTTAGTAACCAAACCAGTGTTGTGAACGAAGGTGCCAACACTCCGACTGGCTCAACATCAACAATTGCTAGACGACATGATAGAAAATTGCATAAGAAAAAATTAACAGCACAACAACGACCAGTCCGAACTGTTGAAACTGTGACTGTGGAATGGTCAGGTATAAAGTTCCCTGAGGCTCGGGGTGCAGGCGTTTGGTTGCGATCTCAACGTATGAAGCTGCCACCTGGCGTAGGACAGCGCAAGACTAAAGCAATAGAGCAGGAATTAAGACTTTTGAATATTGATATTGCACCAACACCGACGGAAGCAATTTGTAAACATTTTAATGAATTACGTTCTGATCTAGCTTTGGCATTAGATCTGAAAAATGCATTGGCGTCTTGTGAGTTTGAATTGCAGGCTTTAAGACACCAATATGAAGCTCTGAACCCTGGAAAGACATTAACAATACCAGCATCGATTTGCAACACCAATGTGGATGCGGAAATAAAACCCCTCGGAGAGATTATTGATGTGGTTGGATCACCGAGTGCCCTGAACACTACTATATAG

Protein sequence:

>DPOGS213693-PA
MADILDILDIEQPGASEITRDSIIHGDKAKKKYVTAKTVRRPEGMHREVFALLYNDNKDLPPLLPTDTGKAYKQTKAKLGMRKVRKWVWAPFTNPARKDNAVFHHWKRASDEAKEYPFAQFNKQVSIPSYSESEYNQYLKSEDWSQAETDHLMDLCQRFDLRFIVIHDRWDRAAFRDRSVEDLKERYYNICAILSKVKTNPWSNSVTMVNGEKRVYHYDAEHERKRKEQLKRLFDRTQEQIDEEQMLLAELKKIEARKRERERKTQDLQKLISRADSGNGIVSNQTSVVNEGANTPTGSTSTIARRHDRKLHKKKLTAQQRPVRTVETVTVEWSGIKFPEARGAGVWLRSQRMKLPPGVGQRKTKAIEQELRLLNIDIAPTPTEAICKHFNELRSDLALALDLKNALASCEFELQALRHQYEALNPGKTLTIPASICNTNVDAEIKPLGEIIDVVGSPSALNTTI-