Monarch geneset OGS2.0

DPOGS208467
TranscriptDPOGS208467-TA996 bp
ProteinDPOGS208467-PA331 aa
Genomic positionDPSCF300064 - 1569556-1571060
RNAseq coverage67x (Rank: top 67%)
Annotation
HeliconiusHMEL0042924e-7662.34% 
Bombyx% 
Drosophila% 
EBI UniRef50UniRef50_P067462e-11057.86%DNA polymerase beta n=98 Tax=Eumetazoa RepID=DPOLB_HUMAN
NCBI RefSeqXP_001950059.13e-11861.54%PREDICTED: similar to predicted protein [Acyrthosiphon pisum]
NCBI nr blastpgi|3363909297e-12163.86%DNA polymerase beta [Tribolium castaneum]
NCBI nr blastxgi|3363909295e-11863.86%DNA polymerase beta [Tribolium castaneum]
Group
Gene OntologyGO:00038874.1e-110DNA-directed DNA polymerase activity
GO:00036774.1e-110DNA binding
GO:00062812.5e-43DNA repair
GO:00038241.9e-26catalytic activity
GO:00167794.7e-25nucleotidyltransferase activity
KEGG pathwaytca:6563597e-113 
 K02330 (POLB)maps-> Base excision repair
InterPro domain[12-330] IPR0020544.1e-110DNA-directed DNA polymerase X
[66-83] IPR0020082.5e-43DNA polymerase, family X, beta-like
[6-92] IPR0109961.9e-26DNA-directed DNA polymerase, family X, beta-like, N-terminal
[107-124] IPR0223124.7e-25DNA polymerase family X
[93-150] IPR0189441.5e-17DNA polymerase lambda, fingers domain
Orthology groupMCL17512 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208467-TA
ATGAGCAAGCGAAAGAATCCATCTACCAATAATAATGTGAATGCAGATTTTTGCGACTTTTTAATGGAATTAGCTGATTATGAAAAAAATGTCAGTCGTAATATTCATAAATACAATGCCTATCGTAAAGCTGCTAGCGTTTTAGCTGCCCATCATAAACGTATTGAATCTGGGCAGCAAGCAAAGAAGTTAAATGGTGTAGGAGAAAAAATATCAAAAAAAATTGACGAGTTCTTACAAACGGGAAAATTGAGAAAATTAGAAAATATACATAATGACGAAAAGGCGCAAGCTATCAGTTTACTAACACGGGTTTCGGGTATCGGTCCTGTGAAAGCTGCTGATTTAATTGATAAGGGTATAAAAACTTTAGAAGATTTGAACAAAAACCAAGATTTGTTAAACCACCATCAACTGATTGGTCTGAAGTATTTCGAAAATTTCGAGGAGAAGATTCCTCGTGAAGAAATTCAAAAAATAGAGGCAATAATAAAAAAAAATATTTTAGATTTAGATTGTGATTATACTATTACCATTTGTGGCAGTTACAGGCGCGGCGCATCACAAAGTGGCGACATTGACGTGCTAGTTACACACCCAACGATGAAGTTGGATAAGGAAAAAATCGGTGAAAAGCTTTTGAAAGAAATAAAAGAGGCTTTAGGAGAACTCATTGTTGACGTGATATCAATGGGTGCGACGAAGTTTATGGGCGTCTGTCGTCTGTCAGACGGACATTTAAACCGGCGTTTGGACATTCGGCTCATACCAAATGAACAGTACCACTGCGCCGTCCTTTACTTCACCGGAAGTGACGTTTTCAATAAAAATATGAGAGCACATGCTTTGGAAAAACGATTTACACTGAATGAATACTCCCTGCGACCGATCGGTGTAACCGGTGTCCACGGACAGCCGGTGCCGATCACATCAGAAGAAGACATTTTCGACTACATCGACTATCCTTACAAGAAACCGGAAGAACGAAATCTGTAG

Protein sequence:

>DPOGS208467-PA
MSKRKNPSTNNNVNADFCDFLMELADYEKNVSRNIHKYNAYRKAASVLAAHHKRIESGQQAKKLNGVGEKISKKIDEFLQTGKLRKLENIHNDEKAQAISLLTRVSGIGPVKAADLIDKGIKTLEDLNKNQDLLNHHQLIGLKYFENFEEKIPREEIQKIEAIIKKNILDLDCDYTITICGSYRRGASQSGDIDVLVTHPTMKLDKEKIGEKLLKEIKEALGELIVDVISMGATKFMGVCRLSDGHLNRRLDIRLIPNEQYHCAVLYFTGSDVFNKNMRAHALEKRFTLNEYSLRPIGVTGVHGQPVPITSEEDIFDYIDYPYKKPEERNL-