Monarch geneset OGS2.0

DPOGS212911
TranscriptDPOGS212911-TA1578 bp
ProteinDPOGS212911-PA525 aa
Genomic positionDPSCF300285 - 62776-64353
RNAseq coverage1x (Rank: top 94%)
Annotation
HeliconiusHMEL0042450.081.30% 
BombyxBGIBMGA013987-TA0.075.43% 
DrosophilaPole2-PA8e-11541.44% 
EBI UniRef50UniRef50_Q5ZKQ66e-13745.77%DNA polymerase epsilon subunit 2 n=38 Tax=Chordata RepID=DPOE2_CHICK
NCBI RefSeqXP_002732910.12e-14047.21%PREDICTED: DNA-directed DNA polymerase epsilon 2-like [Saccoglossus kowalevskii]
NCBI nr blastpgi|1479071641e-14248.08%DNA-directed DNA polymerase epsilon 2 [Xenopus laevis]
NCBI nr blastxgi|1479071642e-13948.08%DNA-directed DNA polymerase epsilon 2 [Xenopus laevis]
Group
Gene OntologyGO:00056349.2e-162nucleus
GO:00038879.2e-162DNA-directed DNA polymerase activity
GO:00062619.2e-162DNA-dependent DNA replication
GO:00036773e-42DNA binding
GO:00062603e-42DNA replication
KEGG pathwayxla:3991162e-143 
 K02325 (POLE2)maps-> Purine metabolism
    Base excision repair
    DNA replication
    Nucleotide excision repair
    Pyrimidine metabolism
InterPro domain[1-523] IPR0162669.2e-162DNA polymerase epsilon, subunit B
[283-484] IPR0071853e-42DNA polymerase alpha/epsilon, subunit B
Orthology groupMCL12695 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212911-TA
ATGGCAGACTTGCAAATTGTACGTTCTGAAGTGAATAGCGCTTTCAAACTCAACGGGTTTACAATTCGAAAAGAAGCTAGCACTTTTGTTGCTGAGCAAGTGGCTGCGGTATCAAAAGAGGAACGTAAAAAAATACTAGATAAATTAATAGAGCACCTTTTACACCAATGTTTGTCTCAACCAGTATTAGAGAAGCAGCATCTAGAAGTGGCCTACAAAGAATGTTTGTCTTCAGGTCTCGAAGAAAGCGAAACCATATTAAATGTCATCGATGCATTGAATGTCCCCAAGCTCCGTTATGATTGTGATAGAAAGAAGTTTACTAAAGAAACGAATGTCAAGAATAATTTATACCCGGAACCTAAGTGGCAGGCACAGCTTTTTATCGAACGCTATACCATCATTCAGCAGAGAACAACACGTAATAAACTATTTGCGAGAGAGGCTTTGCCGTCCATGGAAAATGAAAATCGTTTTCAGTTACGAACTATTGAGGTTTTACAGAGTTCTTCAAGTCGAGTGGATGAGGTCATTGTTCTGGGTCTTATAACACAATTGACGGAGGGTAAATATTATTTGGAGGACCCAACCGGCAGCGTTCCTTTAGATATGAGTCAAACTCGCTATCATTCCGGTTTGTTTACTGAGAGTAGTTTTGTCTTAGCTGAGGGTTATTACGATGATAAAGTTTTACATGTAATGGGTCTTGTGCTACCACCTTCTGAGACACGAGAAACGTCATTACCATATTTTGGTAATTTGAATACATTTGGAGGTATATCAAAAACATTATTGAAGCATTCAAAAAATTTGTCAAAAATTGAACAGGAGAATGAAGATGGTATGATAATATTTCTATCGGAGGTTTGGTTTGACAATATAAAAGTGATATCGAAGTTAAAAACATTATTTTCTGGCTATAATGATTTTCCTCCTATAGCCATAGTGTTTATGGGTGAATTTTTATCTTGTCCATACGGATACGAGCATAGCACACAGTTAAAAGCTGCTCTTGGTAACTTGTGTGATATGATTCTTCCTTTCAAAAAGCTTAGGGAATCATGCAAATTTATATTTGTGCCAGGCAGAGGTGACCCAGCGGCTCCGAACATACTGCCCCGTCCAGCTATACCGAGTTTCATCACACAAGACATTAAAAGCAAATTGGGTGATTCGGTTATATTCACAACTAATCCATGTAGAATTCAATACTGTACACAGGAAATTGTTGTTATAAGACAGGATTTGGTGACAAAGATGTGTAGGAATTCGGTTCACTTTCCTGATGCTGGTGACATACCAGATCATTTGACGAAGACATTGTTAAGCCAATGTACATTATCACCGTTATCATTAGCCGTCCAACCTATATATTGGAAGCATGCGGACTCTTTGAGTTTATATCCAATGCCGGATTTAGTGGTTGTGGCTGATAGCTTCCAACCTTATACCAGATCATACCAGAATTGCCAAATAATTAACCCTGGATCATTCCCACATACAGAATACTCGTTTAAGGTTTATGTACCGGCCTCGAGACTCGTTGAAGATTCACAAATACCTAATGAAGATACTTGA

Protein sequence:

>DPOGS212911-PA
MADLQIVRSEVNSAFKLNGFTIRKEASTFVAEQVAAVSKEERKKILDKLIEHLLHQCLSQPVLEKQHLEVAYKECLSSGLEESETILNVIDALNVPKLRYDCDRKKFTKETNVKNNLYPEPKWQAQLFIERYTIIQQRTTRNKLFAREALPSMENENRFQLRTIEVLQSSSSRVDEVIVLGLITQLTEGKYYLEDPTGSVPLDMSQTRYHSGLFTESSFVLAEGYYDDKVLHVMGLVLPPSETRETSLPYFGNLNTFGGISKTLLKHSKNLSKIEQENEDGMIIFLSEVWFDNIKVISKLKTLFSGYNDFPPIAIVFMGEFLSCPYGYEHSTQLKAALGNLCDMILPFKKLRESCKFIFVPGRGDPAAPNILPRPAIPSFITQDIKSKLGDSVIFTTNPCRIQYCTQEIVVIRQDLVTKMCRNSVHFPDAGDIPDHLTKTLLSQCTLSPLSLAVQPIYWKHADSLSLYPMPDLVVVADSFQPYTRSYQNCQIINPGSFPHTEYSFKVYVPASRLVEDSQIPNEDT-