Monarch geneset OGS2.0

DPOGS213019
TranscriptDPOGS213019-TA2919 bp
ProteinDPOGS213019-PA972 aa
Genomic positionDPSCF300024 + 257581-260835
RNAseq coverage105x (Rank: top 60%)
Annotation
HeliconiusHMEL0122960.092.51% 
BombyxBGIBMGA006939-TA0.091.89% 
DrosophilaDNApol-delta-PA0.076.30% 
EBI UniRef50UniRef50_A6R8570.049.59%DNA polymerase n=3 Tax=root RepID=A6R857_AJECN
NCBI RefSeqXP_001868453.10.078.61%DNA polymerase delta catalytic subunit [Culex quinquefasciatus]
NCBI nr blastpgi|3241207620.091.59%DNA polymerase delta catalytic subunit [Papilio polytes]
NCBI nr blastxgi|3241207620.091.59%DNA polymerase delta catalytic subunit [Papilio polytes]
Group
Gene OntologyGO:00038872.9e-146DNA-directed DNA polymerase activity
GO:00036772.9e-146DNA binding
GO:00062602.9e-146DNA replication
GO:00001662.9e-146nucleotide binding
GO:00061396e-95nucleobase, nucleoside, nucleotide and nucleic acid metabolic process
GO:00036766e-95nucleic acid binding
KEGG pathwaycqu:CpipJ_CPIJ0182870.0 
 K02327 (POLD1)maps-> Purine metabolism
    Base excision repair
    DNA replication
    Homologous recombination
    Mismatch repair
    Nucleotide excision repair
    Pyrimidine metabolism
InterPro domain[419-846] IPR0061342.9e-146DNA-directed DNA polymerase, family B, multifunctional domain
[289-635] IPR0061726e-95DNA-directed DNA polymerase, family B
[446-838] IPR0045786.4e-84DNA-directed DNA polymerase, family B, pol2
[92-402] IPR0123371.3e-74Ribonuclease H-like
[110-403] IPR0061332.6e-72DNA-directed DNA polymerase, family B, exonuclease domain
[582-713] IPR0232114e-19DNA polymerase, palm domain
Orthology groupMCL12834 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213019-TA
ATGGACAAGAAAAAACAATCAAAAGGTCCTCCAGTAAAGAGGTTTAAATCTAATGATGACGACGATGACGTTGAATTACCTTGTTCGTTTGAAGACCAACTTGCTGGCATGGAATGTGAATTTGATTCGCCCCAAGCATTTGGTGAAGGACCTGAAAATCAAAATACAAACATGAAATGGTCACGACCTCAACCGCCGGATCTAGATCCTAAAGTTAATAAATTGGTTTTTCAACAATTGGACATAGACCATTACAATGGGCAACCATTGAAAGATATGCCTGGATCTCAAATAGGTCCTGTACCAATAATGAGAATGTATGGTGTCACAATGGAAGGTAATTCAGTTTGCTGCCATGTACATGGTTTTACACCATACTTTTATGTCACAGTACCTTTGAATTTTAAGGAATCCAATTGTCATGAATTAAAATCAAACTTAAACAAAGCTATTCTCGAAGATCTTCGTTCTAATAAAGATAATATTAGAGAAACAGTGTTAGAAGTGAAGTTAGTAAAAGCTAAATCTATCATGTATTACAAAAATGATGACGATACAACATTTGCTCGAGTCTCTGTTGCTCTGCCAAAACTAATTGCCGCTGCAAAGAGATTAATTGAAAGACAGCCAACATCATTTGGCCTTATGAATCCTTCATTTTATGAAACAAACATTGATTTTGATATTCGATTTATGGTCGACACATCTGTTGTTGGGTGCAGCTGGATAGAACTCCCTCCAGGTAAATGGTCTTTAAGAACAAAAGATAATTCAGTAAAGCCAGAATCGAGGTGCCAAATTGAAGTTGATGTTGCATGGAATCAGTTCATTTCTCATCAACCAGAAGGAGAATGGCTAAAACTTGCACCTTTCCGAATTTTAAGTTTTGATATTGAATGTGCTGGTAGGAAAGGTGTATTTCCAGAACCAAAACATGATCCTGTTATTCAAATTGCTTCAATGGTCATAAGACAAGGAGAAAGCGAACCATATCTTAGAAATGTGTTTACTCTAAATACATGTGCACCTATTGTAGGATCTCAAGTATTCAGTTTTCAATCTGAAGCAGAGATGTTATCAAAATGGTCAGACTTCTTCCGTGAGCTTGATGCAGATATCATTACAGGATATAATATATGTAACTTTGACTGGCCCTATCTAATTAACCGGGCCAAACATCTAAAAGTTGATTGTTTTGACTTTTTAGAAATGGCCAGGGTCACTGGTGTTCCTTTATTATGTTTATTAACTCGTGGTCAGCAAATAAAAGTTGTAAGCCAGTTATTGCGCAAATCGAAGGGGGCAGGGTATTTAATGCCAGCTTACCATAGTCAAGGTTCCGAAGATCAATTTGAGGGGGCTACAGTTATTGAACCCAAAAAGGGATATTATGCAGATCCAATCTCAACATTGGATTTTGCATCTCTATACCCAAGTATAATGATGGCCCATAACTTATGCTATACTACATTAGTGCCACCCAACATTCAGAAAGAAATTAATCTGAACTCTGATGATGTAACTGTTACACCTTCCAATAATATGTTCGTTAAAGCACACAAACGGAAAGGATTACTACCTGAAATTTTAGAATCCCTACTTGCTGCTCGAAAAAAAGCTAAGGCTGATTTAAAAGAAGAGAAGGACCCATTAAAAAGATCTGTGCTCGATGGCAGACAATTAGCTTTAAAAATAAGTGCAAACTCTGTTTATGGTTTCACTGGTGCACAAGTTGGCAAATTACCTTGCTTAGAAATATCAGGTAGTGTAACAGCATATGGAAGGACTATGATAGAATTTACTAGAGCAGAGGTTGAAAAAAAATATACCAAGTCCAACGGATACAAGGAAGACGCTGTTGTCATATATGGCGACACTGACTCAGTCATGGTAAAATTTGGTGTGAAAACTTTAGAAGAAAGTATGGAGCTTGGTAGAGAAGCAGCTGAATTTGTAACTTCAAAATTTGTTAAGCCCATTAAATTGGAGTTTGAAAAAGTATATTACCCTTACCTTCTTATTAATAAAAAGAGGTATGCAGGACTCTACTTTACTAAACCAGAAAAATATGATAAGATGGATTGTAAAGGCATAGAAACGGTCAGAAGAGATAACTGCCCCCTAATTTCGAATATGATGAGTACTTGTCTTCAGAAACTATTAATAGATAGAGATCCTGATGGTGCTATTAATTATGCCAAACAAATGATATCCGATCTTTTGTGTAACCGCATTGATATTTCCCAACTTGTTATAACTAAGGAACTCACTAAAAATGATTATGCTGCTAAGCAGGCTCATGTTGAATTGGCTAATAAAATGAAAAAACGAGATGCTGGTACTGCACCTAAGCTTGGTGACCGAGTTCCATATGTTTTATGTAGTGCTGCAAAAAATACTCCTGCATATATGAAGGCAGAAGATCCCATATATGTTTTAGAAAATAGTGTGCCCATTGATTTTAATTATTATTTAGAAAATCAATTATCCAAACCCTTACTCAGAATTTTTGAACCAATACTAGGTGAAAAAGCTGAATCTTTACTACTTAAAGGTGAACATACAAGGACAAAGGCAATGGTCACATCTAAAGTTGGTGCACTGGCAGCATTTACTAAAAAAAAAGAAAAATGTATTGGCTGTAAAACCGTTATGCCCAATGATACTAAAAAAGCCTTATGTGATCACTGTTTAAAAAATGAAGGACAGCTATATATAACTGAAGTATTTAAATTAAGGCAATTGCAAGAAAGATTCTCACGGCTTTGGACAGAATGTCAACGATGTCAAGGGAGTCTTCATGAAGAAGTATTGTGCACAAACAGAGACTGCACTATATTTTATATGAGAAAAAAAATGGGAATGGAACTTGACACTCAAGAAAAAACAGTCCTCAGGTTTGGGGAACCTATTTGGTAA

Protein sequence:

>DPOGS213019-PA
MDKKKQSKGPPVKRFKSNDDDDDVELPCSFEDQLAGMECEFDSPQAFGEGPENQNTNMKWSRPQPPDLDPKVNKLVFQQLDIDHYNGQPLKDMPGSQIGPVPIMRMYGVTMEGNSVCCHVHGFTPYFYVTVPLNFKESNCHELKSNLNKAILEDLRSNKDNIRETVLEVKLVKAKSIMYYKNDDDTTFARVSVALPKLIAAAKRLIERQPTSFGLMNPSFYETNIDFDIRFMVDTSVVGCSWIELPPGKWSLRTKDNSVKPESRCQIEVDVAWNQFISHQPEGEWLKLAPFRILSFDIECAGRKGVFPEPKHDPVIQIASMVIRQGESEPYLRNVFTLNTCAPIVGSQVFSFQSEAEMLSKWSDFFRELDADIITGYNICNFDWPYLINRAKHLKVDCFDFLEMARVTGVPLLCLLTRGQQIKVVSQLLRKSKGAGYLMPAYHSQGSEDQFEGATVIEPKKGYYADPISTLDFASLYPSIMMAHNLCYTTLVPPNIQKEINLNSDDVTVTPSNNMFVKAHKRKGLLPEILESLLAARKKAKADLKEEKDPLKRSVLDGRQLALKISANSVYGFTGAQVGKLPCLEISGSVTAYGRTMIEFTRAEVEKKYTKSNGYKEDAVVIYGDTDSVMVKFGVKTLEESMELGREAAEFVTSKFVKPIKLEFEKVYYPYLLINKKRYAGLYFTKPEKYDKMDCKGIETVRRDNCPLISNMMSTCLQKLLIDRDPDGAINYAKQMISDLLCNRIDISQLVITKELTKNDYAAKQAHVELANKMKKRDAGTAPKLGDRVPYVLCSAAKNTPAYMKAEDPIYVLENSVPIDFNYYLENQLSKPLLRIFEPILGEKAESLLLKGEHTRTKAMVTSKVGALAAFTKKKEKCIGCKTVMPNDTKKALCDHCLKNEGQLYITEVFKLRQLQERFSRLWTECQRCQGSLHEEVLCTNRDCTIFYMRKKMGMELDTQEKTVLRFGEPIW-