Monarch geneset OGS2.0

DPOGS204841
TranscriptDPOGS204841-TA1320 bp
ProteinDPOGS204841-PA439 aa
Genomic positionDPSCF300227 - 186842-189063
RNAseq coverage185x (Rank: top 49%)
Annotation
HeliconiusHMEL0138942e-12077.74% 
BombyxBGIBMGA011736-TA2e-14073.33% 
DrosophilaCG12018-PA7e-9438.43% 
EBI UniRef50UniRef50_E9FRK33e-9642.60%Putative uncharacterized protein n=1 Tax=Daphnia pulex RepID=E9FRK3_DAPPU
NCBI RefSeqXP_001658442.13e-9842.96%DNA polymerase delta small subunit [Aedes aegypti]
NCBI nr blastpgi|1571163665e-9742.96%DNA polymerase delta small subunit [Aedes aegypti]
NCBI nr blastxgi|1571163661e-9343.16%DNA polymerase delta small subunit [Aedes aegypti]
Group
Gene OntologyGO:00038871.7e-34DNA-directed DNA polymerase activity
GO:00036771.7e-34DNA binding
GO:00062601.7e-34DNA replication
KEGG pathwayaag:AaeL_AAEL0126691e-97 
 K02328 (POLD2)maps-> Purine metabolism
    Base excision repair
    DNA replication
    Homologous recombination
    Mismatch repair
    Nucleotide excision repair
    Pyrimidine metabolism
InterPro domain[192-397] IPR0071851.7e-34DNA polymerase alpha/epsilon, subunit B
Orthology groupMCL11779 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204841-TA
ATGCTCTTTAAAGTTAATGTTGAGAATTCGAAAAAACAATTTGAGGTGGTAGATGTCGACAGACATTCAGTGCAATACGCTGATGGATCCAGTCGGTTTTATAAAGTTTCGAGAGATTTTTCTAAACAGTATGCTCATATTTATTCAGCAAGGCTAAATATTTTTAGAAATATATTGGCACCTGTAGTTAATAAAAAGTATTCAAATAAATATAAAATTTTAAAATTGTGTGATCTTCGCGATAAAGGTACACCTTGTATAATCATTGGTACATTATTTAAACTTCAAGAATTAAAACCAAGTATTTTAAAAGAGCTCTCAGATCAACTTGAAATATTACCGCAACCAACAAGAACCCATTTTGTTCATGAAACTGACAGTTTAGTGTTGGAAGATGAGCTGCAAAGAATAAAATTATCTGGAGATTGTATTGATGTTAATAAAGTGGTAACAGGTGTTGTTGTTGCTATACTTGGTTCTGAAGATGAAGATGGAATATTCACTGTTAAAGATGTCTGCTGGGCGGGATGTAATGTACAAAAACCTCTACCTACATTAAATAACGACCGATACATTGTTCTGATGTCAGGATTAAGTCTAGCATCCAAATTTGGAGACCATTTGTTCTCCTTAAACTTATTTATAGAATGGTTATCAGGGTTTTGTGGTACTACACAATATCAAGAAGAAGTTTCAAAGATTGTCAGAGTTATAATCGCAGGTGGAATCTATGCAAACCAGTCTAGTGATTACCAACTTAGTGAATCGGATGTTATAAGCTCATCTGATGTTGTCGATTCATTCTCTGCTGCTGTCAGTGCGGTTACACCTTTAGATCTAATGCCAGGTTGTAAGGATCCTACTGGCATTATGTTACCACAGAAACCGTTTCATTACTGTTTATTCCCAAAAGCAATTGAATATAAATCATTCAATAGAGTATCAAACCCTTATGAATGTGATATTGGAGGTTTTGTATGCTTGGGCACATCAGGAGAACCAATAAGAGATATAATGCAGAACAGTGAGATAGACAAACATCTGGATGTTATGAAGAAGACTTTGGAGTGGAGGCACATGGCACCTACCTGTCCTGATAATGTTCCCTGCACACCCTGTATAGATACAGATCCATTCACTATTTATAATTGCCCTGCCATATACTTTAGTGGCAACTGTACTGAATTTGAAACAGAAATTTTTAATGGTGACGATGGACAGAAAGTCAGAATTGTATGCATACCAGATTTTTGTGAAACGAAAACTGTAGTGTTGGTGAATCTTGCGAATTTGGAATGTTATAGTATGACTTTTGGCTGA

Protein sequence:

>DPOGS204841-PA
MLFKVNVENSKKQFEVVDVDRHSVQYADGSSRFYKVSRDFSKQYAHIYSARLNIFRNILAPVVNKKYSNKYKILKLCDLRDKGTPCIIIGTLFKLQELKPSILKELSDQLEILPQPTRTHFVHETDSLVLEDELQRIKLSGDCIDVNKVVTGVVVAILGSEDEDGIFTVKDVCWAGCNVQKPLPTLNNDRYIVLMSGLSLASKFGDHLFSLNLFIEWLSGFCGTTQYQEEVSKIVRVIIAGGIYANQSSDYQLSESDVISSSDVVDSFSAAVSAVTPLDLMPGCKDPTGIMLPQKPFHYCLFPKAIEYKSFNRVSNPYECDIGGFVCLGTSGEPIRDIMQNSEIDKHLDVMKKTLEWRHMAPTCPDNVPCTPCIDTDPFTIYNCPAIYFSGNCTEFETEIFNGDDGQKVRIVCIPDFCETKTVVLVNLANLECYSMTFG-