Monarch geneset OGS2.0

DPOGS205849
TranscriptDPOGS205849-TA783 bp
ProteinDPOGS205849-PA260 aa
Genomic positionDPSCF300081 + 191901-193612
RNAseq coverage1075x (Rank: top 12%)
Annotation
HeliconiusHMEL0173882e-11581.15% 
BombyxBGIBMGA010906-TA8e-15197.31% 
Drosophilamus209-PA3e-12780.77% 
EBI UniRef50UniRef50_P179175e-12580.77%Proliferating cell nuclear antigen n=205 Tax=Eukaryota RepID=PCNA_DROME
NCBI RefSeqNP_001036825.11e-14696.15%proliferating cell nuclear antigen [Bombyx mori]
NCBI nr blastpgi|217173941e-14897.69%proliferating cell nuclear antigen [Spodoptera frugiperda]
NCBI nr blastxgi|217173943e-14197.69%proliferating cell nuclear antigen [Spodoptera frugiperda]
Group
Gene OntologyGO:00036774.8e-192DNA binding
GO:00436264.8e-192PCNA complex
GO:00062754.8e-192regulation of DNA replication
GO:00303374.8e-192DNA polymerase processivity factor activity
KEGG pathwayaag:AaeL_AAEL0125451e-129 
 K04802 (PCNA)maps-> Base excision repair
    DNA replication
    Mismatch repair
    Nucleotide excision repair
    Cell cycle
InterPro domain[1-259] IPR0007304.8e-192Proliferating cell nuclear antigen, PCNA
[127-254] IPR0226492e-63Proliferating cell nuclear antigen, PCNA, C-terminal
[1-125] IPR0226486.1e-60Proliferating cell nuclear antigen, PCNA, N-terminal
Orthology groupMCL13126 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205849-TA
ATGTTCGAAGCACGACTTCTCCGTAGCTCTATCCTGAAGAAAGTACTGGAGGCAATCAAGGATCTCCTTACTCAGGCAACTTTTGACTGTGATGATAATGGTATACAGTTGCAGGCTATGGACAATTCACATGTGTCTCTTGTATCACTCACCCTGAGAGCCGATGGATTTGATAAATACAGATGTGATAGAAACATCTCAATGGGAATGAATCTCGGCAGTATGTCTAAGATCCTAAAATGTGCTGGTGACAAGGACACTGTAACAATGAAAGCGCAAGATAATGCCGACACAGTCACCTTTGTGTTTGAGAGCCCTAACCAGGAAAAGGTTTCCGATTACGAAATGAAGCTCATGAATTTGGATTTAGAACATTTGGGCATCCCAGAGACAGAATACAGTTGCACAATTCGAATGCCAAGCGGTGAATTCGCAAGGATTTGTCGTGATCTATCACAGTTCGGCGAATCTATGGTGATATCTTGTACTAAGGAAGGAGTAAAGTTTTCAGCGAGTGGTGACATCGGCTCGGCAAATATAAAGCTGGCTCAAACAGCATCTATTGATAAGGAAGAAGAAGCCGTTGTTATTGAGATGGATGAACCAGTCACACTCACCTTCGCCTGTCAGTATCTTAACTACTTCACTAAAGCCACTTCATTGAGTCCTCAGGTACAGTTGTCGATGTCAGCGGATGTCCCCCTGGTGGTGGAGTACCGCATCCCGGACATAGGTCACATACGTTACTACCTCGCACCCAAGATAGAAGAAGACGACAGCTAG

Protein sequence:

>DPOGS205849-PA
MFEARLLRSSILKKVLEAIKDLLTQATFDCDDNGIQLQAMDNSHVSLVSLTLRADGFDKYRCDRNISMGMNLGSMSKILKCAGDKDTVTMKAQDNADTVTFVFESPNQEKVSDYEMKLMNLDLEHLGIPETEYSCTIRMPSGEFARICRDLSQFGESMVISCTKEGVKFSASGDIGSANIKLAQTASIDKEEEAVVIEMDEPVTLTFACQYLNYFTKATSLSPQVQLSMSADVPLVVEYRIPDIGHIRYYLAPKIEEDDS-