Monarch geneset OGS2.0

DPOGS209232
TranscriptDPOGS209232-TA1068 bp
ProteinDPOGS209232-PA355 aa
Genomic positionDPSCF300204 + 160349-161416
RNAseq coverage144x (Rank: top 54%)
Annotation
HeliconiusHMEL0157439e-18081.97% 
BombyxBGIBMGA011461-TA0.088.73% 
DrosophilaRfC38-PA1e-14966.76% 
EBI UniRef50UniRef50_Q171N95e-15068.57%Rfc5p, putative n=15 Tax=Coelomata RepID=Q171N9_AEDAE
NCBI RefSeqXP_624376.15e-15269.52%PREDICTED: similar to Replication factor C subunit 3 (Replication factor C 38 kDa subunit) (RFC38) (Activator 1 38 kDa subunit) (A1 38 kDa subunit) (RF-C 38 kDa subunit) [Apis mellifera]
NCBI nr blastpgi|665236931e-15069.52%PREDICTED: replication factor C subunit 3 [Apis mellifera]
NCBI nr blastxgi|3800219064e-14469.52%PREDICTED: LOW QUALITY PROTEIN: replication factor C subunit 3-like [Apis florea]
Group
Gene OntologyGO:00036778.3e-35DNA binding
GO:00062608.3e-35DNA replication
GO:00056341e-33nucleus
GO:00055241.2e-05ATP binding
GO:00001666e-05nucleotide binding
GO:00171116e-05nucleoside-triphosphatase activity
KEGG pathwayame:5519902e-151 
 K10756 (RFC3_5)maps-> DNA replication
    Mismatch repair
    Nucleotide excision repair
InterPro domain[248-344] IPR0089218.3e-35DNA polymerase III, clamp loader complex, gamma/delta/delta subunit, C-terminal
[250-344] IPR0194831e-33DNA polymerase III, clamp-loader complex, subunit E, C-terminal
Orthology groupMCL14035 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209232-TA
ATGAGTCTTTGGGTTGATAAACACCGCCCTAAAGATTTAATGAAGCTAGATTATCACAAAGATCAAGCTGTAAGATTGAAAAGTCTTGTTCAACAAAGTGATTTTCCTCATCTTCTTGTATACGGTCCATCAGGAGCTGGTAAAAAAACTAGAATTATGTGCCTACTAAGAGAACTATATGGATCCGGATCTGAACGTTTGCGTCAAGAAACAATGCATTTTACAACGCCTTCTAATAAGAAAATAGAAATAATGACTGTCAGTAGCAATTATCACATTGAGGTCAATCCTACTGATGTTGGTATACACGATAGAGTTGTTATTATGGATCTTGTAAAAAACGTGGCACAAACGCATCAGATCGACTCTGCCGGTCAGCGTGAGTTTAAAGTTGTTATATTAAATGAAGTAGATGACTTAACAAAAGATGCACAACATGCTCTTCGTCGTACTATGGAGAAATATGTAAGTACTTGCAGACTAATATTAATTGCCAATTCAATTTCAAGAGTTATTACAGCCATAAGATCGCGTTGTTTAACAATAAGGGTGCCAGCCCCTACCGAAACAGAAATAGCATCGGTTTTGCATGCTGTTTGTAAAAAAGAAGGCCTAAGTTTGCCATCGGAACTAGCTATGCGGATAGCAAAAAGTGCAGATCGTAATCTACGTCGTGCTTTGTTAATGTGTGAGGCATGTAAAGTACAGCATTATCCATTTACCTCTGATCAGAAAGTTCCCGAGCCAGATTGGCAAATATTTATAAGAGATACAGCAGCTATGATTTTATCTGAACAATCACCCAAAAAACTTGCTGAAGTTCGTCAAAAGTTATATGAATTAATAATACACGGAGTACCACCAGATGTAATATTTGCAGGACTTTTAAAGGAATTAGTTTGTAACTGTGATATGTCTATGAAATGTAAGATTGCAAGCTATGCAGCCCAGTATGAACATAGAATGAGACTGGGTAATAAATCCATATTTCACATAGAAGCATTTGTTGCAAAATTTATGGCTATTTATAAAAAGTTTTTGGAAGAAGCACTTGGAGATGTATTTTGA

Protein sequence:

>DPOGS209232-PA
MSLWVDKHRPKDLMKLDYHKDQAVRLKSLVQQSDFPHLLVYGPSGAGKKTRIMCLLRELYGSGSERLRQETMHFTTPSNKKIEIMTVSSNYHIEVNPTDVGIHDRVVIMDLVKNVAQTHQIDSAGQREFKVVILNEVDDLTKDAQHALRRTMEKYVSTCRLILIANSISRVITAIRSRCLTIRVPAPTETEIASVLHAVCKKEGLSLPSELAMRIAKSADRNLRRALLMCEACKVQHYPFTSDQKVPEPDWQIFIRDTAAMILSEQSPKKLAEVRQKLYELIIHGVPPDVIFAGLLKELVCNCDMSMKCKIASYAAQYEHRMRLGNKSIFHIEAFVAKFMAIYKKFLEEALGDVF-