Monarch geneset OGS2.0

DPOGS204627
TranscriptDPOGS204627-TA1050 bp
ProteinDPOGS204627-PA349 aa
Genomic positionDPSCF300277 - 211961-213766
RNAseq coverage247x (Rank: top 42%)
Annotation
HeliconiusHMEL0126604e-10389.90% 
BombyxBGIBMGA009495-TA4e-17182.99% 
DrosophilaRfC4-PA1e-13872.12% 
EBI UniRef50UniRef50_D7ELV84e-13665.23%Replication factor C 40kD subunit n=7 Tax=Eumetazoa RepID=D7ELV8_TRICA
NCBI RefSeqNP_001036917.11e-16683.78%replication factor C subunit 2 [Bombyx mori]
NCBI nr blastpgi|1129828533e-16583.78%replication factor C subunit 2 [Bombyx mori]
NCBI nr blastxgi|1129828535e-15684.19%replication factor C subunit 2 [Bombyx mori]
Group
Gene OntologyGO:00056631.1e-23DNA replication factor C complex
GO:00055241.1e-23ATP binding
GO:00036891.1e-23DNA clamp loader activity
GO:00062601.1e-23DNA replication
GO:00036776.3e-22DNA binding
GO:00001661.2e-13nucleotide binding
GO:00171111.2e-13nucleoside-triphosphatase activity
KEGG pathwaynvi:1001239621e-142 
 K10755 (RFC2_4)maps-> DNA replication
    Mismatch repair
    Nucleotide excision repair
InterPro domain[252-340] IPR0137481.1e-23Replication factor C
[253-345] IPR0089216.3e-22DNA polymerase III, clamp loader complex, gamma/delta/delta subunit, C-terminal
[64-200] IPR0035931.2e-13ATPase, AAA+ type, core
[68-186] IPR0039592.5e-12ATPase, AAA-type, core
Orthology groupMCL13885 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204627-TA
ATGAGTGACGACGACGTTACTATAATGGAAGTTGACGAGGTAGATCCTAAGCCCAGCAAGTCGGTAACAGTTCAAAGTAAAGTTACTTCACATTTACCCTGGATAGAAAAATACAGACCGCAGACATTTACTGATATAGTGGGCAACGAGGACACGGTGTCACGTCTGGCGGTGTTTGCTCGTACTGGCAACGCACCTAATATAATAATAGCGGGACCGCCCGGTGTAGGTAAAACCACAACAATCCTGTGTTTAGCTCGCGTACTTCTCGGTCCCGCTTTCAAAGATGCTGTTCTTGAATTAAACGCATCAAATGATCGAGGTATCGATGTTGTCCGTAATAAAATCAAAATGTTTGCTCAACAGAAGGTGACATTACCTCCTGGAAGACATAAAATAGTTATTTTGGATGAGGCTGACAGCATGACGGATGGCGCCCAGCAGGCTCTGAGGCGTACCATGGAACTTTATTCCAGCACCACGCGCTTTGCTCTAGCAGCTAACAACAGTGAGAAGATTATTGAACCAATCCAGTCTCGTTGTGCTGTTTTACGCTATACCAAACTGAGCGATGCCCAGATACTAGCTAAGGTTATTGAAGTCTGTGAAAAAGAAGACTTATCCTACACAGAGGAAGGAGTTAATGCTGTTGTTTTCACCGCCCAGGGAGACTTGAGAGCCGCACTAAATAACCTGCAGTCCACAGCACAGGGCTTTGCCCATGTCAGTCCTGATAATGTGTTCAAGGTGTGCGATGAACCGCACCCTATGCTCGTAAAGACGATGTTAGAGGCCTGCACAAGAAAGGATATTTATGAAGCTTATAAGGTTATAGCAAAGCTTTGTCGTCTCGGTTATGCTACAGAGGATATATTGAGTAACATGTTCCGTGTGAGCAAGACTCTGTCTGTGTCGGAAGATGTACGTCTAGGGCTGATCAAACAGATCGGACTCACACAGATGAGGGCGGCAGACGGGCTCACTTCACAGCTACAACTGGCAGCCCTACTGGCAAGGATGTGTGGAGGGAAGGGGGTGGATGAAGAGTAA

Protein sequence:

>DPOGS204627-PA
MSDDDVTIMEVDEVDPKPSKSVTVQSKVTSHLPWIEKYRPQTFTDIVGNEDTVSRLAVFARTGNAPNIIIAGPPGVGKTTTILCLARVLLGPAFKDAVLELNASNDRGIDVVRNKIKMFAQQKVTLPPGRHKIVILDEADSMTDGAQQALRRTMELYSSTTRFALAANNSEKIIEPIQSRCAVLRYTKLSDAQILAKVIEVCEKEDLSYTEEGVNAVVFTAQGDLRAALNNLQSTAQGFAHVSPDNVFKVCDEPHPMLVKTMLEACTRKDIYEAYKVIAKLCRLGYATEDILSNMFRVSKTLSVSEDVRLGLIKQIGLTQMRAADGLTSQLQLAALLARMCGGKGVDEE-