Monarch geneset OGS2.0

DPOGS213934
TranscriptDPOGS213934-TA1053 bp
ProteinDPOGS213934-PA350 aa
Genomic positionDPSCF300226 - 185226-189493
RNAseq coverage222x (Rank: top 45%)
Annotation
HeliconiusHMEL0028443e-17986.57% 
BombyxBGIBMGA003377-TA0.086.61% 
DrosophilaCG8142-PA9e-11856.25% 
EBI UniRef50UniRef50_Q9VX151e-11556.25%CG8142 n=16 Tax=Coelomata RepID=Q9VX15_DROME
NCBI RefSeqNP_001040483.11e-18086.61%replication factor C4 [Bombyx mori]
NCBI nr blastpgi|1140525913e-17986.61%replication factor C4 [Bombyx mori]
NCBI nr blastxgi|1140525912e-17286.61%replication factor C4 [Bombyx mori]
Group
Gene OntologyGO:00055242.6e-15ATP binding
GO:00036771.9e-14DNA binding
GO:00062601.9e-14DNA replication
GO:00001665.1e-12nucleotide binding
GO:00171115.1e-12nucleoside-triphosphatase activity
GO:00056631.4e-10DNA replication factor C complex
GO:00036891.4e-10DNA clamp loader activity
KEGG pathwaytca:6635831e-122 
 K10755 (RFC2_4)maps-> DNA replication
    Mismatch repair
    Nucleotide excision repair
InterPro domain[66-191] IPR0039592.6e-15ATPase, AAA-type, core
[255-346] IPR0089211.9e-14DNA polymerase III, clamp loader complex, gamma/delta/delta subunit, C-terminal
[62-196] IPR0035935.1e-12ATPase, AAA+ type, core
[261-342] IPR0137481.4e-10Replication factor C
Orthology groupMCL12903 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213934-TA
ATGCAGGCATTTTTAAAAACCGGCAAGATATCAAGTACTGATAAACCATCTACTTCGGGAGTTAAATCCACAAAGAAAAAGGCTCCAGCTCCATGGGTTGAAAAATACCGTCCAAAAACTATAGATGATATCGTTGATCAAGGAGAAGTGGTTCAAGTTTTAAGAGAATGTCTGGCTGGAGGTGATTTACCACATCTGTTGTTTTATGGTCCACCAGGAACTGGTAAAACAAGTGCTATCTTGGCTGCTGCTAGACAGCTCTTTGGAGACATTACTAGAGAGCGAGTTCTTGAACTGAATGCTTCAGATGAAAGAGGAATACAAGTCATAAGAGATAAAGTAAAAACTTTTGCCCAGTTAACAGTCAGCAATACAAGACCAGATGGCAGACCGTGCCCGCCATACAAACTGGTTATCTTGGACGAAGCAGATTCAATGACAACGGCAGCGCAGGCAGCCTTACGTCGAACTATGGAGCGAGAGACGAGGACTACACGTTTTTGTCTCATATGTAATTATGTATCAAGAATCATTCCACCAATTACCAGCAGATGTTCGAAGTTTCGATTCAAACCGCTGGCGAGGGAGAATGTTATCAAGAGATTACAAGAAGTATGTAAATCAGAGGCTGTGGAGGTTGGTGATGGTGAAGTACTCCATCAAGCTGTGGACACATGTGGGGGAGATCTTAGGCGAGCACTCACAGCACTGCAGTGCTGTCAGCGCTTACTCGGCAAAATTACAGCTGATGGATTAATTGAGGTGACGGGACTCGTACCTGAAAATCTAGTGGATGAATTTCTAAACGTGAAAAACTACAATGAGTTGGAGAGATTCGTTGAGAATTTTCTCATGGACGCGTATTCAGCATCTCAATTATTGGAACAGCTGTCAGAGAGAGTGGTGAATGCTGGTCATTTGACTAACAAGCAGAAGTGTGTGATTAGTGAGAAGCTGGCTGTGTGTTCTCACCGACTACTAGAGGGTGGAGCTGAGGTGATGCAGCTGACAGACCTCGGCTGTACCGTGATCATGGCTAATAATAACCCGTGA

Protein sequence:

>DPOGS213934-PA
MQAFLKTGKISSTDKPSTSGVKSTKKKAPAPWVEKYRPKTIDDIVDQGEVVQVLRECLAGGDLPHLLFYGPPGTGKTSAILAAARQLFGDITRERVLELNASDERGIQVIRDKVKTFAQLTVSNTRPDGRPCPPYKLVILDEADSMTTAAQAALRRTMERETRTTRFCLICNYVSRIIPPITSRCSKFRFKPLARENVIKRLQEVCKSEAVEVGDGEVLHQAVDTCGGDLRRALTALQCCQRLLGKITADGLIEVTGLVPENLVDEFLNVKNYNELERFVENFLMDAYSASQLLEQLSERVVNAGHLTNKQKCVISEKLAVCSHRLLEGGAEVMQLTDLGCTVIMANNNP-