Monarch geneset OGS2.0

DPOGS202177
TranscriptDPOGS202177-TA3009 bp
ProteinDPOGS202177-PA1002 aa
Genomic positionDPSCF300162 + 267516-274415
RNAseq coverage271x (Rank: top 40%)
Annotation
HeliconiusHMEL0108890.061.86% 
BombyxBGIBMGA003318-TA0.058.05% 
DrosophilaGnf1-PB0.048.74% 
EBI UniRef50UniRef50_Q295Z30.047.25%GA10826 n=3 Tax=pseudoobscura subgroup RepID=Q295Z3_DROPS
NCBI RefSeqXP_001653061.10.051.81%replication factor C large subunit, putative [Aedes aegypti]
NCBI nr blastpgi|1571178390.051.81%replication factor C large subunit, putative [Aedes aegypti]
NCBI nr blastxgi|1571178390.047.50%replication factor C large subunit, putative [Aedes aegypti]
Group
Gene OntologyGO:00056633.6e-263DNA replication factor C complex
GO:00036773.6e-263DNA binding
GO:00055243.6e-263ATP binding
GO:00036893.6e-263DNA clamp loader activity
GO:00062603.6e-263DNA replication
GO:00056223.2e-19intracellular
KEGG pathwayaag:AaeL_AAEL0013240.0 
 K10754 (RFC1)maps-> DNA replication
    Mismatch repair
    Nucleotide excision repair
InterPro domain[3-991] IPR0121783.6e-263DNA replication factor C, large subunit
[770-923] IPR0137253.8e-53DNA replication factor RFC1, C-terminal
[704-841] IPR0089211.6e-50DNA polymerase III, clamp loader complex, gamma/delta/delta subunit, C-terminal
[259-339] IPR0013573.2e-19BRCT
[521-622] IPR0039592.6e-08ATPase, AAA-type, core
Orthology groupMCL13236 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202177-TA
ATGTCTAGGGATATCAGATCATTCTTTACAGTAAAAAAAGAGAAAACGAAGAAAGATGAAGACAGTGATGTTATACCAGAATCACCGAATGTACAAGTTACAAACAAGAAAAAACAGTCTCGCAAAAAAAGACAAATTCAAGAGGACTCCGATGAGGAAATATTCTCCGCATCAAATAAAAAAAAGAATTCTCCTATAAAAATACTAAAAGAAGTTAAAGCAGCTAACTTATTCGGTTCAGCACCAATCAAAAGAACGGAGCCGATTGTGAAGAGAATAAAAAAAGAAACGGAACTTACCATACACTCCGACGAAGAATTCGAACAGAGTCTCATACAATTAGATGAGAAAATTAATCAAGAGATACAAGCAACTAAGGAAATACCAGATGAAACGTCAATGAAAAAGGAAGATTTAGTCAAAGATAAGAAAGACGATCGTTCAGAAAAATTGATTGAAGATGTTACAAACAACAAAAAGAGAAAGTTGAATAAAAGTTTGAATGAAGGACACGGGGATAATAACAAAGCTGAGGTCAACAAGAAGATGAAGAAGGATTTTAGTGAGTTCATTGAGAACGGAGAAGATTTAAACAAAAGTGAGCCGGCACAGGAATCACCAGAGTCCAAACAGAAGAAGCGCAAACTGGACAAGAGTCTCAATGAATCAGTCCTATCAGATGAGGAGAGGTATGAAAGAAAGAGACAATCAGCTGCCTTATATCAGAGGTACCTGAACAGGTCTGGACCAAAACATTTAGGAACTAAGGAAATGCCTGAGGGTTCACCCGATTGTTTTAAAGATTGTTCGTTTTTATTGACCGGCGTGTTGGATTCCTTTGAGAGGGATGACGTCATCGCAGCCATTACGAAGTATGGTGGCGTCATCAAGACGGGCATCAGTAAGAAGGTGACACACGTCATAGCTGGGGACGACGCCGGTCCGGCGAAATTGGCTAAAGCACGGGAGTTTGGAATAAAAATCATGAATGAAGACGAATTCTTACAGTTCATAAGAGATTCGTCTAACAAAAAGACCCCGCCCAAAGATGTGAAAAAGGAAAGTGGGAAGAAAAAAGATAAATCAAGCGAAAAGAAAAGAGAAAAGATAAAAAAGTCACCACAAGATAAAGAAATCAACGTGAACAAAGCTAAGGTGGAGGAATCTCCGAGAGATATCAAAAAATCTGAAGTAAAAACGAATAAAGTCGAATCCAAAGAAGAAAGTAAAGAACATCCTGTAAAGCCTGTGGGAAGGGAAATCAGTCATGACGGAGAATTGAAGAAATCTTGCAGTACGGAGGTATCCAACTCCCTGATGTGGGTCGACAAATACAAGCCGAAGAATCTTAAACAGATCATCGGCCAGCACGGAGAAGCCAGCAATGTCAACAAATTACTGAACTGGTTGAAGAAGTGGTACGCGAACCGTAAGGCCAAGCTGCCGAAGCCGAGTCCTTGGGCCAAGAACGACGACGGCGGCTACTATAGGGCTGCTCTCTTATCCGGACCACCTGGCGTGGTTTGCTCGTGTGGTGACGATAAACACTTCTCATTAGGCGGTAAAACGACAACGGTGTCGTTAGTGTGCAAGGAGCTCGGTTTTGACACTGTGGAGTTGAACGCCTCGGACACGCGCAGTAAGACGTTGCTCAAGGAACAGCTGGGGGAGTTGCTCTCCACCAACACGCTGCAGGCGTATGCTACAGGCTGTGCGGGCAAGGGAGCGGTGTCAAAGAAACACGTCCTTGTGATGGATGAAGTGGATGGAATGGCCGGCAACGAAGACAGAGGAGGTCTACAGGAGCTAATATCACTGATCAAGACGACTTCTGTACCCGTCATATGTATGTGCAATGATAGGAACAGTGAGAAGATGAGGTCTCTGGTCAACTACTGCTATGACCTCAAGTTCGCCAGGCCGCGGCTGGAACAGATTAAGTCGGCCATGATGTCAATCTGCTTCAAAGAGGGCATCAAGATATCTCCTGAAGCACTCTCTCAGCTGATAGTGTCATCTGGCCAGGATATAAGACAGACAGTTCATTTGCTAAGTGTATGTGCCTCAGGACTTACCAGCGATGAGGCAAAGGCTGTGAGGAAAGACATCAAAATGGGTCCATGGGAGGCGATCCGCAAAGTATTCAGTGCCGAGGAACACAAAACAATGTCCATCATTGACAAAAGCGATCTGTTCTTCTGTGACTACTCCATCATGCCACTATTTGTTCAAGAAAACTTCCTCAATGTGACACCGCATTGTCCAAAGAACGAGATTTTAGATCGTTTCAGCAAAGCTGCGGATAGCTTAAGTCTAGGGGACTTGGTGGAGGCGCGGATAAGAGGGAGTCAGGCGTGGAACCTGTTACCAACACAGGCTATGTTCAGCAGTGTGATCCCCGGACATCAATTATCTGGTCATGTGTCAGGGCAGATGCAGTTTCCTTCGTGGTTGGGTAAAAACTCGCGAGCAAACAAAATGAACCGCCTGTGTCAGGAAATACACGCTCACACCAGACTCAGTACATCTGGATCGAAATCTTCAATATTCCTCGACTACTCCACTCACTTACGAGATGCTATTACAAATCCACTTATTCAAGACAAAACAGACGGGATTGAACATTCCCTTAATGTTTTAGAATCGTATAACCTGTTACGAGAAGATTTGGACTCTCTTGTGGAGTTATCATTGTGGCCGGGCCAAAGAAATCCCACAGTTCTGATTGATTCTAAGGTAAAAGCTGCGATGACTCGCACATATAATAAGAAAGCTAGTGCGTTGCCTTATGCCGCTGCCAGTATTAAGAAAGTTAAAGCGACCGAAGATGGAGAGTTGTCACATGAGGAAGATGACACTAGTGATGTAGAACTTGATGCTATGATAAAGAAAAAGAAAGAACCCACCAAAACCTCTACAAGTAAGACAAAGGTTAAACAGGAGGAATCGGCAAGCTCGAGTAAAGCGGCTGCAAAAAAGAAATCAGCGCCAAAGCAAAAGAAGAAATAG

Protein sequence:

>DPOGS202177-PA
MSRDIRSFFTVKKEKTKKDEDSDVIPESPNVQVTNKKKQSRKKRQIQEDSDEEIFSASNKKKNSPIKILKEVKAANLFGSAPIKRTEPIVKRIKKETELTIHSDEEFEQSLIQLDEKINQEIQATKEIPDETSMKKEDLVKDKKDDRSEKLIEDVTNNKKRKLNKSLNEGHGDNNKAEVNKKMKKDFSEFIENGEDLNKSEPAQESPESKQKKRKLDKSLNESVLSDEERYERKRQSAALYQRYLNRSGPKHLGTKEMPEGSPDCFKDCSFLLTGVLDSFERDDVIAAITKYGGVIKTGISKKVTHVIAGDDAGPAKLAKAREFGIKIMNEDEFLQFIRDSSNKKTPPKDVKKESGKKKDKSSEKKREKIKKSPQDKEINVNKAKVEESPRDIKKSEVKTNKVESKEESKEHPVKPVGREISHDGELKKSCSTEVSNSLMWVDKYKPKNLKQIIGQHGEASNVNKLLNWLKKWYANRKAKLPKPSPWAKNDDGGYYRAALLSGPPGVVCSCGDDKHFSLGGKTTTVSLVCKELGFDTVELNASDTRSKTLLKEQLGELLSTNTLQAYATGCAGKGAVSKKHVLVMDEVDGMAGNEDRGGLQELISLIKTTSVPVICMCNDRNSEKMRSLVNYCYDLKFARPRLEQIKSAMMSICFKEGIKISPEALSQLIVSSGQDIRQTVHLLSVCASGLTSDEAKAVRKDIKMGPWEAIRKVFSAEEHKTMSIIDKSDLFFCDYSIMPLFVQENFLNVTPHCPKNEILDRFSKAADSLSLGDLVEARIRGSQAWNLLPTQAMFSSVIPGHQLSGHVSGQMQFPSWLGKNSRANKMNRLCQEIHAHTRLSTSGSKSSIFLDYSTHLRDAITNPLIQDKTDGIEHSLNVLESYNLLREDLDSLVELSLWPGQRNPTVLIDSKVKAAMTRTYNKKASALPYAAASIKKVKATEDGELSHEEDDTSDVELDAMIKKKKEPTKTSTSKTKVKQEESASSSKAAAKKKSAPKQKKK-