New model in OGS2.0 | DPOGS202177  |
---|---|
Genomic Position | scaffold669:- 28913-35812 |
See gene structure | |
CDS Length | 2970 |
Paired RNAseq reads   | 1249 |
Single RNAseq reads   | 2836 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA003318 (0.0) |
Best Drosophila hit   | germ line transcription factor 1, isoform A (0.0) |
Best Human hit | replication factor C subunit 1 (2e-166) |
Best NR hit (blastp)   | replication factor C large subunit, putative [Aedes aegypti] (0.0) |
Best NR hit (blastx)   | replication factor C large subunit [Culex quinquefasciatus] (0.0) |
GeneOntology terms    | GO:0003677 DNA binding GO:0005634 nucleus GO:0005663 DNA replication factor C complex GO:0006260 DNA replication GO:0005524 ATP binding GO:0003689 DNA clamp loader activity GO:0005875 microtubule associated complex |
InterPro families    | IPR008921 DNA polymerase III, clamp loader complex, gamma/delta/delta subunit, C-terminal IPR001357 BRCT IPR003593 ATPase, AAA+ type, core IPR013725 DNA replication factor RFC1, C-terminal IPR003959 ATPase, AAA-type, core IPR012178 DNA replication factor C, large subunit |
Orthology group | MCL13486 |
Nucleotide sequence:
ATGTCTAGGGATATCAGATCATTCTTTACAGTAAAAAAAGAGAAAACGAAGAAAGATGAA
GACAGTGATGTTATACCAGAATCACCGAATGTACAAGTTACAAACAAGAAAAAACAGTCT
CGCAAAAAAAGACAAATTCAAGAGGACTCCGATGAGGAAATATTCTCCGCATCAAATAAA
AAAAAGAATTCTCCTATAAAAATACTAAAAGAAGTTAAAGCAGCTAACTTATTCGGTTCA
GCACCAATCAAAAGAACGGAGCCGATTGTGAAGAGAATAAAAAAAGAAACGGAACTTACC
ATACACTCCGACGAAGAATTCGAACAGAGTCTCATACAATTAGATGAGAAAATTAATCAA
GAGATACAAGCAACTAAGGAAATACCAGATGAAACGTCAATGAAAAAGGAAGATTTAGTC
AAAGATAAGAAAGACGATCGTTCAGAAAAATTGATTGAAGATGTTACAAACAACAAAAAG
AGAAAGTTGAATAAAAGTTTGAATGAAGGACACGGGGATAATAACAAAGCTGAGGTCAAC
AAGAAGATGAAGAAGGATTTTAGTGAGTTCATTGAGAACGGAGAAGATTTAAACAAAAGT
GAGCCGGCACAGGAATCACCAGAGTCCAAACAGAAGAAGCGCAAACTGGACAAGAGTCTC
AATGAATCAGTCCTATCAGATGAGGAGAGGTATGAAAGAAAGAGACAATCAGCTGCCTTA
TATCAGAGGTACCTGAACAGGTCTGGACCAAAACATTTAGGAACTAAGGAAATGCCTGAG
GGTTCACCCGATTGTTTTAAAGATTGTTCGTTTTTATTGACCGGCGTGTTGGATTCCTTT
GAGAGGGATGACGTCATCGCAGCCATTACGAAGTATGGTGGCGTCATCAAGACGGGCATC
AGTAAGAAGGTGACACACGTCATAGCTGGGGACGACGCCGGTCCGGCGAAATTGGCTAAA
GCACGGGAGTTTGGAATAAAAATCATGAATGAAGACGAATTCTTACAGTTCATAAGAGAT
TCGTCTAACAAAAAGACCCCGCCCAAAGATGTGAAAAAGGAAAGTGGGAAGAAAAAAGAT
AAATCAAGCGAAAAGAAAAGAGAAAAGATAAAAAAGTCACCACAAGATAAAGAAATCAAC
GTGAACAAAGCTAAGGTGGAGGAATCTCCGAGAGATATCAAAAAATCTGAAGTAAAAACG
AATAAAGTCGAATCCAAAGAAGAAAGTAAAGAACATCCTGTAAAGCCTGTGGGAAGGGAA
ATCAGTCATGACGGAGAATTGAAGAAATCTTGCAGTACGGAGGTATCCAACTCCCTGATG
TGGGTCGACAAATACAAGCCGAAGAATCTTAAACAGATCATCGGCCAGCACGGAGAAGCC
AGCAATGTCAACAAATTACTGAACTGGTTGAAGAAGTGGTACGCGAACCGTAAGGCCAAG
CTGCCGAAGCCGAGTCCTTGGGCCAAGAACGACGACGGCGGCTACTATAGGGCTGCTCTC
TTATCCGGACCACCTGGCGTGGGTAAAACGACAACGGTGTCGTTAGTGTGCAAGGAGCTC
GGTTTTGACACTGTGGAGTTGAACGCCTCGGACACGCGCAGTAAGACGTTGCTCAAGGAA
CAGCTGGGGGAGTTGCTCTCCACCAACACGCTGCAGGCGTATGCTACAGGCTGTGCGGGC
AAGGGAGCGGTGTCAAAGAAACACGTCCTTGTGATGGATGAAGTGGATGGAATGGCCGGC
AACGAAGACAGAGGAGGTCTACAGGAGCTAATATCACTGATCAAGACGACTTCTGTACCC
GTCATATGTATGTGCAATGATAGGAACAGTGAGAAGATGAGGTCTCTGGTCAACTACTGC
TATGACCTCAAGTTCGCCAGGCCGCGGCTGGAACAGATTAAGTCGGCCATGATGTCAATC
TGCTTCAAAGAGGGCATCAAGATATCTCCTGAAGCACTCTCTCAGCTGATAGTGTCATCT
GGCCAGGATATAAGACAGACAGTTCATTTGCTAAGTGTATGTGCCTCAGGACTTACCAGC
GATGAGGCAAAGGCTGTGAGGAAAGACATCAAAATGGGTCCATGGGAGGCGATCCGCAAA
GTATTCAGTGCCGAGGAACACAAAACAATGTCCATCATTGACAAAAGCGATCTGTTCTTC
TGTGACTACTCCATCATGCCACTATTTGTTCAAGAAAACTTCCTCAATGTGACACCGCAT
TGTCCAAAGAACGAGATTTTAGATCGTTTCAGCAAAGCTGCGGATAGCTTAAGTCTAGGG
GACTTGGTGGAGGCGCGGATAAGAGGGAGTCAGGCGTGGAACCTGTTACCAACACAGGCT
ATGTTCAGCAGTGTGATCCCCGGACATCAATTATCTGGTCATGTGTCAGGGCAGATGCAG
TTTCCTTCGTGGTTGGGTAAAAACTCGCGAGCAAACAAAATGAACCGCCTGTGTCAGGAA
ATACACGCTCACACCAGACTCAGTACATCTGGATCGAAATCTTCAATATTCCTCGACTAC
TCCACTCACTTACGAGATGCTATTACAAATCCACTTATTCAAGACAAAACAGACGGGATT
GAACATTCCCTTAATGTTTTAGAATCGTATAACCTGTTACGAGAAGATTTGGACTCTCTT
GTGGAGTTATCATTGTGGCCGGGCCAAAGAAATCCCACAGTTCTGATTGATTCTAAGGTA
AAAGCTGCGATGACTCGCACATATAATAAGAAAGCTAGTGCGTTGCCTTATGCCGCTGCC
AGTATTAAGAAAGTTAAAGCGACCGAAGATGGAGAGTTGTCACATGAGGAAGATGACACT
AGTGATGTAGAACTTGATGCTATGATAAAGAAAAAGAAAGAACCCACCAAAACCTCTACA
AGTAAGACAAAGGTTAAACAGGAGGAATCGGCAAGCTCGAGTAAAGCGGCTGCAAAAAAG
AAATCAGCGCCAAAGCAAAAGAAGAAATAG
Protein sequence:
MSRDIRSFFTVKKEKTKKDEDSDVIPESPNVQVTNKKKQSRKKRQIQEDSDEEIFSASNK
KKNSPIKILKEVKAANLFGSAPIKRTEPIVKRIKKETELTIHSDEEFEQSLIQLDEKINQ
EIQATKEIPDETSMKKEDLVKDKKDDRSEKLIEDVTNNKKRKLNKSLNEGHGDNNKAEVN
KKMKKDFSEFIENGEDLNKSEPAQESPESKQKKRKLDKSLNESVLSDEERYERKRQSAAL
YQRYLNRSGPKHLGTKEMPEGSPDCFKDCSFLLTGVLDSFERDDVIAAITKYGGVIKTGI
SKKVTHVIAGDDAGPAKLAKAREFGIKIMNEDEFLQFIRDSSNKKTPPKDVKKESGKKKD
KSSEKKREKIKKSPQDKEINVNKAKVEESPRDIKKSEVKTNKVESKEESKEHPVKPVGRE
ISHDGELKKSCSTEVSNSLMWVDKYKPKNLKQIIGQHGEASNVNKLLNWLKKWYANRKAK
LPKPSPWAKNDDGGYYRAALLSGPPGVGKTTTVSLVCKELGFDTVELNASDTRSKTLLKE
QLGELLSTNTLQAYATGCAGKGAVSKKHVLVMDEVDGMAGNEDRGGLQELISLIKTTSVP
VICMCNDRNSEKMRSLVNYCYDLKFARPRLEQIKSAMMSICFKEGIKISPEALSQLIVSS
GQDIRQTVHLLSVCASGLTSDEAKAVRKDIKMGPWEAIRKVFSAEEHKTMSIIDKSDLFF
CDYSIMPLFVQENFLNVTPHCPKNEILDRFSKAADSLSLGDLVEARIRGSQAWNLLPTQA
MFSSVIPGHQLSGHVSGQMQFPSWLGKNSRANKMNRLCQEIHAHTRLSTSGSKSSIFLDY
STHLRDAITNPLIQDKTDGIEHSLNVLESYNLLREDLDSLVELSLWPGQRNPTVLIDSKV
KAAMTRTYNKKASALPYAAASIKKVKATEDGELSHEEDDTSDVELDAMIKKKKEPTKTST
SKTKVKQEESASSSKAAAKKKSAPKQKKK