DPGLEAN10824 in OGS1.0

New model in OGS2.0DPOGS202177 
Genomic Positionscaffold669:- 28913-35812
See gene structure
CDS Length2970
Paired RNAseq reads  1249
Single RNAseq reads  2836
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA003318 (0.0)
Best Drosophila hit  germ line transcription factor 1, isoform A (0.0)
Best Human hitreplication factor C subunit 1 (2e-166)
Best NR hit (blastp)  replication factor C large subunit, putative [Aedes aegypti] (0.0)
Best NR hit (blastx)  replication factor C large subunit [Culex quinquefasciatus] (0.0)
GeneOntology terms





  
GO:0003677 DNA binding
GO:0005634 nucleus
GO:0005663 DNA replication factor C complex
GO:0006260 DNA replication
GO:0005524 ATP binding
GO:0003689 DNA clamp loader activity
GO:0005875 microtubule associated complex
InterPro families




  
IPR008921 DNA polymerase III, clamp loader complex, gamma/delta/delta subunit, C-terminal
IPR001357 BRCT
IPR003593 ATPase, AAA+ type, core
IPR013725 DNA replication factor RFC1, C-terminal
IPR003959 ATPase, AAA-type, core
IPR012178 DNA replication factor C, large subunit
Orthology groupMCL13486

Nucleotide sequence:

ATGTCTAGGGATATCAGATCATTCTTTACAGTAAAAAAAGAGAAAACGAAGAAAGATGAA
GACAGTGATGTTATACCAGAATCACCGAATGTACAAGTTACAAACAAGAAAAAACAGTCT
CGCAAAAAAAGACAAATTCAAGAGGACTCCGATGAGGAAATATTCTCCGCATCAAATAAA
AAAAAGAATTCTCCTATAAAAATACTAAAAGAAGTTAAAGCAGCTAACTTATTCGGTTCA
GCACCAATCAAAAGAACGGAGCCGATTGTGAAGAGAATAAAAAAAGAAACGGAACTTACC
ATACACTCCGACGAAGAATTCGAACAGAGTCTCATACAATTAGATGAGAAAATTAATCAA
GAGATACAAGCAACTAAGGAAATACCAGATGAAACGTCAATGAAAAAGGAAGATTTAGTC
AAAGATAAGAAAGACGATCGTTCAGAAAAATTGATTGAAGATGTTACAAACAACAAAAAG
AGAAAGTTGAATAAAAGTTTGAATGAAGGACACGGGGATAATAACAAAGCTGAGGTCAAC
AAGAAGATGAAGAAGGATTTTAGTGAGTTCATTGAGAACGGAGAAGATTTAAACAAAAGT
GAGCCGGCACAGGAATCACCAGAGTCCAAACAGAAGAAGCGCAAACTGGACAAGAGTCTC
AATGAATCAGTCCTATCAGATGAGGAGAGGTATGAAAGAAAGAGACAATCAGCTGCCTTA
TATCAGAGGTACCTGAACAGGTCTGGACCAAAACATTTAGGAACTAAGGAAATGCCTGAG
GGTTCACCCGATTGTTTTAAAGATTGTTCGTTTTTATTGACCGGCGTGTTGGATTCCTTT
GAGAGGGATGACGTCATCGCAGCCATTACGAAGTATGGTGGCGTCATCAAGACGGGCATC
AGTAAGAAGGTGACACACGTCATAGCTGGGGACGACGCCGGTCCGGCGAAATTGGCTAAA
GCACGGGAGTTTGGAATAAAAATCATGAATGAAGACGAATTCTTACAGTTCATAAGAGAT
TCGTCTAACAAAAAGACCCCGCCCAAAGATGTGAAAAAGGAAAGTGGGAAGAAAAAAGAT
AAATCAAGCGAAAAGAAAAGAGAAAAGATAAAAAAGTCACCACAAGATAAAGAAATCAAC
GTGAACAAAGCTAAGGTGGAGGAATCTCCGAGAGATATCAAAAAATCTGAAGTAAAAACG
AATAAAGTCGAATCCAAAGAAGAAAGTAAAGAACATCCTGTAAAGCCTGTGGGAAGGGAA
ATCAGTCATGACGGAGAATTGAAGAAATCTTGCAGTACGGAGGTATCCAACTCCCTGATG
TGGGTCGACAAATACAAGCCGAAGAATCTTAAACAGATCATCGGCCAGCACGGAGAAGCC
AGCAATGTCAACAAATTACTGAACTGGTTGAAGAAGTGGTACGCGAACCGTAAGGCCAAG
CTGCCGAAGCCGAGTCCTTGGGCCAAGAACGACGACGGCGGCTACTATAGGGCTGCTCTC
TTATCCGGACCACCTGGCGTGGGTAAAACGACAACGGTGTCGTTAGTGTGCAAGGAGCTC
GGTTTTGACACTGTGGAGTTGAACGCCTCGGACACGCGCAGTAAGACGTTGCTCAAGGAA
CAGCTGGGGGAGTTGCTCTCCACCAACACGCTGCAGGCGTATGCTACAGGCTGTGCGGGC
AAGGGAGCGGTGTCAAAGAAACACGTCCTTGTGATGGATGAAGTGGATGGAATGGCCGGC
AACGAAGACAGAGGAGGTCTACAGGAGCTAATATCACTGATCAAGACGACTTCTGTACCC
GTCATATGTATGTGCAATGATAGGAACAGTGAGAAGATGAGGTCTCTGGTCAACTACTGC
TATGACCTCAAGTTCGCCAGGCCGCGGCTGGAACAGATTAAGTCGGCCATGATGTCAATC
TGCTTCAAAGAGGGCATCAAGATATCTCCTGAAGCACTCTCTCAGCTGATAGTGTCATCT
GGCCAGGATATAAGACAGACAGTTCATTTGCTAAGTGTATGTGCCTCAGGACTTACCAGC
GATGAGGCAAAGGCTGTGAGGAAAGACATCAAAATGGGTCCATGGGAGGCGATCCGCAAA
GTATTCAGTGCCGAGGAACACAAAACAATGTCCATCATTGACAAAAGCGATCTGTTCTTC
TGTGACTACTCCATCATGCCACTATTTGTTCAAGAAAACTTCCTCAATGTGACACCGCAT
TGTCCAAAGAACGAGATTTTAGATCGTTTCAGCAAAGCTGCGGATAGCTTAAGTCTAGGG
GACTTGGTGGAGGCGCGGATAAGAGGGAGTCAGGCGTGGAACCTGTTACCAACACAGGCT
ATGTTCAGCAGTGTGATCCCCGGACATCAATTATCTGGTCATGTGTCAGGGCAGATGCAG
TTTCCTTCGTGGTTGGGTAAAAACTCGCGAGCAAACAAAATGAACCGCCTGTGTCAGGAA
ATACACGCTCACACCAGACTCAGTACATCTGGATCGAAATCTTCAATATTCCTCGACTAC
TCCACTCACTTACGAGATGCTATTACAAATCCACTTATTCAAGACAAAACAGACGGGATT
GAACATTCCCTTAATGTTTTAGAATCGTATAACCTGTTACGAGAAGATTTGGACTCTCTT
GTGGAGTTATCATTGTGGCCGGGCCAAAGAAATCCCACAGTTCTGATTGATTCTAAGGTA
AAAGCTGCGATGACTCGCACATATAATAAGAAAGCTAGTGCGTTGCCTTATGCCGCTGCC
AGTATTAAGAAAGTTAAAGCGACCGAAGATGGAGAGTTGTCACATGAGGAAGATGACACT
AGTGATGTAGAACTTGATGCTATGATAAAGAAAAAGAAAGAACCCACCAAAACCTCTACA
AGTAAGACAAAGGTTAAACAGGAGGAATCGGCAAGCTCGAGTAAAGCGGCTGCAAAAAAG
AAATCAGCGCCAAAGCAAAAGAAGAAATAG

Protein sequence:

MSRDIRSFFTVKKEKTKKDEDSDVIPESPNVQVTNKKKQSRKKRQIQEDSDEEIFSASNK
KKNSPIKILKEVKAANLFGSAPIKRTEPIVKRIKKETELTIHSDEEFEQSLIQLDEKINQ
EIQATKEIPDETSMKKEDLVKDKKDDRSEKLIEDVTNNKKRKLNKSLNEGHGDNNKAEVN
KKMKKDFSEFIENGEDLNKSEPAQESPESKQKKRKLDKSLNESVLSDEERYERKRQSAAL
YQRYLNRSGPKHLGTKEMPEGSPDCFKDCSFLLTGVLDSFERDDVIAAITKYGGVIKTGI
SKKVTHVIAGDDAGPAKLAKAREFGIKIMNEDEFLQFIRDSSNKKTPPKDVKKESGKKKD
KSSEKKREKIKKSPQDKEINVNKAKVEESPRDIKKSEVKTNKVESKEESKEHPVKPVGRE
ISHDGELKKSCSTEVSNSLMWVDKYKPKNLKQIIGQHGEASNVNKLLNWLKKWYANRKAK
LPKPSPWAKNDDGGYYRAALLSGPPGVGKTTTVSLVCKELGFDTVELNASDTRSKTLLKE
QLGELLSTNTLQAYATGCAGKGAVSKKHVLVMDEVDGMAGNEDRGGLQELISLIKTTSVP
VICMCNDRNSEKMRSLVNYCYDLKFARPRLEQIKSAMMSICFKEGIKISPEALSQLIVSS
GQDIRQTVHLLSVCASGLTSDEAKAVRKDIKMGPWEAIRKVFSAEEHKTMSIIDKSDLFF
CDYSIMPLFVQENFLNVTPHCPKNEILDRFSKAADSLSLGDLVEARIRGSQAWNLLPTQA
MFSSVIPGHQLSGHVSGQMQFPSWLGKNSRANKMNRLCQEIHAHTRLSTSGSKSSIFLDY
STHLRDAITNPLIQDKTDGIEHSLNVLESYNLLREDLDSLVELSLWPGQRNPTVLIDSKV
KAAMTRTYNKKASALPYAAASIKKVKATEDGELSHEEDDTSDVELDAMIKKKKEPTKTST
SKTKVKQEESASSSKAAAKKKSAPKQKKK