DPGLEAN03974 in OGS1.0

New model in OGS2.0DPOGS206318 
Genomic Positionscaffold1708:- 7610-9053
See gene structure
CDS Length1110
Paired RNAseq reads  769
Single RNAseq reads  1926
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA014134 (6e-132)
Best Drosophila hit  RNA polymerase I subunit (3e-56)
Best Human hitDNA-directed RNA polymerase I subunit RPA1 (3e-51)
Best NR hit (blastp)  PREDICTED: similar to DNA-directed RNA polymerase I largest subunit [Tribolium castaneum] (2e-94)
Best NR hit (blastx)  PREDICTED: similar to DNA-directed RNA polymerase I largest subunit [Tribolium castaneum] (1e-87)
GeneOntology terms




  
GO:0006360 transcription from RNA polymerase I promoter
GO:0005736 DNA-directed RNA polymerase I complex
GO:0003899 DNA-directed RNA polymerase activity
GO:0005634 nucleus
GO:0003677 DNA binding
GO:0008270 zinc ion binding
InterPro families
  
IPR007081 RNA polymerase Rpb1, domain 5
IPR015699 DNA-directed RNA pol I, largest subunit
Orthology groupMCL40945

Nucleotide sequence:

ATGCGTTTCGTTTTTCTACCGTTTTCTCAATACAAAACGCAGTATACCGTGAAGCCACCG
CAAATAATAAAACACATGCAGAATAAATTCTTTAATGAGATGTTTGCGGTTATCCGAAAG
CAGGCGAAGAGTACTTGTGGTGTTTTGTGGGCTGCGGAGAAAGAAAAGAAACGTCGGGTC
GCTGATGATGAGGAGGACGAAGACAATTCCCCTGACCTTGAGGAACGTCAAGGTCAAGAC
GTGGACAGTTCGGATGATGAGGGGCCTCACGACGATGAAGACAATACAGACGTAAAAATA
CGTAAAAAACGTTCCGAAGAACAAGAATACGAAGATCCAGAGTCAGAAGAAGAAGAGAAA
TCTGATGACGATTTGGAAATAGATGAAAATAATACAAAAGAAAAAGATGATGATCTTGAA
ACGGTTGAAGAGGTAAACGCTGAAGATGCTAAAGCCATGGAAAAAGTAGTTGGGAAAATA
ACCAACGCGTCAAACTATACCTTTGACACCAAGAACCATAAGTGGTGTGAATTGACCGTG
TTCTTCCCGATAGCATTCCTGCGGGTAGACCTGTCCCAGGCCTTGCGCGACGCAGCCAAG
AATTCAGTCATATACGAAATCAAGAACATCAAACGGGCGATCACTAACAAGGAAAAGGAC
GTCCTATACCTCAAAACTGAAGGGATCAATATAGTACAAATGTCCAAGTACAGTCACCTC
TTAGATTTGAACAAGTTATACACGAACGACATTCACGCCATCGCAAACACGTACGGCATT
GAAGCGGCAAACAAAGTTATCATAAAAGAAATCCAGAACGTATTCAACGTGTACGGCATC
ACCGTGGACCCGCGCCACTTGACGCTGGTGGCGGATTACATGACGTACAACGGAATATTT
GAGCCCATGAGCAGGAAGGGGATGGAGGCGTCAACTTCACCTTTACAGCAAATGTCCTTC
GAATCGTCTTTGATATTCCTGAAGGAGGCTGTTCTGAACTCCAAGAAAGATTTCATCAGG
TCGGCGTCCAGCTGTCTCATGCTCGGCCAGCCGTGCCGGGCCGGCACGGGTTCCTTCAGT
TTGCAGCATTTCAGTAAAGTTGTCAGCTAA

Protein sequence:

MRFVFLPFSQYKTQYTVKPPQIIKHMQNKFFNEMFAVIRKQAKSTCGVLWAAEKEKKRRV
ADDEEDEDNSPDLEERQGQDVDSSDDEGPHDDEDNTDVKIRKKRSEEQEYEDPESEEEEK
SDDDLEIDENNTKEKDDDLETVEEVNAEDAKAMEKVVGKITNASNYTFDTKNHKWCELTV
FFPIAFLRVDLSQALRDAAKNSVIYEIKNIKRAITNKEKDVLYLKTEGINIVQMSKYSHL
LDLNKLYTNDIHAIANTYGIEAANKVIIKEIQNVFNVYGITVDPRHLTLVADYMTYNGIF
EPMSRKGMEASTSPLQQMSFESSLIFLKEAVLNSKKDFIRSASSCLMLGQPCRAGTGSFS
LQHFSKVVS