DPGLEAN22531 in OGS1.0

New model in OGS2.0DPOGS203050 
Genomic Positionscaffold558:+ 25605-30863
See gene structure
CDS Length3258
Paired RNAseq reads  940
Single RNAseq reads  2426
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA014045 (0.0)
Best Drosophila hit  RNA polymerase III 128kD subunit (0.0)
Best Human hitDNA-directed RNA polymerase III subunit RPC2 isoform 1 (0.0)
Best NR hit (blastp)  GF12287 [Drosophila ananassae] (0.0)
Best NR hit (blastx)  PREDICTED: similar to DNA-directed RNA polymerase III subunit 127.6 kDa polypeptide (RNA polymerase III subunit 2) (RPC2) isoform 1 [Apis mellifera] (0.0)
GeneOntology terms




  
GO:0006383 transcription from RNA polymerase III promoter
GO:0005666 DNA-directed RNA polymerase III complex
GO:0003899 DNA-directed RNA polymerase activity
GO:0005634 nucleus
GO:0003677 DNA binding
GO:0032549 ribonucleoside binding
InterPro families








  
IPR007120 DNA-directed RNA polymerase, subunit 2, domain 6
IPR014724 RNA polymerase Rpb2, OB-fold
IPR015712 DNA-directed RNA polymerase, subunit 2
IPR007644 RNA polymerase, beta subunit, protrusion
IPR007641 RNA polymerase Rpb2, domain 7
IPR007642 RNA polymerase Rpb2, domain 2
IPR007645 RNA polymerase Rpb2, domain 3
IPR007646 RNA polymerase Rpb2, domain 4
IPR007647 RNA polymerase Rpb2, domain 5
IPR007121 RNA polymerase, beta subunit, conserved site
Orthology groupMCL10220

Nucleotide sequence:

ATGGGTGAATTAAATAAAAATCACGGTAAAGAGTGGGATTCAGGCAAAAGCCTCAGAGAA
CCTATAAAAACCTTAGAGAAAAAAAAAATTGTTCAAGCAAATGAAAAAGTATTCTGTGAT
TCCGATCCATTATTCTACATAAAGTATTTAAACGCCTATGTCGGAACACCAGATCTGGAA
GAAGGATTTAATGTGACCAAGCCAACAACACCTCATGAATGTCGACTGAGAGACATGACT
TATTCTGCACCAATCACTGTTGACATAGAATATATAAGAGGCAATCAAAGAGTCATTTTT
AAAAATAAGCAATTAATTGGAAGAATGCCTTTAATGCTGAGATCCTCGAATTGTGTTTTA
ACGAATAAATCAGATTTCGAGTTAGCTCAATTAAATGAATGTCCACATGATCCCGGTGGC
TATTTTATTATAAGAGGACAAGAAAAAGTAATTTTAATTCAAGAACAACTCTCTAGGAAT
AGAATGATTGTTGATGAATTTAAAGGTGCCATCCAGTGTCAAGTAACGAGTTCGACCCAT
GAGAAGAAAACTAGGACTATAGTTATTGTTAAAAATGGGAAATATGTTTTGAGGCACAAT
GCTTTATCTGATGATATACCAATATGTGTAGTTTTTAAAGCCATGGGTATATGTAGTGAC
CAAGAAATAATGCAGTTAGTGGGTACAGATGATGCTATTGCAAAAAAAATGGCACCCTGC
ATCATGGACTGCCATAATTTAAAAGTTTTCACACAGAATCAGGCGCTTGCTTATATTGGT
AGCAAATTAAAAGTTAAGAGGTTCCAATCAGCAACATCAAAATTTAGAACTCCGGTGGAC
GAAGCAAGAGATCTGATAGCAACAACCATTTTAGCACATGTGGTAGTAGAAAATTATAAT
TTCTATGTGAAAGCTATTTACTTGGCGATAATGGTGAAAAGAGTCATAGAGGCAGAGACT
AACAAAGCGGCTATAGATGACCCAGACTATTATGGAAATAAGAGGCTGGAACTGGCAGGG
TCACTACTGGCGCTCATGTTTGAAGATCTGTTCAAAAGATTTAATTGGGAATTGAAATCT
ATAGCCGATAAAATAATACCGAGAGTAAAAGCAGCGCCGTTTGATGGAAATTGGACTATC
AAGAGATTCAAAATGGAACGTCATGGCGTCACCCAAGTATTGAGTCGTCTCAGCTATATT
TCGGCTCTCGGTATGATGACGAGAGTTAATTCACAGTTCGAAAAGACCAGGAAGGTTTCT
GGGCCTAGATCGCTGCAACCGTCACAGTGGGGAATGCTCTGTCCTTCTGACACCCCTGAG
GGAGAAGCTTGTGGCTTAGTGAAAAATCTGGCTTTAATGACGCATATTACAACAGAGTGC
TCAGAAGGTCCGATATCAAGGCTAGCTTGTAACGCCGGGGTTGAGGACGTTAGACTACTA
GGAGGTGAAGAAATAAACCATCCAGCTCTCTATATGGTCTTTTTAAACGGTAACATCTTA
GGCGTCACTAGAGAATATAAGAAACTAATAAAAATCTTCAGGATGTTTAGAAGACGAGGT
CTGATCTCTGCGTTTGTGTCTATATATCCTAATCACAATCAAAGAACGGTGTATATATGC
AGTGACGGTGGTAGACTCTGCCGTCCGTATATAATAGTGGAGAAAGGTTTGCCTTTGGTT
CAACAGCATCATATCAATGAGCTCAACAGGGGTATACGAAAGTTTCAAGATTTTCTCAAC
GATGGTCTCATAGAATATTTGGACGTGAATGAAGAAAATGACAGTCATATAGCGACTGTT
GAGACCGAGATAGATCCGTACGTGACAACGCATCTCGAGATAGAGCCGTTCACTATACTT
GGAGTTTGTGCGGGTCTTGTGCCTTATCCCCACCACAATCAGAGCCCCAGGAACACATAC
CAATGTGCTATGGGCAAACAGGCCATGGGTACAATAGGCTATAATCAGAAGAATAGAATA
GACACGCTTATGTATAATTTAGTGTACCCGCAGTGTCCGATGGTTAAAACGAGGACTATA
GAATTGACAAATTTTGATAAGCTGCCGGCCGGGCAGAACGCTACTGTCGCTGTTATGAGT
TACAGTGGCTATGATATAGAAGATGCTTTAATTTTGAACAGAGCATCGATAGACCGTGGT
TACGGACGTTGTTTGGTATACAAAAGTGCCAAAACAATTATGAAGCGATACAGCAATCAG
ACGTCGGACAGAATCCTAGGACCATCAAGAGACGCCAACACTGGAAAAGTTATTAAGGCT
CACGAAGTTCTGGATTCCGATGGCATAGCAGCTCCGGGTGAAATGGTCGAGAACCGTCAA
GTGTTGATAAACAAACAAATGCCTCCCGCCACGCTCAATCCTATAAATCAAGGTCAACCG
CAACAAATAGATTATAAAGACGTGCCAATAACGTACAAAGGGCCAGTCGAATCTTATATA
GAAAAGGTCATGGTGTCCTCCAACTCTGAAGATGCGTTTCTAATAAAAATATTACTGAGG
CAAACTAGAATACCAGAGATCGGTGATAAATTTAGTTCCAGGCATGGACAGAAAGGAGTG
ACAGGTTTGATAGTTCAACAGGAAGACATGCCGTTCAATGATAGAGGGATATGTCCCGAT
ATGATCATGAATCCACATGGTTTCCCCTCGAGAATGACGGTGGGAAAAACTATAGAGCTA
TTGGCGGGAAAAGCTGGACTGATGGAGGGAAAATTTCATTACGGCACAGCATTTGGTGGG
TCGAAAGTTCGCGATGTATGTCAGGAGTTGGAAAAGCATGGCTATAACTACCACGGAAAG
GATATATTCTATTCTGGTATAACAGGAGAACCTTTGGAAGCGTACATTTATTCCGGGCCA
GTTTACTATCAGAAGTTGAAACATATGGTGCAAGACAAAATGCACGCTCGTGCTAGAGGA
CCGAGAGCTGTACTGACGAGACAACCCACTGAAGGAAGGTCGAGGGATGGAGGACTACGA
CTTGGGGAGATGGAACGAGATTGCCTCATTGGTTATGGTGCAAGTATGCTGTTGATGGAA
CGTCTGATGTTAGCGTCTGACGCGTTCAGTGCGGACATTTGCAGTAGTTGCGGCAGGCTG
GCCTCCCGTGCTTGGTGCCACGCCTGCCGGTCCTCGGCCGTCTCGGCGGTCGATATGCCT
TACGCCTGTAAGCTGTTGTTCCAGGAGCTGGCCTCCATGAATATAGTCCCGAGGCTGAAA
TTGAAGAAGTACTCGTGA

Protein sequence:

MGELNKNHGKEWDSGKSLREPIKTLEKKKIVQANEKVFCDSDPLFYIKYLNAYVGTPDLE
EGFNVTKPTTPHECRLRDMTYSAPITVDIEYIRGNQRVIFKNKQLIGRMPLMLRSSNCVL
TNKSDFELAQLNECPHDPGGYFIIRGQEKVILIQEQLSRNRMIVDEFKGAIQCQVTSSTH
EKKTRTIVIVKNGKYVLRHNALSDDIPICVVFKAMGICSDQEIMQLVGTDDAIAKKMAPC
IMDCHNLKVFTQNQALAYIGSKLKVKRFQSATSKFRTPVDEARDLIATTILAHVVVENYN
FYVKAIYLAIMVKRVIEAETNKAAIDDPDYYGNKRLELAGSLLALMFEDLFKRFNWELKS
IADKIIPRVKAAPFDGNWTIKRFKMERHGVTQVLSRLSYISALGMMTRVNSQFEKTRKVS
GPRSLQPSQWGMLCPSDTPEGEACGLVKNLALMTHITTECSEGPISRLACNAGVEDVRLL
GGEEINHPALYMVFLNGNILGVTREYKKLIKIFRMFRRRGLISAFVSIYPNHNQRTVYIC
SDGGRLCRPYIIVEKGLPLVQQHHINELNRGIRKFQDFLNDGLIEYLDVNEENDSHIATV
ETEIDPYVTTHLEIEPFTILGVCAGLVPYPHHNQSPRNTYQCAMGKQAMGTIGYNQKNRI
DTLMYNLVYPQCPMVKTRTIELTNFDKLPAGQNATVAVMSYSGYDIEDALILNRASIDRG
YGRCLVYKSAKTIMKRYSNQTSDRILGPSRDANTGKVIKAHEVLDSDGIAAPGEMVENRQ
VLINKQMPPATLNPINQGQPQQIDYKDVPITYKGPVESYIEKVMVSSNSEDAFLIKILLR
QTRIPEIGDKFSSRHGQKGVTGLIVQQEDMPFNDRGICPDMIMNPHGFPSRMTVGKTIEL
LAGKAGLMEGKFHYGTAFGGSKVRDVCQELEKHGYNYHGKDIFYSGITGEPLEAYIYSGP
VYYQKLKHMVQDKMHARARGPRAVLTRQPTEGRSRDGGLRLGEMERDCLIGYGASMLLME
RLMLASDAFSADICSSCGRLASRAWCHACRSSAVSAVDMPYACKLLFQELASMNIVPRLK
LKKYS