DPGLEAN04632 in OGS1.0

New model in OGS2.0DPOGS203690 
Genomic Positionscaffold120:- 90191-98196
See gene structure
CDS Length5688
Paired RNAseq reads  7547
Single RNAseq reads  18139
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA003484 (5e-06)
Best Drosophila hit  RNA polymerase II 215kD subunit (0.0)
Best Human hitDNA-directed RNA polymerase II subunit RPB1 (0.0)
Best NR hit (blastp)  largest subunit of the RNA polymerase II complex [Drosophila guanche] (0.0)
Best NR hit (blastx)  RNA polymerase II 215kD subunit [Drosophila erecta] (0.0)
GeneOntology terms



  
GO:0005703 polytene chromosome puff
GO:0006366 transcription from RNA polymerase II promoter
GO:0003899 DNA-directed RNA polymerase activity
GO:0005665 DNA-directed RNA polymerase II, core complex
GO:0003677 DNA binding
InterPro families







  
IPR000684 RNA polymerase II, heptapeptide repeat, eukaryotic
IPR007080 RNA polymerase Rpb1, domain 1
IPR007081 RNA polymerase Rpb1, domain 5
IPR000722 RNA polymerase, alpha subunit
IPR007075 RNA polymerase Rpb1, domain 6
IPR007073 RNA polymerase Rpb1, domain 7
IPR007066 RNA polymerase Rpb1, domain 3
IPR007083 RNA polymerase Rpb1, domain 4
IPR006592 RNA polymerase, N-terminal
Orthology groupMCL13877

Nucleotide sequence:

ATGGCGACCACGAATGATTCGAAGGCGCCTTTGCGCCAAGTTAAAAGAGTACAATTTGGC
ATTTTATCTCCAGATGAAATCCGTCGCATGTCAGTCACAGAAGGGGGAATTCGTTTCCCA
GAAACAATGGAAGGGGGAAGGCCCAAACTTGGTGGGCTTATGGATCCTCGACAAGGGGTG
ATAGACAGAAGTTCTCGATGCCAAACCTGCGCTGGAAATATGACAGAATGCCCTGGACAC
TTTGGCCACATTGATTTAGCCAAACCAGTATTTCATATTGGTTTTATTACCAAAACAATT
AAAGTATTAAGATGTGTTTGTTTTTATTGCTCAAAATTACTTGTCAGTCCTACAAATCCA
AAAATCAAAGAAGTGGTAATGAAATCTAAAGGTCAACCACGTAAAAGGTTGACTTATGTA
TATGACCTTTGCAAGGGTAAAAATATTTGTGAGGGTGGAGAAGATATGGATATTGGAAAA
GAAGGGGAAGAAGGCAAAAGGGGCACAGGACATGGAGGTTGTGGTCATTACCAACCTTCT
ATCAGGCGGCAAGGATTAGATCTGACAGCAGAATGGAAACATGCTAATGAAGACACACAA
GATAAAAAGATAATAATAACCGCAGAACGGGTTTATGAAATATTAAAACACATAACAGAT
GAAGATTCCTTTATTTTGGGTATGGACCCCAAATTTGCCAGACCCGATTGGATGATTGTC
ACAGTCCTTCCTGTACCACCTCTTGCAGTCAGACCCGCTGTAGTTATGTTTGGATCTGCT
AAAAACCAGGATGATTTGACCCATAAGCTTGCTGATATTATAAAAGCTAATAATGAGTTG
ATGAGAAATGAACAATCAGGAGCTGCGGCTCATGTTCTAACTGACAATATCAGAATGTTA
CAGTTCCATGTTGCGACATTTGTTGATAATGACATGCCAGGAATGCCTAAGGCTATGCAA
AAATCTGGTAAACCCTTGAAAGCCATAAAAGCAAGACTAAAAGGCAAAGAAGGTAGAATT
CGTGGAAATCTTATGGGAAAACGTGTTGATTTCTCAGCTAGAACAGTAATTACACCTGAT
CCTAATTTGCGCATTGACCAAGTAGGCGTCCCAAGATCTATTGCACAAAATTTGACATTC
CCCGAGCTTGTAACGCCCTTCAACATTGATCGGATGCAAGAACTCGTGCGAAGAGGAAAT
GCACAGTACCCAGGTGCAAAATACATTGTTCGGGATAATGGTGAAAGAATAGATTTAAGA
TTCCACCCCAAACCATCAGATTTGCATCTACAATATGGCTACAAAGTTGAGCGTCACTTG
AGAGATGATGATTTGGTTATCTTCAACCGACAACCAACACTACATAAGATGAGTATGATG
GGTCATAGGGTCAAAGTATTGCCATGGTCAACATTTCGTATGAACTTGAGTTGTACTTCG
CCGTACAATGCTGATTTCGACGGCGATGAAATGAATTTACATGTACCCCAGTCTATGGAA
ACACGAGCGGAAGTAGAAAACATACACATAACGCCTCGTCAAATTATAACTCCACAAGCT
AATAAACCAGTCATGGGTATTGTGCAAGATACACTGACTGCTGTCAGAAAAATGACAAAA
CGAGACGTATTTTTAACGAAAGAGCAAGTAATGAACTTGCTAATGTTTTTACCAACATGG
GATGGAAAAATTCCACAACCTTGCATCCTGAAGCCACAACCGCTTTGGACAGGAAAACAA
ATATTTACTCTGATCATTCCTGGAAATGTCAATATGGTGCGTACTCATTCCACACATCCT
GATGATGAGGACGATGGTGTTAATAGATGGATATCACCTGGAGACACTAAAGTAATTGTG
GAACACGGGGAACTTCTTATGGGTATTCTGTGTAAGAAATCTCTTGGTGCATCTGCTGGT
TCTTTACTGCATATATGTATGTTGGAGTTAGGACATGAAATAGCTGGTCGTTTTTACGGT
AACATTCAAACTGTCATCAATAATTGGCTACTATTGGAAGGTCACTCCATTGGTATTGGG
GATACAATTGCTGATCCTCAAACATATCAAGAAATCCAAAGGGCTATTGTGAAGGCTAAA
GATGATGTCATAGAAGTTATACAGAAAGCTCACAATATGGAGCTTGAGCCAACTCCTGGT
AATACTCTGAGGCAAACTTTCGAAAATCAGGTCAATCGTATTCTTAACGACGCTCGTGAC
AAAACTGGTGGTTCAGCCAAAAAGTCTCTAACAGAGTACAATAACCTTAAAGCTATGGTA
GTCGCTGGTTCCAAAGGATCAAACATCAATATTTCACAAGTCATTGCTTGCGTGGGTCAG
CAAAACGTCGAAGGAAAGCGTATTCCGTTTGGCTTCCGTAAGAGAACATTGCCGCATTTT
ATCAAAGACGATTATGGTCCGGAATCAAGAGGTTTCGTAGAGAACTCTTACCTGGCCGGT
TTAACACCATCTGAGTTTTATTTCCACGCTATGGGAGGTCGTGAAGGTCTTATCGATACA
GCTGTCAAAACTGCCGAGACTGGGTATATTCAGCGGCGTTTGATAAAGGCTATGGAATCT
GTTATGGTGCATTATGATGGCACAGTCCGAAATTCGGTTGGACAACTGATTCAACTAAGA
TATGGTGAGGATGGTTTAGCTGGAGAAACAGTAGAGTTCCAAAACATGCCCACTGTAAAG
TTATCCAACAAGGCATTTGAAAAGAAATTTAAGTTCGACCCAACCAATGAAAGGTATTTG
AAGAGAATTTTCCATGAAGATATTATAAAAGAACTAACGGAGTCGGGTTACGTGATTGCC
GACTTGGAAAGCGAATGGGAACAGCTTTGCAAAGATCGTGAAATATTGCGACAAATTTTC
CCTAGCGGTGAATCTAAAGTTGTATTGCCGTGCAACTTCAGAAGAATGATTTGGAATGTT
CAAAAGATTTTCCACATCAATAAGAGAATGTCAACAGATTTAAGTCCGATAAAAGTGATA
CAAGGCGTGAAAGATCTTTTGAAGAAATGTGTCATTGTCGCTGGTGAAGATCGTTTGTCT
AAACAAGCGAATGAAAATGCAACCTTACTCTTCCAATGTTTAGTAAGATCTACTTTTTGC
ACGAAGTATGTATCTGAAGATTACAGACTATCAAGTGAAGCTTTCGAATGGTTGATTGGA
GAAATTGAAACAAGATTCCAACAAGCACAAGTCAACCCAGGTGAAATGGTTGGGGCCCTG
GCAGCCCAGTCTCTTGGAGAGCCGGCTACTCAGATGACATTGAATACCTTCCACTTTGCC
GGTGTGTCATCTAAAAACGTAACTCTTGGTGTACCGCGTCTAAAAGAAATCATTAACATA
TCAAAGAAACCAAAAGCACCATCTCTAACAGTATTCCTTACTGGAGGCGCGGCCAGAGAT
GCAGAGAAAGCTAAGAATGTTCTGTGTCGATTGGAACACACGACATTGCGTAAAGTCACA
GCCAATACCGCTATCTACTACGATCCAGACCCTCAGAACACAGTTATTGCTGAAGATCAA
GAGTTTGTTAATGTTTATTATGAAATGCCTGATTTTGATCCAACAAAGATTTCACCTTGG
CTATTGCGTATTGAACTGGACCGCAAGAGAATGACAGACAAGAAGCTGACGATGGAACAG
ATCGCTGAGAAGATTAACGCTGGGTTCGGGGATGATCTCAATTGTATTTTCAATGACGAT
AATGCTGAAAAATTGGTTTTGCGAATAAGAATTATGAACAACGAGGAGAGCAAATTCCAA
GACAACGACGAAGAAACGGTCGATAAAATGGAAGACGATATGTTCCTTAGATGTATTGAA
GCGAACATGTTATCGGACATGACTTTACAGGGTATTGAGGCTATAGCAAAAGTGTACATG
CACTTGCCGCAGACTGAAGCGAAGAAACGCATTATTATAACAGATCAAGGCGAATTTAAA
GCGATCGCAGAGTGGCTTTTGGAAACAGATGGTACTTCACTTATGAAAGTACTGTCAGAA
CGAGACGTAGATCCAGTGCGGACATTCAGTAACGATATTTGTGAGATATTCCAAGTGCTA
GGTATAGAGGCTGTGCGGAAGTCAGTCGAGAAGGAAATGAATGCTGTGTTGCAATTCTAT
GGTCTTTATGTAAACTACAGACATCTCGCTTTGCTTTGTGACGTGATGACTGCCAAAGGT
CATCTTATGGCTATAACACGTCACGGTATTAACAGACAAGATACCGGAGCACTCATGAGG
TGTTCTTTTGAAGAGACTGTAGATGTTTTACTTGATGCGGCTAGTCACGCTGAAGTTGAT
CCTATGAGAGGCGTTTCTGAAAATATTATCATGGGGCAATTGCCGCGAATGGGAACCGGT
TGTTTCGATTTATTACTGGATGCTGAGAAATGTAAACATGGAATGGAAATGGGCGGTCTA
GGTGTCGGAATGGGAGTCGCAGGTGGGATGTATTTCGGTGTCGGCACACCTTCCATGACA
CCACTGATGACGCCCTGGTCAAACCAAAACACTCCCGGATATGGCAGCAGTGTTTGGTCG
CCTGGTCAAGTTGGAAGCAGCATGACACCAGGAGGGCCATCATTCTCGCCGTCGGGAGCA
TCAGACGCGTCAGGGTTGTCACCTGCTTATAGCAGTTCATGGTCTCCACAACCAGGGTCA
CCAGGTTCCCCGGGCGCTCCGTTATCACCTTACGCCTCGCCAGCGGGAGCGTCTCCCTCG
TATTCTCCCACCAGTCCAGTATATGCTGCTCCCTCACCCAGTGTCACGCCGTCCTCGCCA
GTCTATTCCCCCACCGGACCTTCTTACTCTCTAACATCACACAATTACTCACCAACGTCA
CCGATCTATTCGCCGAACTCGCCGAGATACTCACCAAGGTATTCGCCAACGTCCCCCGGC
TATTCCCCCCCGTCACCAAGATACTCCCCAACATCTCCAAGCTATTCGCCAACCAGCCCG
GCGTATTCACCAAACTCTCCGAGCTATTCGGTGAAATCACAAGACTATTCACCGACTAGT
CCCAACTATTCTCCCGCAAGTACATCTTATTCACCGAGCGGACCCGTGTATTCTGTCAAC
TCGCAAGGATATTCGCCATCGTCTCTAAATTACTCCCCCAGTAATCTTGTTTATAGTCCG
ACCTCACAAAACTACTCTCCCTCATCACCGAATTACACCCCTCCGGCACCACCGTACTCG
CCGACGAGCCATTCATATTCATTGCCCTATTCACCGGCATCTCCAAGTATTTCTCCGTCA
TCGCCGAAATATTCACCATCACCGAATTATTCGCCGACATCACCATCGGTTTCCGGAGGA
AGTCCCACGTATTCGCCGACAAGCCATCAATACGGGCCGAAGAGTCCTCAATATTCCCCG
TCGAGCACAGCGTACTCGCCTTCGTCACCTCAACATCCGGGCAGTGCGAGATATTCACCA
TCATCGCGGAACTACTTGTCATCTTCACCACAATATTCCCCATCATCTCCCAGATATTCA
CCGTCTTCTAATAAATATTCTCCGACTAATATAACCTACACTCCGACATCTTCTAATTAT
TCACCATCTTCACCGGCGTATTCATCTTCGTCCGGTCCATCCAAATATTCGCCGACATCA
CCAAATTATTCGCCGACCTCCCCGTCTCATGACGATCTTGACGATTAG

Protein sequence:

MATTNDSKAPLRQVKRVQFGILSPDEIRRMSVTEGGIRFPETMEGGRPKLGGLMDPRQGV
IDRSSRCQTCAGNMTECPGHFGHIDLAKPVFHIGFITKTIKVLRCVCFYCSKLLVSPTNP
KIKEVVMKSKGQPRKRLTYVYDLCKGKNICEGGEDMDIGKEGEEGKRGTGHGGCGHYQPS
IRRQGLDLTAEWKHANEDTQDKKIIITAERVYEILKHITDEDSFILGMDPKFARPDWMIV
TVLPVPPLAVRPAVVMFGSAKNQDDLTHKLADIIKANNELMRNEQSGAAAHVLTDNIRML
QFHVATFVDNDMPGMPKAMQKSGKPLKAIKARLKGKEGRIRGNLMGKRVDFSARTVITPD
PNLRIDQVGVPRSIAQNLTFPELVTPFNIDRMQELVRRGNAQYPGAKYIVRDNGERIDLR
FHPKPSDLHLQYGYKVERHLRDDDLVIFNRQPTLHKMSMMGHRVKVLPWSTFRMNLSCTS
PYNADFDGDEMNLHVPQSMETRAEVENIHITPRQIITPQANKPVMGIVQDTLTAVRKMTK
RDVFLTKEQVMNLLMFLPTWDGKIPQPCILKPQPLWTGKQIFTLIIPGNVNMVRTHSTHP
DDEDDGVNRWISPGDTKVIVEHGELLMGILCKKSLGASAGSLLHICMLELGHEIAGRFYG
NIQTVINNWLLLEGHSIGIGDTIADPQTYQEIQRAIVKAKDDVIEVIQKAHNMELEPTPG
NTLRQTFENQVNRILNDARDKTGGSAKKSLTEYNNLKAMVVAGSKGSNINISQVIACVGQ
QNVEGKRIPFGFRKRTLPHFIKDDYGPESRGFVENSYLAGLTPSEFYFHAMGGREGLIDT
AVKTAETGYIQRRLIKAMESVMVHYDGTVRNSVGQLIQLRYGEDGLAGETVEFQNMPTVK
LSNKAFEKKFKFDPTNERYLKRIFHEDIIKELTESGYVIADLESEWEQLCKDREILRQIF
PSGESKVVLPCNFRRMIWNVQKIFHINKRMSTDLSPIKVIQGVKDLLKKCVIVAGEDRLS
KQANENATLLFQCLVRSTFCTKYVSEDYRLSSEAFEWLIGEIETRFQQAQVNPGEMVGAL
AAQSLGEPATQMTLNTFHFAGVSSKNVTLGVPRLKEIINISKKPKAPSLTVFLTGGAARD
AEKAKNVLCRLEHTTLRKVTANTAIYYDPDPQNTVIAEDQEFVNVYYEMPDFDPTKISPW
LLRIELDRKRMTDKKLTMEQIAEKINAGFGDDLNCIFNDDNAEKLVLRIRIMNNEESKFQ
DNDEETVDKMEDDMFLRCIEANMLSDMTLQGIEAIAKVYMHLPQTEAKKRIIITDQGEFK
AIAEWLLETDGTSLMKVLSERDVDPVRTFSNDICEIFQVLGIEAVRKSVEKEMNAVLQFY
GLYVNYRHLALLCDVMTAKGHLMAITRHGINRQDTGALMRCSFEETVDVLLDAASHAEVD
PMRGVSENIIMGQLPRMGTGCFDLLLDAEKCKHGMEMGGLGVGMGVAGGMYFGVGTPSMT
PLMTPWSNQNTPGYGSSVWSPGQVGSSMTPGGPSFSPSGASDASGLSPAYSSSWSPQPGS
PGSPGAPLSPYASPAGASPSYSPTSPVYAAPSPSVTPSSPVYSPTGPSYSLTSHNYSPTS
PIYSPNSPRYSPRYSPTSPGYSPPSPRYSPTSPSYSPTSPAYSPNSPSYSVKSQDYSPTS
PNYSPASTSYSPSGPVYSVNSQGYSPSSLNYSPSNLVYSPTSQNYSPSSPNYTPPAPPYS
PTSHSYSLPYSPASPSISPSSPKYSPSPNYSPTSPSVSGGSPTYSPTSHQYGPKSPQYSP
SSTAYSPSSPQHPGSARYSPSSRNYLSSSPQYSPSSPRYSPSSNKYSPTNITYTPTSSNY
SPSSPAYSSSSGPSKYSPTSPNYSPTSPSHDDLDD