New model in OGS2.0 | DPOGS203690  |
---|---|
Genomic Position | scaffold120:- 90191-98196 |
See gene structure | |
CDS Length | 5688 |
Paired RNAseq reads   | 7547 |
Single RNAseq reads   | 18139 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA003484 (5e-06) |
Best Drosophila hit   | RNA polymerase II 215kD subunit (0.0) |
Best Human hit | DNA-directed RNA polymerase II subunit RPB1 (0.0) |
Best NR hit (blastp)   | largest subunit of the RNA polymerase II complex [Drosophila guanche] (0.0) |
Best NR hit (blastx)   | RNA polymerase II 215kD subunit [Drosophila erecta] (0.0) |
GeneOntology terms    | GO:0005703 polytene chromosome puff GO:0006366 transcription from RNA polymerase II promoter GO:0003899 DNA-directed RNA polymerase activity GO:0005665 DNA-directed RNA polymerase II, core complex GO:0003677 DNA binding |
InterPro families    | IPR000684 RNA polymerase II, heptapeptide repeat, eukaryotic IPR007080 RNA polymerase Rpb1, domain 1 IPR007081 RNA polymerase Rpb1, domain 5 IPR000722 RNA polymerase, alpha subunit IPR007075 RNA polymerase Rpb1, domain 6 IPR007073 RNA polymerase Rpb1, domain 7 IPR007066 RNA polymerase Rpb1, domain 3 IPR007083 RNA polymerase Rpb1, domain 4 IPR006592 RNA polymerase, N-terminal |
Orthology group | MCL13877 |
Nucleotide sequence:
ATGGCGACCACGAATGATTCGAAGGCGCCTTTGCGCCAAGTTAAAAGAGTACAATTTGGC
ATTTTATCTCCAGATGAAATCCGTCGCATGTCAGTCACAGAAGGGGGAATTCGTTTCCCA
GAAACAATGGAAGGGGGAAGGCCCAAACTTGGTGGGCTTATGGATCCTCGACAAGGGGTG
ATAGACAGAAGTTCTCGATGCCAAACCTGCGCTGGAAATATGACAGAATGCCCTGGACAC
TTTGGCCACATTGATTTAGCCAAACCAGTATTTCATATTGGTTTTATTACCAAAACAATT
AAAGTATTAAGATGTGTTTGTTTTTATTGCTCAAAATTACTTGTCAGTCCTACAAATCCA
AAAATCAAAGAAGTGGTAATGAAATCTAAAGGTCAACCACGTAAAAGGTTGACTTATGTA
TATGACCTTTGCAAGGGTAAAAATATTTGTGAGGGTGGAGAAGATATGGATATTGGAAAA
GAAGGGGAAGAAGGCAAAAGGGGCACAGGACATGGAGGTTGTGGTCATTACCAACCTTCT
ATCAGGCGGCAAGGATTAGATCTGACAGCAGAATGGAAACATGCTAATGAAGACACACAA
GATAAAAAGATAATAATAACCGCAGAACGGGTTTATGAAATATTAAAACACATAACAGAT
GAAGATTCCTTTATTTTGGGTATGGACCCCAAATTTGCCAGACCCGATTGGATGATTGTC
ACAGTCCTTCCTGTACCACCTCTTGCAGTCAGACCCGCTGTAGTTATGTTTGGATCTGCT
AAAAACCAGGATGATTTGACCCATAAGCTTGCTGATATTATAAAAGCTAATAATGAGTTG
ATGAGAAATGAACAATCAGGAGCTGCGGCTCATGTTCTAACTGACAATATCAGAATGTTA
CAGTTCCATGTTGCGACATTTGTTGATAATGACATGCCAGGAATGCCTAAGGCTATGCAA
AAATCTGGTAAACCCTTGAAAGCCATAAAAGCAAGACTAAAAGGCAAAGAAGGTAGAATT
CGTGGAAATCTTATGGGAAAACGTGTTGATTTCTCAGCTAGAACAGTAATTACACCTGAT
CCTAATTTGCGCATTGACCAAGTAGGCGTCCCAAGATCTATTGCACAAAATTTGACATTC
CCCGAGCTTGTAACGCCCTTCAACATTGATCGGATGCAAGAACTCGTGCGAAGAGGAAAT
GCACAGTACCCAGGTGCAAAATACATTGTTCGGGATAATGGTGAAAGAATAGATTTAAGA
TTCCACCCCAAACCATCAGATTTGCATCTACAATATGGCTACAAAGTTGAGCGTCACTTG
AGAGATGATGATTTGGTTATCTTCAACCGACAACCAACACTACATAAGATGAGTATGATG
GGTCATAGGGTCAAAGTATTGCCATGGTCAACATTTCGTATGAACTTGAGTTGTACTTCG
CCGTACAATGCTGATTTCGACGGCGATGAAATGAATTTACATGTACCCCAGTCTATGGAA
ACACGAGCGGAAGTAGAAAACATACACATAACGCCTCGTCAAATTATAACTCCACAAGCT
AATAAACCAGTCATGGGTATTGTGCAAGATACACTGACTGCTGTCAGAAAAATGACAAAA
CGAGACGTATTTTTAACGAAAGAGCAAGTAATGAACTTGCTAATGTTTTTACCAACATGG
GATGGAAAAATTCCACAACCTTGCATCCTGAAGCCACAACCGCTTTGGACAGGAAAACAA
ATATTTACTCTGATCATTCCTGGAAATGTCAATATGGTGCGTACTCATTCCACACATCCT
GATGATGAGGACGATGGTGTTAATAGATGGATATCACCTGGAGACACTAAAGTAATTGTG
GAACACGGGGAACTTCTTATGGGTATTCTGTGTAAGAAATCTCTTGGTGCATCTGCTGGT
TCTTTACTGCATATATGTATGTTGGAGTTAGGACATGAAATAGCTGGTCGTTTTTACGGT
AACATTCAAACTGTCATCAATAATTGGCTACTATTGGAAGGTCACTCCATTGGTATTGGG
GATACAATTGCTGATCCTCAAACATATCAAGAAATCCAAAGGGCTATTGTGAAGGCTAAA
GATGATGTCATAGAAGTTATACAGAAAGCTCACAATATGGAGCTTGAGCCAACTCCTGGT
AATACTCTGAGGCAAACTTTCGAAAATCAGGTCAATCGTATTCTTAACGACGCTCGTGAC
AAAACTGGTGGTTCAGCCAAAAAGTCTCTAACAGAGTACAATAACCTTAAAGCTATGGTA
GTCGCTGGTTCCAAAGGATCAAACATCAATATTTCACAAGTCATTGCTTGCGTGGGTCAG
CAAAACGTCGAAGGAAAGCGTATTCCGTTTGGCTTCCGTAAGAGAACATTGCCGCATTTT
ATCAAAGACGATTATGGTCCGGAATCAAGAGGTTTCGTAGAGAACTCTTACCTGGCCGGT
TTAACACCATCTGAGTTTTATTTCCACGCTATGGGAGGTCGTGAAGGTCTTATCGATACA
GCTGTCAAAACTGCCGAGACTGGGTATATTCAGCGGCGTTTGATAAAGGCTATGGAATCT
GTTATGGTGCATTATGATGGCACAGTCCGAAATTCGGTTGGACAACTGATTCAACTAAGA
TATGGTGAGGATGGTTTAGCTGGAGAAACAGTAGAGTTCCAAAACATGCCCACTGTAAAG
TTATCCAACAAGGCATTTGAAAAGAAATTTAAGTTCGACCCAACCAATGAAAGGTATTTG
AAGAGAATTTTCCATGAAGATATTATAAAAGAACTAACGGAGTCGGGTTACGTGATTGCC
GACTTGGAAAGCGAATGGGAACAGCTTTGCAAAGATCGTGAAATATTGCGACAAATTTTC
CCTAGCGGTGAATCTAAAGTTGTATTGCCGTGCAACTTCAGAAGAATGATTTGGAATGTT
CAAAAGATTTTCCACATCAATAAGAGAATGTCAACAGATTTAAGTCCGATAAAAGTGATA
CAAGGCGTGAAAGATCTTTTGAAGAAATGTGTCATTGTCGCTGGTGAAGATCGTTTGTCT
AAACAAGCGAATGAAAATGCAACCTTACTCTTCCAATGTTTAGTAAGATCTACTTTTTGC
ACGAAGTATGTATCTGAAGATTACAGACTATCAAGTGAAGCTTTCGAATGGTTGATTGGA
GAAATTGAAACAAGATTCCAACAAGCACAAGTCAACCCAGGTGAAATGGTTGGGGCCCTG
GCAGCCCAGTCTCTTGGAGAGCCGGCTACTCAGATGACATTGAATACCTTCCACTTTGCC
GGTGTGTCATCTAAAAACGTAACTCTTGGTGTACCGCGTCTAAAAGAAATCATTAACATA
TCAAAGAAACCAAAAGCACCATCTCTAACAGTATTCCTTACTGGAGGCGCGGCCAGAGAT
GCAGAGAAAGCTAAGAATGTTCTGTGTCGATTGGAACACACGACATTGCGTAAAGTCACA
GCCAATACCGCTATCTACTACGATCCAGACCCTCAGAACACAGTTATTGCTGAAGATCAA
GAGTTTGTTAATGTTTATTATGAAATGCCTGATTTTGATCCAACAAAGATTTCACCTTGG
CTATTGCGTATTGAACTGGACCGCAAGAGAATGACAGACAAGAAGCTGACGATGGAACAG
ATCGCTGAGAAGATTAACGCTGGGTTCGGGGATGATCTCAATTGTATTTTCAATGACGAT
AATGCTGAAAAATTGGTTTTGCGAATAAGAATTATGAACAACGAGGAGAGCAAATTCCAA
GACAACGACGAAGAAACGGTCGATAAAATGGAAGACGATATGTTCCTTAGATGTATTGAA
GCGAACATGTTATCGGACATGACTTTACAGGGTATTGAGGCTATAGCAAAAGTGTACATG
CACTTGCCGCAGACTGAAGCGAAGAAACGCATTATTATAACAGATCAAGGCGAATTTAAA
GCGATCGCAGAGTGGCTTTTGGAAACAGATGGTACTTCACTTATGAAAGTACTGTCAGAA
CGAGACGTAGATCCAGTGCGGACATTCAGTAACGATATTTGTGAGATATTCCAAGTGCTA
GGTATAGAGGCTGTGCGGAAGTCAGTCGAGAAGGAAATGAATGCTGTGTTGCAATTCTAT
GGTCTTTATGTAAACTACAGACATCTCGCTTTGCTTTGTGACGTGATGACTGCCAAAGGT
CATCTTATGGCTATAACACGTCACGGTATTAACAGACAAGATACCGGAGCACTCATGAGG
TGTTCTTTTGAAGAGACTGTAGATGTTTTACTTGATGCGGCTAGTCACGCTGAAGTTGAT
CCTATGAGAGGCGTTTCTGAAAATATTATCATGGGGCAATTGCCGCGAATGGGAACCGGT
TGTTTCGATTTATTACTGGATGCTGAGAAATGTAAACATGGAATGGAAATGGGCGGTCTA
GGTGTCGGAATGGGAGTCGCAGGTGGGATGTATTTCGGTGTCGGCACACCTTCCATGACA
CCACTGATGACGCCCTGGTCAAACCAAAACACTCCCGGATATGGCAGCAGTGTTTGGTCG
CCTGGTCAAGTTGGAAGCAGCATGACACCAGGAGGGCCATCATTCTCGCCGTCGGGAGCA
TCAGACGCGTCAGGGTTGTCACCTGCTTATAGCAGTTCATGGTCTCCACAACCAGGGTCA
CCAGGTTCCCCGGGCGCTCCGTTATCACCTTACGCCTCGCCAGCGGGAGCGTCTCCCTCG
TATTCTCCCACCAGTCCAGTATATGCTGCTCCCTCACCCAGTGTCACGCCGTCCTCGCCA
GTCTATTCCCCCACCGGACCTTCTTACTCTCTAACATCACACAATTACTCACCAACGTCA
CCGATCTATTCGCCGAACTCGCCGAGATACTCACCAAGGTATTCGCCAACGTCCCCCGGC
TATTCCCCCCCGTCACCAAGATACTCCCCAACATCTCCAAGCTATTCGCCAACCAGCCCG
GCGTATTCACCAAACTCTCCGAGCTATTCGGTGAAATCACAAGACTATTCACCGACTAGT
CCCAACTATTCTCCCGCAAGTACATCTTATTCACCGAGCGGACCCGTGTATTCTGTCAAC
TCGCAAGGATATTCGCCATCGTCTCTAAATTACTCCCCCAGTAATCTTGTTTATAGTCCG
ACCTCACAAAACTACTCTCCCTCATCACCGAATTACACCCCTCCGGCACCACCGTACTCG
CCGACGAGCCATTCATATTCATTGCCCTATTCACCGGCATCTCCAAGTATTTCTCCGTCA
TCGCCGAAATATTCACCATCACCGAATTATTCGCCGACATCACCATCGGTTTCCGGAGGA
AGTCCCACGTATTCGCCGACAAGCCATCAATACGGGCCGAAGAGTCCTCAATATTCCCCG
TCGAGCACAGCGTACTCGCCTTCGTCACCTCAACATCCGGGCAGTGCGAGATATTCACCA
TCATCGCGGAACTACTTGTCATCTTCACCACAATATTCCCCATCATCTCCCAGATATTCA
CCGTCTTCTAATAAATATTCTCCGACTAATATAACCTACACTCCGACATCTTCTAATTAT
TCACCATCTTCACCGGCGTATTCATCTTCGTCCGGTCCATCCAAATATTCGCCGACATCA
CCAAATTATTCGCCGACCTCCCCGTCTCATGACGATCTTGACGATTAG
Protein sequence:
MATTNDSKAPLRQVKRVQFGILSPDEIRRMSVTEGGIRFPETMEGGRPKLGGLMDPRQGV
IDRSSRCQTCAGNMTECPGHFGHIDLAKPVFHIGFITKTIKVLRCVCFYCSKLLVSPTNP
KIKEVVMKSKGQPRKRLTYVYDLCKGKNICEGGEDMDIGKEGEEGKRGTGHGGCGHYQPS
IRRQGLDLTAEWKHANEDTQDKKIIITAERVYEILKHITDEDSFILGMDPKFARPDWMIV
TVLPVPPLAVRPAVVMFGSAKNQDDLTHKLADIIKANNELMRNEQSGAAAHVLTDNIRML
QFHVATFVDNDMPGMPKAMQKSGKPLKAIKARLKGKEGRIRGNLMGKRVDFSARTVITPD
PNLRIDQVGVPRSIAQNLTFPELVTPFNIDRMQELVRRGNAQYPGAKYIVRDNGERIDLR
FHPKPSDLHLQYGYKVERHLRDDDLVIFNRQPTLHKMSMMGHRVKVLPWSTFRMNLSCTS
PYNADFDGDEMNLHVPQSMETRAEVENIHITPRQIITPQANKPVMGIVQDTLTAVRKMTK
RDVFLTKEQVMNLLMFLPTWDGKIPQPCILKPQPLWTGKQIFTLIIPGNVNMVRTHSTHP
DDEDDGVNRWISPGDTKVIVEHGELLMGILCKKSLGASAGSLLHICMLELGHEIAGRFYG
NIQTVINNWLLLEGHSIGIGDTIADPQTYQEIQRAIVKAKDDVIEVIQKAHNMELEPTPG
NTLRQTFENQVNRILNDARDKTGGSAKKSLTEYNNLKAMVVAGSKGSNINISQVIACVGQ
QNVEGKRIPFGFRKRTLPHFIKDDYGPESRGFVENSYLAGLTPSEFYFHAMGGREGLIDT
AVKTAETGYIQRRLIKAMESVMVHYDGTVRNSVGQLIQLRYGEDGLAGETVEFQNMPTVK
LSNKAFEKKFKFDPTNERYLKRIFHEDIIKELTESGYVIADLESEWEQLCKDREILRQIF
PSGESKVVLPCNFRRMIWNVQKIFHINKRMSTDLSPIKVIQGVKDLLKKCVIVAGEDRLS
KQANENATLLFQCLVRSTFCTKYVSEDYRLSSEAFEWLIGEIETRFQQAQVNPGEMVGAL
AAQSLGEPATQMTLNTFHFAGVSSKNVTLGVPRLKEIINISKKPKAPSLTVFLTGGAARD
AEKAKNVLCRLEHTTLRKVTANTAIYYDPDPQNTVIAEDQEFVNVYYEMPDFDPTKISPW
LLRIELDRKRMTDKKLTMEQIAEKINAGFGDDLNCIFNDDNAEKLVLRIRIMNNEESKFQ
DNDEETVDKMEDDMFLRCIEANMLSDMTLQGIEAIAKVYMHLPQTEAKKRIIITDQGEFK
AIAEWLLETDGTSLMKVLSERDVDPVRTFSNDICEIFQVLGIEAVRKSVEKEMNAVLQFY
GLYVNYRHLALLCDVMTAKGHLMAITRHGINRQDTGALMRCSFEETVDVLLDAASHAEVD
PMRGVSENIIMGQLPRMGTGCFDLLLDAEKCKHGMEMGGLGVGMGVAGGMYFGVGTPSMT
PLMTPWSNQNTPGYGSSVWSPGQVGSSMTPGGPSFSPSGASDASGLSPAYSSSWSPQPGS
PGSPGAPLSPYASPAGASPSYSPTSPVYAAPSPSVTPSSPVYSPTGPSYSLTSHNYSPTS
PIYSPNSPRYSPRYSPTSPGYSPPSPRYSPTSPSYSPTSPAYSPNSPSYSVKSQDYSPTS
PNYSPASTSYSPSGPVYSVNSQGYSPSSLNYSPSNLVYSPTSQNYSPSSPNYTPPAPPYS
PTSHSYSLPYSPASPSISPSSPKYSPSPNYSPTSPSVSGGSPTYSPTSHQYGPKSPQYSP
SSTAYSPSSPQHPGSARYSPSSRNYLSSSPQYSPSSPRYSPSSNKYSPTNITYTPTSSNY
SPSSPAYSSSSGPSKYSPTSPNYSPTSPSHDDLDD