Monarch geneset OGS2.0

DPOGS203690
TranscriptDPOGS203690-TA5688 bp
ProteinDPOGS203690-PA1895 aa
Genomic positionDPSCF300010 - 2019204-2027209
RNAseq coverage854x (Rank: top 15%)
Annotation
HeliconiusHMEL0133140.087.28% 
BombyxBGIBMGA003484-TA0.089.44% 
DrosophilaRpII215-PA0.078.65% 
EBI UniRef50UniRef50_P040520.078.65%DNA-directed RNA polymerase II subunit RPB1 n=3048 Tax=root RepID=RPB1_DROME
NCBI RefSeqXP_001355562.10.079.01%RpII215 [Drosophila pseudoobscura pseudoobscura]
NCBI nr blastpgi|3241205880.087.72%RNA polymerase II largest subunit [Stenopsyche marmorata]
NCBI nr blastxgi|3838572870.085.90%PREDICTED: DNA-directed RNA polymerase II subunit RPB1-like [Megachile rotundata]
Group
Gene OntologyGO:00038993.5e-196DNA-directed RNA polymerase activity
GO:00036773.5e-196DNA binding
GO:00063513.5e-196transcription, DNA-dependent
KEGG pathwaydpo:Dpse_GA137980.0 
 K03006 (RPB1)maps-> Huntington's disease
    Purine metabolism
    Pyrimidine metabolism
    RNA polymerase
InterPro domain[236-539] IPR0065923.5e-196RNA polymerase, N-terminal
[12-344] IPR0070804.1e-113RNA polymerase Rpb1, domain 1
[820-1417] IPR0070812.6e-102RNA polymerase Rpb1, domain 5
[346-509] IPR0007229.5e-69RNA polymerase, alpha subunit
[886-1069] IPR0070753.4e-68RNA polymerase Rpb1, domain 6
[1154-1290] IPR0070733.3e-53RNA polymerase Rpb1, domain 7
[514-682] IPR0070665.6e-48RNA polymerase Rpb1, domain 3
[712-813] IPR0070831.2e-37RNA polymerase Rpb1, domain 4
Orthology groupMCL13412 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203690-TA
ATGGCGACCACGAATGATTCGAAGGCGCCTTTGCGCCAAGTTAAAAGAGTACAATTTGGCATTTTATCTCCAGATGAAATCCGTCGCATGTCAGTCACAGAAGGGGGAATTCGTTTCCCAGAAACAATGGAAGGGGGAAGGCCCAAACTTGGTGGGCTTATGGATCCTCGACAAGGGGTGATAGACAGAAGTTCTCGATGCCAAACCTGCGCTGGAAATATGACAGAATGCCCTGGACACTTTGGCCACATTGATTTAGCCAAACCAGTATTTCATATTGGTTTTATTACCAAAACAATTAAAGTATTAAGATGTGTTTGTTTTTATTGCTCAAAATTACTTGTCAGTCCTACAAATCCAAAAATCAAAGAAGTGGTAATGAAATCTAAAGGTCAACCACGTAAAAGGTTGACTTATGTATATGACCTTTGCAAGGGTAAAAATATTTGTGAGGGTGGAGAAGATATGGATATTGGAAAAGAAGGGGAAGAAGGCAAAAGGGGCACAGGACATGGAGGTTGTGGTCATTACCAACCTTCTATCAGGCGGCAAGGATTAGATCTGACAGCAGAATGGAAACATGCTAATGAAGACACACAAGATAAAAAGATAATAATAACCGCAGAACGGGTTTATGAAATATTAAAACACATAACAGATGAAGATTCCTTTATTTTGGGTATGGACCCCAAATTTGCCAGACCCGATTGGATGATTGTCACAGTCCTTCCTGTACCACCTCTTGCAGTCAGACCCGCTGTAGTTATGTTTGGATCTGCTAAAAACCAGGATGATTTGACCCATAAGCTTGCTGATATTATAAAAGCTAATAATGAGTTGATGAGAAATGAACAATCAGGAGCTGCGGCTCATGTTCTAACTGACAATATCAGAATGTTACAGTTCCATGTTGCGACATTTGTTGATAATGACATGCCAGGAATGCCTAAGGCTATGCAAAAATCTGGTAAACCCTTGAAAGCCATAAAAGCAAGACTAAAAGGCAAAGAAGGTAGAATTCGTGGAAATCTTATGGGAAAACGTGTTGATTTCTCAGCTAGAACAGTAATTACACCTGATCCTAATTTGCGCATTGACCAAGTAGGCGTCCCAAGATCTATTGCACAAAATTTGACATTCCCCGAGCTTGTAACGCCCTTCAACATTGATCGGATGCAAGAACTCGTGCGAAGAGGAAATGCACAGTACCCAGGTGCAAAATACATTGTTCGGGATAATGGTGAAAGAATAGATTTAAGATTCCACCCCAAACCATCAGATTTGCATCTACAATATGGCTACAAAGTTGAGCGTCACTTGAGAGATGATGATTTGGTTATCTTCAACCGACAACCAACACTACATAAGATGAGTATGATGGGTCATAGGGTCAAAGTATTGCCATGGTCAACATTTCGTATGAACTTGAGTTGTACTTCGCCGTACAATGCTGATTTCGACGGCGATGAAATGAATTTACATGTACCCCAGTCTATGGAAACACGAGCGGAAGTAGAAAACATACACATAACGCCTCGTCAAATTATAACTCCACAAGCTAATAAACCAGTCATGGGTATTGTGCAAGATACACTGACTGCTGTCAGAAAAATGACAAAACGAGACGTATTTTTAACGAAAGAGCAAGTAATGAACTTGCTAATGTTTTTACCAACATGGGATGGAAAAATTCCACAACCTTGCATCCTGAAGCCACAACCGCTTTGGACAGGAAAACAAATATTTACTCTGATCATTCCTGGAAATGTCAATATGGTGCGTACTCATTCCACACATCCTGATGATGAGGACGATGGTGTTAATAGATGGATATCACCTGGAGACACTAAAGTAATTGTGGAACACGGGGAACTTCTTATGGGTATTCTGTGTAAGAAATCTCTTGGTGCATCTGCTGGTTCTTTACTGCATATATGTATGTTGGAGTTAGGACATGAAATAGCTGGTCGTTTTTACGGTAACATTCAAACTGTCATCAATAATTGGCTACTATTGGAAGGTCACTCCATTGGTATTGGGGATACAATTGCTGATCCTCAAACATATCAAGAAATCCAAAGGGCTATTGTGAAGGCTAAAGATGATGTCATAGAAGTTATACAGAAAGCTCACAATATGGAGCTTGAGCCAACTCCTGGTAATACTCTGAGGCAAACTTTCGAAAATCAGGTCAATCGTATTCTTAACGACGCTCGTGACAAAACTGGTGGTTCAGCCAAAAAGTCTCTAACAGAGTACAATAACCTTAAAGCTATGGTAGTCGCTGGTTCCAAAGGATCAAACATCAATATTTCACAAGTCATTGCTTGCGTGGGTCAGCAAAACGTCGAAGGAAAGCGTATTCCGTTTGGCTTCCGTAAGAGAACATTGCCGCATTTTATCAAAGACGATTATGGTCCGGAATCAAGAGGTTTCGTAGAGAACTCTTACCTGGCCGGTTTAACACCATCTGAGTTTTATTTCCACGCTATGGGAGGTCGTGAAGGTCTTATCGATACAGCTGTCAAAACTGCCGAGACTGGGTATATTCAGCGGCGTTTGATAAAGGCTATGGAATCTGTTATGGTGCATTATGATGGCACAGTCCGAAATTCGGTTGGACAACTGATTCAACTAAGATATGGTGAGGATGGTTTAGCTGGAGAAACAGTAGAGTTCCAAAACATGCCCACTGTAAAGTTATCCAACAAGGCATTTGAAAAGAAATTTAAGTTCGACCCAACCAATGAAAGGTATTTGAAGAGAATTTTCCATGAAGATATTATAAAAGAACTAACGGAGTCGGGTTACGTGATTGCCGACTTGGAAAGCGAATGGGAACAGCTTTGCAAAGATCGTGAAATATTGCGACAAATTTTCCCTAGCGGTGAATCTAAAGTTGTATTGCCGTGCAACTTCAGAAGAATGATTTGGAATGTTCAAAAGATTTTCCACATCAATAAGAGAATGTCAACAGATTTAAGTCCGATAAAAGTGATACAAGGCGTGAAAGATCTTTTGAAGAAATGTGTCATTGTCGCTGGTGAAGATCGTTTGTCTAAACAAGCGAATGAAAATGCAACCTTACTCTTCCAATGTTTAGTAAGATCTACTTTTTGCACGAAGTATGTATCTGAAGATTACAGACTATCAAGTGAAGCTTTCGAATGGTTGATTGGAGAAATTGAAACAAGATTCCAACAAGCACAAGTCAACCCAGGTGAAATGGTTGGGGCCCTGGCAGCCCAGTCTCTTGGAGAGCCGGCTACTCAGATGACATTGAATACCTTCCACTTTGCCGGTGTGTCATCTAAAAACGTAACTCTTGGTGTACCGCGTCTAAAAGAAATCATTAACATATCAAAGAAACCAAAAGCACCATCTCTAACAGTATTCCTTACTGGAGGCGCGGCCAGAGATGCAGAGAAAGCTAAGAATGTTCTGTGTCGATTGGAACACACGACATTGCGTAAAGTCACAGCCAATACCGCTATCTACTACGATCCAGACCCTCAGAACACAGTTATTGCTGAAGATCAAGAGTTTGTTAATGTTTATTATGAAATGCCTGATTTTGATCCAACAAAGATTTCACCTTGGCTATTGCGTATTGAACTGGACCGCAAGAGAATGACAGACAAGAAGCTGACGATGGAACAGATCGCTGAGAAGATTAACGCTGGGTTCGGGGATGATCTCAATTGTATTTTCAATGACGATAATGCTGAAAAATTGGTTTTGCGAATAAGAATTATGAACAACGAGGAGAGCAAATTCCAAGACAACGACGAAGAAACGGTCGATAAAATGGAAGACGATATGTTCCTTAGATGTATTGAAGCGAACATGTTATCGGACATGACTTTACAGGGTATTGAGGCTATAGCAAAAGTGTACATGCACTTGCCGCAGACTGAAGCGAAGAAACGCATTATTATAACAGATCAAGGCGAATTTAAAGCGATCGCAGAGTGGCTTTTGGAAACAGATGGTACTTCACTTATGAAAGTACTGTCAGAACGAGACGTAGATCCAGTGCGGACATTCAGTAACGATATTTGTGAGATATTCCAAGTGCTAGGTATAGAGGCTGTGCGGAAGTCAGTCGAGAAGGAAATGAATGCTGTGTTGCAATTCTATGGTCTTTATGTAAACTACAGACATCTCGCTTTGCTTTGTGACGTGATGACTGCCAAAGGTCATCTTATGGCTATAACACGTCACGGTATTAACAGACAAGATACCGGAGCACTCATGAGGTGTTCTTTTGAAGAGACTGTAGATGTTTTACTTGATGCGGCTAGTCACGCTGAAGTTGATCCTATGAGAGGCGTTTCTGAAAATATTATCATGGGGCAATTGCCGCGAATGGGAACCGGTTGTTTCGATTTATTACTGGATGCTGAGAAATGTAAACATGGAATGGAAATGGGCGGTCTAGGTGTCGGAATGGGAGTCGCAGGTGGGATGTATTTCGGTGTCGGCACACCTTCCATGACACCACTGATGACGCCCTGGTCAAACCAAAACACTCCCGGATATGGCAGCAGTGTTTGGTCGCCTGGTCAAGTTGGAAGCAGCATGACACCAGGAGGGCCATCATTCTCGCCGTCGGGAGCATCAGACGCGTCAGGGTTGTCACCTGCTTATAGCAGTTCATGGTCTCCACAACCAGGGTCACCAGGTTCCCCGGGCGCTCCGTTATCACCTTACGCCTCGCCAGCGGGAGCGTCTCCCTCGTATTCTCCCACCAGTCCAGTATATGCTGCTCCCTCACCCAGTGTCACGCCGTCCTCGCCAGTCTATTCCCCCACCGGACCTTCTTACTCTCTAACATCACACAATTACTCACCAACGTCACCGATCTATTCGCCGAACTCGCCGAGATACTCACCAAGGTATTCGCCAACGTCCCCCGGCTATTCCCCCCCGTCACCAAGATACTCCCCAACATCTCCAAGCTATTCGCCAACCAGCCCGGCGTATTCACCAAACTCTCCGAGCTATTCGGTGAAATCACAAGACTATTCACCGACTAGTCCCAACTATTCTCCCGCAAGTACATCTTATTCACCGAGCGGACCCGTGTATTCTGTCAACTCGCAAGGATATTCGCCATCGTCTCTAAATTACTCCCCCAGTAATCTTGTTTATAGTCCGACCTCACAAAACTACTCTCCCTCATCACCGAATTACACCCCTCCGGCACCACCGTACTCGCCGACGAGCCATTCATATTCATTGCCCTATTCACCGGCATCTCCAAGTATTTCTCCGTCATCGCCGAAATATTCACCATCACCGAATTATTCGCCGACATCACCATCGGTTTCCGGAGGAAGTCCCACGTATTCGCCGACAAGCCATCAATACGGGCCGAAGAGTCCTCAATATTCCCCGTCGAGCACAGCGTACTCGCCTTCGTCACCTCAACATCCGGGCAGTGCGAGATATTCACCATCATCGCGGAACTACTTGTCATCTTCACCACAATATTCCCCATCATCTCCCAGATATTCACCGTCTTCTAATAAATATTCTCCGACTAATATAACCTACACTCCGACATCTTCTAATTATTCACCATCTTCACCGGCGTATTCATCTTCGTCCGGTCCATCCAAATATTCGCCGACATCACCAAATTATTCGCCGACCTCCCCGTCTCATGACGATCTTGACGATTAG

Protein sequence:

>DPOGS203690-PA
MATTNDSKAPLRQVKRVQFGILSPDEIRRMSVTEGGIRFPETMEGGRPKLGGLMDPRQGVIDRSSRCQTCAGNMTECPGHFGHIDLAKPVFHIGFITKTIKVLRCVCFYCSKLLVSPTNPKIKEVVMKSKGQPRKRLTYVYDLCKGKNICEGGEDMDIGKEGEEGKRGTGHGGCGHYQPSIRRQGLDLTAEWKHANEDTQDKKIIITAERVYEILKHITDEDSFILGMDPKFARPDWMIVTVLPVPPLAVRPAVVMFGSAKNQDDLTHKLADIIKANNELMRNEQSGAAAHVLTDNIRMLQFHVATFVDNDMPGMPKAMQKSGKPLKAIKARLKGKEGRIRGNLMGKRVDFSARTVITPDPNLRIDQVGVPRSIAQNLTFPELVTPFNIDRMQELVRRGNAQYPGAKYIVRDNGERIDLRFHPKPSDLHLQYGYKVERHLRDDDLVIFNRQPTLHKMSMMGHRVKVLPWSTFRMNLSCTSPYNADFDGDEMNLHVPQSMETRAEVENIHITPRQIITPQANKPVMGIVQDTLTAVRKMTKRDVFLTKEQVMNLLMFLPTWDGKIPQPCILKPQPLWTGKQIFTLIIPGNVNMVRTHSTHPDDEDDGVNRWISPGDTKVIVEHGELLMGILCKKSLGASAGSLLHICMLELGHEIAGRFYGNIQTVINNWLLLEGHSIGIGDTIADPQTYQEIQRAIVKAKDDVIEVIQKAHNMELEPTPGNTLRQTFENQVNRILNDARDKTGGSAKKSLTEYNNLKAMVVAGSKGSNINISQVIACVGQQNVEGKRIPFGFRKRTLPHFIKDDYGPESRGFVENSYLAGLTPSEFYFHAMGGREGLIDTAVKTAETGYIQRRLIKAMESVMVHYDGTVRNSVGQLIQLRYGEDGLAGETVEFQNMPTVKLSNKAFEKKFKFDPTNERYLKRIFHEDIIKELTESGYVIADLESEWEQLCKDREILRQIFPSGESKVVLPCNFRRMIWNVQKIFHINKRMSTDLSPIKVIQGVKDLLKKCVIVAGEDRLSKQANENATLLFQCLVRSTFCTKYVSEDYRLSSEAFEWLIGEIETRFQQAQVNPGEMVGALAAQSLGEPATQMTLNTFHFAGVSSKNVTLGVPRLKEIINISKKPKAPSLTVFLTGGAARDAEKAKNVLCRLEHTTLRKVTANTAIYYDPDPQNTVIAEDQEFVNVYYEMPDFDPTKISPWLLRIELDRKRMTDKKLTMEQIAEKINAGFGDDLNCIFNDDNAEKLVLRIRIMNNEESKFQDNDEETVDKMEDDMFLRCIEANMLSDMTLQGIEAIAKVYMHLPQTEAKKRIIITDQGEFKAIAEWLLETDGTSLMKVLSERDVDPVRTFSNDICEIFQVLGIEAVRKSVEKEMNAVLQFYGLYVNYRHLALLCDVMTAKGHLMAITRHGINRQDTGALMRCSFEETVDVLLDAASHAEVDPMRGVSENIIMGQLPRMGTGCFDLLLDAEKCKHGMEMGGLGVGMGVAGGMYFGVGTPSMTPLMTPWSNQNTPGYGSSVWSPGQVGSSMTPGGPSFSPSGASDASGLSPAYSSSWSPQPGSPGSPGAPLSPYASPAGASPSYSPTSPVYAAPSPSVTPSSPVYSPTGPSYSLTSHNYSPTSPIYSPNSPRYSPRYSPTSPGYSPPSPRYSPTSPSYSPTSPAYSPNSPSYSVKSQDYSPTSPNYSPASTSYSPSGPVYSVNSQGYSPSSLNYSPSNLVYSPTSQNYSPSSPNYTPPAPPYSPTSHSYSLPYSPASPSISPSSPKYSPSPNYSPTSPSVSGGSPTYSPTSHQYGPKSPQYSPSSTAYSPSSPQHPGSARYSPSSRNYLSSSPQYSPSSPRYSPSSNKYSPTNITYTPTSSNYSPSSPAYSSSSGPSKYSPTSPNYSPTSPSHDDLDD-