Monarch geneset OGS2.0

DPOGS206318
TranscriptDPOGS206318-TA4962 bp
ProteinDPOGS206318-PA1653 aa
Genomic positionDPSCF300082 - 576558-581853
RNAseq coverage335x (Rank: top 34%)
Annotation
HeliconiusHMEL0126020.084.77% 
BombyxBGIBMGA014133-TA0.078.74% 
DrosophilaRpI1-PA0.050.97% 
EBI UniRef50UniRef50_D2A5Z90.052.93%DNA-directed RNA polymerase n=1 Tax=Tribolium castaneum RepID=D2A5Z9_TRICA
NCBI RefSeqXP_970918.10.052.93%PREDICTED: similar to DNA-directed RNA polymerase I largest subunit [Tribolium castaneum]
NCBI nr blastpgi|910849330.052.93%PREDICTED: similar to DNA-directed RNA polymerase I largest subunit [Tribolium castaneum]
NCBI nr blastxgi|910849330.052.87%PREDICTED: similar to DNA-directed RNA polymerase I largest subunit [Tribolium castaneum]
Group
Gene OntologyGO:00038990DNA-directed RNA polymerase activity
GO:00056340nucleus
GO:00082700zinc ion binding
GO:00063510transcription, DNA-dependent
GO:00036775.6e-130DNA binding
KEGG pathwaytca:6595260.0 
 K02999 (RPA1)maps-> Purine metabolism
    Pyrimidine metabolism
    RNA polymerase
InterPro domain[5-1645] IPR0156990DNA-directed RNA pol I, largest subunit
[282-614] IPR0065925.6e-130RNA polymerase, N-terminal
[933-1602] IPR0070816.5e-96RNA polymerase Rpb1, domain 5
[408-586] IPR0007227.3e-64RNA polymerase, alpha subunit
[589-778] IPR0070661.4e-32RNA polymerase Rpb1, domain 3
[21-322] IPR0070803.8e-25RNA polymerase Rpb1, domain 1
[816-926] IPR0070837.4e-25RNA polymerase Rpb1, domain 4
Orthology groupMCL14842 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206318-TA
ATGGACTTCGATTCAAGATTCACATCTGCCTTAACAGGCCTAACATCGGTGGATCCTGATTCTGTACAATTTCTGATGTTTACGGATGACGACATTAAAAAACTTAGTGTTGCTAAAATAATTAATACTATATCATTTGATGCTATGGGCAATCCCAACAAAGGAGGATTATATGACCCAGCCCTTGGACCTCTCAGGGAAAGAAATGATTTCTGTGCCACTTGCTCTAATTCTTTATTACACTGTCCAGGACATTTTGGACATATAGAATTGCCGCTCACTGTAGTAAATCCAGTATTTATAAAAAATATATATACAATATTTCGTATATCATGTCTTACTTGTTTTAAAATACAAATGGATGATAGAATGAAGTTTATTCTCAAATTACAACTGGAATTGATTGATGCAGGTCATGTTACCGCTGCACTTGACCTGGAAAATTATGTAGGTGATGTTAAAGAGTACAGAAATGAATCATCACCTGAAAAGCCAAATAAAGTTATAAGAAAATACCAAAAACTTCTTAAACACAAGGATCCGGTGTTTGAAACATTTCAAAATAAGAATACTGACAAACTAAGAACAAAAATCATCAATCATTACTTTAAAGCATTGAGTGTATCAAAATTATGCATGTATTGTAAAAGTAAATTAATTAAAGTGACAACATCCGATGGAAAAATAATGTACAACATCAGCGCTGATAAAGGTGCTAAAGGTGTAAAAATATTAATGCCGGATGATCTGAAAAGATATTGTAAGGCCATAGAACAGAACGATTCGGATATCTTAGTACACTGTGTTCCTATATTGAAATATATTAACATTAAACCAGTTACTGATATCTTTTTTATGCAAGTTGTTCCCGTACTGCCGCCCATTGTTAGGCCTTGTAATGTATTGCATGGAGAATTGATGGAACATCCACAAACCAATGTGTATAAAGGTATCCTACAATCTGCGTATAGTGCGAGAGCCGTTCTACAGGTACTAATGACGCCTCAACATCAGAAAGCTATAGATGGCTTAGAACAGCAGCCAAGGCAAGCCTATGAAAGTGTAATGGGTGAAACCCCAGCAGAAAAGCTGCACACTGTGTGGCAAGACCTTCAAAAACACATCAACTATCTATTAACCTGTGAAGGACAAGGTACTACAAGTCAAGGATTAAAACATATACTCGAAAAGAAAACCGGGGTCATAAGAATGCATATGATGGGTAAAAGAGTGAACTTTGCTGCTAGGTCTGTGATCACGCCTGATCCCAATGTAGATATTGATGAAATAGGCATCCCTGATGCATTTGCCACTAAACTGACATATCCGGTTCCCGTCACTGAATGGAATGTGGATGAATTAAGGAAAATGGTAATTAATGGACCAAATGTCCATCCTGGAGCTGAAAAGCTAGAAGTCAAAAACGGAAGAATAATTAGAATACCACCGGACTCGATAGAGAAAAGAAAAAGTCTTGCCAAGAAACTGTTAACACCAGATGAATATAAATCGTCCGGATTGAAAATTGTTCATAGACATTTAGTGAATGGGGATGTTTTAATACTCAATCGCCAACCATCACTCCACAAGCCGAGTATGATGGCCCATAGAGCGAGGATACTCAAAGGAGAAAAAACTCTAAGGCTTCACTATGCTAACTGCAAATCATACAATGCTGATTTTGATGGTGACGAGATGAACGCACATTTCCCACAAAACGAAATTGCGAGAAGCGAAGCATACAATATCATGTCTGTGAGGAAACAATACCTTGTTCCTAAAGACGGAACTCCTTTAAGTGGTTTGATACAGGATCATGTTATTGCTGGAGTGAGGATGTCACTTAGAGGCACGTTTTTCAATAAAGCTGATTATCAACAGCTGGTGTTTCAAGCACTATCAAGTCATAAAGGGGAAATAAAATTGCTACCACCAACAATTTTAAAACCCGCAGTGCTGTGGTCTGGGAAACAAGTTATGTCAACAATAATTATTAACACTATACCTAAAGGTAAACCTTATCTATCATTAGAAGGGAAAGCCAAAATAAGTGCTAAAGCATGGCAAAAAGAAGCTCCTAGGCCTTGGAAGGCTGGTGGTACACCATTCACGAATCCAAACACCATGACAGAAGCAGAAGTCATTATCAGAAAAGGCGAACTTGTAAGTGGTGTATTGGATAAGACTCATTATGGGGCCACACCTTATGGCTTAGTTCATTGCATGTATGAACTGTATGGGGGAGACAGTTCAAGTGCTCTCCTTAGTTCATTTTCTAAAGTCTTTACATTTTATCTTCAATGGATAGGTTTTACGCTAGGCGTAAAAGATATTTTGGTGGTTGATGAAGCTAACAAAAAACGTGACGAAATTATAAGTTTGGTTCGACAAGCCGGCAGAGTAGCAGCTGTGAAAGCAACAGAAGTTCCCGCGGACGTAGAGGAACAAAAATTGAAAGATGTAATATCTGAAATGTTAAGTAAAGATCCTAAATTCCGAGCAAATCTCGACAGGCAGTATAAAAATATGTTGGATTCGTATACAAATAACATAAACACCGTTTGTTTATCTGAAGGTTTATTGGAAAAGTTCCCATCAAATAATTTACAACTGATGGTGCAATCGGGAGCAAAAGGCTCAACTGTCAACACTATGCAAATTTCATGTTTACTTGGGCAGATAGAATTAGAAGGCAAGAGACCGCCGTTAATGATATCTGGTAGATCACTGCCAAGCTTTCCTCCTTATGATATCTCTCCGAGAGCGGGCGGTTTCATCGACGGTCGTTTTATGACTGGAATACAACCACAAGAGTTCTTCTTCCATTGTATGGCCGGTAGAGAAGGTCTTATTGATACTGCTGTTAAAACAAGTCGTTCCGGTTACTTGCAAAGATGTTTGATTAAACATTTAGAGGGTTTAAGTGTAGCTTATGACCATACCGTCAGGGACGCTGACAGTAGCATCATCCAGTTCGCTTATGGTGAAGACGGCCTTGATATTCTTAAATGCCAGTTCTTAAAGGATGGACAATTCAAGTTCTTGGATGAAAATTCTAGTGCTGTAATTGGAAAGTCCGTGATCAAAAAACTAAAAGATGAGACCGATACGAAGATTATAGCAAAAGCACAAAAATCATTGAAAAAGTGGAAAAAGAAAAATGGAAGTCCGTTTGAAAAGATACGAACAAGTGGTTTTGCGAAATTTTCTTCGTTAGTTAGAAAGGACATAGTTCTTGATGATTTACCTACTGATCAAACTAGAGATCCTTATTATTGGGAGTTAGAAAAAATGTGGCGGGAATTGGACGAGGAAGAAAGACAACAGTACTACAGGAAACCTTGTCCCGACCCAATCCCCAGCAAACTATCACCTGAATACAAATTTGGCACGATAAATGAACAGTTAGACGGTATCATTCAAAACTACTTGAAAAATCGGACGGAATCTAGTTACAATGAGTACACAGAAAAGGATAAATTTTTAGAAGTCATCAGTGCTAAATATTTGGAATCTATGGCCGCTCCCGGAGAACCAGTCGGCTTGTTAGCGGCCCAATCCATTGGAGAGCCCTCCACGCAAATGACATTGAACACATTCCATTTCGCTGGTAGAGGAGACATGAACGTGACATTAGGTATACCACGACTGAGAGAAATTTTAATGACAGCATCAGCTCAATTAAAAACTCCAAACATGGACATACCCTTCTTACCCAATATAGCAGATTTAAACAAAAAAGCCGAGAAGTTGAGACAAAAAATGAACAGAGTTACGGTGTCAGACGTTTTGGAAAAGATTGAGGTTCAATGCGAAATTGTTACAAAGCCCGATCGGCAGATGAAGACGACGATGCGTTTCGTTTTTCTACCGTTTTCTCAATACAAAACGCAGTATACCGTGAAGCCACCGCAAATAATAAAACACATGCAGAATAAATTCTTTAATGAGATGTTTGCGGTTATCCGAAAGCAGGCGAAGAGTACTTGTGGTGTTTTGTGGGCTGCGGAGAAAGAAAAGAAACGTCGGGTCGCTGATGATGAGGAGGACGAAGACAATTCCCCTGACCTTGAGGAACGTCAAGGTCAAGACGTGGACAGTTCGGATGATGAGGGGCCTCACGACGATGAAGACAATACAGACGTAAAAATACGTAAAAAACGTTCCGAAGAACAAGAATACGAAGATCCAGAGTCAGAAGAAGAAGAGAAATCTGATGACGATTTGGAAATAGATGAAAATAATACAAAAGAAAAAGATGATGATCTTGAAACGGTTGAAGAGGTAAACGCTGAAGATGCTAAAGCCATGGAAAAAGTAGTTGGGAAAATAACCAACGCGTCAAACTATACCTTTGACACCAAGAACCATAAGTGGTGTGAATTGACCGTGTTCTTCCCGATAGCATTCCTGCGGGTAGACCTGTCCCAGGCCTTGCGCGACGCAGCCAAGAATTCAGTCATATACGAAATCAAGAACATCAAACGGGCGATCACTAACAAGGAAAAGGACGTCCTATACCTCAAAACTGAAGGGATCAATATAGTACAAATGTCCAAGTACAGTCACCTCTTAGATTTGAACAAGTTATACACGAACGACATTCACGCCATCGCAAACACGTACGGCATTGAAGCGGCAAACAAAGTTATCATAAAAGAAATCCAGAACGTATTCAACGTGTACGGCATCACCGTGGACCCGCGCCACTTGACGCTGGTGGCGGATTACATGACGTACAACGGAATATTTGAGCCCATGAGCAGGAAGGGGATGGAGGCGTCAACTTCACCTTTACAGCAAATGTCCTTCGAATCGTCTTTGATATTCCTGAAGGAGGCTGTTCTGAACTCCAAGAAAGATTTCATCAGGTCGGCGTCCAGCTGTCTCATGCTCGGCCAGCCGTGCCGGGCCGGCACGGGTTCCTTCAGTTTGCAGCATTTCAGTAAAGTTGTCAGCTAA

Protein sequence:

>DPOGS206318-PA
MDFDSRFTSALTGLTSVDPDSVQFLMFTDDDIKKLSVAKIINTISFDAMGNPNKGGLYDPALGPLRERNDFCATCSNSLLHCPGHFGHIELPLTVVNPVFIKNIYTIFRISCLTCFKIQMDDRMKFILKLQLELIDAGHVTAALDLENYVGDVKEYRNESSPEKPNKVIRKYQKLLKHKDPVFETFQNKNTDKLRTKIINHYFKALSVSKLCMYCKSKLIKVTTSDGKIMYNISADKGAKGVKILMPDDLKRYCKAIEQNDSDILVHCVPILKYINIKPVTDIFFMQVVPVLPPIVRPCNVLHGELMEHPQTNVYKGILQSAYSARAVLQVLMTPQHQKAIDGLEQQPRQAYESVMGETPAEKLHTVWQDLQKHINYLLTCEGQGTTSQGLKHILEKKTGVIRMHMMGKRVNFAARSVITPDPNVDIDEIGIPDAFATKLTYPVPVTEWNVDELRKMVINGPNVHPGAEKLEVKNGRIIRIPPDSIEKRKSLAKKLLTPDEYKSSGLKIVHRHLVNGDVLILNRQPSLHKPSMMAHRARILKGEKTLRLHYANCKSYNADFDGDEMNAHFPQNEIARSEAYNIMSVRKQYLVPKDGTPLSGLIQDHVIAGVRMSLRGTFFNKADYQQLVFQALSSHKGEIKLLPPTILKPAVLWSGKQVMSTIIINTIPKGKPYLSLEGKAKISAKAWQKEAPRPWKAGGTPFTNPNTMTEAEVIIRKGELVSGVLDKTHYGATPYGLVHCMYELYGGDSSSALLSSFSKVFTFYLQWIGFTLGVKDILVVDEANKKRDEIISLVRQAGRVAAVKATEVPADVEEQKLKDVISEMLSKDPKFRANLDRQYKNMLDSYTNNINTVCLSEGLLEKFPSNNLQLMVQSGAKGSTVNTMQISCLLGQIELEGKRPPLMISGRSLPSFPPYDISPRAGGFIDGRFMTGIQPQEFFFHCMAGREGLIDTAVKTSRSGYLQRCLIKHLEGLSVAYDHTVRDADSSIIQFAYGEDGLDILKCQFLKDGQFKFLDENSSAVIGKSVIKKLKDETDTKIIAKAQKSLKKWKKKNGSPFEKIRTSGFAKFSSLVRKDIVLDDLPTDQTRDPYYWELEKMWRELDEEERQQYYRKPCPDPIPSKLSPEYKFGTINEQLDGIIQNYLKNRTESSYNEYTEKDKFLEVISAKYLESMAAPGEPVGLLAAQSIGEPSTQMTLNTFHFAGRGDMNVTLGIPRLREILMTASAQLKTPNMDIPFLPNIADLNKKAEKLRQKMNRVTVSDVLEKIEVQCEIVTKPDRQMKTTMRFVFLPFSQYKTQYTVKPPQIIKHMQNKFFNEMFAVIRKQAKSTCGVLWAAEKEKKRRVADDEEDEDNSPDLEERQGQDVDSSDDEGPHDDEDNTDVKIRKKRSEEQEYEDPESEEEEKSDDDLEIDENNTKEKDDDLETVEEVNAEDAKAMEKVVGKITNASNYTFDTKNHKWCELTVFFPIAFLRVDLSQALRDAAKNSVIYEIKNIKRAITNKEKDVLYLKTEGINIVQMSKYSHLLDLNKLYTNDIHAIANTYGIEAANKVIIKEIQNVFNVYGITVDPRHLTLVADYMTYNGIFEPMSRKGMEASTSPLQQMSFESSLIFLKEAVLNSKKDFIRSASSCLMLGQPCRAGTGSFSLQHFSKVVS-