Monarch geneset OGS2.0

DPOGS206294
TranscriptDPOGS206294-TA3393 bp
ProteinDPOGS206294-PA1130 aa
Genomic positionDPSCF300290 + 400758-411261
RNAseq coverage256x (Rank: top 41%)
Annotation
HeliconiusHMEL0131250.074.79% 
BombyxBGIBMGA010804-TA0.075.03% 
DrosophilaRpI135-PA0.053.88% 
EBI UniRef50UniRef50_E2AY840.054.61%DNA-directed RNA polymerase n=2 Tax=Formicidae RepID=E2AY84_CAMFO
NCBI RefSeqXP_001606574.10.060.87%PREDICTED: similar to CG4033-PA, partial [Nasonia vitripennis]
NCBI nr blastpgi|3454849970.060.48%PREDICTED: DNA-directed RNA polymerase I subunit RPA2 [Nasonia vitripennis]
NCBI nr blastxgi|3454849970.060.39%PREDICTED: DNA-directed RNA polymerase I subunit RPA2 [Nasonia vitripennis]
Group
Gene OntologyGO:00038990DNA-directed RNA polymerase activity
GO:00325490ribonucleoside binding
GO:00063510transcription, DNA-dependent
GO:00036773.1e-101DNA binding
GO:00056343.6e-13nucleus
KEGG pathwaynvi:1001218400.0 
 K03002 (RPA2)maps-> Purine metabolism
    Pyrimidine metabolism
    RNA polymerase
InterPro domain[6-1127] IPR0157120DNA-directed RNA polymerase, subunit 2
[670-1026] IPR0071203.1e-101DNA-directed RNA polymerase, subunit 2, domain 6
[34-434] IPR0076447.4e-33RNA polymerase, beta subunit, protrusion
[452-516] IPR0076454.4e-23RNA polymerase Rpb2, domain 3
[1028-1120] IPR0076411.3e-17RNA polymerase Rpb2, domain 7
[564-621] IPR0096743.6e-13RNA polymerase I, Rpa2 specific
[209-371] IPR0076421.6e-11RNA polymerase Rpb2, domain 2
Orthology groupMCL11850 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206294-TA
ATGGATCCCAAAAAGATCCTACACGAACCGTCTTTGAAGTATACTAGTAATCCTGATTATAGAAGACCACCAAAAACCGCTAACCCGTACTTACAATGTCTGGGAACTCCGCATATAGATTCATTCAATTACATGATCAAAGATGGATTAAAAGCTGCCATAGATGATTTAATTCCCGTCGAATTTGATGTGCCAAGCGGAGAAAGAATTAAAATAACTATAGATGAAGCTGCTTTCGCGAAACCAAGTGTTCCTATGGAGGCTGTAGGAGTTAAAAATCAAATAGTCTTGCCGACAGAATGCAGACAAAGAGCAGCTACATATAAAGGAGATTTCAAAGTTAGATTATCTTTTACCGTTGATGGGAAGACCATATCAATGGACAGATCCCTCGGCAGTTTGCCAATCATGTTAAAGTCCAAAGTTTGCCACTTGGCTGACCTGTCTCCTGAGGAACTGGTGAAGAAGAATGAACATGCGGACGAGTGGGGCGGATACTTTATTATTAAGGGTCATGAACGTCTGGCTCGCATGTTGTTAGTCACCAGACGGAATTACCCCGTCGCTATCAAGAGATCCGGTTGGAGGATGAGAGGGAATCTGTTCACGGATTACGGTGTGCTCATGAGATGTGTGAAACCAGATCAAACTAGTACTAACAACGTACTGCATTTTCTCCAAAATGGAACTTGCAAATTAATGTTCTCTCATCGTAAAGTGATGTACTACGCTCCGCTGGTGCTGATACTAAAGTGTCTCGTGGACTGGCCCGACCATTACATATACAGATTACTCCTACACGGAAAGAAGAATGACTTGTATTATGTTAACTGTGTGCAGAACATGCTCCGGGAACTTCACGAGCAGGATCTCCATACTTCTGTTGAATGTCGTTCCTACATGGGTCGTATGTTCAGAGCACGGCTGGATCTTCCGCCGGATGCTACAGATCTGGATGCTGCGAACTTCCTGTTAGTGAGGTGCATCATGATACACCTGAACGATTACAAGGATAAGTTCTACGGATTGGTGTTCATGAATCAGAAGCTGTTCGATCTGGTGCAGAATAAATGCAAGGTGGAGGGAGCTGATGCTGTGATGGTGCAGGAGTTGCAGGTGGGAGGTCACCTGTACTTGCAGGTGTTGAAGGAACGCCTCCAGACCTTGCTTTACGTCCTCAAAGCCAATATCATCAAGAAGTCTAAAACCAGCAGATTGTCGCTGACTTCGAAAGAACTGCAGCAAATAATACGTTCAGCCGGCGGCCTGGAACAGAAGATGGAGACGTTCCTAGCGACTGGTAACGCTCCGTCCAACAACGTCAACCTGGCGCAGTACAAAGGACTGACCATAGTCGCTGAAAACCTCAACAGAATGAGATACATGTCGCATTTCAAGGCGATACATCGCGGTTCGTTCTTCATGGAGATGCGTACGACGGAGGCACGTCAACTGTTGCCAGATGCTTGGGGCTTCGTGTGTCCCGTACATACGCCCGATGGAGCACCCTGTGGCTTACTCAACCATCTCACCGCCTCCGCACAGGTCACCCAACAACCCGATCCCAAGCAAGTGTCATCTCTACCGGCCGTTCTCGAGAAATGCGGGATGGACCCTATAAGCTCTGTGGCCCACACTCCGTTGACCCACGATGTCTACAAATATCCGGTATTCATAGATGGTAGGCTGGTGGGCTATTTCAATGAGGACACTGCCCTGAAATCGGTGTCATACCTCCGAACGTTGAAGGTCAAAGGTGAAGACGTGCCTATATCCACTGAGATTGTTATGGTGCCTAAAAAACAGATACCGGCCCAATATGCCGGCGTGTTTCTATTCACGAGCGAAGCGCGCATGATGCGGCCGGTCATTAATCTGTCGACGGGTCAACTTGAACTGGTCGGCACTATGGAACAACTCTACTTGGACATAGCCGTAGCACAGACAGAGATCATCAAAGGCAAAACCACCCACTTGGAGTTATCGAAATCAGCGTTTATGAGTAATTTGGCCCAACTAGTTCCCATGCCGGACTGCAACCAATCACCGCGTAACATGTACCAGTGTCAGATGGGTAAACAAACGATGGGAACCCCTATCCATACTTGGTCTACGAACGCTGAGACCAAGTTATACCGGTTGCAGACGGGTGCTACGCCGCTCTTCCGGCCGGTACACCACGACAACCTCAGTCTAGACGACTATCCTTCCGGCACAAACGCAATACTCGCCGTTATATCCTACACAGGCTACGATATGGAAGACGCCATGATAATAAACAAATCGTCGTACGAGCGAGGTTTCGCAGCGGGTTCCGTATACAAATCCAACTTCGCGGAACTGAAGAGCTCGTCTTCATACTTCTGCCGCGACCCCACGAGAACTGACCTCGCGGCTTACATGGACGAGGACGGACTCCCGGCCGTGGGGGCGAGGATACAACCCGAGGATCCCTTCTACTGCCACTACGACAGCGACAGTTCAAAGTTCGTGGTAACGAAATACCACGGCAAGGAGGAGGTTGTGGTGGACAGTGTGAGGCTCTGCGGGGAGTTCAGCAGCAAGGCTCCCAAAAAAGCTTGCATCATGGTCAGAGTGCAGCGCAATCCGACGGTTGGTGACAAATTCGCTTCACGAGCTGGTCAAAAGGGAATCTGTTCCCAGAAATGGCCGGCCGAGGACTTACCCTTCACTGAATCGGGCCTCATACCGGACGTGCTGTTCAATCCGCACGGCTTCCCCTCGCGGATGACCATCGCCATGATGATAGAGTGTATGGCGGGGAAAGCCGCCTGTGCCTGCACGGACACGTCGGTCCACGACGCTACGCCGTTCCGCTTCAACGAACAGGACACGGCCATCAATTACTTCGGGCGCCTGCTGGAGGCCGGCGGCTACAACTACTACGGCACGGAGAGGATCTACAGCGGCGTCGACGGCCGCGAGATGCAGGCCGACATATTCTGCGGACTAGTGCACTACCAGCGGCTGCGGCACATGGTGTCCGACAAGTGGCAGGTCCGCACGACCGGGGCCGTGGACGCTCTCACCCGTCAGCCCGTGAAGGGGCGGCGGCGCGGAGGAGGAGTAAGGCTCGGAGAAATGGAAAGGGACGCGCTCTTAGCACACGGAGCCACTTTTCTACTACAAGACAGACTCTTCCACTGTTCAGACAAGAGCGAGGCTATTATTTGCTCCAAATGCGGCACACTCCTCGGTCCGATATCTGGTAACACCGAGGGTAGCAAGGACACGTGCCGCTTGTGCGGCGAGGGGAACTTGTTGCTCATATCGATACCCTATATATTCAAGTTCTTTGTGACCCAGTTGGCCTCCGTCAATATTAATATTAAAATCAACTGTAACAGCAACTTGGCGATAGGAAGCTGCTGA

Protein sequence:

>DPOGS206294-PA
MDPKKILHEPSLKYTSNPDYRRPPKTANPYLQCLGTPHIDSFNYMIKDGLKAAIDDLIPVEFDVPSGERIKITIDEAAFAKPSVPMEAVGVKNQIVLPTECRQRAATYKGDFKVRLSFTVDGKTISMDRSLGSLPIMLKSKVCHLADLSPEELVKKNEHADEWGGYFIIKGHERLARMLLVTRRNYPVAIKRSGWRMRGNLFTDYGVLMRCVKPDQTSTNNVLHFLQNGTCKLMFSHRKVMYYAPLVLILKCLVDWPDHYIYRLLLHGKKNDLYYVNCVQNMLRELHEQDLHTSVECRSYMGRMFRARLDLPPDATDLDAANFLLVRCIMIHLNDYKDKFYGLVFMNQKLFDLVQNKCKVEGADAVMVQELQVGGHLYLQVLKERLQTLLYVLKANIIKKSKTSRLSLTSKELQQIIRSAGGLEQKMETFLATGNAPSNNVNLAQYKGLTIVAENLNRMRYMSHFKAIHRGSFFMEMRTTEARQLLPDAWGFVCPVHTPDGAPCGLLNHLTASAQVTQQPDPKQVSSLPAVLEKCGMDPISSVAHTPLTHDVYKYPVFIDGRLVGYFNEDTALKSVSYLRTLKVKGEDVPISTEIVMVPKKQIPAQYAGVFLFTSEARMMRPVINLSTGQLELVGTMEQLYLDIAVAQTEIIKGKTTHLELSKSAFMSNLAQLVPMPDCNQSPRNMYQCQMGKQTMGTPIHTWSTNAETKLYRLQTGATPLFRPVHHDNLSLDDYPSGTNAILAVISYTGYDMEDAMIINKSSYERGFAAGSVYKSNFAELKSSSSYFCRDPTRTDLAAYMDEDGLPAVGARIQPEDPFYCHYDSDSSKFVVTKYHGKEEVVVDSVRLCGEFSSKAPKKACIMVRVQRNPTVGDKFASRAGQKGICSQKWPAEDLPFTESGLIPDVLFNPHGFPSRMTIAMMIECMAGKAACACTDTSVHDATPFRFNEQDTAINYFGRLLEAGGYNYYGTERIYSGVDGREMQADIFCGLVHYQRLRHMVSDKWQVRTTGAVDALTRQPVKGRRRGGGVRLGEMERDALLAHGATFLLQDRLFHCSDKSEAIICSKCGTLLGPISGNTEGSKDTCRLCGEGNLLLISIPYIFKFFVTQLASVNINIKINCNSNLAIGSC-