Monarch geneset OGS2.0

DPOGS206293
TranscriptDPOGS206293-TA2022 bp
ProteinDPOGS206293-PA673 aa
Genomic positionDPSCF300290 + 393733-398856
RNAseq coverage1x (Rank: top 93%)
Annotation
HeliconiusHMEL0131259e-10274.47% 
BombyxBGIBMGA010804-TA0.072.71% 
DrosophilaRpI135-PA2e-15944.70% 
EBI UniRef50UniRef50_E0V8Y65e-16643.95%DNA-directed RNA polymerase n=1 Tax=Pediculus humanus corporis RepID=E0V8Y6_PEDHC
NCBI RefSeqXP_001606574.10.051.00%PREDICTED: similar to CG4033-PA, partial [Nasonia vitripennis]
NCBI nr blastpgi|3454849970.050.74%PREDICTED: DNA-directed RNA polymerase I subunit RPA2 [Nasonia vitripennis]
NCBI nr blastxgi|3838524080.051.52%PREDICTED: DNA-directed RNA polymerase I subunit RPA2-like [Megachile rotundata]
Group
Gene OntologyGO:00038991.3e-227DNA-directed RNA polymerase activity
GO:00325491.3e-227ribonucleoside binding
GO:00063511.3e-227transcription, DNA-dependent
GO:00036772.9e-26DNA binding
GO:00056341.9e-13nucleus
KEGG pathwaynvi:1001218400.0 
 K03002 (RPA2)maps-> Purine metabolism
    Pyrimidine metabolism
    RNA polymerase
InterPro domain[6-659] IPR0157121.3e-227DNA-directed RNA polymerase, subunit 2
[34-170] IPR0076442.9e-26RNA polymerase, beta subunit, protrusion
[451-515] IPR0076452.3e-23RNA polymerase Rpb2, domain 3
[563-620] IPR0096741.9e-13RNA polymerase I, Rpa2 specific
[209-370] IPR0076421.1e-11RNA polymerase Rpb2, domain 2
Orthology groupMCL11850 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206293-TA
ATGGATCCCAAAAAGATCCTACACGAACCGTCTTTGAAGTATACTAGTAATCCTGATTATAGAAGACCACCAAAAACCGCTAACCCGTACTTACAATGTCTGGGAACTCCGCATATAGATTCATTCAATTACATGATCAAAGATGGATTAAAAGCTGCCATAGATGATTTAATTCCCGTCGAATTTGATGTGCCAAGCGGAGAAAGAATTAAAATAACTATAGATGAAGCTGCTTTCGCGAAACCAAGTGTTCCTATGGAGGCTGTAGGAGTTAAAAATCAAATAGTCTTGCCAACAGAATGCAGACAAAGAGCAGCTACATATAAAGGAGATTTCAAAGTTAGATTATCTTTTACCGTTGATGGGAAGACCATATCAATGGACAGATCCCTCGGCAGTTTGCCAATCATGTTAAAGTCCAAAGTTTGCCACTTGGCTGACCTGTCTCCTGAGGAACTGGTGAAGAAGAATGAACATGCGGACGAGTGGGGCGGATACTTTATTATTAAGTATTATTTCTTGCTATCCCAACCTGGCCCCGCCAAAGGAAGCAGTGAACTCGCTATCAAGAGATCCGGTTGGAGGATGAGAGGGAATCTGTTCACAGATTACGGTGTGCTCATGAGATGTGTGAAACCAGATCAAACTAGTACTAACAACGTACTGCATTTTCTCCAAAATGGAACTTGCAAATTAATGTTCTCTCATCGTAAAGTGATGTACTACGCTCCGCTGGTGCTGATACTAAAGTGTCTCGTGGACTGGCCGGACCATTACATATACAGATTACTCTTACACGGAAAGAAGAATGACTTGTATTATGTTAACTGTGTACAGAACATGCTCCGGGAACTTCACGAGCAGGATCTCCATACTTCTGTCGAATGTCGTTCCTACATGGGTCGTATGTTCAGAGCACGGCTGGATCTTCCGCCGGATGCTACAGATCTGGATGCTGCGAACTTCCTGTTAGTGAGGTGCATCATGATACACCTGAACGATTACAAGGATAAGTTCTACGGATTGGTGTTCATGAATCAGAAGCTGTTCGATCTGGTGCAGAATAAATGCAAGGTGGAGGGAGCTGATGCTGTGATGGTGCAGGAGTTGCAGGTGGGAGGTCACCTGTACTTGCAGGTGTTGAAGGAACGCCTCCAGACCTTGCTTTACGTCCTCAAAGCCAATATCATCAAGAAGTCTAAAACCAGCAGATTGTCGCTGACTTCGAAAGAACTGCAGCAAATAATACGTTCAGCCGGCGGCCTGGAACAGAAGATGGAGACGTTCCTAGCGACTGGTAACGCTCCGTCCAACAACGTCAACCTGGCGCAGTACAAAGGACTGACCATAGTCGCTGAAAACCTCAACAGAATGAGATACATGTCGCATTTCAAGGCGATACATCGCGGTTCGTTCTTCATGGAGATGCGTACGACGGAGGCACGTCAACTGTTGCCAGATGCTTGGGGCTTCGTGTGTCCCGTACATACGCCCGATGGAGCACCCTGTGGCTTACTCAACCATCTCACCGCCTCCGCACAGGTCACCCAACAACCCGATCCCAAGCAAGTGTCATCTCTACCGGCCGTTCTCGAGAAATGCGGCATGGACCCTATAAGCTCTGTGGCCCACACTCCGCTGACCCACGATGTCTACAAATATCCGGTATTCATAGATGGTAGGCTGGTGGGCTATTTCAATGAGGACACTGCCCTCAAATCGGTGTCATACCTCCGAACGTTGAAGGTCAAAGGTGAAGACGTGCCTATATCCACTGAGATTGTTATGGTGCCTAAAAAACAGATACCGGCCCAATATGCCGGCGTGTTTCTATTCACGAGCGAAGCCCGCATGATGCGGCCGGTTATTAATCTGTCCACGGGTCAACTTGAACTGGTCGGCACTATGGAACAACTCTACTTGGACATAGCTGTGGCACAGACAGAGATCATCAAAGGTATTATTGCGTACGCTACTAGTGTCATCCGCGGCTTTGTCTGCGTTTTGGTCGATATTGATTGA

Protein sequence:

>DPOGS206293-PA
MDPKKILHEPSLKYTSNPDYRRPPKTANPYLQCLGTPHIDSFNYMIKDGLKAAIDDLIPVEFDVPSGERIKITIDEAAFAKPSVPMEAVGVKNQIVLPTECRQRAATYKGDFKVRLSFTVDGKTISMDRSLGSLPIMLKSKVCHLADLSPEELVKKNEHADEWGGYFIIKYYFLLSQPGPAKGSSELAIKRSGWRMRGNLFTDYGVLMRCVKPDQTSTNNVLHFLQNGTCKLMFSHRKVMYYAPLVLILKCLVDWPDHYIYRLLLHGKKNDLYYVNCVQNMLRELHEQDLHTSVECRSYMGRMFRARLDLPPDATDLDAANFLLVRCIMIHLNDYKDKFYGLVFMNQKLFDLVQNKCKVEGADAVMVQELQVGGHLYLQVLKERLQTLLYVLKANIIKKSKTSRLSLTSKELQQIIRSAGGLEQKMETFLATGNAPSNNVNLAQYKGLTIVAENLNRMRYMSHFKAIHRGSFFMEMRTTEARQLLPDAWGFVCPVHTPDGAPCGLLNHLTASAQVTQQPDPKQVSSLPAVLEKCGMDPISSVAHTPLTHDVYKYPVFIDGRLVGYFNEDTALKSVSYLRTLKVKGEDVPISTEIVMVPKKQIPAQYAGVFLFTSEARMMRPVINLSTGQLELVGTMEQLYLDIAVAQTEIIKGIIAYATSVIRGFVCVLVDID-