Monarch geneset OGS2.0

DPOGS202328
TranscriptDPOGS202328-TA1602 bp
ProteinDPOGS202328-PA533 aa
Genomic positionDPSCF300032 + 593429-596658
RNAseq coverage203x (Rank: top 47%)
Annotation
HeliconiusHMEL0050996e-5340.53% 
BombyxBGIBMGA004994-TA2e-14696.00% 
DrosophilaRpII33-PA5e-13084.80% 
EBI UniRef50UniRef50_F4X4P82e-11577.20%DNA-directed RNA polymerase II subunit RPB3 n=2 Tax=Myrmicinae RepID=F4X4P8_ACREC
NCBI RefSeqNP_001040388.16e-14596.00%DNA-directed RNA polymerase II polypeptide [Bombyx mori]
NCBI nr blastpgi|406450831e-14496.40%homologue of DNA-directed RNA polymerase II subunit [Antheraea pernyi]
NCBI nr blastxgi|406450832e-14095.65%homologue of DNA-directed RNA polymerase II subunit [Antheraea pernyi]
Group
Gene OntologyGO:00038994.4e-87DNA-directed RNA polymerase activity
GO:00063514.4e-87transcription, DNA-dependent
GO:00081522.1e-47metabolic process
GO:00038242.1e-47catalytic activity
GO:00301702.1e-47pyridoxal phosphate binding
GO:00469834.6e-42protein dimerization activity
GO:00036772.6e-23DNA binding
KEGG pathwayame:5524584e-132 
 K03011 (RPB3)maps-> Huntington's disease
    Purine metabolism
    Pyrimidine metabolism
    RNA polymerase
InterPro domain[18-252] IPR0112634.4e-87DNA-directed RNA polymerase, RpoA/D/Rpb3-type
[281-484] IPR0019262.1e-47Pyridoxal phosphate-dependent enzyme, beta subunit
[44-174] IPR0112624.6e-42DNA-directed RNA polymerase, insert domain
[4-265] IPR0090252.6e-23DNA-directed RNA polymerase, RBP11-like
[20-249] IPR0112615.8e-07DNA-directed RNA polymerase, dimerisation
Orthology groupMCL14950 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202328-TA
ATGCCGTACGCAAATCAACCATCAGTACATATAACAGAATTAACAGACGACAATGTTAAATTTGTCGTCGAAGATACAGAGCTCAGCGTTGCCAACAGCATGCGACGAGTTTTCATAGCAGAGACGCCGACAATGGCTATTGACTGGGTTCAACTGGAAGCAAACTCAACAGTTCTAAGTGACGAATTCCTCGCTCATCGTATTGGTTTAATACCCCTGATATCTGATGACGTCGTTGATAAAATACGTTACTCCAGGGACTGCATGTGTGCAGATTTCTGTACTGAATGCAGTGTGGAATTTACATTGGATGTAAAGTGCACTGGCGATCAAACAAGACATGTGACAACTGCTGATTTAAAATCTAGTGACCCACGAGTGGTGCCGGTCACTTCACGCCATAGAGATGAAGACCAGGCAGATTATGGTGAACTTGATGAAATCTTGATTATCAAGCTACGTAAGGGACAAGAATTAAAGTTGCGAGCATATGCCAAAAAGGGGTTCGGAAAAGAGCATGCCAAATGGAATCCAACATCCGGTGTGTGTTTTGAGTATGATCCTGACAATATAATGAGACATACATTATACCCAAAGCCAGACGAGTGGCCGAAGAGTGAGCATACAGAGCTGGATGAGGATCAGTATGAAGCTGATTTCAACTGGGAAGCAAAACCAAACAAATTCTTCTATAATGTAGAATCCTCGGGAGCATTGAAACCGGAGAATATAGTGTTGATGGGCATTGTTGAGTCTAAGTATGTAGGGAATCATTTTTGCAATGTGCTACTAAAATGCGAAAACCTACAACTTACCGGCAGTTCTGACAGGTTGGACTTGTTAGCTGGATATGGGACATTAGGGTTGGAGATTCTTTCTCAGGTTGATAAGTTAGATGCTATAATATGCCCCGTCGGCACCGGAGGCCTCATAGCTAGCATATTGGTGGCAGTTAAATCCTTAAAACCCCAATGTTTGATTTATGGTGTTGAAAGTTCAGGAGCACCAACTATGAGTAAAGCCTTGCAAGAGAAGAAACCAACGATAATTCTAGTCAAACCAACAATTGCTGAAAGCATTGCCGATAAAATAGCAAGCAACAATGCTTTTCATATCATTAAAAGATATCTAACCAAAATGATCACTGTTGATGATTTGTGGATATCAAGAGCTATGGTTAACTTATTGGAGAGAGAAAAGATAATAGTGGAAGCGGCTGCACCTACACCAGTCGCCGCCATAATGGCGGGAAAAGTTCCTGAACTACGAGGAAAAAATGTCGTGTGCGTGTTGACTGGTGGTAATATAAAATTATCTCGTCTGCCTTACATCGTGGATCGCGGTCTAATGGCTGAAGGGCGGCTCGTTGGGTTTTCGATCACTTTGCCGGATGGACCAGCTGAGATTGCCCGTCTGCTGACTAAAGTTGTGGATACCGGTGCGGACGTGAGGAGTTTCGAACCGGAACGGTCTTGGATTAAGCGAGACGCTCTTAATGTCACCGTATTCATGTTACTTGAAACCAGTGATCCCAGTCACGCGAAGGAATTGGAGAAGAAATTACTTAAAGATTATCCTCATTCTCAAATCATTTCAGCATAA

Protein sequence:

>DPOGS202328-PA
MPYANQPSVHITELTDDNVKFVVEDTELSVANSMRRVFIAETPTMAIDWVQLEANSTVLSDEFLAHRIGLIPLISDDVVDKIRYSRDCMCADFCTECSVEFTLDVKCTGDQTRHVTTADLKSSDPRVVPVTSRHRDEDQADYGELDEILIIKLRKGQELKLRAYAKKGFGKEHAKWNPTSGVCFEYDPDNIMRHTLYPKPDEWPKSEHTELDEDQYEADFNWEAKPNKFFYNVESSGALKPENIVLMGIVESKYVGNHFCNVLLKCENLQLTGSSDRLDLLAGYGTLGLEILSQVDKLDAIICPVGTGGLIASILVAVKSLKPQCLIYGVESSGAPTMSKALQEKKPTIILVKPTIAESIADKIASNNAFHIIKRYLTKMITVDDLWISRAMVNLLEREKIIVEAAAPTPVAAIMAGKVPELRGKNVVCVLTGGNIKLSRLPYIVDRGLMAEGRLVGFSITLPDGPAEIARLLTKVVDTGADVRSFEPERSWIKRDALNVTVFMLLETSDPSHAKELEKKLLKDYPHSQIISA-