Monarch geneset OGS2.0

DPOGS211146
TranscriptDPOGS211146-TA3531 bp
ProteinDPOGS211146-PA1176 aa
Genomic positionDPSCF300007 - 132324-135854
RNAseq coverage617x (Rank: top 21%)
Annotation
HeliconiusHMEL0172040.095.32% 
BombyxBGIBMGA003018-TA0.094.73% 
DrosophilaRpII140-PA0.093.53% 
EBI UniRef50UniRef50_P308760.087.23%DNA-directed RNA polymerase II subunit RPB2 n=1111 Tax=root RepID=RPB2_HUMAN
NCBI RefSeqXP_313416.30.094.87%AGAP003648-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3241206740.097.96%RNA polymerase II second largest subunit [Papilio polytes]
NCBI nr blastxgi|3241206740.097.96%RNA polymerase II second largest subunit [Papilio polytes]
Group
Gene OntologyGO:00325490ribonucleoside binding
GO:00038993.4e-126DNA-directed RNA polymerase activity
GO:00036773.4e-126DNA binding
GO:00063513.4e-126transcription, DNA-dependent
KEGG pathwayaga:AgaP_AGAP0036480.0 
 K03010 (RPB2)maps-> Huntington's disease
    Purine metabolism
    Pyrimidine metabolism
    RNA polymerase
InterPro domain[1-1177] IPR0157120DNA-directed RNA polymerase, subunit 2
[707-1081] IPR0071203.4e-126DNA-directed RNA polymerase, subunit 2, domain 6
[38-451] IPR0076441.1e-69RNA polymerase, beta subunit, protrusion
[202-394] IPR0076424.1e-53RNA polymerase Rpb2, domain 2
[1083-1173] IPR0076415.2e-31RNA polymerase Rpb2, domain 7
[567-630] IPR0076465.5e-29RNA polymerase Rpb2, domain 4
[806-930] IPR0147241.2e-24RNA polymerase Rpb2, OB-fold
[468-533] IPR0076454.8e-23RNA polymerase Rpb2, domain 3
[653-701] IPR0076472.7e-20RNA polymerase Rpb2, domain 5
Orthology groupMCL10170 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211146-TA
ATGTATGATACAGAAGATGATCAGTATGAGGAAGAAGAAGTCGAAGATATTTCGTCTGAATTATGGCAGGAGGCCTGCTGGATAGTAATAAACGCATATTTCGATGAGAAAGGTCTAGTAAGGCAACAACTCGATAGTTTCGATGAATTCATACAAATGTCAGTCCAACGAATTGTCGAAGATTCCCCTCCCATAGAACTGCAAGCTGAAGCTCAACATTCATCCGGTGAAATAGAGACACCGCCAAAATACCATTTAAAATTTGATCAAATTTATCTTTCTAAACCAACTCATTGGGAAAAAGACGGAGCGCCATCCCCTATGATGCCTAATGAAGCTCGCCTACGTAATTTAACTTACTCTGCACCTTTGTATGTTGATATAACAAAAACCATAGTCAAAGAAAATGAAGATCCTATTGAGACGCAACATCAAAAAACGTTTATTGGAAAAATTCCAATTATGCTCAGATCTACATATTGTTTACTGAGCAATTTGACTGACCGTGATTTGACTGAGTTAAATGAATGTCCTTTAGACCCTGGTGGTTATTTTATTATCAATGGCTCTGAAAAGGTGCTAATTGCTCAAGAAAAAATGGCTACAAATACTGTGTATGTTTTCAGTATGCAGGGTGGTAAATATGCTTATAAAACTGAGATAAGATCTTGCCTTGAACATAGCTCAAGGCCTACATCTACTCTATGGGTTAATATGATGGCAAGAGGAGGACAGAGTATTAAAAAGTCGGCAATTGGTCAGAGGATTGTGGCTATTGTTCCATATATCAAACAGGAAATTCCTATCATGATAGTATTTAGAGCATTGGGTTTTGTGGCAGACAGAGATATTCTAGAACATATCATTTATGACTTTGATGACCCCGAAATGATGGAAATGGTTAAGCCTTCTTTGGATGAAGCTTTTGTTATTCAAGAACAAAATGTTGCTCTTAGCTTCATTGGTGCCAGAGGAGCCCGTCCTGGTGTCACTAAAGAGAGGCGTATCAAATATGCAAGAGAAATTTTGCAAAAGGAAATGCTGCCTCATGTTGGTGTATCTGATTTTTGTGAAACAAAAAAAGCATACTTTCTAGGTTACATGGTACATAGATTACTTTTAGCTGCTTTGGGTAGAAGAGAGTTGGATGACAGAGATCATTATGGAAATAAACGACTTGATTTAGCTGGACCATTATTAGCATTTCTGTTCAGAGGTCTCTTCAAGAATTTATTAAAAGAAGTAAGAATGTACGCTCAGAAATTCATTGACAAAGGAAAAGATTTTAATCTGGAATTGGCAATCAAAACAAAAATTATTACCGATGGTTTGAGATATTCTTTGGCTACTGGAAATTGGGGTGACCAAAAGAAAGCTCATCAGGCAAGAGCCGGAGTATCACAGGTATTGAACAGACTAACCTTTGCCTCTACTTTATCTCACTTGAGGCGTGTCAACTCCCCAATTGGTCGTGACGGCAAACTAGCAAAACCACGTCAGTTACACAATACTTTGTGGGGAATGATATGTCCTGCTGAAACACCAGAAGGAGCTGCTGTCGGTTTGGTCAAGAATTTGGCATTAATGGCTTACATTTCTGTCGGAAGTCAGCCATCTCCCATATTAGAGTTCTTGGAAGAGTGGTCTATGGAAAATTTGGAGGAAATAGCTCCATCAGCCATTGCAGATGCTACAAAAATTTTCGTTAATGGCTGTTGGGTCGGTATACACAGAGATCCAGAGCAATTAATGGCTACATTGCGTAAACTCAGACGTCAAATGGACATTATAGTCTCTGAAGTAAGTATGATCCGAGACATAAGAGATAGAGAAATAAGAATTTATACTGATGCTGGAAGAATTTGTAGACCATTACTTATTGTTGAGAATGGATCTTTACTATTGAAGAAGAAACATATTGATCAATTAAAAGAAAGAGATTATAATAATTATGGTTGGCAGAACTTGGTAGCAAGTGGTGTCGTTGAATATATTGACACCCTGGAAGAAGAAACTGTAATGATTGCTATGAACCCTGATGATTTACAACAAATAAAAGAATATGCTTATTGTACTACATACACTCATTGTGAGATTCACCCTGCTATGATATTAGGTGTATGCGCCTCTATTATTCCATTCCCAGATCATAATCAAAGTCCGAGAAACACTTACCAAAGTGCTATGGGCAAACAAGCTATGGGAGTATATATCACAAACTTCCATGTTAGAATGGACACATTAGCTCATGTTCTGTTCTATCCACATAAACCCTTGGTTACTACCAGATCTATGGAATATCTTCGCTTCAGAGAGCTGCCAGCTGGAATCAATTCAATTGTAGCCATTTTATGTTACACTGGATATAATCAAGAGGACAGTGTCATCTTAAACGCTTCAGCTGTAGAGAGAGGGTTCTTCAGATCAGTGTTCTATCGTTCTTATAAAGACTCGGAATCTAAGAGAATCGGTGATCAAGAAGAGCAATTTGAAAAACCAACAAGACAGACGTGTCAAGGGATGAGGAATGCTTTGTATGACAAATTGGATGATGACGGAATTATTGCTCCGGGTATAAGAGTTTCTGGAGATGATGTAGTAATTGGAAAAACAATTACGTTACCAGAAAATGACGATGAGTTGGAAGGCACCACGAAACGCTTCACCAAAAGAGACGCTTCGACATTTTTACGTAACAGTGAAACTGGAATTGTCGATCAAGTTATGTTAACGTTGAATAGCGAAGGATATAAGTTCTGCAAAATTAGGGTTAGATCAGTACGCATACCACAGATTGGCGACAAGTTTGCATCACGGCACGGACAAAAAGGAACCTGTGGGATCCAATACAGGCAAGAAGACATGCCCTTCACTTGTGAGGGGATCACTCCAGACATTATTATTAACCCACACGCCATCCCATCCCGTATGACAATTGGTCACTTGATTGAATGTATTCAGGGGAAAGTGTCATCGAACAAAGGCGAAATAGGTGACGCAACACCGTTTAACGACGCTGTTAACGTGCAAAAGATTTCTTCACTTCTACAAGAATATGGTTATCATCTTAGAGGTAATGAAGTAATGTATAACGGTCACACTGGCAGAAAGATCAACGCCCAAGTGTTCCTGGGGCCCACGTACTATCAACGGTTGAAGCATATGGTGGACGACAAAATTCACTCCAGAGCCCGAGGACCAGTACAGATTTTAGTTCGACAGCCCATGGAGGGTAGGGCTCGGGACGGTGGATTGCGTTTCGGGGAAATGGAGCGTGATTGTCAAATAGCTCACGGAGCCGCTCAGTTTTTGAGAGAGCGATTGTTCGAGGTTTCAGATCCTTACCGCATACACGTTTGCAATTTCTGCGGTTTGATAGCAATAGCCAACCTCCGTAACAATACATTCGAATGCAAAGGATGCAAAAATAAAACACAGATTTCTCAAGTGAGGCTGCCTTACGCTGCAAAGTTGTTGTTCCAAGAACTCATGTCTATGAACATCGCCCCCAGACTTATGGTCGTAAATTAA

Protein sequence:

>DPOGS211146-PA
MYDTEDDQYEEEEVEDISSELWQEACWIVINAYFDEKGLVRQQLDSFDEFIQMSVQRIVEDSPPIELQAEAQHSSGEIETPPKYHLKFDQIYLSKPTHWEKDGAPSPMMPNEARLRNLTYSAPLYVDITKTIVKENEDPIETQHQKTFIGKIPIMLRSTYCLLSNLTDRDLTELNECPLDPGGYFIINGSEKVLIAQEKMATNTVYVFSMQGGKYAYKTEIRSCLEHSSRPTSTLWVNMMARGGQSIKKSAIGQRIVAIVPYIKQEIPIMIVFRALGFVADRDILEHIIYDFDDPEMMEMVKPSLDEAFVIQEQNVALSFIGARGARPGVTKERRIKYAREILQKEMLPHVGVSDFCETKKAYFLGYMVHRLLLAALGRRELDDRDHYGNKRLDLAGPLLAFLFRGLFKNLLKEVRMYAQKFIDKGKDFNLELAIKTKIITDGLRYSLATGNWGDQKKAHQARAGVSQVLNRLTFASTLSHLRRVNSPIGRDGKLAKPRQLHNTLWGMICPAETPEGAAVGLVKNLALMAYISVGSQPSPILEFLEEWSMENLEEIAPSAIADATKIFVNGCWVGIHRDPEQLMATLRKLRRQMDIIVSEVSMIRDIRDREIRIYTDAGRICRPLLIVENGSLLLKKKHIDQLKERDYNNYGWQNLVASGVVEYIDTLEEETVMIAMNPDDLQQIKEYAYCTTYTHCEIHPAMILGVCASIIPFPDHNQSPRNTYQSAMGKQAMGVYITNFHVRMDTLAHVLFYPHKPLVTTRSMEYLRFRELPAGINSIVAILCYTGYNQEDSVILNASAVERGFFRSVFYRSYKDSESKRIGDQEEQFEKPTRQTCQGMRNALYDKLDDDGIIAPGIRVSGDDVVIGKTITLPENDDELEGTTKRFTKRDASTFLRNSETGIVDQVMLTLNSEGYKFCKIRVRSVRIPQIGDKFASRHGQKGTCGIQYRQEDMPFTCEGITPDIIINPHAIPSRMTIGHLIECIQGKVSSNKGEIGDATPFNDAVNVQKISSLLQEYGYHLRGNEVMYNGHTGRKINAQVFLGPTYYQRLKHMVDDKIHSRARGPVQILVRQPMEGRARDGGLRFGEMERDCQIAHGAAQFLRERLFEVSDPYRIHVCNFCGLIAIANLRNNTFECKGCKNKTQISQVRLPYAAKLLFQELMSMNIAPRLMVVN-