Monarch geneset OGS2.0

DPOGS203050
TranscriptDPOGS203050-TA3258 bp
ProteinDPOGS203050-PA1085 aa
Genomic positionDPSCF300206 + 25605-30863
RNAseq coverage227x (Rank: top 44%)
Annotation
HeliconiusHMEL0161530.088.37% 
BombyxBGIBMGA014045-TA0.087.85% 
DrosophilaRpIII128-PA0.072.75% 
EBI UniRef50UniRef50_Q7Q4160.070.47%DNA-directed RNA polymerase (Fragment) n=2 Tax=Bilateria RepID=Q7Q416_ANOGA
NCBI RefSeqXP_001958974.10.073.31%GF12287 [Drosophila ananassae]
NCBI nr blastpgi|3123747840.073.45%hypothetical protein AND_15508 [Anopheles darlingi]
NCBI nr blastxgi|3838629310.072.92%PREDICTED: DNA-directed RNA polymerase III subunit RPC2 [Megachile rotundata]
Group
Gene OntologyGO:00038990DNA-directed RNA polymerase activity
GO:00325490ribonucleoside binding
GO:00063510transcription, DNA-dependent
GO:00036778.1e-122DNA binding
KEGG pathwaydan:Dana_GF122870.0 
 K03021 (RPC2)maps-> Cytosolic DNA-sensing pathway
    Purine metabolism
    Pyrimidine metabolism
    RNA polymerase
InterPro domain[28-1086] IPR0157120DNA-directed RNA polymerase, subunit 2
[622-995] IPR0071208.1e-122DNA-directed RNA polymerase, subunit 2, domain 6
[28-365] IPR0076441.4e-38RNA polymerase, beta subunit, protrusion
[997-1081] IPR0076411.5e-27RNA polymerase Rpb2, domain 7
[162-337] IPR0076429.4e-24RNA polymerase Rpb2, domain 2
[392-457] IPR0076453e-22RNA polymerase Rpb2, domain 3
[493-554] IPR0076465.9e-20RNA polymerase Rpb2, domain 4
[575-616] IPR0076478.2e-10RNA polymerase Rpb2, domain 5
[734-845] IPR0147241.8e-09RNA polymerase Rpb2, OB-fold
Orthology groupMCL10170 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203050-TA
ATGGGTGAATTAAATAAAAATCACGGTAAAGAGTGGGATTCAGGCAAAAGCCTCAGAGAACCTATAAAAACCTTAGAGAAAAAAAAAATTGTTCAAGCAAATGAAAAAGTATTCTGTGATTCCGATCCATTATTCTACATAAAGTATTTAAACGCCTATGTCGGAACACCAGATCTGGAAGAAGGATTTAATGTGACCAAGCCAACAACACCTCATGAATGTCGACTGAGAGACATGACTTATTCTGCACCAATCACTGTTGACATAGAATATATAAGAGGCAATCAAAGAGTCATTTTTAAAAATAAGCAATTAATTGGAAGAATGCCTTTAATGCTGAGATCCTCGAATTGTGTTTTAACGAATAAATCAGATTTCGAGTTAGCTCAATTAAATGAATGTCCACATGATCCCGGTGGCTATTTTATTATAAGAGGACAAGAAAAAGTAATTTTAATTCAAGAACAACTCTCTAGGAATAGAATGATTGTTGATGAATTTAAAGGTGCCATCCAGTGTCAAGTAACGAGTTCGACCCATGAGAAGAAAACTAGGACTATAGTTATTGTTAAAAATGGGAAATATGTTTTGAGGCACAATGCTTTATCTGATGATATACCAATATGTGTAGTTTTTAAAGCCATGGGTATATGTAGTGACCAAGAAATAATGCAGTTAGTGGGTACAGATGATGCTATTGCAAAAAAAATGGCACCCTGCATCATGGACTGCCATAATTTAAAAGTTTTCACACAGAATCAGGCGCTTGCTTATATTGGTAGCAAATTAAAAGTTAAGAGGTTCCAATCAGCAACATCAAAATTTAGAACTCCGGTGGACGAAGCAAGAGATCTGATAGCAACAACCATTTTAGCACATGTGGTAGTAGAAAATTATAATTTCTATGTGAAAGCTATTTACTTGGCGATAATGGTGAAAAGAGTCATAGAGGCAGAGACTAACAAAGCGGCTATAGATGACCCAGACTATTATGGAAATAAGAGGCTGGAACTGGCAGGGTCACTACTGGCGCTCATGTTTGAAGATCTGTTCAAAAGATTTAATTGGGAATTGAAATCTATAGCCGATAAAATAATACCGAGAGTAAAAGCAGCGCCGTTTGATGGAAATTGGACTATCAAGAGATTCAAAATGGAACGTCATGGCGTCACCCAAGTATTGAGTCGTCTCAGCTATATTTCGGCTCTCGGTATGATGACGAGAGTTAATTCACAGTTCGAAAAGACCAGGAAGGTTTCTGGGCCTAGATCGCTGCAACCGTCACAGTGGGGAATGCTCTGTCCTTCTGACACCCCTGAGGGAGAAGCTTGTGGCTTAGTGAAAAATCTGGCTTTAATGACGCATATTACAACAGAGTGCTCAGAAGGTCCGATATCAAGGCTAGCTTGTAACGCCGGGGTTGAGGACGTTAGACTACTAGGAGGTGAAGAAATAAACCATCCAGCTCTCTATATGGTCTTTTTAAACGGTAACATCTTAGGCGTCACTAGAGAATATAAGAAACTAATAAAAATCTTCAGGATGTTTAGAAGACGAGGTCTGATCTCTGCGTTTGTGTCTATATATCCTAATCACAATCAAAGAACGGTGTATATATGCAGTGACGGTGGTAGACTCTGCCGTCCGTATATAATAGTGGAGAAAGGTTTGCCTTTGGTTCAACAGCATCATATCAATGAGCTCAACAGGGGTATACGAAAGTTTCAAGATTTTCTCAACGATGGTCTCATAGAATATTTGGACGTGAATGAAGAAAATGACAGTCATATAGCGACTGTTGAGACCGAGATAGATCCGTACGTGACAACGCATCTCGAGATAGAGCCGTTCACTATACTTGGAGTTTGTGCGGGTCTTGTGCCTTATCCCCACCACAATCAGAGCCCCAGGAACACATACCAATGTGCTATGGGCAAACAGGCCATGGGTACAATAGGCTATAATCAGAAGAATAGAATAGACACGCTTATGTATAATTTAGTGTACCCGCAGTGTCCGATGGTTAAAACGAGGACTATAGAATTGACAAATTTTGATAAGCTGCCGGCCGGGCAGAACGCTACTGTCGCTGTTATGAGTTACAGTGGCTATGATATAGAAGATGCTTTAATTTTGAACAGAGCATCGATAGACCGTGGTTACGGACGTTGTTTGGTATACAAAAGTGCCAAAACAATTATGAAGCGATACAGCAATCAGACGTCGGACAGAATCCTAGGACCATCAAGAGACGCCAACACTGGAAAAGTTATTAAGGCTCACGAAGTTCTGGATTCCGATGGCATAGCAGCTCCGGGTGAAATGGTCGAGAACCGTCAAGTGTTGATAAACAAACAAATGCCTCCCGCCACGCTCAATCCTATAAATCAAGGTCAACCGCAACAAATAGATTATAAAGACGTGCCAATAACGTACAAAGGGCCAGTCGAATCTTATATAGAAAAGGTCATGGTGTCCTCCAACTCTGAAGATGCGTTTCTAATAAAAATATTACTGAGGCAAACTAGAATACCAGAGATCGGTGATAAATTTAGTTCCAGGCATGGACAGAAAGGAGTGACAGGTTTGATAGTTCAACAGGAAGACATGCCGTTCAATGATAGAGGGATATGTCCCGATATGATCATGAATCCACATGGTTTCCCCTCGAGAATGACGGTGGGAAAAACTATAGAGCTATTGGCGGGAAAAGCTGGACTGATGGAGGGAAAATTTCATTACGGCACAGCATTTGGTGGGTCGAAAGTTCGCGATGTATGTCAGGAGTTGGAAAAGCATGGCTATAACTACCACGGAAAGGATATATTCTATTCTGGTATAACAGGAGAACCTTTGGAAGCGTACATTTATTCCGGGCCAGTTTACTATCAGAAGTTGAAACATATGGTGCAAGACAAAATGCACGCTCGTGCTAGAGGACCGAGAGCTGTACTGACGAGACAACCCACTGAAGGAAGGTCGAGGGATGGAGGACTACGACTTGGGGAGATGGAACGAGATTGCCTCATTGGTTATGGTGCAAGTATGCTGTTGATGGAACGTCTGATGTTAGCGTCTGACGCGTTCAGTGCGGACATTTGCAGTAGTTGCGGCAGGCTGGCCTCCCGTGCTTGGTGCCACGCCTGCCGGTCCTCGGCCGTCTCGGCGGTCGATATGCCTTACGCCTGTAAGCTGTTGTTCCAGGAGCTGGCCTCCATGAATATAGTCCCGAGGCTGAAATTGAAGAAGTACTCGTGA

Protein sequence:

>DPOGS203050-PA
MGELNKNHGKEWDSGKSLREPIKTLEKKKIVQANEKVFCDSDPLFYIKYLNAYVGTPDLEEGFNVTKPTTPHECRLRDMTYSAPITVDIEYIRGNQRVIFKNKQLIGRMPLMLRSSNCVLTNKSDFELAQLNECPHDPGGYFIIRGQEKVILIQEQLSRNRMIVDEFKGAIQCQVTSSTHEKKTRTIVIVKNGKYVLRHNALSDDIPICVVFKAMGICSDQEIMQLVGTDDAIAKKMAPCIMDCHNLKVFTQNQALAYIGSKLKVKRFQSATSKFRTPVDEARDLIATTILAHVVVENYNFYVKAIYLAIMVKRVIEAETNKAAIDDPDYYGNKRLELAGSLLALMFEDLFKRFNWELKSIADKIIPRVKAAPFDGNWTIKRFKMERHGVTQVLSRLSYISALGMMTRVNSQFEKTRKVSGPRSLQPSQWGMLCPSDTPEGEACGLVKNLALMTHITTECSEGPISRLACNAGVEDVRLLGGEEINHPALYMVFLNGNILGVTREYKKLIKIFRMFRRRGLISAFVSIYPNHNQRTVYICSDGGRLCRPYIIVEKGLPLVQQHHINELNRGIRKFQDFLNDGLIEYLDVNEENDSHIATVETEIDPYVTTHLEIEPFTILGVCAGLVPYPHHNQSPRNTYQCAMGKQAMGTIGYNQKNRIDTLMYNLVYPQCPMVKTRTIELTNFDKLPAGQNATVAVMSYSGYDIEDALILNRASIDRGYGRCLVYKSAKTIMKRYSNQTSDRILGPSRDANTGKVIKAHEVLDSDGIAAPGEMVENRQVLINKQMPPATLNPINQGQPQQIDYKDVPITYKGPVESYIEKVMVSSNSEDAFLIKILLRQTRIPEIGDKFSSRHGQKGVTGLIVQQEDMPFNDRGICPDMIMNPHGFPSRMTVGKTIELLAGKAGLMEGKFHYGTAFGGSKVRDVCQELEKHGYNYHGKDIFYSGITGEPLEAYIYSGPVYYQKLKHMVQDKMHARARGPRAVLTRQPTEGRSRDGGLRLGEMERDCLIGYGASMLLMERLMLASDAFSADICSSCGRLASRAWCHACRSSAVSAVDMPYACKLLFQELASMNIVPRLKLKKYS-