Monarch geneset OGS2.0

DPOGS202263
TranscriptDPOGS202263-TA969 bp
ProteinDPOGS202263-PA322 aa
Genomic positionDPSCF300032 - 514197-516060
RNAseq coverage78x (Rank: top 65%)
Annotation
HeliconiusHMEL0056003e-16584.29% 
BombyxBGIBMGA004913-TA2e-14778.31% 
DrosophilaCG3756-PA2e-10958.64% 
EBI UniRef50UniRef50_B4I1972e-9554.13%GM18508 n=1 Tax=Drosophila sechellia RepID=B4I197_DROSE
NCBI RefSeqXP_001658447.15e-11862.20%DNA-directed RNA polymerase [Aedes aegypti]
NCBI nr blastpgi|1571163771e-11662.20%DNA-directed RNA polymerase [Aedes aegypti]
NCBI nr blastxgi|1571163774e-11362.20%DNA-directed RNA polymerase [Aedes aegypti]
Group
Gene OntologyGO:00038991.2e-78DNA-directed RNA polymerase activity
GO:00063511.2e-78transcription, DNA-dependent
GO:00036773.2e-35DNA binding
GO:00469835e-35protein dimerization activity
KEGG pathwayaag:AaeL_AAEL0075681e-117 
 K03027 (RPC5)maps-> Cytosolic DNA-sensing pathway
    Purine metabolism
    Pyrimidine metabolism
    RNA polymerase
InterPro domain[51-317] IPR0112631.2e-78DNA-directed RNA polymerase, RpoA/D/Rpb3-type
[37-320] IPR0090253.2e-35DNA-directed RNA polymerase, RBP11-like
[77-205] IPR0112625e-35DNA-directed RNA polymerase, insert domain
[52-315] IPR0112615.6e-16DNA-directed RNA polymerase, dimerisation
Orthology groupMCL14983 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202263-TA
ATGCCGAAACTTGATGAGAAGCCCAGAGTTTTCTTAGAAGAATTTCGAGTTAAGAATGCACCTGACGATTATGGAATGGCAGACGAAAAATGGAATTTTAAGAAATTCACAAAGAAATTTCGTATTGTTATAGTCCGTATGGATAGTACTGAAATGGAATTTGATTTGATTGGCATTCAACCTGCTTTTGCCAATGCTTTTCGAAGGCTCATGTTAAGTGAAGTACCCAGTATGGCAATTGAAAAGGTGATGATAAAAAATAATACATCCATTATACAGGACGAAGTGCTTGCACATAGGTTGGGGTTGATTCCACTAAAGGCAGACCCGCGGTTATTTGAATTTCGTCCTGAAAATGCTGAAGAAGGGACTGAGTTTGACACTTTAGAGTTTTCATTAAAAATAAAATGCACAAATAATAAATATGGACCCAAAGACTCTTTCCGTGCTGAGGACTTGTATGAAAATCATAGCGTATACTCGTCTTCAATTAAATGGCATCCCATTGGAAATCAGGCGTCAATCCACAAGGAAGCTGATGTTGGTCCAGTGGATGATGACATCCTCATATCCAAGATGAGACCCGGCCATGAACTCGATATGCATCTGGTTGTACATATTTTTGACACAACTGCCTCCTACCGCCTACTTCCTGAAGTAACTCTAACCCGTGAAGTGGATGGAAGTGAAGCGACCTTGCTACAGAGCTGCTTTTCACCTGGAGTCATCGGCCTGGATTCTGATGGCAAAGCTTTTGTGAGGGACGCCAGATATGACAGCTGCAGCAGAAATGCTTACAGATATGACTGTATAAAAGACGCCGTAATATTAAGTAGAATACGTGATCATTTTATATTTAATGTGGAGTCAGTGGGTGCCATGCCTCCGAATGTCATATTTGTTGAAGCTGTTAAAATATTAAGAGATAAATGTAAAACATTACTGGATGAATTAAATAATTTTTCTTAA

Protein sequence:

>DPOGS202263-PA
MPKLDEKPRVFLEEFRVKNAPDDYGMADEKWNFKKFTKKFRIVIVRMDSTEMEFDLIGIQPAFANAFRRLMLSEVPSMAIEKVMIKNNTSIIQDEVLAHRLGLIPLKADPRLFEFRPENAEEGTEFDTLEFSLKIKCTNNKYGPKDSFRAEDLYENHSVYSSSIKWHPIGNQASIHKEADVGPVDDDILISKMRPGHELDMHLVVHIFDTTASYRLLPEVTLTREVDGSEATLLQSCFSPGVIGLDSDGKAFVRDARYDSCSRNAYRYDCIKDAVILSRIRDHFIFNVESVGAMPPNVIFVEAVKILRDKCKTLLDELNNFS-