Monarch geneset OGS2.0

DPOGS202262
TranscriptDPOGS202262-TA774 bp
ProteinDPOGS202262-PA257 aa
Genomic positionDPSCF300032 - 517710-520043
RNAseq coverage30x (Rank: top 75%)
Annotation
HeliconiusHMEL0056004e-12982.89% 
BombyxBGIBMGA004913-TA3e-11578.71% 
DrosophilaCG3756-PA2e-8557.53% 
EBI UniRef50UniRef50_O151608e-7453.17%DNA-directed RNA polymerases I and III subunit RPAC1 n=51 Tax=Opisthokonta RepID=RPAC1_HUMAN
NCBI RefSeqXP_001844509.18e-9460.84%DNA-directed RNA polymerase I 40 kDa polypeptide [Culex quinquefasciatus]
NCBI nr blastpgi|1700332861e-9260.84%DNA-directed RNA polymerase I 40 kDa polypeptide [Culex quinquefasciatus]
NCBI nr blastxgi|1700332867e-9060.84%DNA-directed RNA polymerase I 40 kDa polypeptide [Culex quinquefasciatus]
Group
Gene OntologyGO:00038996.4e-42DNA-directed RNA polymerase activity
GO:00063516.4e-42transcription, DNA-dependent
GO:00469836.4e-32protein dimerization activity
GO:00036773.5e-14DNA binding
KEGG pathwaycqu:CpipJ_CPIJ0031232e-93 
 K03027 (RPC5)maps-> Cytosolic DNA-sensing pathway
    Purine metabolism
    Pyrimidine metabolism
    RNA polymerase
InterPro domain[51-256] IPR0112636.4e-42DNA-directed RNA polymerase, RpoA/D/Rpb3-type
[82-203] IPR0112626.4e-32DNA-directed RNA polymerase, insert domain
[37-148] IPR0090253.5e-14DNA-directed RNA polymerase, RBP11-like
[52-207] IPR0112611.2e-07DNA-directed RNA polymerase, dimerisation
Orthology groupMCL44097 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202262-TA
ATGCCGAAACTTGATGAGAAGCCCAGAGTTTTCTTAGAAGAATTTCGAGTTAAGAATGCACCTGACGATTATGGAATGGCAGACGAAAAATGGAATTTTAAGAAATTCACAAAGAAATTTCGTATTGTTATAGTCCGTATGGATAGTACTGAAATGGAATTTGATTTGATTGGCATTCAACCTGCTTTTGCCAATGCTTTTCGAAGGCTCATGTTAAGTGAAGTACCCAGTATGGCAATTGAAAAGGACGAAGTGCTTGCACATAGATTGGGGTTGATACCACTAAAGGCAGACCCGCGGTTATTTGAATTTCGTCCTGAAAATGCTGAAGAAGGGACTGAGTTTGACACTTTAGAGTTTTCATTAAAAATAAAATGCACAAATAATAAATATGGACCCAAAGACTCTTTCCGTGCTGAGGACTTGTATGAAAATCATAGCGTATACTCGTCTTCAATTAAATGGCATCCCATTGGAAATCAGGCGTCAATCCACAAGGAAGCTGATGTTGGTCCAGTGGATGATGACATCCTCATATCCAAGATGAGACCCGGCCATGAACTCGATATGCATCTGGTTGCTGTTAAAGGCATCGGCAAGGATCACGCCAAGTTCTCACCAGTGGCAACTGCCTCCTACCGCCTACTTCCTGAAGTAACTCTAACCCGTGAAGTGGATGGAAGTGAAGCGACCTTGCTACAGAGCTGCTTTTCACCTGGAGTCATCGGCCTGGATTCTGATGGCAAAGCTTTTCTACTTAAACGACCTCAATAA

Protein sequence:

>DPOGS202262-PA
MPKLDEKPRVFLEEFRVKNAPDDYGMADEKWNFKKFTKKFRIVIVRMDSTEMEFDLIGIQPAFANAFRRLMLSEVPSMAIEKDEVLAHRLGLIPLKADPRLFEFRPENAEEGTEFDTLEFSLKIKCTNNKYGPKDSFRAEDLYENHSVYSSSIKWHPIGNQASIHKEADVGPVDDDILISKMRPGHELDMHLVAVKGIGKDHAKFSPVATASYRLLPEVTLTREVDGSEATLLQSCFSPGVIGLDSDGKAFLLKRPQ-