Monarch geneset OGS2.0

DPOGS202271
TranscriptDPOGS202271-TA810 bp
ProteinDPOGS202271-PA269 aa
Genomic positionDPSCF300032 - 451590-452664
RNAseq coverage152x (Rank: top 53%)
Annotation
HeliconiusHMEL0076422e-14690.71% 
BombyxBGIBMGA004985-TA7e-12777.99% 
DrosophilaCG5380-PB7e-8757.25% 
EBI UniRef50UniRef50_Q9VD259e-8557.25%Probable DNA-directed RNA polymerase III subunit RPC6 n=25 Tax=Endopterygota RepID=RPC6_DROME
NCBI RefSeqXP_624984.16e-9661.94%PREDICTED: similar to DNA-directed RNA polymerase III 39 kDa polypeptide (RNA polymerase III C39 subunit) [Apis mellifera]
NCBI nr blastpgi|3838637733e-9561.57%PREDICTED: DNA-directed RNA polymerase III subunit RPC6-like [Megachile rotundata]
NCBI nr blastxgi|3838637732e-9261.57%PREDICTED: DNA-directed RNA polymerase III subunit RPC6-like [Megachile rotundata]
Group
Gene OntologyGO:00038992e-92DNA-directed RNA polymerase activity
GO:00036772e-92DNA binding
GO:00063512e-92transcription, DNA-dependent
KEGG pathwayame:5526072e-95 
 K03025 (RPC34)maps-> Cytosolic DNA-sensing pathway
    Purine metabolism
    Pyrimidine metabolism
    RNA polymerase
InterPro domain[12-269] IPR0160497e-121RNA polymerase Rpc34-like
[1-269] IPR0078322e-92RNA polymerase Rpc34
Orthology groupMCL13511 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202271-TA
ATGGCAGCTGCAGTGCCTGAATTAACTTCTGCTGAACTCGTAGCTGCAATTAACAAATTACTTCAGCAAGGGGTTTTCGATCTATATAATCAAGGAGGCTCTTTACTATATAGATTGAAAAATCAAAACTCCAAACAAGCAATAAAAGGGGCTGATAATGAAGAAAAAGTTGTATATAATTTAATAGAGGAAGCTGGTAATAAGGGAATATGGATAAGAGATATAAGAATCAGATCAAATCTGGCTAATACACAATTAACAAAAGTTTTAAAAAGTTTGGAAGGCAAGAAAGTTATTAAAGCCGTCAAATGTGTTAATGCATCCAAGAAAAAAGTTTATATGCTATTTAATTTAGAGCCCGACAGATCTGTATCTGGTGGTGCGTGGTATCAGGACCAGGATTTTGAATCTGAATTTGTTGACATTCTGAATCGTCAGTGCTTGAGATTCCTTCAACAAAGAGCCGATAAAATAAAAAATAATCCGAGAGGTCCCATAGTTGGCCGTACACAGTCATATGCCACAGCTGCCGAGGTTCAGAAATACATTACTGACTTAGGTATAAGTAAAGTTAAACTAGAAGTTGAAGACGTTATAACTATTCTGAACACATTAGTTTACGACGGGAAAGCTGAAAGTAATGTTTATCCCGACGGAAACAGAGTTTACAGGGCTATAGAATCTCTCCTACCTCCTCCAGGTTTAGTGCAGGTACCATGCGGAGTATGCCCTCTCATACACAAATGCAATTCAACAGGATTAATAACACCTCAAGATTGTAAATATATGGATGAGTGGCTCGAGCAATAA

Protein sequence:

>DPOGS202271-PA
MAAAVPELTSAELVAAINKLLQQGVFDLYNQGGSLLYRLKNQNSKQAIKGADNEEKVVYNLIEEAGNKGIWIRDIRIRSNLANTQLTKVLKSLEGKKVIKAVKCVNASKKKVYMLFNLEPDRSVSGGAWYQDQDFESEFVDILNRQCLRFLQQRADKIKNNPRGPIVGRTQSYATAAEVQKYITDLGISKVKLEVEDVITILNTLVYDGKAESNVYPDGNRVYRAIESLLPPPGLVQVPCGVCPLIHKCNSTGLITPQDCKYMDEWLEQ-