Monarch geneset OGS2.0

DPOGS205979
TranscriptDPOGS205979-TA1494 bp
ProteinDPOGS205979-PA497 aa
Genomic positionDPSCF300164 - 111775-114916
RNAseq coverage288x (Rank: top 38%)
Annotation
HeliconiusHMEL0148696e-14257.30% 
BombyxBGIBMGA009408-TA0.065.08% 
DrosophilaCG12267-PA2e-8635.10% 
EBI UniRef50UniRef50_Q171Y01e-10239.38%DNA-directed RNA polymerase III, 62-kS-subunit, putative n=4 Tax=Culicidae RepID=Q171Y0_AEDAE
NCBI RefSeqXP_001652793.12e-10339.38%DNA-directed RNA polymerase III, 62-kS-subunit, putative [Aedes aegypti]
NCBI nr blastpgi|1571162014e-10239.38%DNA-directed RNA polymerase III, 62-kS-subunit, putative [Aedes aegypti]
NCBI nr blastxgi|1571162016e-9939.10%DNA-directed RNA polymerase III, 62-kS-subunit, putative [Aedes aegypti]
Group
Gene OntologyGO:00038991.4e-18DNA-directed RNA polymerase activity
GO:00036771.4e-18DNA binding
GO:00063511.4e-18transcription, DNA-dependent
KEGG pathwayaag:AaeL_AAEL0074966e-103 
 K03023 (RPC3)maps-> Cytosolic DNA-sensing pathway
    Purine metabolism
    Pyrimidine metabolism
    RNA polymerase
InterPro domain[194-309] IPR0088061.4e-18RNA polymerase III Rpc82, C -terminal
[8-65] IPR0131971.4e-12RNA polymerase III subunit RPC82-related, helix-turn-helix
Orthology groupMCL13301 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205979-TA
ATGTCTCGACAACTTGGTAGAGTAGTTTCGCAAATTTTGAATAGGTATTTTGGAGAAATCGTCGAAAAAGTGGGAAACGATCTGTTCATGTATGGTTGTAAACCCATGGGTATGATAATAAAGAGTACAGGCCTACCTCGGACACAGGTCATCGAGAGTTTGAGAACTCTCCTTAAATTTGATCTAGCTACATTTGAAGCTAAAGATGTTATTGTTGATTACAAATTACAGCCAGAAAATATATTACTGCTCATACGATACCCAAGGTATCTTTTGCAAATGAAAACTAAGTATGGCAGTGAGGCTGAAATGTTGGTTGAGGAATTGTTACAACAGGCATCTTGTACTGCAACGGTACTAATAGTCAGCGTCATGAACAAATACAAAGACGATAAGGAAAAGAATTTAAATATAGTGAAATTAAAAGACACATTCATTAGTCTGGCGACTGCAGGATATATTCAGCAAGCTCCGGTGGCAGAAATTAACAATGAAATACCTGTTTTAGTACCCGTGGCCACCATAGTACCAGATTTAGATGTTAGAGAATTAATTAATGCTATGAACAGCAATCTCAACAATGTTAGTGACAATATTTACTGGAAAGTAAACTATGATAGATTCCACCTTGATTTCCGAGATGAAGTTATGATAAAGGCAGTGACACGTCGCATTGATGATAATGCCGGGGAATTGTTAAAGTTGATGTTAGAGCAGATGTATTTATCGTCATCATCGTGGGCATCTGACTCGTCACCGGTCCCGCTAACTGATCTAAAGGATGGATGTCAGAGGATCGGAGAGAATATGAAAAACCATGTGGAACAATATTTAAGAGTTATCGAGGAAAGTACAGGATTCATTCGTCGTACGGGTGACGCCGGGGGCGGGCAATATAGTGTGAGGATCCAACACGCTCTGATACAACTCGTGGAAGCTGTGTGTGATCATACGGTCACTGAGAGACTCGGGGGAAAAGCAGCCAGAATATTTAGATTGATAAGATCCAAAAAGTACTTAGAAGAGGACGATATACAGAAGAATGCGATGCTACCCAACAAGGAGTGTAAGGAATTGACTTACAAATTGTTGGAGGAACATTTTATTAGTGTTCAGCCGATGAGAAAAACAGCTTCAGCGGGCGGTATGGCCAAAGCGATATATTTATACCACGTCAAGTTACATGATGTAGCACACGTCGTCCGTGACATGTGCTATCGGTCGCTCCACAACGTGATGTCGGTGTCGCGTCACAGGCGCGTGTTAAACGCGCGTCTCTTGGAAAAAAAACGCCGCGTCCGGACGATCGTGCACGGCATGAGGCTTAGAGGGGAACCCCAGCAGAATATTGACGATGTGCTGGATACCCTGACTCCGCCGGAAGTGACGTCAGCGGCGGAAGCGGAGCGTCGCCTTAGTACGCTCGCAGCATCCGAGCTCCACCTGGACCGCGCCCTGTTCATACTCACCAGCTACTTCACGTACCAGAGATGA

Protein sequence:

>DPOGS205979-PA
MSRQLGRVVSQILNRYFGEIVEKVGNDLFMYGCKPMGMIIKSTGLPRTQVIESLRTLLKFDLATFEAKDVIVDYKLQPENILLLIRYPRYLLQMKTKYGSEAEMLVEELLQQASCTATVLIVSVMNKYKDDKEKNLNIVKLKDTFISLATAGYIQQAPVAEINNEIPVLVPVATIVPDLDVRELINAMNSNLNNVSDNIYWKVNYDRFHLDFRDEVMIKAVTRRIDDNAGELLKLMLEQMYLSSSSWASDSSPVPLTDLKDGCQRIGENMKNHVEQYLRVIEESTGFIRRTGDAGGGQYSVRIQHALIQLVEAVCDHTVTERLGGKAARIFRLIRSKKYLEEDDIQKNAMLPNKECKELTYKLLEEHFISVQPMRKTASAGGMAKAIYLYHVKLHDVAHVVRDMCYRSLHNVMSVSRHRRVLNARLLEKKRRVRTIVHGMRLRGEPQQNIDDVLDTLTPPEVTSAAEAERRLSTLAASELHLDRALFILTSYFTYQR-