Monarch geneset OGS2.0

DPOGS204856
TranscriptDPOGS204856-TA642 bp
ProteinDPOGS204856-PA213 aa
Genomic positionDPSCF300227 + 203935-204918
RNAseq coverage80x (Rank: top 64%)
Annotation
HeliconiusHMEL0138988e-11087.79% 
BombyxBGIBMGA011753-TA1e-7090.51% 
DrosophilaCG7339-PA7e-7560.85% 
EBI UniRef50UniRef50_Q9VTL68e-7360.85%CG7339 n=11 Tax=Pancrustacea RepID=Q9VTL6_DROME
NCBI RefSeqXP_394665.13e-7770.00%PREDICTED: similar to polymerase (RNA) III (DNA directed) polypeptide H isoform 1 [Apis mellifera]
NCBI nr blastpgi|480963244e-7670.00%PREDICTED: DNA-directed RNA polymerase III subunit RPC8-like isoform 1 [Apis mellifera]
NCBI nr blastxgi|1571307593e-7864.09%DNA-directed RNA polymerase III 25 kDa polypeptide [Aedes aegypti]
Group
Gene OntologyGO:00038991.5e-39DNA-directed RNA polymerase activity
GO:00036771.5e-39DNA binding
GO:00063511.5e-39transcription, DNA-dependent
KEGG pathwayame:4111917e-77 
 K03022 (RPC25)maps-> Cytosolic DNA-sensing pathway
    Purine metabolism
    Pyrimidine metabolism
    RNA polymerase
InterPro domain[1-163] IPR0045191.5e-39DNA-directed RNA polymerase
[79-196] IPR0132382.5e-37RNA polymerase III, subunit Rpc25
[1-80] IPR0055762.5e-26RNA polymerase Rpb7, N-terminal
[79-193] IPR0160273.3e-18Nucleic acid-binding, OB-fold-like
Orthology groupMCL14058 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204856-TA
ATGTTTGTACTGTCGGAAATGAAAGATGTGCTAAGAGTTACACCTGAACACTTTCACCAAAGTCTAACAGAGTCAATAACAACATTATTAAACAGAAAACTTGCTAATAAGGTCGTTCTAAATGTGGGGCTCTGTATAGCATTATTTGATATAACCAATATAGGTCACTCATACATATTTCCTGGTGACGGATCGTCACACACTGAAGTGATATTTAGGTATATTGTGTTTCGTCCACTTGTTGAGGAAATACTTATAGGAAAAATAAGAAGTTGTAGCCGGGATGGGGTACATGTTACAATGGGTTTTTTTGATGACATTTTAATACCTGTGAATGCACTCCAACATCCATCTAGGTTTGATGAAACAGATCAAGCCTGGGTTTGGGAATATCCCAAAGAAGATGGAGAAAAGCATGATCTATTCATGGATTCAGGTGAATCTATTAGATTTAGAGTAACAAGTGAGGAGTTTGAAGAAAGTTTGCCAACTGGTCCACCAGGATCTGAACGTCCCACTCAAGCAGTGGCACCATATAGATTAATTGGTGGTATTAATGAACCGGGCCTAGGATTATTGACCTGGTGGGTGACACCAGAACAGGATGAAGGTGATGAAGAAGAACAAGAAGATGTTGAATAA

Protein sequence:

>DPOGS204856-PA
MFVLSEMKDVLRVTPEHFHQSLTESITTLLNRKLANKVVLNVGLCIALFDITNIGHSYIFPGDGSSHTEVIFRYIVFRPLVEEILIGKIRSCSRDGVHVTMGFFDDILIPVNALQHPSRFDETDQAWVWEYPKEDGEKHDLFMDSGESIRFRVTSEEFEESLPTGPPGSERPTQAVAPYRLIGGINEPGLGLLTWWVTPEQDEGDEEEQEDVE-