Monarch geneset OGS2.0

DPOGS215212
TranscriptDPOGS215212-TA1071 bp
ProteinDPOGS215212-PA356 aa
Genomic positionDPSCF300143 + 152245-156288
RNAseq coverage313x (Rank: top 36%)
Annotation
HeliconiusHMEL0038721e-14569.75% 
BombyxBGIBMGA008718-TA3e-10456.49% 
DrosophilaCG5147-PA9e-2628.27% 
EBI UniRef50UniRef50_B0WKN91e-3934.16%DNA-directed RNA polymerase III subunit D n=4 Tax=Culicidae RepID=B0WKN9_CULQU
NCBI RefSeqXP_001659518.12e-4334.28%hypothetical protein AaeL_AAEL008794 [Aedes aegypti]
NCBI nr blastpgi|1571198113e-4234.28%hypothetical protein AaeL_AAEL008794 [Aedes aegypti]
NCBI nr blastxgi|1571198112e-4233.71%hypothetical protein AaeL_AAEL008794 [Aedes aegypti]
Group
Gene OntologyGO:00038991.7e-27DNA-directed RNA polymerase activity
GO:00036771.7e-27DNA binding
GO:00056661.7e-27DNA-directed RNA polymerase III complex
GO:00063831.7e-27transcription from RNA polymerase III promoter
KEGG pathwayaag:AaeL_AAEL0087945e-43 
 K03026 (RPC4)maps-> Cytosolic DNA-sensing pathway
    Purine metabolism
    Pyrimidine metabolism
    RNA polymerase
InterPro domain[20-356] IPR0078111.7e-27RNA polymerase III Rpc4
Orthology groupMCL15900 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215212-TA
ATGTCCAATTCCGGTGACAAACCAAACGCAAATGGTGTCAACACGGATCCTCTACAGAGACTAGCTTCGTTTAAGGCCCCAAGAGACCTATCACTAGGTGGATTAAAGCCAAATAAAAAAATCTTCACTCCAAACTTAAATGTAACTAGGAATAAAAATAAAGGTGCTTCGTCAACAAGTAATCGTGATTCTAAAAAAGATGAGAGAAACAGACGGGATCGCAAGAACGATAAAAATAAGAACTTCAAAAACGGACCAAATATTATAAAGTCCAGCGGAGTATTCTCTGAAGGTCTGGGTAGTGCTGAACGACATCACTCCAGTAGGGTGTCCTATGGACGTGACGTTGATTCCTCACACACTCTCCAAAAACCTACACTAAGGGTCAAGGATGTAGTAAAGATAGATAAAGAGTTGGAGGAACAGAAAATCCAAGAGGCTTTTGGTGAGAACAATGCTTTTGATGATGAAGGAGAAGACTTCAAACAGGTGTTGGACATTGAAGCTCCTATCAAGTTACCCATGGACGACGGAAGCATTAAGCAATTAAAAGCCGCTGTGAAGATAAAAAATGAGGTGATAGTGAAACAGGAGCCGAGCGAGGAAGCATCTACGGCTGTGTTGGAACAGAAGCCCTTGGCTGATGTGAAGGAAGTGTATGAAGATTCGAGTGTTATGAACCTGTTGCGGAGTGACAAACCAACACTCATATTGTTACAGTTAGCCGACTCCCTGCCGGGCAGAGGTGGTGCTAGTGATGATAGGAAACATGAGCCTGGGACGTCTGAGGCTGCGGAACAAGAAACTGATAGCAGATGTCACCTGACTGATCTCGAGGAAGGAAGAATAGGGAAGATGAGGATACACAGGTCCGGCAGGGTCACCATGGCCTTAGGAGATACGATATTTGAGGTGAGTGCTGGCACTAAAGCCTCCTTCCACCAGGAGGTCGTGTCGGTAGGAGTGGATGAAGCGTCCCGTTCCGCGAGCCTGGTCTCCTTGGGTCCCCTTCATCACAAACTGAACATAACACCGCACTGGCAGTCAATGTTCAATAAAATGTCCGTGTGA

Protein sequence:

>DPOGS215212-PA
MSNSGDKPNANGVNTDPLQRLASFKAPRDLSLGGLKPNKKIFTPNLNVTRNKNKGASSTSNRDSKKDERNRRDRKNDKNKNFKNGPNIIKSSGVFSEGLGSAERHHSSRVSYGRDVDSSHTLQKPTLRVKDVVKIDKELEEQKIQEAFGENNAFDDEGEDFKQVLDIEAPIKLPMDDGSIKQLKAAVKIKNEVIVKQEPSEEASTAVLEQKPLADVKEVYEDSSVMNLLRSDKPTLILLQLADSLPGRGGASDDRKHEPGTSEAAEQETDSRCHLTDLEEGRIGKMRIHRSGRVTMALGDTIFEVSAGTKASFHQEVVSVGVDEASRSASLVSLGPLHHKLNITPHWQSMFNKMSV-