Monarch geneset OGS2.0

DPOGS210797
TranscriptDPOGS210797-TA1467 bp
ProteinDPOGS210797-PA488 aa
Genomic positionDPSCF300027 - 1112195-1116603
RNAseq coverage201x (Rank: top 47%)
Annotation
HeliconiusHMEL0141791e-7756.39% 
BombyxBGIBMGA007112-TA0.061.38% 
DrosophilaSin-PA3e-8035.80% 
EBI UniRef50UniRef50_E3XA883e-7736.87%Putative uncharacterized protein n=1 Tax=Anopheles darlingi RepID=E3XA88_ANODA
NCBI RefSeqXP_397129.32e-8836.72%PREDICTED: similar to RNA polymerase III 80 kDa subunit RPC5 [Apis mellifera]
NCBI nr blastpgi|3838549559e-8938.70%PREDICTED: DNA-directed RNA polymerase III subunit RPC5-like, partial [Megachile rotundata]
NCBI nr blastxgi|3838549551e-8838.70%PREDICTED: DNA-directed RNA polymerase III subunit RPC5-like, partial [Megachile rotundata]
Group
Gene OntologyGO:00038991.9e-110DNA-directed RNA polymerase activity
GO:00056341.9e-110nucleus
GO:00063511.9e-110transcription, DNA-dependent
KEGG pathway 
InterPro domain[2-424] IPR0068861.9e-110DNA-directed RNA polymerase III subunit Rpc5
Orthology groupMCL13205 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210797-TA
ATGGACGAAGAGGATCCAGTTGTCCAAGAGATCCCGGTCTATCTTTCACAAGCGCTTTCAGAGCATTTATATATTTATCAATACCCGGTTAGACCGGCGAATAGGGATTGGAAAGATATCAAAGTTATCAATGCCTCTATAAAACCAAAAAATCAACTTGTGCGAATGGAAATCGGTCTGGATACGTATAGTGAAAAATATTGTCCCTCTAAAGGAGAACAGATAGCTTTGAATACTGATGGACCACAGGAATCAAAATACATTAAAGATAAGGAAAAAGAGAGATCTCAGTATTTTAGAAATGGTATAATGGACAAGATTGTGTATGAAAGCAGCTCGCCATGTTTAGAAACCAAGCATTACGCTGTAGCTATTCTACAGGATAAGGAATTACATTGTACACCTATAAAAGGTATTTGTCAGCTGAGACCGTCATATTCATATTTTGATAAACAAGACAAAAGGAAGATCGACAAGAGCAAAGCCGAGAATTCCGATGACGAGGAAAAAGAATCGGAACCCCAACAGGTGACTGTAAAGTTCTCAAGAGCGGAAACGGATGTGGCGAAGAAAGCGAGGGAAAAATCATATGAATCCATCTCACAGAAGATAGCCAATGAACCTTGGTACGATGCATTCTGGAGAAAATTGGATGACGATCATGCTGATTTGGAGCGTCTAAAGTTATTCAGCTCAACAACATCAGACGGTTCAGCTCTAACTTTGGGTGCGACAGAGTATATAAACACCTTGGTGCCTTCGCTCACTGACGAAGCTGAGATGCCACCAGTTAAGAAGACCTCCTTACAGGATCAGATTAAGGAAATTTTGTTAAATGCTAAACTGATGACCTTCAACGAGCTCCGTTCCCTGGTCCGCAACGATGAGGGTAGTTTTGTGAGTGAGAGCGCCCTCCTCGCGGCTTTGGGCGGCGTCGCGTGCTGTGTCCGCGGCCTCTGGACCGCGCGCTCGCAGCAGATGTACACTCGACCAGCCCCCGCGCCGCCTAGACTGATGTGCGCGGCCAGGGATCACGTGCTCTACTTATTCACGCAGCACTCGTACGTGGATCGTCGGAAAATAGCGGCGGCTGTACGTCTACCGGCTCAGGAAGTATTGGAAATACTTCGCTCCGTCGCTAAGTTGAATCCACAGACTGGATGGGAACTTCTCTTGCCACCGGATTCAGCATTCGAAGCCAAGTATCCGGAGGTGATTCAGAGGCAGAACCTCTATTGGGAGGCCTGCCAGCGGCAGTTCAACGAGATGCTGATAGGTGAAAACCTTCCAAAGCGGCAAAGAAAAAAGTCCCAAAGGGATTCAATAAGTTCGGATTCAATGCTGAGCCCCAGACCGAGAAATTACAGTGTTAGTGAGGATGATGATAGGAAAAGGAAGATCAAAATGGCCTCAGGTTCTAAAAGGACTAGAAATATGAGTTCAAGTAGTGCCCAGGACGCAACGTGA

Protein sequence:

>DPOGS210797-PA
MDEEDPVVQEIPVYLSQALSEHLYIYQYPVRPANRDWKDIKVINASIKPKNQLVRMEIGLDTYSEKYCPSKGEQIALNTDGPQESKYIKDKEKERSQYFRNGIMDKIVYESSSPCLETKHYAVAILQDKELHCTPIKGICQLRPSYSYFDKQDKRKIDKSKAENSDDEEKESEPQQVTVKFSRAETDVAKKAREKSYESISQKIANEPWYDAFWRKLDDDHADLERLKLFSSTTSDGSALTLGATEYINTLVPSLTDEAEMPPVKKTSLQDQIKEILLNAKLMTFNELRSLVRNDEGSFVSESALLAALGGVACCVRGLWTARSQQMYTRPAPAPPRLMCAARDHVLYLFTQHSYVDRRKIAAAVRLPAQEVLEILRSVAKLNPQTGWELLLPPDSAFEAKYPEVIQRQNLYWEACQRQFNEMLIGENLPKRQRKKSQRDSISSDSMLSPRPRNYSVSEDDDRKRKIKMASGSKRTRNMSSSSAQDAT-