Monarch geneset OGS2.0

DPOGS207920
TranscriptDPOGS207920-TA2265 bp
ProteinDPOGS207920-PA754 aa
Genomic positionDPSCF300349 + 65433-74576
RNAseq coverage1171x (Rank: top 11%)
Annotation
HeliconiusHMEL0037910.065.43% 
BombyxBGIBMGA000069-TA0.059.83% 
DrosophilaRtf1-PA3e-10545.19% 
EBI UniRef50UniRef50_E2A1P91e-10450.97%RNA polymerase-associated protein Rtf1 n=10 Tax=Neoptera RepID=E2A1P9_CAMFO
NCBI RefSeqXP_968097.22e-11844.65%PREDICTED: similar to Rtf1 CG10955-PA [Tribolium castaneum]
NCBI nr blastpgi|1892351653e-11744.65%PREDICTED: similar to Rtf1 CG10955-PA [Tribolium castaneum]
NCBI nr blastxgi|1700402693e-14437.99%conserved hypothetical protein [Culex quinquefasciatus]
Group
Gene OntologyGO:00056344.2e-46nucleus
GO:00036774.2e-46DNA binding
GO:00165704.2e-46histone modification
GO:00063524.2e-46transcription initiation, DNA-dependent
KEGG pathway 
InterPro domain[364-472] IPR0181444.2e-46Plus-3 domain, subgroup
[368-472] IPR0043431.3e-32Plus-3
Orthology groupMCL12969 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207920-TA
ATGGCTAAAAGAAAAAATAAGCCCTTAATCGATTCAGATTCAAGCAGCGACTGTTCAGATCTAGATTCGCAATTTTTAAACTTGGCGAAGAAGAAAAAGAAACCAGAAGAGATAGCAGCACAGAAGGTGAACAGCAAAACATCGGGCAGCGAGTCAGACTGGGACGATACCGAGAAGAAAAATGACAAATCATCATCATCGGACTCTGAATCAAACAGTGATTCCCACAGTGATACATCCAAAAAGAAATCACCAGAGAATCCCAGCAGAAAGAGCTCATCAGATCATTACGAGGAACATAAGAATCCTGAGCATGAGGTTAAAAGGTCGGAGGTCAAACAGAATGAGCTACGTAGAAGTACAGACAGCTCAGGGTCAAAGAAGAAAGACACATACAGTGAGCCCGAGGAAGGTGAAGTGTCATCACACTCATCGGACAACGACTCCATAGACTCCGAAGAAGAATTTGATGATGGGTGGGAGATAGAACGCAAGCTGCGACTAGCGAGGCGGTCAGCGGCCGAGAGAGACGTGTCTCCTACTGAGCTGCAGAGGAGGAGGGAGGCGAGGAGGCGGCGGAGGGAGAGGCGGGGTAGGAGGGGGGAGAGAGAAGCCGTTGTAGAAGAAAAGAGGAAAGAGGAACGCGAAGAGGAGAAAGAGAGAGAGAAGCCTCCGCCCAGCCCCGGGGAGGTGACAGACGATCAAAAAGATACAGAGAGAGATCAGGACCGGTCCGCCTCCCCGCTGTTCGGTGCCAAGACTGAGAGGAAGAGGAACGTGGACGACAGGAGAGTGAACGCTATGGCGGCGCTCAGGGCCCAGAGAGACGCGCGACAGAGGAACGTGGAGACCAAACAGAAAAAGAGGGCGCTGGAGAGGAAGGAGGAGGACGACGAAGCGGATCCGGAAATAATAGGAGGCACCAGCAAACAGAGCGTCAAGCTGAAGGCGTCCGACATATACTCTGACGACTCGGGCTCGGACTCCGAGGACAAGTCACAGGGAAAAAGAAGCTCCTCGAGTTCCTCCACATCAGACGCCGAGGAAGAAGAGAAGAAGAGAGAGAGAGAGGAAGTTGAAGTGAAGTACGCGGACACCAGGGAACAGATAAATAAGCTGAGGCTTAGTAGGTTCAAGTTAGAGCGTCTCGTACATTTACCTTTCTTCTCGCGCGTCGTGTCCGGGTGTTTCGTTCGTATCGGCATCGGCAATAACAACGGAAACCCGGTGTACAGGGTCGCCGAAATTATAGATGTATACGAGACGGCAAAGGTGTATAACTTAGGAAACACGAGGACTAACAAGGGCTTCAAGCTGAGACACGGCACGCAGGACAGGGTGTTTAGGCTGGAGTTCGTGAGCAATCAGGAGTTCACAGAAAATGAATTCCAGAAATGGCATCGAGCCATCAAGGAAGCCAACAAGAAGCCTCCCACCATGGACTTCGTTAGGAACAAGATACTGGAGGTTAAGGACGCGCTCATGTACGAGTTCAAGGAGTTCACAGAAAATGAATTCCAGAAGTGGCATCGAGCCATCAAGGAGGCCAACAAGAAGCCTCCCACCATGGACTTCGTTAGGAATAAGATACTGGAGGTTAAGGACGCGCTCATGTACGAGTTTAAGGAAGAGGATATAGAGAAGATTGTAGCGGAGAAGGAGAGGTTCAGGTCGCACCCGACCAACTACGCCATGAAGAAAACCCAGCTCATGAAGGAGAGAGATGTAGCACAGCTGAGAGGTGACGAGGAATTGGTTCTAGAATTAAACTCCAAGCTTCAGGAGCTGGAAGAGAGAGCCAGCGCCCTGGACAAGACGAGGACCAGCTCCATACAGAGCATCAGCTACATCAACAACAGGAACCGGAAACTCAACGTGGAGACGGCCGAGAAGGCCATCATGGAGGAGGTGAAAGCTATGAAGGGGAAGAAGATGGACGATCCCTTCACCAGGAGACACACCAAGCCCGAACTACTGAAGAACGAGCAGCAAGCGGCGGAGCAGCAGAAACAGAAGGACGAAGAAGAGAGGATAGAGAAGGAGAAAGAGGAAGAGATACTGAACCGGCCGGTCGCGCCCCGCCCGCTCCCGCCGGACGGCAGTTTGTATTCTTTACACGACTTCGACATCAACATAGAAATAGATCTCCCCGCGCCCAAGCCGGTGACGTCACACTCCAAACAGATAACCATAAAGGTGAAGGACGCCGGCCCAAAAAGGTCATTGAACCTGGACGATTACAAGAAGAGACACGGCCTCATATAG

Protein sequence:

>DPOGS207920-PA
MAKRKNKPLIDSDSSSDCSDLDSQFLNLAKKKKKPEEIAAQKVNSKTSGSESDWDDTEKKNDKSSSSDSESNSDSHSDTSKKKSPENPSRKSSSDHYEEHKNPEHEVKRSEVKQNELRRSTDSSGSKKKDTYSEPEEGEVSSHSSDNDSIDSEEEFDDGWEIERKLRLARRSAAERDVSPTELQRRREARRRRRERRGRRGEREAVVEEKRKEEREEEKEREKPPPSPGEVTDDQKDTERDQDRSASPLFGAKTERKRNVDDRRVNAMAALRAQRDARQRNVETKQKKRALERKEEDDEADPEIIGGTSKQSVKLKASDIYSDDSGSDSEDKSQGKRSSSSSSTSDAEEEEKKREREEVEVKYADTREQINKLRLSRFKLERLVHLPFFSRVVSGCFVRIGIGNNNGNPVYRVAEIIDVYETAKVYNLGNTRTNKGFKLRHGTQDRVFRLEFVSNQEFTENEFQKWHRAIKEANKKPPTMDFVRNKILEVKDALMYEFKEFTENEFQKWHRAIKEANKKPPTMDFVRNKILEVKDALMYEFKEEDIEKIVAEKERFRSHPTNYAMKKTQLMKERDVAQLRGDEELVLELNSKLQELEERASALDKTRTSSIQSISYINNRNRKLNVETAEKAIMEEVKAMKGKKMDDPFTRRHTKPELLKNEQQAAEQQKQKDEEERIEKEKEEEILNRPVAPRPLPPDGSLYSLHDFDINIEIDLPAPKPVTSHSKQITIKVKDAGPKRSLNLDDYKKRHGLI-