Monarch geneset OGS2.0

DPOGS214946
TranscriptDPOGS214946-TA522 bp
ProteinDPOGS214946-PA173 aa
Genomic positionDPSCF300280 - 65016-66388
RNAseq coverage422x (Rank: top 29%)
Annotation
HeliconiusHMEL0155844e-7783.24% 
Bombyx% 
DrosophilaRpb7-PA2e-8888.44% 
EBI UniRef50UniRef50_Q9VFB52e-8688.44%IP02321p n=17 Tax=Bilateria RepID=Q9VFB5_DROME
NCBI RefSeqXP_002423690.11e-8989.82%mixed-lineage leukemia protein, mll, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3838655062e-8992.98%PREDICTED: DNA-directed RNA polymerase II subunit RPB7-like [Megachile rotundata]
NCBI nr blastxgi|3838655065e-9092.98%PREDICTED: DNA-directed RNA polymerase II subunit RPB7-like [Megachile rotundata]
Group
Gene OntologyGO:00038995.7e-28DNA-directed RNA polymerase activity
GO:00063515.7e-28transcription, DNA-dependent
GO:00037238e-14RNA binding
KEGG pathwaynvi:1001190374e-89 
 K03015 (RPB7)maps-> Huntington's disease
    Purine metabolism
    Pyrimidine metabolism
    RNA polymerase
InterPro domain[83-170] IPR0123403.2e-35Nucleic acid-binding, OB-fold
[1-82] IPR0055765.7e-28RNA polymerase Rpb7, N-terminal
[79-171] IPR0160272.3e-22Nucleic acid-binding, OB-fold-like
[78-157] IPR0030298e-14Ribosomal protein S1, RNA-binding domain
Orthology groupMCL14582 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214946-TA
ATGTTCTACCACATATCGTTGGAGCATGAAATACTGCTGCATCCAAGATATTTTGGACCGCAATTATTAGATACTGTTAAACAGAAGTTGTATACAGAGGTGGAAGGAACTTGCACTGGCAAATATGGTTTTGTTATAGCAGTGACCACAATAGACAGCATTGGTGCAGGTCTCATCCAACCAGGACAAGGATTCGTTGTGTATCCAGTTAAATATAAAGCCATAGTTTTCAGACCATTTAAAGGAGAGGTTCTTGACGCTATAGTCACACAAGTGAATAAGGTTGGAATGTTTGCTCAAATCGGTCCGTTAAGTTGTTTTATATCACATCATTCCATACCCACTGATATGGAATTCTGTCCCAATGTAAATCCCCCGTGCTATAAAAGTAAACAGGAAGATAATGTCATACAGGAAGAGGATGTCATTAGATTAAAGATTGTTGGCACCAGAGTTGATGCCACGGGCATTTTTGCCATCGGAACACTAATGGACGACTACTTAGGATTAGTAACACAATGA

Protein sequence:

>DPOGS214946-PA
MFYHISLEHEILLHPRYFGPQLLDTVKQKLYTEVEGTCTGKYGFVIAVTTIDSIGAGLIQPGQGFVVYPVKYKAIVFRPFKGEVLDAIVTQVNKVGMFAQIGPLSCFISHHSIPTDMEFCPNVNPPCYKSKQEDNVIQEEDVIRLKIVGTRVDATGIFAIGTLMDDYLGLVTQ-