Monarch geneset OGS2.0

DPOGS205428
TranscriptDPOGS205428-TA1077 bp
ProteinDPOGS205428-PA358 aa
Genomic positionDPSCF300504 + 33757-34833
RNAseq coverage220x (Rank: top 45%)
Annotation
HeliconiusHMEL0108237e-14266.94% 
BombyxBGIBMGA001686-TA2e-10552.35% 
DrosophilaCG18600-PA6e-0822.08% 
EBI UniRef50UniRef50_D6WSU62e-3231.22%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WSU6_TRICA
NCBI RefSeqXP_001601651.18e-3429.30%PREDICTED: hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|3838589598e-3730.06%PREDICTED: DNA-directed RNA polymerase I subunit RPA49-like [Megachile rotundata]
NCBI nr blastxgi|3838589592e-3929.62%PREDICTED: DNA-directed RNA polymerase I subunit RPA49-like [Megachile rotundata]
Group
Gene OntologyGO:00038997.3e-23DNA-directed RNA polymerase activity
GO:00056347.3e-23nucleus
GO:00036777.3e-23DNA binding
GO:00063517.3e-23transcription, DNA-dependent
KEGG pathwaynvi:1001173982e-33 
 K03005 (RPA49)maps-> Purine metabolism
    Pyrimidine metabolism
    RNA polymerase
InterPro domain[42-354] IPR0096687.3e-23RNA polymerase I associated factor, A49-like
Orthology groupMCL17921 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205428-TA
ATGACACAATTACACATCGAAGAGGTTTATCCGAAAAGTATAACGAATCCCGTACTAATTAACTTTCAAAATGGCTACGCGACAGACAATTTCACAACCGAACCGTGTTTTATATACGACAATGACGAAAAGCATAACAAAACTATAGCCACTACATTAGACGGCTTAGTCTACGCTGGCGAGGAGGATACAGAAGATCTTGGCAGAACTTTGATATTAGCCAGAAATAAATGCACCGGTAAAGTACGCGTGATCGAATCCAGCTATGTAGACCTGAAACCAGTCTTCAAAACTAACACAGAGCCAGCTCCGTTAGAAACCAGTACCTTGGAACTTAGTAGGAAATTCGGTTCAAAGAAACAAAAACAAAAAATGGAACAACGCGAGAAAATGAAAGTCAACATTGAAACTGTTACCGAGCAAATGCAAAATGTGACACAGGAAATAACTGAAGACAAAGTCGATTTATCGTCATACAACAAGACCGATTCAGATGATTTTTATATACCGATTATAAATCGGCAAGCCGATAAAGCGGAGGACGTTTACGACATCAATAATATATTAACCGAGGAACAGTACGAAAAAATAAGCTCTGAGCTGGAAGGCAAAGACTATGAAAACAATTTGATTCCGATCATCAAGAGTATTGTTAAAAACAATTTATCACAAAAGATGACAGTTTTAGCTGTATACGCCAATTCGTTATTACAGCTGTATGTAACAATGATGAAAGAAATAAGTAAAAAAAGTTTTGTAATATGCCCACATTCAGTTACATTAAACAAGCATGTATTAGACCACTTCCTACTCACGACCAATGGTAAACGGACTCGCCCGGCCCCATACAAAGACAAGTCACTATGTCATGCTATGGTGATAATACTAATGATAAATAATCTTAAATTCGACCTGAACAGTCTCTGTGAATCTATTAAGATCACTCCAAACACAGCATCCATGAAAGTCCGAGTGACCGGAGCGTCCGTTACAACATCGGGCAGCAAAAAAGTAGTTCAGTTGAAGCTACCTCTGAACACGAAGTCCAGTTTCAGAAGAAGAAGTGCTAAGTTTTAA

Protein sequence:

>DPOGS205428-PA
MTQLHIEEVYPKSITNPVLINFQNGYATDNFTTEPCFIYDNDEKHNKTIATTLDGLVYAGEEDTEDLGRTLILARNKCTGKVRVIESSYVDLKPVFKTNTEPAPLETSTLELSRKFGSKKQKQKMEQREKMKVNIETVTEQMQNVTQEITEDKVDLSSYNKTDSDDFYIPIINRQADKAEDVYDINNILTEEQYEKISSELEGKDYENNLIPIIKSIVKNNLSQKMTVLAVYANSLLQLYVTMMKEISKKSFVICPHSVTLNKHVLDHFLLTTNGKRTRPAPYKDKSLCHAMVIILMINNLKFDLNSLCESIKITPNTASMKVRVTGASVTTSGSKKVVQLKLPLNTKSSFRRRSAKF-