Monarch geneset OGS2.0

DPOGS211616
TranscriptDPOGS211616-TA1335 bp
ProteinDPOGS211616-PA444 aa
Genomic positionDPSCF300232 + 278028-279410
RNAseq coverage130x (Rank: top 56%)
Annotation
HeliconiusHMEL0162450.082.39% 
BombyxBGIBMGA008237-TA0.078.60% 
DrosophilaCG3703-PA2e-5651.23% 
EBI UniRef50UniRef50_D7EIJ87e-10243.11%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D7EIJ8_TRICA
NCBI RefSeqXP_967703.16e-10343.11%PREDICTED: similar to RUN domain-containing protein 1 [Tribolium castaneum]
NCBI nr blastpgi|910935491e-10143.11%PREDICTED: similar to RUN domain-containing protein 1 [Tribolium castaneum]
NCBI nr blastxgi|2700155932e-9943.11%hypothetical protein TcasGA2_TC001458 [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[262-431] IPR0040122.2e-29RUN
Orthology groupMCL13794 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211616-TA
ATGTTACAAGCATTGGAGGAATTTACTTCTCGTGGGGTTGATTCTCGAGCTGCTCCTCAAGGTAGCGCAGGAGAACCAGCGTGTAACGAGTGTGTAGAACTGGAACGGAAGATAAGGAATCAAAGAGTAAAACAAGCGCACTTTATCGAAAGACTAAAAAATCAATTAAAAGAATTGGAAAGATACACTGCTGGAACTTCAAATTCAAACGATGATACTAAACAGAGATCATTAATAGAACATTTGAGAAATGAAATTGACCATAGCTTGGAGGAAGGTTGCCATCACCCGCTTACAACAGAAGAACTGAGACATCAGATAGACTGTGCCGTTCGTCAATATACTGACAGACATCTTTCCAAAGAAGAGATCATTACAAAACTACAGACACATATCGACGATATGCAGAAATTTATAAAATTGATGAAAAATGTTGGTAAATCAGAATCATTTGAAAAGAATTTCAACCGCAACAATGTAAATAATGAACTAAAAACAGAAACAATAAATCTTATGAGGAAAGCTTCTACATTAATACAAATATTCACAGTGTCACAGTTTGGCTGTTCACCGAGTAGCAGTAAGAGATATGATAGATCATCAACAATTGTTACTCACTGGGGTAATTTAAGAGCAAAATTAGAAATGTCTATAGACGCAGTACTTCATATAGTGTCAAGGGAAAGGGCTTCACACACTTATATTGATAATTATGACAGTGAGTCAGAGGAAGGGGGCTTAGTGATCAACGATGTACCTCTCACAACCGCAGTAAGAAGACATCTTGCTCTTAACCTTAGAGACTTACTGCAACATGGCTTAGTACACCCTGAATCTAAATCCTTAGTGCCTATAATTGGTTGCTTCCCTGTCCGTAGATCCTCTACATCAAGATGTGTCCATATATGGGAATTAATTTTACGTTATTATGATTTAAATGATGGTGATAAATTTAATTCAACACCAGCGAGGAAACTCAGTCAGAGTTTCAATCTAGATATTGTTGGGAGCACATCCATTACAAACAAGCAAAGTCTTTTAAGTGCAATCGGTAGTATTTTAGCATCACACACTCCGTACAAAAGAAGCTATGATGCACACTTTAAAGCCTTTGTATGTGCGGCTCTCAATGTACACAAACTTGTTATATGGTTGAACATCATGTTGAAATGTAGACTGTTAATAGACTCATACTATCTGTCCACTAGTTATGTTGTCAACACAGGTTTTCAGGATACACTGCAGAGTCTCGACCGTCTCACACCATATAATTTTGATTTGCCCGTCGATCTTGCCGTCAAACAGTTTCAGAACATCAAAGATGTATTTATGTAA

Protein sequence:

>DPOGS211616-PA
MLQALEEFTSRGVDSRAAPQGSAGEPACNECVELERKIRNQRVKQAHFIERLKNQLKELERYTAGTSNSNDDTKQRSLIEHLRNEIDHSLEEGCHHPLTTEELRHQIDCAVRQYTDRHLSKEEIITKLQTHIDDMQKFIKLMKNVGKSESFEKNFNRNNVNNELKTETINLMRKASTLIQIFTVSQFGCSPSSSKRYDRSSTIVTHWGNLRAKLEMSIDAVLHIVSRERASHTYIDNYDSESEEGGLVINDVPLTTAVRRHLALNLRDLLQHGLVHPESKSLVPIIGCFPVRRSSTSRCVHIWELILRYYDLNDGDKFNSTPARKLSQSFNLDIVGSTSITNKQSLLSAIGSILASHTPYKRSYDAHFKAFVCAALNVHKLVIWLNIMLKCRLLIDSYYLSTSYVVNTGFQDTLQSLDRLTPYNFDLPVDLAVKQFQNIKDVFM-