Monarch geneset OGS2.0

DPOGS208734
TranscriptDPOGS208734-TA1806 bp
ProteinDPOGS208734-PA601 aa
Genomic positionDPSCF300043 + 263650-274667
RNAseq coverage1443x (Rank: top 9%)
Annotation
HeliconiusHMEL0152440.072.37% 
BombyxBGIBMGA003399-TA0.074.29% 
Drosophilapum-PA2e-2441.27% 
EBI UniRef50UniRef50_D7EK091e-2741.73%Pumilio n=2 Tax=Tribolium castaneum RepID=D7EK09_TRICA
NCBI RefSeqXP_391849.32e-3333.88%PREDICTED: similar to pumilio CG9755-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3838481372e-2939.94%PREDICTED: pumilio homolog 2-like [Megachile rotundata]
NCBI nr blastxgi|3838481375e-3638.32%PREDICTED: pumilio homolog 2-like [Megachile rotundata]
Group
KEGG pathway 
Orthology groupMCL21926 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208734-TA
ATGAAGTGGCCGGGTACTGGCGGAGAGGAAGAGGCGGGCGAAGCTGGTCTGCAGACAGCGCGAGTATCGCAAAGTCGTGCCCAGGACGACGCCGCAGTACATTATGTGTTCCAGCGCGAGAATCAAGAAGCTGATTTGTCTACACTAGCTCCTAAACAGCGCTGGGCGGTTTGCGAGGATTCAATAGCAGAGAATCAAGATAAATGGAAGAATCATACTCCTATAACAATGAATAACATACAGCATCAAGGTCATATACAGTTAAAGAATTCACAAACCAATCAACAATATATTAACCAGGCAATAGCATCCTCGGTGGGCACACTAGCTATAAACCCGTCTGTGATTGGAAGCCATCCGGCACATTTGACTACAAACATGGTCCATACCCAATTGGCTCCATTACAAAACCACACATTGGGTAACAATTTGGCCATACAGAACAGCATGATGCTGCAACCTCTTCAAGGAATGCAGAATGCTCCTCAAATGGGACAGCAAGGTTTATATGATATTCATCAACACCCAGCCACTCCCATGCAAGCTCCAATGAATGGCATGGGCAAGAGTGCTGAGCACCTCATGTATCTAAGTCAGGCTGGGATGCTTGCTCCGGGAACAAATCAATTTCTACAACAAGGACAGAATCAACTTGGGCTTACACCAGTCGCTGCAATCAGAAATCAGATTGCACCATCTGCAAAAAAGTTGTGGGACAAAGGACCTGGAGCTGGAGGTGATATCAAAGGACCACCACATTTACCACCCTTGCAGCTGAACCCTGAACAGATGTGGCGAGACCCCACTTGGTCTGCTCAAGCAGTGGAACACAATGTGGGTGTACCGATGATGGGCACTCGTCGTGGGGTAGCGTTTCCTGGCGGGGACGCCAGCAGCATCCTCTCGCCGAGAGACACCGCCGGCCTGGGGGTCAAGATGGTGGAGTACGTGCTGAGAGGCTCCCCAACGGAGGGTGGCGTGGCGGGGGCAGTGGCGGGCGGCGGAGATAAGGTGGCGGGCGTGGTGGCGGGCCTGCGAGGCATGGTGCTGGACGAGCGCTCAGAGGACAAGCCGGCCGCCGCCTCGCCCTTCGACAAGGACCTGCACGAGCTGCACGACCACGCCGCGCTGCAGAACGGACTGCACAACGGCCAGGACGACGACAAGGCCTTCAATCGCACCCCTGGCTCCCGGCAGCCGTCTCCCGCTGAGGAGGAGGGTGGTGTGGGCGGCATGAACAACGTGGGCACTGGCGGGGTGAACGCGGGCGGCGTGGGCGTCGGGGTAGGTGTGGGCGTTCGCAACGGCGAGGCGGGCGCCTTCCCACTGCTGCCGCACGCGCTGCCCGCGCCACAGCCGCAGCTGCACCACTTGCCGCATCTGCAGCACCCTCAGCATCATCCCGGTATGCTTGGAGTTGCTCCTCTCCAAGGATTGCCACATCCACAGCATCCTCAACTTCACCCTGGCATGAATCTGAACGACTCTATGAACCAGCACATTGAACTCGTGTTACAAATGGAGCAGCACCCGCAGTTTGACAACAACAGTTTTGCAAACACCCAGCAGTACGCGGGCGGCGTGGGCGGGGTGGGCGGCGTCGGTGGAGTGGGCGGCGTGAGCGCGTCCAGTAGCGGCGTGGGCGGCGCGGGGCTCGTGAACGGAGCGTCCGTGGTGCCCGCCGCGCCGGACTCGGCCCAGCACCACCAACCCTTCGAGCTGCAGTCAGGGTCGGTGGTCCGCGGGGATGCGCTACGGAACAACGCGCTTCAGAAGAAACTAAGGGATCTCTCTGTTGGGGAATAA

Protein sequence:

>DPOGS208734-PA
MKWPGTGGEEEAGEAGLQTARVSQSRAQDDAAVHYVFQRENQEADLSTLAPKQRWAVCEDSIAENQDKWKNHTPITMNNIQHQGHIQLKNSQTNQQYINQAIASSVGTLAINPSVIGSHPAHLTTNMVHTQLAPLQNHTLGNNLAIQNSMMLQPLQGMQNAPQMGQQGLYDIHQHPATPMQAPMNGMGKSAEHLMYLSQAGMLAPGTNQFLQQGQNQLGLTPVAAIRNQIAPSAKKLWDKGPGAGGDIKGPPHLPPLQLNPEQMWRDPTWSAQAVEHNVGVPMMGTRRGVAFPGGDASSILSPRDTAGLGVKMVEYVLRGSPTEGGVAGAVAGGGDKVAGVVAGLRGMVLDERSEDKPAAASPFDKDLHELHDHAALQNGLHNGQDDDKAFNRTPGSRQPSPAEEEGGVGGMNNVGTGGVNAGGVGVGVGVGVRNGEAGAFPLLPHALPAPQPQLHHLPHLQHPQHHPGMLGVAPLQGLPHPQHPQLHPGMNLNDSMNQHIELVLQMEQHPQFDNNSFANTQQYAGGVGGVGGVGGVGGVSASSSGVGGAGLVNGASVVPAAPDSAQHHQPFELQSGSVVRGDALRNNALQKKLRDLSVGE-