Monarch geneset OGS2.0

DPOGS214516
TranscriptDPOGS214516-TA1503 bp
ProteinDPOGS214516-PA500 aa
Genomic positionDPSCF300287 - 373375-378893
RNAseq coverage67x (Rank: top 67%)
Annotation
HeliconiusHMEL0178350.071.90% 
Bombyx% 
DrosophilaSobp-PA3e-5050.23% 
EBI UniRef50UniRef50_D6WA415e-5556.22%Sine oculis-binding protein n=1 Tax=Tribolium castaneum RepID=D6WA41_TRICA
NCBI RefSeqXP_967801.11e-5556.22%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|910774562e-5456.22%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastxgi|1571308287e-7638.48%hypothetical protein AaeL_AAEL011893 [Aedes aegypti]
Group
KEGG pathway 
Orthology groupMCL20634 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214516-TA
ATGAATGAGCTGCTCGGCTGGTATGGCTATGAGCGTTTGGAGCTTCGGAGATGGGCAGCTTCCAGAGCCAGGGACAGTCTTGAACAGGAACAAAAAGGAAAAGCAGATGAATGTTCGTGGTGCAACAAGAGTGTCGCCAGTGAGAGTGGCGCCTTACAGCAAGCTGGGGCGTTGTTCTGTTCGGAGCTGTGCTTCAGTCAGTCACGGCGAGCCAACTTCAAACGCGCCAAGACATGCGATTGGTGTCGTCACGTGAGACACACTGTCGCCTATGTGGATTTCCAGGACGGAGCCACTCAGCTTCAGTTCTGCTCAGACAAATGTCTGAACCAATACAAGATGCACATCTTCTGTCGCGAGACACAGGCCCACCTCGACCTCAACCCGCACTTAGTGAACGCGGCGTCGTCGTCGAACCTCATCACGCCAGAACTATGGCTCAAGAATTGCAGGAGCAGATCTATATCACCCACATCAGAAGGATCTGGAACACCGAATGACAAAAACGACGACACATGCCAAAGAAAATCCCCACTCCCACTCATAACTATAGCGCCACCAGCCAAGTTAATGAATCCGAAACCACCAGAAGACAGGCCGGTGCAGAAGTCTCCCGAGACCAAGAAGGATCTGAGAACCAAAATGAATTTACGTAAACGTAGAACGTCCAAGTGCTCGACGGTGACGTCACAAACTGTCCGGCAACGAAGTATAACACCAAAAACCCAAGACTTCAGGATGCTGAGTCCCTCGATGGACGGTTCGTCGCCCGCGTCCTGCGTCACGAGCAACAATCCCGTACACTCTCCACCACACATGAACCAACCGATCCCACCACCGTTTCCGAATCCCATGTTCGGGATGCCGCCGCCGGTCTTCATGGACAGCAACCACATGAATGACCACAGAAATCCCATGTTTCAACCGAGAGTGAATTTCATGCCACCGCCTGGCATGCACCAAGAGAGACCGAGATTATTCCCACCGTTGAATTTCCACCAGCCAATAAACCAGGCTCCGCCAGTGACGGTACTGGTTCCTTACCCCGTCGTCATACCCGTCCCGATACCCATCCCGATACCTCTACCCCTGAGCTCGTTCATTCAAGCCCATTGCACCAACAAAGTCAAAACTGAAGTCAACACCGACGACGCCGAGGGTCCTTTAGACTTCACGATGAACCCCGCCAAGAAAAACGAATGCAACCAACCAGAGGTCCACGAAGAGATCGATCCGGCGGCCACGGAACAAGCTAGTCAACAAATAAATAATCATAACGAGAGAGTGGACGACGACTCCCAAAATAATACAGAGACCAACCCCGAACAGACGCTGCCGAAGTTCAAGATAACGCGATTGGGTAACAAGATGGCGAAAATCGTTTCAAAAACGAGAGAGAACGCCGAGTCGTCCAGACCTTTGAGGAAAAGACGGAGACTCGTGGAAGTCGCTACCGACGAAGAGACGCTCATTCCTAAAACAAGGAAAATTGTACAAGTTTAA

Protein sequence:

>DPOGS214516-PA
MNELLGWYGYERLELRRWAASRARDSLEQEQKGKADECSWCNKSVASESGALQQAGALFCSELCFSQSRRANFKRAKTCDWCRHVRHTVAYVDFQDGATQLQFCSDKCLNQYKMHIFCRETQAHLDLNPHLVNAASSSNLITPELWLKNCRSRSISPTSEGSGTPNDKNDDTCQRKSPLPLITIAPPAKLMNPKPPEDRPVQKSPETKKDLRTKMNLRKRRTSKCSTVTSQTVRQRSITPKTQDFRMLSPSMDGSSPASCVTSNNPVHSPPHMNQPIPPPFPNPMFGMPPPVFMDSNHMNDHRNPMFQPRVNFMPPPGMHQERPRLFPPLNFHQPINQAPPVTVLVPYPVVIPVPIPIPIPLPLSSFIQAHCTNKVKTEVNTDDAEGPLDFTMNPAKKNECNQPEVHEEIDPAATEQASQQINNHNERVDDDSQNNTETNPEQTLPKFKITRLGNKMAKIVSKTRENAESSRPLRKRRRLVEVATDEETLIPKTRKIVQV-