Monarch geneset OGS2.0

DPOGS208799
TranscriptDPOGS208799-TA1281 bp
ProteinDPOGS208799-PA426 aa
Genomic positionDPSCF300036 - 330101-333479
RNAseq coverage373x (Rank: top 32%)
Annotation
HeliconiusHMEL0055570.092.73% 
BombyxBGIBMGA007660-TA0.084.42% 
DrosophilaCG13966-PA6e-6440.78% 
EBI UniRef50UniRef50_D6WEM62e-8852.59%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WEM6_TRICA
NCBI RefSeqXP_969467.21e-8952.04%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|1892356562e-8852.04%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastxgi|3838485214e-7648.53%PREDICTED: uncharacterized protein LOC100881413 [Megachile rotundata]
Group
KEGG pathway 
Orthology groupMCL15862 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208799-TA
ATGCATGAATCTCAACCCGACCGTGGCGACATGACCTCTGACCAGCTGACCTCCGAAACAGTTCAAACATATCAGTTTTCATTGTCCCACAGCCGGAGCGAGTCGGCTCTCAACAGCCCCACGGAGCCAGATATTTCATTAGCAGCATCCCTAACAGATCCTCTTGAGGCGCATACCCTTGACGAAGCTCGGAGAACTATCAGAGATCTTCGTATGAAATATCGAGCTCAGGCGCATCAGTTGTTGACATGGCGCCGCGCGCATCGAACCCAAGAGGAACTAGTCACCCGTCTTCAGCGAGAGAAGGCTGAGCAGCTCAAATCATTATCCAGCCAACTCCTCTTATTCGAATCGCGACTGGTTCGTAAACAGAAGGAGATCACTAGCATGTTGGCGTTACGAGAGACGATTATAATTAAACAGCAAAAGGTAATCGAATCTCTCCAGGCCAAGCTGCTGGATAATGGCATTGATACGTCACAGAACATCCCAGATTTCAGGGATATGTTACAGGACACTCACATCATAGGTGACTTTGACTCCCTTAACGATTCAGATTCAGCTGTCATAATGGAAGACGTCGATTCGGACTGCAGCAACTTGCCCAATGTGCCGAGATTTAGATCCGCCAACCCAGATACCGTCACTATAGTCAGATCTATCTCTGACGCCATCGATCCTAATCTAAAGTACAGCATCGTCAGACGCTCGAACGGGTTTCTGAGGAGACCTGAGATTTTGGAGACCGTTTACAGTGTGGAAGAGGAAGCGGACAATGAATCAACAAAAGGCCTAAGTGCACAGAACAGCACGGAGAAAGAATTGAAAGACGGAAACAAAACTGACGAACAGAAATGTAAAGAAACAACTCTTCTTGCACAGAGACGAGATAACTTCCGCATGAGGAGCGTAGTGCTGACGGGAGAAGTGAAAAGTATAGATAACGACAAGGAGGTGAAGCCGAAAACTAGCGGCGAACTGTGGTCCTATAGCTACGTGCCAAAAAGAATGATGCCCGCCAACGATTCGGACGAGGAAGCTACTTCGAACCCAGAAAGCGACGAGGGAGAGGCGGAACCGCGCTCCAACCCGGTAGTGACTTACAACCGGGTCATGTCTAACCACAGGAACGTGACTAAACCTAAGGACGTGAAATACAAGAGGATCAATAAAGCCAAGTCGAAGAGCTTGGAGGAACTGCGAGGCAGATTAAAGAATTGGGTTGAAAAGGGGAATAAGTTAACCGACATGCCATTGGAGCACGCTCAGAGCTACGCTTGA

Protein sequence:

>DPOGS208799-PA
MHESQPDRGDMTSDQLTSETVQTYQFSLSHSRSESALNSPTEPDISLAASLTDPLEAHTLDEARRTIRDLRMKYRAQAHQLLTWRRAHRTQEELVTRLQREKAEQLKSLSSQLLLFESRLVRKQKEITSMLALRETIIIKQQKVIESLQAKLLDNGIDTSQNIPDFRDMLQDTHIIGDFDSLNDSDSAVIMEDVDSDCSNLPNVPRFRSANPDTVTIVRSISDAIDPNLKYSIVRRSNGFLRRPEILETVYSVEEEADNESTKGLSAQNSTEKELKDGNKTDEQKCKETTLLAQRRDNFRMRSVVLTGEVKSIDNDKEVKPKTSGELWSYSYVPKRMMPANDSDEEATSNPESDEGEAEPRSNPVVTYNRVMSNHRNVTKPKDVKYKRINKAKSKSLEELRGRLKNWVEKGNKLTDMPLEHAQSYA-