Monarch geneset OGS2.0

DPOGS209161
TranscriptDPOGS209161-TA1689 bp
ProteinDPOGS209161-PA562 aa
Genomic positionDPSCF300061 - 163660-169848
RNAseq coverage195x (Rank: top 48%)
Annotation
HeliconiusHMEL0097510.064.83% 
BombyxBGIBMGA011478-TA5e-1483.78% 
DrosophilaHPS1-PA1e-3223.45% 
EBI UniRef50UniRef50_E0VH591e-5727.61%Putative uncharacterized protein n=1 Tax=Pediculus humanus corporis RepID=E0VH59_PEDHC
NCBI RefSeqXP_970712.22e-5827.21%PREDICTED: similar to HPS CG12855-PA [Tribolium castaneum]
NCBI nr blastpgi|3454914156e-5826.96%PREDICTED: Hermansky-Pudlak syndrome 1 protein homolog isoform 2 [Nasonia vitripennis]
NCBI nr blastxgi|1892348265e-5526.64%PREDICTED: similar to HPS CG12855-PA [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL14037 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209161-TA
ATGAAGGCTATATTACTTTTTGATACTGTTAACGACTTGATATTTTCTAAGTGGGATGACGATTTTTTGGAACGTATGAAATCGTTTAATGGTCAGGAAAAAGATGTAAACATCACTGATAATCACCACATATCCCAACTCCTGTCTCCAATCATCACCTCCCAGCGGGTGATGGCGGCGCAGTTCAGTAATACATACACTTCCATGCAATGTAAAGATAACACCACTATCGTTTTTGACGAGTTGCTGGACCATGTGTTAATGATCATTTGTGAAGACAGAGTAGACGATGCCCAGCGAGAGCTGATGGACTGCAAAACTCTTGTGCAGCATATATGCGGACAGAATATGAATCTCCTACACTCGCAAGTGTATCGAGAATGGCTATCTGTGTTGTTGGAGTCTCGCGGCAAGGGTGATTCAATCCCGGGAGCTAGTGGTGTCATCGGTGAAAGTGGGGCCACGGCAGCTGCACTGAGTGCCTTGAAGACCATCTCCAAAGAAATAAAATGGTCGCATCCACATTACCACCTGTTATTATATGTGGGCGACAAAATGTTGGCTCTTTACTCTAGTCGCGGTTGTGAAGATCTTTTGCCGCCTGATTTGATCCTGCTGAGCATTCAGTGCATTGCAGCCCAGGAGTACTGGAGTGAACAACAGGAAGAGGAAAAAACTACCCTTTGTGATGATATTCACCTGCCATGGCTGTCAACTGAAAACAGTGCTATAGTACACTTGATATCCTCGGCTGGTAGGGCTTGCATCCCTCACTCGATGCATCTGGCACCCCTGGATACCAGGATTGTGTTTGTGGTCCTCATTGACATGGAGATGAGAGATGTCGGTGTGTCAGTGCAAATGTCAAGTCAAATCTTATCAAATCTACGGAGGCTGCTATTACAAAGGAATCTAGAGATGCTTCCCAACACACTGGACTCGTTGGAACAAGCTTTGAAGAAGACAACTGATGCTCTCCGTAAGAACAAATCAAATTCCAATCTATGCGCTCGTCTCACAAGCAGGATGTTGGAACTGAGAAAGTCATGCAACACAACCACACCTCTAACCCCTGAAACGACAGCTACGGCGATGCAAACAGCTTTAGAAGCTGTCATAGAACAGCTCAAACCAGACATACCCAGCTTGAAGATGAAACAGCCTTTGAAGGAGTTGAGGAACCTCCTCACGCCTTACATAGACTTCCTGAAATACGTGGAGGAGTTCCCGGGGCTGATCCACTTCATTTACATCGACCGCGGCACCGGCAGGTTCCTGGCGCCGGACATGGCGGACTGTGTGGACATGCTGACACCCGACACGGTCCGGGAAATCGTGTCCCGGGCGACGAGCGCCGTCCGCGAGGGTTACGGGGCGTGCGTGTGGCGTCGCGGAGCGCTACATGTATGTGCCGTGAGATGGTTCGAGCGGCGAGGGTCGAGCGTGCGCCCCTCCGCCCCGCCGCACCCCGCCGCCGTCCGCGCGCTACCCCCACCCGGAGACATCTGCGCCGCCTTCTACAGGCAGCTGGTTGAACTGTCGTTCCCGGAGGAGTCGTGTGTGTGTGTGAAGGAGCTGGTGTGTGTGCACGTGGGTCTGGTTCCTCCGGCCACGGCGGTGCAACAGGCGCGGGCGCTGGCTCACGCGGTCATTGACCTGGCCGGGGACGATCATCACCACCTCCTGTGA

Protein sequence:

>DPOGS209161-PA
MKAILLFDTVNDLIFSKWDDDFLERMKSFNGQEKDVNITDNHHISQLLSPIITSQRVMAAQFSNTYTSMQCKDNTTIVFDELLDHVLMIICEDRVDDAQRELMDCKTLVQHICGQNMNLLHSQVYREWLSVLLESRGKGDSIPGASGVIGESGATAAALSALKTISKEIKWSHPHYHLLLYVGDKMLALYSSRGCEDLLPPDLILLSIQCIAAQEYWSEQQEEEKTTLCDDIHLPWLSTENSAIVHLISSAGRACIPHSMHLAPLDTRIVFVVLIDMEMRDVGVSVQMSSQILSNLRRLLLQRNLEMLPNTLDSLEQALKKTTDALRKNKSNSNLCARLTSRMLELRKSCNTTTPLTPETTATAMQTALEAVIEQLKPDIPSLKMKQPLKELRNLLTPYIDFLKYVEEFPGLIHFIYIDRGTGRFLAPDMADCVDMLTPDTVREIVSRATSAVREGYGACVWRRGALHVCAVRWFERRGSSVRPSAPPHPAAVRALPPPGDICAAFYRQLVELSFPEESCVCVKELVCVHVGLVPPATAVQQARALAHAVIDLAGDDHHHLL-