Monarch geneset OGS2.0

DPOGS205665
TranscriptDPOGS205665-TA1251 bp
ProteinDPOGS205665-PA416 aa
Genomic positionDPSCF300023 + 493005-494899
RNAseq coverage316x (Rank: top 36%)
Annotation
HeliconiusHMEL0062030.088.07% 
BombyxBGIBMGA001014-TA0.078.42% 
DrosophilaCG5850-PA2e-14166.67% 
EBI UniRef50UniRef50_Q0E8R84e-14066.67%CG5850, isoform B n=18 Tax=Endopterygota RepID=Q0E8R8_DROME
NCBI RefSeqXP_001603133.14e-14565.69%PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|3227889303e-14566.93%hypothetical protein SINV_05023 [Solenopsis invicta]
NCBI nr blastxgi|3227889307e-14266.93%hypothetical protein SINV_05023 [Solenopsis invicta]
Group
KEGG pathway 
InterPro domain[1-349] IPR0051784e-207Protein of unknown function DUF300
Orthology groupMCL15386 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205665-TA
ATGTTTATAGCTACTTATGTTCTATTAGTCATAATATTAGTGCCGCTGTTGATAGTTCATTCGATAAATAATGGTTTTACAAAGTCAGACCAAAGCACTTTGATAGGAGGAGGCTTTGTTCTTCTAGCTGTGCCTATCTCAATATGGCAGATCACTCAACACATAGTTCACTATACAAAACCATCTTTACAGAAACATATTATTAGGATATTATGGATGGTACCAATCTATGCTTTGAATGCATGGATAGGATTAGAATTTCCAGAGCAATCTATTTATATGGATGCATTAAGAGAGTGTTACGAGGCTTATGTTATATACAACTTTATGAAGTATTTATTTAACTACCTCAACGATGGTCAAGATCTGGAGGCATTATTGGAGACCAAACCTCAAGTTAATCATATATTTCCACTATGCTGTTTAACACCATGGGAGATGGGAAGTGAGTTTGTTCACAACTGTAAACATGGCATTCTTCAATATACCCTAATAAGGCCTCTAACAACCGTCATATCAATCATATGCGACCTTTGCGGAGTTTATGGAGAGAGTGATTTCAGTCCTAATGTGGCTTTCCCATATATAATAGCCATCAATAATTTGTCACAGTTCGTAGCTATGTACTGTTTGGTATTGTTCTATCGGGCAAATCGTGCCGAGTTAAAACCAATGAAACCCATTGGCAAATTTTTATGTATAAAAGCTGTTGTCTTCTTCTCGTTTTTCCAAGGAGTTATTATCAATATACTGGTATACTGTGGAGTGATATCTACTATATTTGATATATCAGACAATGACAAAATAAAAATTATATCGTCAAAACTACAGGATTTTCTTATTTGCATTGAGATGTTCTTAGCGGCTATCGCACATCACTATAGCTTCTCATATAAGCCGTACATATCACCATTGGCTCCGACAAATTCCTGTCTTGGTTCTTTCCTGGCTATGTGGGATGTGTCTGATGTGAAGAGAGATATATCGGAACATTTGGGTGTTGTTGGGAGTTCATTTAGTAGACGATGGCGTGGCAAATCTATGTATCATATGGCCAGAGGATATGATGAAAGTTCTAGACTAAACGAACCAACTGCCAGTTCGGCACCAAATATATCAGTTGAGAGTGTGGAACCAACTGTTGAAGTAAGGACATCTGTGTACGGATCCCTAGAGAACACTATGACGACGGGTTCATTGACCTCCGAAACAGAACCTTTGATACGGAACGATTCGGACCCCAATGTCTAA

Protein sequence:

>DPOGS205665-PA
MFIATYVLLVIILVPLLIVHSINNGFTKSDQSTLIGGGFVLLAVPISIWQITQHIVHYTKPSLQKHIIRILWMVPIYALNAWIGLEFPEQSIYMDALRECYEAYVIYNFMKYLFNYLNDGQDLEALLETKPQVNHIFPLCCLTPWEMGSEFVHNCKHGILQYTLIRPLTTVISIICDLCGVYGESDFSPNVAFPYIIAINNLSQFVAMYCLVLFYRANRAELKPMKPIGKFLCIKAVVFFSFFQGVIINILVYCGVISTIFDISDNDKIKIISSKLQDFLICIEMFLAAIAHHYSFSYKPYISPLAPTNSCLGSFLAMWDVSDVKRDISEHLGVVGSSFSRRWRGKSMYHMARGYDESSRLNEPTASSAPNISVESVEPTVEVRTSVYGSLENTMTTGSLTSETEPLIRNDSDPNV-