Monarch geneset OGS2.0

DPOGS209917
TranscriptDPOGS209917-TA1356 bp
ProteinDPOGS209917-PA451 aa
Genomic positionDPSCF300684 - 8135-11633
RNAseq coverage1x (Rank: top 94%)
Annotation
HeliconiusHMEL0149190.072.61% 
BombyxBGIBMGA011579-TA0.085.23% 
DrosophilaGr21a-PA4e-15865.84% 
EBI UniRef50UniRef50_Q9VPT13e-15665.84%Gustatory and odorant receptor 21a n=19 Tax=Endopterygota RepID=GR21A_DROME
NCBI RefSeqXP_319142.13e-16968.14%gustatory receptor (AGAP009999-PA) [Anopheles gambiae str. PEST]
NCBI nr blastpgi|312348945e-16868.14%AGAP009999-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|312348942e-16268.14%AGAP009999-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00509093.4e-51sensory perception of taste
GO:00160213.4e-51integral to membrane
KEGG pathway 
InterPro domain[57-420] IPR0136043.4e-517TM chemoreceptor
Orthology groupMCL18376 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209917-TA
ATGTATACGGAAAGACATTTCAATTTATATAAACGAGAAAAACTACACAATAGAGACTTTAATGGCAAGGAAGGAAAAGCGTATGAAGCTAAGGACGTCTACGGACCGGAGATTACAGACAAAGACGGAGAGCTGCTTGATCAGCACGACAGCTTCTACATTAATACAAAGAGCCTGCTGGTGTTGTTCCAAATCATGGGCGTGATGCCAATTATGAGAGTTCCAAAAGGTGCTCAAACTACGAAACGAACGACTTATAATTGGATATCCAAGGCAACATTATGGGCGTATTTAGTTTGGAGTCTTGAGAGTGTCATCGTTATTAAAGTTGGCAGAGAGCGATACTACAATTTTCAACAGAACACGAATAAGAGATTTGACGAAGTTATTTACAACATCATATTTTTAAGTATACTTATCCCTCATTTTCTTCTCCCAATCGCGTCTTGGCGTCATGGACCAGAAGTCGCTATATTTAAAAATATGTGGACACACTACCAGCTGAAATATCTCAAGATCACTGGAACGCCAATCGTTTTTCCTAAACTATATTCTCTTACCTGGGGTTTATGTGTTTTTTCTTGGGCCTTGAGTTTTGCCGTTATTCTCTCTCAAAACTACCTGCAAGACGATTTTGAATTATGGCAATCTTTCGCTTACTATCACATTATAGCCATGTTAGATGGTTTCTGTTCGCTATGGTATATTAATTGTAATGCGTTTGGAACAGCATCCAAAGGATTAGCGACAAATCTTCATAAGGCTCTTGAAGCAGAACATCCAGCTTTGAAATTGGCTCAGTACCGTCACCTCTGGGTAGATTTATCTCACATGATGCAACAGCTAGGCAGAGCTTACTCGAACATGTATGGAATTTATTGCATGGTGATTTTCTTCACAACCACAATATCATTATACGGATCTCTCTCTGAAATATTAGATCATGGCTTTAGTTACAAGGAAATGGGACTATTTGTAATAGTGGGTTACTGCATGACTCTATTATTTATAATTTGCAACGAAGCCTATCATGCTACGAGGAAGGTTGGATTGGAATTTCAAGTTCGACTTCTCAACGTAAATCTTGGCGCAATAGATCGCAGCACACAGCGAGAGGTTGAGATGTTTCTTGTGGCCATTGAAAAAAACCCACCGATTATGAACTTGGATGGCTTCACAAACATTAACAGAGAATTGTTTGCAGCTAATATATCCTTTATGTCAACCTATTTGATCGTGTTGATGCAATTTAAGCTGACGCTAGTGAGACAAGGGACGAAGAAAGTTTTCAAATCAATCGTAGATGCCATATTTAATATAACAACTACAGTGTCACAAGATGATGTCGAAGAGTAA

Protein sequence:

>DPOGS209917-PA
MYTERHFNLYKREKLHNRDFNGKEGKAYEAKDVYGPEITDKDGELLDQHDSFYINTKSLLVLFQIMGVMPIMRVPKGAQTTKRTTYNWISKATLWAYLVWSLESVIVIKVGRERYYNFQQNTNKRFDEVIYNIIFLSILIPHFLLPIASWRHGPEVAIFKNMWTHYQLKYLKITGTPIVFPKLYSLTWGLCVFSWALSFAVILSQNYLQDDFELWQSFAYYHIIAMLDGFCSLWYINCNAFGTASKGLATNLHKALEAEHPALKLAQYRHLWVDLSHMMQQLGRAYSNMYGIYCMVIFFTTTISLYGSLSEILDHGFSYKEMGLFVIVGYCMTLLFIICNEAYHATRKVGLEFQVRLLNVNLGAIDRSTQREVEMFLVAIEKNPPIMNLDGFTNINRELFAANISFMSTYLIVLMQFKLTLVRQGTKKVFKSIVDAIFNITTTVSQDDVEE-