Monarch geneset OGS2.0

DPOGS206703
TranscriptDPOGS206703-TA1596 bp
ProteinDPOGS206703-PA531 aa
Genomic positionDPSCF300048 + 1935871-1949982
RNAseq coverage157x (Rank: top 52%)
Annotation
HeliconiusHMEL0100341e-17973.96% 
BombyxBGIBMGA008534-TA1e-14091.42% 
Drosophilamoody-PA1e-9554.28% 
EBI UniRef50UniRef50_E2BBZ32e-9457.70%Gustatory receptor trehalose 1 n=10 Tax=Neoptera RepID=E2BBZ3_HARSA
NCBI RefSeqNP_569970.24e-9454.28%moody [Drosophila melanogaster]
NCBI nr blastpgi|3454789911e-9455.73%PREDICTED: G-protein coupled receptor moody-like [Nasonia vitripennis]
NCBI nr blastxgi|3454789911e-9356.09%PREDICTED: G-protein coupled receptor moody-like [Nasonia vitripennis]
Group
Gene OntologyGO:00071866.2e-47G-protein coupled receptor protein signaling pathway
GO:00160216.2e-47integral to membrane
KEGG pathwaybfo:BRAFLDRAFT_2146701e-21 
 K04163 (HTR7)maps-> Neuroactive ligand-receptor interaction
    Calcium signaling pathway
InterPro domain[190-436] IPR0002766.2e-47GPCR, rhodopsin-like, 7TM
Orthology groupMCL16503 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206703-TA
ATGATCATCGGTTTATTTGGCAACCTTCTCACTGTTGTAGCTCTCTTGAAATGTCCGAAAGTAAGAAACGTCGCCGCTGCGTTCATAATAAGTTTATGCATTGCTGACTTCCTCTTCTGTGCCATGGTTTTGCCTTTCGCCATATCCGGCTTCTGGACGAGAACTTGGTCCCACGGGGGGGCTCTTTGCAAACTCGTGCCGTTCCTCAGATATGGAAATGTAGGAGTATCTCTCTTGAGTATCGCTCTGATTACTTTAAACAGATATATCATGATAGCCCATCACAGCTGGTACGGGCGAGTCTACCGCAAGCACAACATAGCGCTAATGATTATCTTCTCGTGGATGTTTTCTTATGGGATGCAAATACCTACTCTCATCGGAGTCTGGGGTGAGTACGAACCAGCTGTATCTAGAGGTAAATTTGACTATGATCCAGAGCTGGGAACCTGTTCTATAGTTACCGATGAATTTGGACGTTCAGCTAAAACTGCGCTGTTTGTTATTGCATTCATAGTGCCAGCCTTACTCATTTTTATCTGCTATGCAAGAATATTTTGGGTTGTGCACAGTTTATGCATTGCCGACTTCCTCTTCTGTGCCATGGTTTTGCCTTTCGCCATATCCGGCTTCTGGACGAGAACTTGGTCCCACGGGGGGGCTCTTTGCAAACTCGTGCCGTTCCTCAGATATGGAAATGTAGGAGTATCTCTCTTGAGTATCGCTCTGATTACTTTAAACAGATATATCATGATAGCCCATCACAGCTGGTACGGGCGAGTCTACCGCAAGCACAACATAGCGCTAATGATTATCTTCTCGTGGATGTTTTCTTATGGGATGCAAATACCTACTCTCATCGGAGTCTGGGGTAAATTTGACTATGATCCAGAGCTGGGAACCTGTTCTATAGTTACCGATGAATTTGGACGTTCAGCTAAAACTGCGCTGTTTGTTATTGCATTCATAGTGCCAGCCTTACTCATTTTTATCTGCTATGCAAGAATATTTTGGGTTGTGCACAGTTCAGAGCAAAGGATGAGAGAACACCAGCGCTCTCAAAGCACGAATGCTGGCAGCCTCAATAATGATAAACGTTCTACAATAAAGGACACACGGGAAACGAAGGCTCGTCGTAACGAGTGGAGGATAACGAAGATGGTCCTTGCCATATTCCTTTCGTTCCTTGTTTGCTACCTTCCCATCACTATCGCGAAGGTCGCCGATAGTCACGTACATTTCCCTGTATTCCACATAGCGGGCTACCTCCTGCTGTACGCGAGCGCGTGTGTGAATCCCATCATCTATGTTATAATGAACGCTCAGTATCGCGCTGCTTACAAGGCTGCTCTGTGCTGCTCGCTGCCTTCGCCCTCCAGCGAAATGGAAAGAGCGTCACGGGTACAGCTTCAGCAACACACGAACGGTTTTAAGCCAAGTGTCGTTGGGGGAGTCTCGGGCCGACCATCGGGGATCAATGACATCAGGAAGATAGAGCATGGATCCAGATCGAGCTTCGGGCCCAAGAACGAGACGGCTACCAGCGGTTTAGGGTCCCAAAGGAACCTGAAAATAACGAGATTTGACTTAGATTAA

Protein sequence:

>DPOGS206703-PA
MIIGLFGNLLTVVALLKCPKVRNVAAAFIISLCIADFLFCAMVLPFAISGFWTRTWSHGGALCKLVPFLRYGNVGVSLLSIALITLNRYIMIAHHSWYGRVYRKHNIALMIIFSWMFSYGMQIPTLIGVWGEYEPAVSRGKFDYDPELGTCSIVTDEFGRSAKTALFVIAFIVPALLIFICYARIFWVVHSLCIADFLFCAMVLPFAISGFWTRTWSHGGALCKLVPFLRYGNVGVSLLSIALITLNRYIMIAHHSWYGRVYRKHNIALMIIFSWMFSYGMQIPTLIGVWGKFDYDPELGTCSIVTDEFGRSAKTALFVIAFIVPALLIFICYARIFWVVHSSEQRMREHQRSQSTNAGSLNNDKRSTIKDTRETKARRNEWRITKMVLAIFLSFLVCYLPITIAKVADSHVHFPVFHIAGYLLLYASACVNPIIYVIMNAQYRAAYKAALCCSLPSPSSEMERASRVQLQQHTNGFKPSVVGGVSGRPSGINDIRKIEHGSRSSFGPKNETATSGLGSQRNLKITRFDLD-