Monarch geneset OGS2.0

DPOGS206470
TranscriptDPOGS206470-TA1302 bp
ProteinDPOGS206470-PA433 aa
Genomic positionDPSCF300070 + 206509-212501
RNAseq coverage3x (Rank: top 90%)
Annotation
HeliconiusHMEL0119352e-13075.96% 
BombyxBGIBMGA005420-TA8e-13385.04% 
DrosophilaGr21a-PA2e-7037.41% 
EBI UniRef50UniRef50_Q7PQT42e-16166.67%AGAP003098-PA n=13 Tax=Endopterygota RepID=Q7PQT4_ANOGA
NCBI RefSeqXP_001654839.17e-16366.13%Gustatory receptor 21a, putative [Aedes aegypti]
NCBI nr blastpgi|1571675251e-16166.13%Gustatory receptor 21a, putative [Aedes aegypti]
NCBI nr blastxgi|3123725305e-16367.05%hypothetical protein AND_20046 [Anopheles darlingi]
Group
Gene OntologyGO:00509095.7e-26sensory perception of taste
GO:00160215.7e-26integral to membrane
KEGG pathway 
InterPro domain[64-429] IPR0136045.7e-267TM chemoreceptor
Orthology groupMCL19515 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206470-TA
ATGATACCTGACCATCTCTTTGATGAGGGGATTAATAACTCATTGTTACGTAATGATATGAAACATGTTCATTTAAATAGAATTGTTTACAACAAAACTCAAAAGGATTATGAGCGTGATCAACGTAACCTGCTGTCATCCCAGGATGGTGACACTTGTGAGATACACGACCAGTTTTACAGAGACCACAAATTACTTCTGGTGTTGTTCAGAGCACTAGCGGTTATGCCGATCACACGATCAAGACCTGGCACGATTACATTTAGCTGGAAGTCTCGAGCGACGACGTACGCCATCTTTTTCTACATCGTCACCACAATCATTGTCCTGGTCGTCGGTTATGAACGTCTTATGATACTGCGTTCAATTAAAAAGTTTGATGATTACATATATTCCGTGCTATTTGTCGCTTTCCTTGTCCCGCACTTTTGGATCCCTTTCGTCGGATGGGGCGTCGCGCATCAAGTCGCAATATATAAAACGAATTGGGGAAAGTTTCAAGTGAGATATTATCGGGTGACTGGTGAAAATCTCAAATTTCCGAATCTTAAGACGTCTATCGTCATAATAAGTGTGGGATGTCTGTTACTCGCTGTCTGCTTTCTTCTCAGCCTGTGCGCATTGTTGGATGGTTTTCTTCTGCGTCACACGACAGCGTATTATCATATCATTACTATGATTAATATGAATTGCGCGTTGTGGTATATAAACTGCAAGGGGATTAAAATTGCATCACAGAGTCTCTCCAATTGCTTTAGCAGGGATGTGTCCATAGAATGTACGGCCAGTTTGATATCGAGCTATCGTTTCCTTTGGCTAAATTTGTCAGAATTGTTACAATCATTAGGAAACGCTTACGCCAGAACATATTCGACATATTGCCTTTTCATGTTTTTCAACATTACTATTGCTGTTTATGGCGCGTTGTCAGAAATTGTGGATCATGGTTTTCGGTTTAGTTTCAAGGAGATGGGCTTGATCGTTGATGCAGCTTATTGTTCAACGTTGCTTTTCATTTTCGCTGATTGCTCCCACAAGTCCACCCTGAAGGTCGCAGCTGGTGTCCAAGACTGTCTTCTATCCATCGATGTGTTGTCTGTTGATAGACCGACTCAAAAAGAGGTCGCAGCTGGTGTCCAAGACTGTCTTCTATCCATCGATGTGTTGTCTGTTGATAGACCGACTCAAAAAGAGATAGACCACTTCATTCAAGCTATAGAAATGAACCCAGCTGTGGTTAGCTTGAAGGGATACGCTCACGTAAATAGAGAACTACTAACATCGGTTTGTATAAACAGCTAG

Protein sequence:

>DPOGS206470-PA
MIPDHLFDEGINNSLLRNDMKHVHLNRIVYNKTQKDYERDQRNLLSSQDGDTCEIHDQFYRDHKLLLVLFRALAVMPITRSRPGTITFSWKSRATTYAIFFYIVTTIIVLVVGYERLMILRSIKKFDDYIYSVLFVAFLVPHFWIPFVGWGVAHQVAIYKTNWGKFQVRYYRVTGENLKFPNLKTSIVIISVGCLLLAVCFLLSLCALLDGFLLRHTTAYYHIITMINMNCALWYINCKGIKIASQSLSNCFSRDVSIECTASLISSYRFLWLNLSELLQSLGNAYARTYSTYCLFMFFNITIAVYGALSEIVDHGFRFSFKEMGLIVDAAYCSTLLFIFADCSHKSTLKVAAGVQDCLLSIDVLSVDRPTQKEVAAGVQDCLLSIDVLSVDRPTQKEIDHFIQAIEMNPAVVSLKGYAHVNRELLTSVCINS-