Monarch geneset OGS2.0

DPOGS200211
TranscriptDPOGS200211-TA1224 bp
ProteinDPOGS200211-PA407 aa
Genomic positionDPSCF300328 - 199862-201615
RNAseq coverage13x (Rank: top 82%)
Annotation
HeliconiusHMEL0107672e-13758.29% 
BombyxBGIBMGA010296-TA6e-6037.00% 
DrosophilaOr59a-PA1e-1122.41% 
EBI UniRef50UniRef50_C4B7Y18e-8343.90%Olfactory receptor n=1 Tax=Bombyx mori RepID=C4B7Y1_BOMMO
NCBI RefSeqNP_001166615.11e-8343.90%olfactory receptor 53 [Bombyx mori]
NCBI nr blastpgi|3790700508e-12350.37%putative odorant receptor OR28 [Cydia pomonella]
NCBI nr blastxgi|3790700508e-12250.37%putative odorant receptor OR28 [Cydia pomonella]
Group
Gene OntologyGO:00160203e-45membrane
GO:00076083e-45sensory perception of smell
GO:00055493e-45odorant binding
GO:00049843e-45olfactory receptor activity
KEGG pathway 
InterPro domain[4-405] IPR0041173e-45Olfactory receptor, Drosophila
Orthology groupMCL34345 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200211-TA
ATGGCTTCTAAAATAATTAGAATTTTTGAACGATTGGAAAGCCCCAAATACCCTCTCCTGGGCCCAAATTTACAAGGACTTTATTGGTTCGGTCTCTGGCAATGTGGGAATAGGATCCGTGACGGTTTATTTAATATTTTGCATTTAGCCTCTGTTCTCTTCGTTCTGTCTGAATTCGTAGAACTATTTGCTATGGAGATCGATCTTATGAAAATTCTCTTTAACGTTTCTGTAACCGCATTAAGTTTGGTAACAGTATGCAAAACCGTCTTGTTTATTTATTATCTTCCGCATTGGAAGAATCTAGTAAATAGCATATCAAAATTAGAACAAGAACAATTAAAAAGCAATAATTTCAAATTAGTTGCAATAATTAAAAGATATACATTATATTCTAGAGTAATAACCTATTCATTTTGGTCAGTGATTTGTATAACTAGCCTCTTGACTGTTACGGCACCGTTTTTAAAATATATTACTTCGCCAAGCTACAGACAAAGCATTCAAAATGGCACCGAGCTCTATCCCCAGATTTTGAGTTCTTGGTTTCCATTTGATAAGACTAAAATGCCCGGTTACCTAATAGCTGTTTCTATTCATATTATCATGACTACCCAAGGGGCTGCTATTGTAGCCGTTTATGACTCGACCGCAGTCGCTATAATGTCATTTTTAAAAGGACAACAGATTTTATTGCGATATAAATGTGAAAGAATTTTTGGTTTAAATGAGGTGATACCAACAGAAAAAGTTTTAGCTAATATTGAAGAATGTCATCGTTTGCATTGCTTTCTACTGGAGCAACACCATAGATTCAACTCAATTACATCACCTGTTATGATCCTTTATGTTTTGGTATGTTCTGTTATGATGTGTTGTAGTGTTGTGCAGTTAAGTCTGGGTCATTTAAGCACGTCTGAAAAGTTATGGGTTATAGAATTTACAACGGCATTGATTACGCAGCTCTTTTTGTATTGTTGGCATAGTAATGAAATTACGTATGAGAGTAATTTAGTAGACCGTGGGGTCTACGCTAGTAATTGGTGGAGAGGTGATGTTAAAGTTAAGAGACAAATCTTGATTCTAGCTGGAAAGTTAGCCCCCTCATTAATACTCAAGGCCGGGCCAGTCACGACACTTTCTATGGCTACATTTATAAGCATCCTGAAGGGTTCATATAGCTTCTACACTCTAGTAACACAGATGCAGGAAAATCAAATATAG

Protein sequence:

>DPOGS200211-PA
MASKIIRIFERLESPKYPLLGPNLQGLYWFGLWQCGNRIRDGLFNILHLASVLFVLSEFVELFAMEIDLMKILFNVSVTALSLVTVCKTVLFIYYLPHWKNLVNSISKLEQEQLKSNNFKLVAIIKRYTLYSRVITYSFWSVICITSLLTVTAPFLKYITSPSYRQSIQNGTELYPQILSSWFPFDKTKMPGYLIAVSIHIIMTTQGAAIVAVYDSTAVAIMSFLKGQQILLRYKCERIFGLNEVIPTEKVLANIEECHRLHCFLLEQHHRFNSITSPVMILYVLVCSVMMCCSVVQLSLGHLSTSEKLWVIEFTTALITQLFLYCWHSNEITYESNLVDRGVYASNWWRGDVKVKRQILILAGKLAPSLILKAGPVTTLSMATFISILKGSYSFYTLVTQMQENQI-