Monarch geneset OGS2.0

DPOGS209376
TranscriptDPOGS209376-TA1260 bp
ProteinDPOGS209376-PA419 aa
Genomic positionDPSCF300118 + 144848-148435
RNAseq coverage8x (Rank: top 86%)
Annotation
HeliconiusHMEL0181141e-9251.88% 
BombyxBGIBMGA011379-TA6e-2325.35% 
DrosophilaOr92a-PA5e-0727.14% 
EBI UniRef50UniRef50_C4B7U73e-4328.00%Olfactory receptor n=6 Tax=Obtectomera RepID=C4B7U7_BOMMO
NCBI RefSeqNP_001091792.13e-4428.00%candidate olfactory receptor [Bombyx mori]
NCBI nr blastpgi|511273522e-6033.42%putative chemosensory receptor 20 [Heliothis virescens]
NCBI nr blastxgi|511273524e-6133.42%putative chemosensory receptor 20 [Heliothis virescens]
Group
Gene OntologyGO:00160202.6e-33membrane
GO:00076082.6e-33sensory perception of smell
GO:00055492.6e-33odorant binding
GO:00049842.6e-33olfactory receptor activity
KEGG pathway 
InterPro domain[72-384] IPR0041172.6e-33Olfactory receptor, Drosophila
Orthology groupMCL34801 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209376-TA
ATGGCGGATGTAACCGAAAATGGAACGAAGTTTAAGAATATTCGTGAAACTTATAAAATAAGTACGTTCTTGTTGACGCTGGGTTTGATGTACCCGAATCCAGTTTACGACGAGAAAAGAAAGAAACTGATAGTTATGGCAATTTTGTCTGTTCTTCCTGCATTTACTCTAGCAGGCAACGATATCCGTCTCCGCATCTTGAACAACGACATGGCGAATGTCGTCCGTCAGTCAATCATCTTAGTATCTATCACCTTCGTCATCATAAAATTCATTATCTCGATAGCACGTAAGGATGAATTCAGATCTCTCTTAGAAGAAATCGACGCAGACTATGAAAGGTTCAACGGAATGTCCGAACAGTATCAGGACCTGGTTGAGGATACCATCAGAAAGACCAGGAAAGTGGAAAAGTCTTGGTGTTTTGTGTTGCTAGTGACTACAGCGTCGTATCCGGTACTAGCTGGTGCATGCACTATATACTCCCAGTTATTTTCTGACTATCCCAGACGCTACATGGTGCACGAGTTGAAGGCTATCCTGATATCAGAGGAACAAAAGTATCAGAGTCCTTACTTCGAAATAGTCAGTCTCTACACCATGTACATAGTTATTTTTCTTTTTATTGGCTTCACGGGATTTGATGGCATGTTCTCGGTATGCCTCCTCCACGTTTCCTTAAAATTGAAAATTTACCAGGAAAACCTAAGAAACCTTTTCAATGAGAAAGACATTCAAAAAATAAAGTATAACATCGGCTTATTTGTAAAAAACCATTGTGGAGTGTTGAGACTAATAGCTAAAATCCAGACATGTTTCGAAGTGTGGCTCGTCGGCATTTTCATCAACGCGGTTGTACAAATTGGAATGGCGTTCACACAAATAACAAATCAAACCGAAAGTGACATCAATCAAATGTACTACTTATACGCCCTGGCGACCGTTGTACATATATACCTCCCTTGCTACTTCGCGTCCGACGTTACATACAACGCAGCTGAAATCGCTAATGTGGCGTACAGCAGTTCGTGGGAGCGGGTCCAAGACTCTAAGATCAGGAGTTCCATCTGCTTTATCATAGCCAAATGTCAGACGCCAGTACGATTAACAGCTCTTGATATGCTCACCTTTAACATGGAATTGTTCGTCTCGCATGACTTCCTTCATCCTACCATCCATCCATTGTCTGTGAACCCACACGGTGCTGTTGATACAACACACGATCGACAAAAGAACAGCACACACGCTCATAATACATAA

Protein sequence:

>DPOGS209376-PA
MADVTENGTKFKNIRETYKISTFLLTLGLMYPNPVYDEKRKKLIVMAILSVLPAFTLAGNDIRLRILNNDMANVVRQSIILVSITFVIIKFIISIARKDEFRSLLEEIDADYERFNGMSEQYQDLVEDTIRKTRKVEKSWCFVLLVTTASYPVLAGACTIYSQLFSDYPRRYMVHELKAILISEEQKYQSPYFEIVSLYTMYIVIFLFIGFTGFDGMFSVCLLHVSLKLKIYQENLRNLFNEKDIQKIKYNIGLFVKNHCGVLRLIAKIQTCFEVWLVGIFINAVVQIGMAFTQITNQTESDINQMYYLYALATVVHIYLPCYFASDVTYNAAEIANVAYSSSWERVQDSKIRSSICFIIAKCQTPVRLTALDMLTFNMELFVSHDFLHPTIHPLSVNPHGAVDTTHDRQKNSTHAHNT-