Monarch geneset OGS2.0

DPOGS208503
TranscriptDPOGS208503-TA1041 bp
ProteinDPOGS208503-PA346 aa
Genomic positionDPSCF300064 - 733474-739991
RNAseq coverage24x (Rank: top 78%)
Annotation
HeliconiusHMEL0087442e-16785.97% 
BombyxBGIBMGA010605-TA3e-11574.33% 
DrosophilaCG8784-PA3e-9554.38% 
EBI UniRef50UniRef50_A7YII33e-15175.71%Pheromone biosynthesis-activating neuropeptide receptor isoform B n=15 Tax=Obtectomera RepID=A7YII3_HELVI
NCBI RefSeqNP_001036977.18e-15577.18%pheromone biosynthesis activating neuropeptide receptor [Bombyx mori]
NCBI nr blastpgi|1885362735e-15678.40%pheromone biosynthesis-activating neuropeptide receptor [Spodoptera exigua]
NCBI nr blastxgi|1885362735e-15180.49%pheromone biosynthesis-activating neuropeptide receptor [Spodoptera exigua]
Group
Gene OntologyGO:00071866e-47G-protein coupled receptor protein signaling pathway
GO:00160216e-47integral to membrane
KEGG pathwaytgu:1002223133e-52 
 K05053 (NMUR2)maps-> Neuroactive ligand-receptor interaction
InterPro domain[47-321] IPR0002766e-47GPCR, rhodopsin-like, 7TM
Orthology groupMCL12874 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208503-TA
ATGGATTTGGACGACGAGGAGTTGCAATCGTTGAATGAAACCAATGACACGCAGTCCGGTTTCGCTGAACCCGAATCGTTGGATGTGATAGTTCCTTTGAGCGTGATATATGCCATTATATTTGTAACTGGAATCTTAGGAAACATAAGCACATGTGTTGTCATCGGTCGGAACAGATCTATGCACACAGCTACAAACTTTTACCTATTTAGCTTAGCGATATCCGATCTTCTCCTGCTTATTTGCGGACTACCGTTGGAAGTGCATAGACTGTGGAACCCACTGTCTTATCCACTAGGAGAAGCGTTATGTATCACAGTCGGTTTGATATCAGAAACATCAGCCAACGCTACTGTGTTGACAATAACAGCGTTCACTGTGGAAAGATATATCGCCATATGCCGTCCATTCATGTCGCACAAGATGTCCAAGTTGTCTAGAGCAGTGAGATACATTATAGCTATATGGATTTGTGCTTTGTGTTCCGCGGTACCTCAAGCTATGCAGTTCGGTGTTGTCTCCTATAAAGAGAACGGTCAGAATATAAGTGCATGTACAGTAAAAGGTCACGGTGTGCACCAAGTCTTTGTTATATCTAGTTTTGTGTTTTTCGTTGCTCCCATGTCTTTGATAACTGTGTTATATGCGTTGATTGGTCTCAAGTTACACACGTCACGGGTTTTGCATCCAGTCAAGAAGTCATCGGTTGAAAGCGGTGACCGTCCTAATGGTACTCCTCGATACAGAAACGGCGCCTCCCAAAGAAGAGTCATTAGGATGTTAGTCGCAGTGGCGTTGTCGTTTTTCCTCTGTTGGGCGCCATTCCACGTTCAGAGGCTGATAGCGATATATGGCAAGAATATGGAACATCCAACAGACACATTCTATAAGGTGTACATAGTCCTGACGTACGTGTCTGGTGTGCTTTACTTCCTATCCACTTCAATTAATCCGTTCCTGTACAACATCATGTCGAACAAATTTAGAAATGCCTTCAAGGCGATGTTCGGTAAACTGGGGATAACATATAATGGACACTAA

Protein sequence:

>DPOGS208503-PA
MDLDDEELQSLNETNDTQSGFAEPESLDVIVPLSVIYAIIFVTGILGNISTCVVIGRNRSMHTATNFYLFSLAISDLLLLICGLPLEVHRLWNPLSYPLGEALCITVGLISETSANATVLTITAFTVERYIAICRPFMSHKMSKLSRAVRYIIAIWICALCSAVPQAMQFGVVSYKENGQNISACTVKGHGVHQVFVISSFVFFVAPMSLITVLYALIGLKLHTSRVLHPVKKSSVESGDRPNGTPRYRNGASQRRVIRMLVAVALSFFLCWAPFHVQRLIAIYGKNMEHPTDTFYKVYIVLTYVSGVLYFLSTSINPFLYNIMSNKFRNAFKAMFGKLGITYNGH-