Monarch geneset OGS2.0

DPOGS204003
TranscriptDPOGS204003-TA1032 bp
ProteinDPOGS204003-PA343 aa
Genomic positionDPSCF300419 + 31556-33706
RNAseq coverage19x (Rank: top 80%)
Annotation
HeliconiusHMEL0052613e-14494.94% 
BombyxBGIBMGA000094-TA6e-5892.37% 
DrosophilaSPR-PA1e-13364.74% 
EBI UniRef50UniRef50_B4NCZ34e-13667.63%GK10126 n=8 Tax=Endopterygota RepID=B4NCZ3_DROWI
NCBI RefSeqNP_001108346.10.091.57%sex peptide receptor [Bombyx mori]
NCBI nr blastpgi|1688234230.091.57%sex peptide receptor [Bombyx mori]
NCBI nr blastxgi|3070064110.091.28%sex peptide receptor [Helicoverpa armigera]
Group
Gene OntologyGO:00071863.3e-14G-protein coupled receptor protein signaling pathway
GO:00160213.3e-14integral to membrane
KEGG pathwaydre:5693821e-09 
 K04282 (TRHR)maps-> Neuroactive ligand-receptor interaction
    Calcium signaling pathway
InterPro domain[8-315] IPR0194272.3e-247TM GPCR, serpentine receptor class w (Srw)
[1-24] IPR0002763.3e-14GPCR, rhodopsin-like, 7TM
Orthology groupMCL17947 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204003-TA
ATGTATGGATACATAGCACCGTTTCTCCTCGCGACCACAATTGTTGCGAACACTCTGATAGTTGTGGTCCTTTCTCGACGTCATATGAGAACTCCAACAAATGTTGTACTTATGGCCATGGCATTGTGCGATATGTTCACCATGCTTTTCCCAGCACCGTGGCTATTTTATATGTATACTTTTGGCAACCATTACAAACCACTGAGTCCTGTCCATGCATGTCAGGCTTGGCATTACATGAATGAGGTGATCCCAGCAATGTTTCACACAGCAAGTATATGGCTCACTCTTGCTCTAGCAGTTCAGCGATATATCTACGTTTGTCATGCACCTGTAGCTAGGACATGGTGCACAATGCCTAGAGTCAAGAAGTGTCTGATTTATATCGGCGTAGCAGCCTTCCTTCACCAGCTACCCCGGTTTTTTGATCACCAGTACATCCCCCATCTGACATACTGGCGAGGGCATATGGAACACGTTTGCGAGGTGGAGATAGCGCCCTGGGTGAAAACAATATCTCTGGATGCGTACTTCATCACCTACTTCGCATTCAGAGTTCTGTTTGTCCACTTAATACCATGTACGTCGCTCGTCGTTCTGAATGTACTGCTGTTTCGAGCGATGAGAACAGCACAAATAAATAGACAAAAATTGTTCAAAGAAAACCGAAAGTCAGAATGCAAGAGACTCAGAGATTCGAATTGCACAACCCTTATGCTCATTGTTGTAGTCACAGTTTTTCTCATGGTAGAAATACCAGTGGCCGTTGTGACTATACTACATATTATATCAAGCACAATCGTTGAGATTCTGGACTACCATATTGCAAATATTCTGATATTAATAACGAATTTTTTTATAATTGTCTCCTATCCTATAAATTTTGCTATTTATTGCGGTATGTCGAGACAATTTAGGGAGACATTTAAAGAATTGTTCATTAGAGGCGCCGTCACCACTCGTAACGGAGGTTCGAGCAGATATTCACTAGTTAACGGCCCTAGAACTTGCACGAATGAAACAGTACTGTAA

Protein sequence:

>DPOGS204003-PA
MYGYIAPFLLATTIVANTLIVVVLSRRHMRTPTNVVLMAMALCDMFTMLFPAPWLFYMYTFGNHYKPLSPVHACQAWHYMNEVIPAMFHTASIWLTLALAVQRYIYVCHAPVARTWCTMPRVKKCLIYIGVAAFLHQLPRFFDHQYIPHLTYWRGHMEHVCEVEIAPWVKTISLDAYFITYFAFRVLFVHLIPCTSLVVLNVLLFRAMRTAQINRQKLFKENRKSECKRLRDSNCTTLMLIVVVTVFLMVEIPVAVVTILHIISSTIVEILDYHIANILILITNFFIIVSYPINFAIYCGMSRQFRETFKELFIRGAVTTRNGGSSRYSLVNGPRTCTNETVL-