Monarch geneset OGS2.0

DPOGS207789
TranscriptDPOGS207789-TA1578 bp
ProteinDPOGS207789-PA525 aa
Genomic positionDPSCF300042 + 184462-187002
RNAseq coverage118x (Rank: top 58%)
Annotation
HeliconiusHMEL0175746e-16880.19% 
BombyxBGIBMGA005482-TA0.072.64% 
DrosophilaCG32447-PA1e-8744.17% 
EBI UniRef50UniRef50_Q9VNZ52e-8544.17%CG32447, isoform A n=12 Tax=Drosophila RepID=Q9VNZ5_DROME
NCBI RefSeqXP_969870.11e-10745.11%PREDICTED: similar to CG32447 CG32447-PB [Tribolium castaneum]
NCBI nr blastpgi|910924602e-10645.11%PREDICTED: similar to CG32447 CG32447-PB [Tribolium castaneum]
NCBI nr blastxgi|910924605e-10444.70%PREDICTED: similar to CG32447 CG32447-PB [Tribolium castaneum]
Group
Gene OntologyGO:00071864.7e-34G-protein coupled receptor protein signaling pathway
GO:00160214.7e-34integral to membrane
GO:00049304.7e-34G-protein coupled receptor activity
KEGG pathwaybfo:BRAFLDRAFT_1940269e-12 
 K04606 (GRM3)maps-> Neuroactive ligand-receptor interaction
InterPro domain[173-374] IPR0179784.7e-34GPCR, family 3, C-terminal
Orthology groupMCL15797 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207789-TA
ATGGCGGACGTAGCAGTGCGAGCCCTGCTCTTCACCGCGTTACTGGCCACCCTCGCTGACGCTGGTGCGGCACGGTTTGGAGATCGCCTAGCACCACGTGTTCCTGTTCCGTACGCACCACGCAACATTCCAGCTGAACTCCATTATTTAGAGAATGAGAACAATAGGGTGATAGATCAGAATTATGATGAAAACGTCAATGATGAACGAGTCGAGCCACAGAAAGACATTTTACCCTCACCGCGACGACCCACTCGTACGGAGGCACATCGACTAGAAATGAGCACAGAATATTTTGTCATACCTCATCGGAAGTCATCGGTACCAGCGGTTTCCACTTCAAGAGTAGTCGTCGTTTCCACATCACGTCCCGCAAATACAACCGCCTCAGTAATAATAGCACCTCACAATGTACAGATATTAAGAGCGCAACCATGGGCCGTACCAATATTGGCATTAGCGTGTGTGAGCATGGCAATTATTGGAGGTTTTGAAGCATTTATAGTGTGGGGTGCAAGTCGGAAAGCGCCAAGTCAACGTCATCTACTTCTAGGTCAATGTTTATTGTTCGGACTTTTTACTTGTGCAGCGACCGCAGCACTCTTTACCGCAGCGCCAACTGCATTTACATGTGGTGCCGTACGATTTGGAACAGGCGTAGCCTATGTGATTGTTTTTGCGTCCCTATTAGTGAAATGTGTATTTTTGTTAAGCTTAAATGGTGGAGTGTACTTACCAGCAGCATACCAAGGACTTTTATTATTCTTTGCCGTAATGATTCAAGTTGCAATAGGAGCACAGTGGTTAGGAGGGTCGCCGCCTAAAGTGGCCAACGGGGCCATGAAATGTGATTCACCACTTGCTGACCTATTACTTTCATTATGCTATGCCGCTTTTCTGATTGCTGTGGTTTGTGGTGTAGCGTTAAGATCACGAGGAATTCGCGATAATTATAGAGAAGCGACTCACATTGCTGGTGCTGGCGGTGCCACTGCAGCCGTGTGGATCTGTTGGATAGCGGCTGCATTAGGGGCACCGGAACAGCATCGTGAAGCATGTGTAGGAGCGGGCCTGATAAGCACATGTGCAGTCGTATTCGCTTTAATGTTTGCACCAAAAGGCAGACGATTAGCTGCACTAGGCCGCGAAGGAAGGTGGGATGCCGATAGAGAGGAAGGACTCAGCTCCATAGGCGCTGGTGGTTCTGGTTACTCACCGTCCTTTTTCCACTTCAAACCAGTTAAATACGGAATGGTGTCTGCCGCTGCCCCAGTTCCAACGCCAGCGCCTGTGATAGACCGCAAGGAGCACCAAGCTCAGCCAGCCGATTATTACGGAGCTCTGTACGCCGGTTCGCACCCGCACTCACGTTCACTGTGTCCACCGCCACATTACCCCCTGCACCCACTAACGCACTACCACTACAATTACCAGCACTATAATTATCTCCTGCCACCAGGCGTTATGATGCGTCCAGAGGAGGGCAACGTGTACACGAGCGTGGAGCCCACCTTCAGCAGCAACCCCAACGTGTATTTCCAACGACAGGAACCTTTGCACGTCGGCATGATGTACTGA

Protein sequence:

>DPOGS207789-PA
MADVAVRALLFTALLATLADAGAARFGDRLAPRVPVPYAPRNIPAELHYLENENNRVIDQNYDENVNDERVEPQKDILPSPRRPTRTEAHRLEMSTEYFVIPHRKSSVPAVSTSRVVVVSTSRPANTTASVIIAPHNVQILRAQPWAVPILALACVSMAIIGGFEAFIVWGASRKAPSQRHLLLGQCLLFGLFTCAATAALFTAAPTAFTCGAVRFGTGVAYVIVFASLLVKCVFLLSLNGGVYLPAAYQGLLLFFAVMIQVAIGAQWLGGSPPKVANGAMKCDSPLADLLLSLCYAAFLIAVVCGVALRSRGIRDNYREATHIAGAGGATAAVWICWIAAALGAPEQHREACVGAGLISTCAVVFALMFAPKGRRLAALGREGRWDADREEGLSSIGAGGSGYSPSFFHFKPVKYGMVSAAAPVPTPAPVIDRKEHQAQPADYYGALYAGSHPHSRSLCPPPHYPLHPLTHYHYNYQHYNYLLPPGVMMRPEEGNVYTSVEPTFSSNPNVYFQRQEPLHVGMMY-