Monarch geneset OGS2.0

DPOGS209412
TranscriptDPOGS209412-TA3444 bp
ProteinDPOGS209412-PA1147 aa
Genomic positionDPSCF300346 + 142334-150357
RNAseq coverage2x (Rank: top 92%)
Annotation
HeliconiusHMEL0085820.069.58% 
BombyxBGIBMGA014261-TA1e-17176.58% 
Drosophilamtt-PD0.069.49% 
EBI UniRef50UniRef50_B3MFX40.069.49%GF12584 n=7 Tax=Drosophila RepID=B3MFX4_DROAN
NCBI RefSeqXP_001958834.10.069.49%GF12584 [Drosophila ananassae]
NCBI nr blastpgi|3838554300.068.32%PREDICTED: uncharacterized protein LOC100881789 [Megachile rotundata]
NCBI nr blastxgi|1947530550.069.30%GF12584 [Drosophila ananassae]
Group
Gene OntologyGO:00071862.5e-98G-protein coupled receptor protein signaling pathway
GO:00160212.5e-98integral to membrane
GO:00049302.5e-98G-protein coupled receptor activity
KEGG pathwaydan:Dana_GF125840.0 
 K04611 (GRMN)maps-> Neuroactive ligand-receptor interaction
InterPro domain[227-239] IPR0003372.5e-98GPCR, family 3
[780-1019] IPR0179784.1e-72GPCR, family 3, C-terminal
[258-655] IPR0018287.3e-69Extracellular ligand-binding receptor
[424-435] IPR0001622.2e-13GPCR, family 3, metabotropic glutamate receptor
[697-748] IPR0115009.2e-13GPCR, family 3, nine cysteines domain
Orthology groupMCL10076 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209412-TA
ATGGTGCTAGGGTGGATGTTCGCTGTCCTCCTGCTGGCCGCCGTCTCGTCTCGCCATCATCGGCCTCGTCTATCTATCACCGACCTCAACAATTCATCTGATATTTCTATTATTATGTCTAACATCGAATTTGTAGATGTCGAAGAATATTATACACTTACAGATAAATCTCGTCACAAGCGAGAATCTCACCCTCTGATCGGTCCCTTCGACATGTCGAAGATGTCTAGAAATAGAAAACGTCCCAGAGCCGACAAGCAGCTGCAGCCGACCGCTTCCCTGCTTCTCTCAAACGTATCTGTATCTGATGTCTCTCAAGCTACGTCGTCTACTCCTGATAACACTAAAAATGAAATCACAACCGAGTACATCACCGACACTAACAATACACCCAAAGAAAATATACTATTGAATAATTTAGAAAGAAGTACAATGAATAGTTTACTTGAAGGTGACACAAAAAATATTAAACTCGGGAACATTACTTCGATAGTTGAAGGAAAATCAAAAATAAGAAATAAAATGCCAAGGAAATCTAAAAGTGTTGCGGCAAGAAAACGTGTTTTAAATATTAGAAATAGTGCTGAAAGTTCAGTGAGAACCAGTGACTTACGACTGATGAGTAAGAAAAAAACTGACAATGAAAGTTTATGGGCAGTGAAACATGCAGCGGTCGTGGAAGGTGACATCATACTTGGAGGACTTATGATGGTCCACGGACGGGCCGGCGGCTCCTCCAGCTGCGGTCCACTGATGGCTCAAGGCGGGGTGCAGGCGCTCGAGGCCATGTTGTTTTCCTTGGACGAAGCTCAGAGAGCTGGACTCGCGCCGCCGGGAGTCAGGCTCGGAGCGCTCGTGCTGGATGACTGTGACAGTGACACTCGGGGACTGGAGATGGCCTTGGACTTCATTAAAGGTTCGATTGGCAACATCGACGACGAGGAGTACGCTTGTAACGCCACAACCGTCAGGAAAGTGATCACGGGAGTAGTGGGCGCCTCTTCTTCAGTGACTTCCGTACAAGTAGCGAATCTGTTGCGACTCTTTAAGATACCCCAGGTATCATTCTTCTCGACTTCTCCGGAATTGTCTAACAAAGCTCGTTTCGAGTATTTTACAAGAACTATTCCTTCCGACCTTCATCAAGTCCGAGCTTTGGTTGAAATTGTGAAAAAGTTCGGTTGGAGATATGTCTCTATCATATATGAAGAATCCAATTATGGTATAAAGGCATTTGAAGAGTTGGAAACTTTGCTTTTACGTGAAGACATCTGTATCGCAGTCAAAGAAAGATTGGTGAAGGACTCAGGTGCGGCCGACGACCGAGCCTACGATACTATAGTAGAACGCTTACTATCTCGACCGCGGGCCCGAGGTGTTATCGTGTTTGGGTCGGACCAAGAAGTGGCAGGAGTGATGGCGGCGGTGGGTAGGCGCGGGGCGGCGGGGTCTTTTGGCTGGGTCGGGTCGGACGGCTGGAGCGCCCGTGCGTTGGTCGCTGCTGGAAACGAACCTGTCGTCGAGGGCACAATCAGCGTGCAGCCACAATCCAATCCCGTCAGAGGCTTCCGAGACTATTTTCTTTCGCTCACTCCTCGGAACAACATTCGTAACCCCTGGTTTGTCGAATTTTGGGAAGAACAATTTCGCTGTCGTTACCCCGGCAGCCCACGCTCTCGTACCAACGCTCAGTACGAGCCCTGTTCTGGTACGGAACGTCTTTCTGTCGACAACACGGAATTCGAAGCCCAGCTACAGTTCGTCACTGACGCCGTGTGGGCGTTTGTTTATGCTATTCGGGACATGCATAAGGATGCGTGCGGTGGTAAACCGGGACTGTGTGACGAGATGCGACCTGTTAGCGGCCCCGTATTGTTGCGATACTTGCGACAAGTGCGATTTATAGGTCTTAGCGGAGACGAGTTTCATTTCGACTCACATGGTGACGGACCAGCTCGATACAATATTTTACATTTCAAACAAGTTTCACAAGGAGTCTATCGTTGGGTTAAAGTGGGTCGCTACCTTGATGGAGTATTGGAATTGAATATGGACGAAATTCAATTTAAATGGGATCAACCAAAACACCCAGAATCAGTTTGCAGCGCTGAATGTGACACGGGACAAGCAAAACAATATGTGGAAGGAGAGAGCTGTTGTTGGCACTGTTTCAATTGCACTCAATATGAGATCCGGTCTCCATCGCTGGAGACGTCCTGTGTGGCTTGTCCTCTTGGCACTCTTCCTGACGCGCGCCGCATGCACTGCGCTCCCGTGCCCGAGCTCTACCTCCGACCCGACACGCCCGCAGCTATCGGCGCCATGGCTTTTTCATCATTGGGGATACTCTTGACATTGGTCGTGGGTGGTGTGTGGATCGCCCGTCGTGGTACGCCTGTGGTACGAGCGAGTGGTCGTGAGCTGAGCGGTGTATTACTCGCCGGTATATTCATGTGTTACCTCGTCACCTTTGCGCTCGTATTGCGACCTACTGACTTCCTCTGTGCTGTACAGCGATTCGCAACAGGTTTTTGTTTCACCGTGATTTATGCAGCTCTCCTCACCAAAACGAATAGAATAGCTCGTATATTCGATGCCAGCAAACAATCTGCGCGTAGGCCCTCACTAATTTCACCCAAATCTCAACTTCTTATTTGTTCTATTTTGGTGTCAATTCAGGTCGTCATCGTGGTCGTATGGCAGTCGGCGTCACCGGCCCGCGCTATTCACCACTATCCTACTCGCGAAGACAACATGCTGGTTTGTGACTCTTACGTCGATGCGTCTTATACGATCGCTTTCTTTTATCCCGTCGTACTCATCGTCGTCTGTACCGTGTACGCCGTTCTCACCAGGAAGATACCTGAAGCTTTTAATGAGAGCAAACACATTGGATTCACAATGTACACGACCTGTGTCATCTGGTTAGCATTCGTACCTCTATACTTCGGCACCTCAAGTCAAGTGCCGCTGCGCGTTACGAGCATGGCGGTCACCATATCACTGAGCGCCAGTGTCACGTTGGCCTGTCTGTTCGCACCTAAGATGCACATCATACTGTTTCACCCAGAACGTAATGTTCGTGGCACCTTGACGGCGTCCCGGTGGCGCGGGAGAGGCACGGCGCCGGGCGCGGTGTGCGCCGCTCTGGTGGGAGCCGCGCCTCTCCCTCGCTCCGCTCACACTCCGTCCACTGACCTCTCCACACTCGACGTCACGGAGCGGTCGACGTCGACGGATCGTCAGGTACAGACGGACGAAATCGAGCCACTGCCTCGTAATTGTGCCGTAGAACGGAATGGTCTCCATTTAGATCCAAACAAATTATCCGAACTGACTTGCGGACTTCCCACTTTCGAGGGTGTGGCCATTAGACGCCGGCCAAGAGACGAGTGCACAGCGAGACTCGCGGTCGCTGTGGTCGGAGCTCGGACGGGGTTATAG

Protein sequence:

>DPOGS209412-PA
MVLGWMFAVLLLAAVSSRHHRPRLSITDLNNSSDISIIMSNIEFVDVEEYYTLTDKSRHKRESHPLIGPFDMSKMSRNRKRPRADKQLQPTASLLLSNVSVSDVSQATSSTPDNTKNEITTEYITDTNNTPKENILLNNLERSTMNSLLEGDTKNIKLGNITSIVEGKSKIRNKMPRKSKSVAARKRVLNIRNSAESSVRTSDLRLMSKKKTDNESLWAVKHAAVVEGDIILGGLMMVHGRAGGSSSCGPLMAQGGVQALEAMLFSLDEAQRAGLAPPGVRLGALVLDDCDSDTRGLEMALDFIKGSIGNIDDEEYACNATTVRKVITGVVGASSSVTSVQVANLLRLFKIPQVSFFSTSPELSNKARFEYFTRTIPSDLHQVRALVEIVKKFGWRYVSIIYEESNYGIKAFEELETLLLREDICIAVKERLVKDSGAADDRAYDTIVERLLSRPRARGVIVFGSDQEVAGVMAAVGRRGAAGSFGWVGSDGWSARALVAAGNEPVVEGTISVQPQSNPVRGFRDYFLSLTPRNNIRNPWFVEFWEEQFRCRYPGSPRSRTNAQYEPCSGTERLSVDNTEFEAQLQFVTDAVWAFVYAIRDMHKDACGGKPGLCDEMRPVSGPVLLRYLRQVRFIGLSGDEFHFDSHGDGPARYNILHFKQVSQGVYRWVKVGRYLDGVLELNMDEIQFKWDQPKHPESVCSAECDTGQAKQYVEGESCCWHCFNCTQYEIRSPSLETSCVACPLGTLPDARRMHCAPVPELYLRPDTPAAIGAMAFSSLGILLTLVVGGVWIARRGTPVVRASGRELSGVLLAGIFMCYLVTFALVLRPTDFLCAVQRFATGFCFTVIYAALLTKTNRIARIFDASKQSARRPSLISPKSQLLICSILVSIQVVIVVVWQSASPARAIHHYPTREDNMLVCDSYVDASYTIAFFYPVVLIVVCTVYAVLTRKIPEAFNESKHIGFTMYTTCVIWLAFVPLYFGTSSQVPLRVTSMAVTISLSASVTLACLFAPKMHIILFHPERNVRGTLTASRWRGRGTAPGAVCAALVGAAPLPRSAHTPSTDLSTLDVTERSTSTDRQVQTDEIEPLPRNCAVERNGLHLDPNKLSELTCGLPTFEGVAIRRRPRDECTARLAVAVVGARTGL-