Monarch geneset OGS2.0

DPOGS214534
TranscriptDPOGS214534-TA2679 bp
ProteinDPOGS214534-PA892 aa
Genomic positionDPSCF300287 + 137200-165460
RNAseq coverage5x (Rank: top 88%)
Annotation
HeliconiusHMEL0100545e-13896.34% 
BombyxBGIBMGA010995-TA3e-13492.74% 
DrosophilamAcR-60C-PB2e-13463.02% 
EBI UniRef50UniRef50_Q7Q8W80.059.91%AGAP010513-PA n=2 Tax=Endopterygota RepID=Q7Q8W8_ANOGA
NCBI RefSeqXP_314486.10.059.91%putative muscarinic acetylcholine receptor 1 (AGAP010513-PA) [Anopheles gambiae str. PEST]
NCBI nr blastpgi|312110390.059.91%AGAP010513-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1892342260.064.20%PREDICTED: similar to putative muscarinic acetylcholine receptor 1 (AGAP010513-PA) [Tribolium castaneum]
Group
Gene OntologyGO:00071861.4e-78G-protein coupled receptor protein signaling pathway
GO:00160211.4e-78integral to membrane
GO:00160205.7e-31membrane
GO:00049815.7e-31muscarinic acetylcholine receptor activity
KEGG pathwayaga:AgaP_AGAP0105130.0 
 K04131 (CHRM3)maps-> Salivary secretion
    Regulation of actin cytoskeleton
    Neuroactive ligand-receptor interaction
    Calcium signaling pathway
    Gastric acid secretion
InterPro domain[341-859] IPR0002761.4e-78GPCR, rhodopsin-like, 7TM
[326-338] IPR0009955.7e-31Muscarinic acetylcholine receptor
Orthology groupMCL14000 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214534-TA
ATGCTCATCGCTTTTAACCAAACAGCTAACTTGTCAGACCTGGCGTTCGATTCCGACCGTCCTGCCAGTCCCTTCAGCCTCGCCCAGAAGATCGTCATAGCCATCATCGCCAGCGTCCTCTCCGTACTCACAGTAGTCGGCAACTCCATGGTCATGATAAGCTTTAAGATCGACAAACAGCTGCAGACAATCAGCAATTACTTCCTGTTCTCGCTAGCGGTCGCCGACTTCGCTGTTGGTTTGATATCTATGCCACTATTCACAATGTTCACGATATACGGCTACTGGCCGCTGGGACCTCATATTTGCGACACCTGGCTAGCGTTGGATTACTTAGCGTCGAACGCATCCGTACTTAACCTTTTAATAATAAGTTTCGATCGATATTTTAGTGTGACGCGGCCCTTAACGTACAGGGCTAAGAGGACTACACGCCGCGCGATGGTTATGATAGGATGTGCGTGGGGTTTCAGTTTGGTGTTATGGCCGCCGTGGATTTACGCATGGCCGTATATAGATGGCGAGAGGAAGGTACCACCTCACGAATGTTACATACAGTTCATAGAGACAAACCAGTTTATCACGTTCGGAACAGCCATAGCTGCGTTTTATGTGCCAGTGACAGTGATGTGTATATTGTACTATAAAATTTGGAGGGAAACCAAAAAAAGACAGAAGGATCTGCCTAATCTACAAGGCGGCAAGAAACACGATTCATCGAAGAGATCTAATTCTAGAGCGGTTCTTCCAATCGAACGTGATATCGTCGGACGGGTCGGCCATGGACACGTCGGGCGCGGGCGTGGGGTCGGTGGGACGCGTCCGCTGTCGGTCGGCGAACGAGCGGCGGGGCGGGTGCCGCGGACTCGCCGAGTTGACGTCATGCTCATCGCTTTTAACCAAACAGCTAACTTGTCAGACCTGGCGTTCGATTCCGACCGTCCTGCCAGTCCCTTCAGCCTCGCCCAGAAGATCGTCATAGCCATCATCGCCAGCGTCCTCTCCGTACTCACAGTAGTCGGCAACTCCATGGTCATGATAAGCTTTAAGATCGACAAACAGCTGCAGACAATCAGCAATTACTTCCTGTTCTCGCTAGCGGTCGCCGACTTCGCTGTTGGTTTGATATCTATGCCACTATTCACAATGTTCACGATATACGGCTACTGGCCGCTGGGACCTCATATTTGCGACACCTGGCTAGCGTTGGATTACTTAGCGTCGAACGCATCCGTACTTAACCTTTTAATAATAAGTTTCGATCGATATTTTAGTGTGACGCGGCCCTTAACGTACAGGGCTAAGAGGACTACACGCCGCGCGATGGTTATGATAGGATGTGCGTGGGGTTTCAGTTTGGTGTTATGGCCGCCGTGGATTTACGCATGGCCGTATATAGATGGCGAGAGGAAGGTACCACCTCACGAATGTTACATACAGTTCATAGAGACAAACCAGTTTATCACGTTCGGAACAGCCATAGCTGCGTTTTATGTGCCAGTGACAGTGATGTGTATATTGTACTATAAAATTTGGAGGGAAACCAAAAAAAGACAGAAGGATCTGCCTAATCTACAAGGCGGCAAGAAACACGATTCATCGAAGAGATCTAATTCTAGTGACGAAACCAAAGAAATAGATGGTCGAGCAAGATCCGAGTCCGGGGATGCTGATTCAGTGTATCACGTGAGGGGTGCACTCCACGACGCCAGGTGGAGAGACAATCAGGCCTTATCCCAACGTCCAAAGCGAGGCTGGGCTGCGGTAAGGGACTGGTGTGTCGCTTGGTGGCACTCTGGTAGAGAAGACCTCGAGGATACAGAACCGGAGGAAGAGCCATCTGACCCTGGGTATGCCACACCCGTGTCAGTTGAGACGCCGTTGCAGAGTACTGTGTCCAGATGCACATCTCTGAATGTAATAAGAGATCCATACGCTGGCCGCGGGGGATCGGGGGGGTCGAGTGTCACGGATGGAGGGACTTCTCCACTCCGACGTAATTTCGAGACGCCTGCCCCTATACCAGCTGCCAGAGACAGCCGATCGCTGCCACCGAACACCAGAATCAACACCTCCGCGTCACCAGCCCCAAAATCAGCATCTGCTGATTCGGTTTACACCATCCTTATCAGATTACCAGATGCTGATACAGAAAGACCCAGCATTAAAATGATCACCGAAGAGTCTCCACCGACGAATACAAGAACACACTATCGACCTGCTCGAGGGGATTCCGAACTAAACATACACCCAGCTGGTCACGCTGCACTAACCAGACGGACATCACACATACAAGACGTGAGAATTCCTCTAAATGCGAAAATTATACCGAAACAGCTGGCTGGCAAAGGGATTACTTCAAAACAGCCAAAGAAAAAGAAAACTCAAGAGAAGAAACAGGAATCAAAAGCTGCGAAAACGCTCTCAGCGATATTGTTATCTTTCATCATCACTTGGACGCCTTATAATATCCTCGTGCTATTGAAACCACTCACAGCATGTACCAAGTGTGATGAACTTTGGTCTTTCTTTTACGCGCTATGCTACATCAACTCCACTATAAATCCCGTGTGTTATGCCCTATGCAACGCGACGTTCAGGAGAACGTACGTTAGAATTTTGACTTGTAAATGGCATAATAGAAATAGGGAAGCAATGACAAGAGGAGTGTACAATTAG

Protein sequence:

>DPOGS214534-PA
MLIAFNQTANLSDLAFDSDRPASPFSLAQKIVIAIIASVLSVLTVVGNSMVMISFKIDKQLQTISNYFLFSLAVADFAVGLISMPLFTMFTIYGYWPLGPHICDTWLALDYLASNASVLNLLIISFDRYFSVTRPLTYRAKRTTRRAMVMIGCAWGFSLVLWPPWIYAWPYIDGERKVPPHECYIQFIETNQFITFGTAIAAFYVPVTVMCILYYKIWRETKKRQKDLPNLQGGKKHDSSKRSNSRAVLPIERDIVGRVGHGHVGRGRGVGGTRPLSVGERAAGRVPRTRRVDVMLIAFNQTANLSDLAFDSDRPASPFSLAQKIVIAIIASVLSVLTVVGNSMVMISFKIDKQLQTISNYFLFSLAVADFAVGLISMPLFTMFTIYGYWPLGPHICDTWLALDYLASNASVLNLLIISFDRYFSVTRPLTYRAKRTTRRAMVMIGCAWGFSLVLWPPWIYAWPYIDGERKVPPHECYIQFIETNQFITFGTAIAAFYVPVTVMCILYYKIWRETKKRQKDLPNLQGGKKHDSSKRSNSSDETKEIDGRARSESGDADSVYHVRGALHDARWRDNQALSQRPKRGWAAVRDWCVAWWHSGREDLEDTEPEEEPSDPGYATPVSVETPLQSTVSRCTSLNVIRDPYAGRGGSGGSSVTDGGTSPLRRNFETPAPIPAARDSRSLPPNTRINTSASPAPKSASADSVYTILIRLPDADTERPSIKMITEESPPTNTRTHYRPARGDSELNIHPAGHAALTRRTSHIQDVRIPLNAKIIPKQLAGKGITSKQPKKKKTQEKKQESKAAKTLSAILLSFIITWTPYNILVLLKPLTACTKCDELWSFFYALCYINSTINPVCYALCNATFRRTYVRILTCKWHNRNREAMTRGVYN-