Monarch geneset OGS2.0

DPOGS209840
TranscriptDPOGS209840-TA2163 bp
ProteinDPOGS209840-PA720 aa
Genomic positionDPSCF300117 + 844388-847945
RNAseq coverage2x (Rank: top 92%)
Annotation
HeliconiusHMEL0039261e-17878.27% 
BombyxBGIBMGA008065-TA0.076.49% 
DrosophilaCG7918-PA1e-9353.80% 
EBI UniRef50UniRef50_E2B7470.048.04%Probable muscarinic acetylcholine receptor gar-2 n=5 Tax=Formicidae RepID=E2B747_HARSA
NCBI RefSeqXP_001606246.17e-17846.05%PREDICTED: similar to g-protein-linked acetylcholine receptor gar-2a [Nasonia vitripennis]
NCBI nr blastpgi|3504016640.049.33%PREDICTED: hypothetical protein LOC100749279 [Bombus impatiens]
NCBI nr blastxgi|3227794720.051.09%hypothetical protein SINV_07997 [Solenopsis invicta]
Group
Gene OntologyGO:00071861.4e-36G-protein coupled receptor protein signaling pathway
GO:00160211.4e-36integral to membrane
KEGG pathwaynvi:1001226352e-177 
 K04134 (CHRMN)maps-> Neuroactive ligand-receptor interaction
InterPro domain[28-187] IPR0002761.4e-36GPCR, rhodopsin-like, 7TM
Orthology groupMCL14689 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209840-TA
ATGCTATTGCGATGTATTAGATATATTACTCGAAAGCCTCCATCCATAGCAAAACCGGTCGATTACTTTTTGCCTGAAAGTGGAAATGTTCTGGTACTACTCGCATTCCTCGTGGATAGAACTATACGACAGCCAAGTAATTATTTTATTGCTTCGCTGGCTGCCACTGATCTTCTGATAGGTACTTTGTCGCTCCCATTCTATACGGACTACGTATTAAAAGGGTACTGGCATTTAGGACCCTTGCTATGTGACCTTTGGCTTTCTGTCGACTACACAGTTTGTTTAGTGTCACAGTATACAGTACTTCTTATAACTGTAGACAGATTTTGTAGCGTAAAAATAGCAGCTAGATACAGAGCATGGAGAACAAAAAACAGAGTGATATGGATGGTTACCGTAACTTGGATAATACCAGCATTGTTATTTTTTACAACTATATTTGGTTGGGAACATTTCGTTGGATATAGAGACTTGCAAGAGGGTATTTTTAAAACAGCTTATGATATGCAAAAAAAAAGCGAAGCTAAACAGAGGAAAATGCAGTCAATGGTTGCTTTAAGTGCTGGTGTTATGACTGGCATGGCAGGAAGAGCTGCTGGAATGGCAGCATTATCAAAGGCTACAATATCAAGTGAAGATCAAATAAAAGTTTCAGAAGCAATCCGTAAGGATTCAACGTCTAAAGGTTCTTTAGCAGCGCCTTTATTAAATCTAGGTGGGTCAACCGATTCAAATTCCAGCCAAAAGTCATCGGCTCATAAAAGTAGTGCCACTGCAACAGGTACCTCAACAACGGCCGAATCAAATGAAGAAAAAAAAGAAGAACAAGTAGAAGCAGAAAGATCGAGCAGCCCTGCCTTTGATTCGGATGATGAAAGTACCACACAAACTAATATAAAAAAACGACCCTCGGTTGCGAATTTAGTCATGCAGACTGGTGCTATACAATTACTTAATAATATGCGACTTAATGGTAGTATGCTTATGAAACCTGAATTAAAACAATCTCCTATTATGAAGCATGAACTTGAAAAACCGAAGTCATCTTTATCACAAATATCAGAGAAGAGCCTTTTAGATACTGAAAACCCTGATACCTCACACAGTAATGGACAAACCCCTGCAGTAAGTGTACCCCTGTCCTCAGCTAGCTTTCTGTCCCCTCCATCTTCGGGGATGGCTACTCCGCCCGAAAGAGATGTTATCACTACAATTGCTCAAAGAATCATCCCACCTCCTTCAGAATTTAGAGGCAGTCCACCTGTGTCTGAAAGTCCTCCATCATATTCAGAATCATCTAGAACTAGAATCTTTAAGTTAATAAAGAGTGATGGTTGTGCTTCTGATGTTCTTATGGGTATGGATGGAGCCGATTTAAGGTATATGGATGAAAGTTCTATAATTGTCCCTACACCCACGAATGAAAGTCCACCATCTTCAATCACCTTTCCAACAGCTACCGAACCATCATCACCTCCAACCCTTAATTCAGGTATTATAACAAGTACTTCATTATTACAGGCTGCTCTTATACGAGCAACAGCTGAAGCTCAAGTAACACAACCTAAAACAAATTTATCTCCTCCACCACCTGTAAAAGTTAACACAGAGGTAAGTTTATCTACAGGAGAACGCCCAAAACCTATTATTACCCTCGTAACTTCGTCTTCCAATAATAATGAGACTCAAACTTTAAAACCACCTGATAAATCGTCAATATCGCGGTTAAACGAAACATCAGCTATTGAAAATAGTGAAATTTCTAGTCCTGCTGTAAGTGGATCTGGTCGAGGTGATTCCAGAAGAGATTTTGTAAAAAATATAGGCAAAAAATTAAAAGTAAAACGTTCAAAACGAGATGGATTATTCTCTAGCATAGGAAGGCAAAAATCGAAATCAGAAAATAGGGCTAGGAAAGCTTTTAGAACTATTTCTTTTATATTGGGGGCTTTTGTTGTATGTTGGACACCATATCATGTTTTGGCTTTAGTAGAAGGTTTCTGCACTGATCCTCCATGCATAAATCAACATCTTTATATGTTTTCATACTTTCTCTGCTATGCCAATAGTCCTATTAATCCTTTTTGTTATGCCTTGGCTAACCAACAATTCAAAAAAACTTTCACAAGACTTCTTAGGGGGGATTTCCATATAACCTAG

Protein sequence:

>DPOGS209840-PA
MLLRCIRYITRKPPSIAKPVDYFLPESGNVLVLLAFLVDRTIRQPSNYFIASLAATDLLIGTLSLPFYTDYVLKGYWHLGPLLCDLWLSVDYTVCLVSQYTVLLITVDRFCSVKIAARYRAWRTKNRVIWMVTVTWIIPALLFFTTIFGWEHFVGYRDLQEGIFKTAYDMQKKSEAKQRKMQSMVALSAGVMTGMAGRAAGMAALSKATISSEDQIKVSEAIRKDSTSKGSLAAPLLNLGGSTDSNSSQKSSAHKSSATATGTSTTAESNEEKKEEQVEAERSSSPAFDSDDESTTQTNIKKRPSVANLVMQTGAIQLLNNMRLNGSMLMKPELKQSPIMKHELEKPKSSLSQISEKSLLDTENPDTSHSNGQTPAVSVPLSSASFLSPPSSGMATPPERDVITTIAQRIIPPPSEFRGSPPVSESPPSYSESSRTRIFKLIKSDGCASDVLMGMDGADLRYMDESSIIVPTPTNESPPSSITFPTATEPSSPPTLNSGIITSTSLLQAALIRATAEAQVTQPKTNLSPPPPVKVNTEVSLSTGERPKPIITLVTSSSNNNETQTLKPPDKSSISRLNETSAIENSEISSPAVSGSGRGDSRRDFVKNIGKKLKVKRSKRDGLFSSIGRQKSKSENRARKAFRTISFILGAFVVCWTPYHVLALVEGFCTDPPCINQHLYMFSYFLCYANSPINPFCYALANQQFKKTFTRLLRGDFHIT-