Monarch geneset OGS2.0

DPOGS203809
TranscriptDPOGS203809-TA3327 bp
ProteinDPOGS203809-PA1108 aa
Genomic positionDPSCF300010 + 2007149-2017195
RNAseq coverage46x (Rank: top 71%)
Annotation
HeliconiusHMEL0133130.093.22% 
BombyxBGIBMGA003715-TA0.091.74% 
DrosophilanAcRbeta-64B-PA0.089.02% 
EBI UniRef50UniRef50_G6CM840.099.31%Nicotinic acetylcholine receptor subunit beta 1 isoform 1 n=6 Tax=Endopterygota RepID=G6CM84_DANPL
NCBI RefSeqNP_001166819.10.093.23%nicotinic acetylcholine receptor subunit beta 1 isoform 1 [Bombyx mori]
NCBI nr blastpgi|2905609480.093.23%nicotinic acetylcholine receptor subunit beta 1 isoform 1 precursor [Bombyx mori]
NCBI nr blastxgi|2905609480.094.74%nicotinic acetylcholine receptor subunit beta 1 isoform 1 precursor [Bombyx mori]
Group
Gene OntologyGO:00160216.2e-305integral to membrane
GO:00068116.2e-305ion transport
GO:00160201.9e-75membrane
GO:00068101.1e-71transport
GO:00052301.1e-71extracellular ligand-gated ion channel activity
GO:00048895.7e-22nicotinic acetylcholine-activated cation-selective channel activity
GO:00052165.7e-22ion channel activity
GO:00452115.7e-22postsynaptic membrane
KEGG pathway 
InterPro domain[13-497] IPR0062016.2e-305Neurotransmitter-gated ion-channel
[821-1097] IPR0060291.9e-75Neurotransmitter-gated ion-channel transmembrane domain
[616-820] IPR0062021.1e-71Neurotransmitter-gated ion-channel ligand-binding
[649-665] IPR0023945.7e-22Nicotinic acetylcholine receptor
Orthology groupMCL16326 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203809-TA
ATGGCGGGCCGCCGGGCGCGCCGCTCGCTGCTGGCGGCGCCCGCGGGCCTGCTGCTACTGCTCGCTCTGCTCTGGCCGGGGGAGGTAAGCTGCGGCTATCACGAGAAGCGGCTGCTGCACCATCTGCTGGACCATTACAACGTGCTGGAGCGACCCGTAGTCAACGAGAGCGATCCACTCCAGCTCTCCTTCGGCCTCACGCTTATGCAGATCATTGACGTGGAATGGAATGACATGAATTTGAGATGGAACACTTCGGACTTTGGCGGAGTCAAAGATTTACGCGTTCCACCACATAGATTATGGAAACCAGACGTGCTTATGTACAACAGTGCAGACGAAGGTTTTGATAGCACCTACCCTACAAATGTAGTAGTGCGAAACAACGGCTCCTGTCTCTACGTACCGCCTGGAATCTTTAAAAGCACTTGCAAAATAGACATAACCTGGTTTCCTTTTGACGACCAAAGATGCGAAATGAAGTTTGGGAGTTGGACATACGATGGTTATCAACTGGATCTTCAATTGCAAGATGAAGCCGGCGGAGATATAAGTAGTTTTGTTACAAACGGAGAATGGGAACTTATAGGAGTACCAGGAAAACGTAATGAGATCTACTACAACTGTTGCCCCGAGCCATATATTGACATAACCTTCGCAGTAGTTATTCGGCGGAAGACTTTATACTATTTTTTCAATCTTATCGTGCCGTGTGTTCTTATTGCTTCCATGGCATTGTTAGGGTTCACGTTGCCACCGGATTCTGGTGAAAAACTCTCGTTAGGTGTTACAATATTGCTGTCTCTGACAGTATTTTTGAACATGGTAGCAGAGACGATGCCAGCAACGTCAGATGCAGTACCTCTACTAGGCACATATTTTAACTGCATTATGTTTATGGTGGCATCTTCTGTCGTATCAACAATACTTATCCTTAATTACCATCATAGACATGCGAATACACACGAAATGAGTGATTGGATTCGGTGCGTGTTTTTGTATTGGCTACCTTGGATTCTTCGAATGTCGCGACCGGGTTCAGCGACTACTCCACCGCCAGCTCGCGCGCCTCCTCCCCCAGACTTGGAATTGCGCGAGCGATCTTCAAAGTCGCTATTAGCCAATGTGCTCGACATCGATGATGACTTCCGGCATACCCACGCGCAACAACCACCCTGCTGCCGATACTACAGGTCCCTCGACGATCTACACGAACACTACTCTCCAAGTGGTGAAGAAAATGGCGCGGGTTTAGCAGCGCATAGCTGTTTTGGTGTCGACTACGAACTCTCTTTGATACTAAAAGAGCTTCGAGTCATCACAGATAAGATGCGCACAGACGACGAAGAGGCGGACATTTCGCGCGACTGGAAATTCGCTGCCATGGTCGTGGACAGACTGTGCCTTATTATCTTTACCCTGTTCACAATCATTGCCACGCTAGCCGTGCTACTGTCCGCACCACACATCATGGTGTCCGGTGCGGGTGTCGAAGCCGCGACGTTGGAGTTGCCGCCGCCTGCCTCGTCACCCACGCCCCCCTATAGAAATAAGTTACCGCTGACTGCCAACCTAATGCATTCCACCAAAGAACTGCCTATCCAGTTTCTTGTCTTCTTCTTCCTGGCCTCTCATTCAGCGGCTCAATCTATGCTCTTCTATATACATGACTGTGTAGGTTCTAGCTGGAAGGGTGTAGGCTGTATAGGTTTGGGCGCGGCACGGGAGGAATCGCGTTTGCCGCGTGCGGCAGTGACCGCCGCCATGACGGCTAGGTCGCGCGCTCCCTTACTCGTCGTCTTACTTTTAGCGGCACTATATTCAGGATGGTGTTCCGAAGACGAAGAACGACTTGTCCGAGATCTTTTCCGAGGTTACAACAAGCTCATACGACCCGTTCAGAATATGACACAGAAAGTGGACGTTAGATTCGGCCTCGCCTTTGTACAGCTCATTAACGTTAATGAAAAAAATCAAATAATGAAATCGAACGTATGGCTAAGGCTGGTTTGGATGGATTACCAGCTGATGTGGGATGAGGCTGACTACGGCGGCATCGGTGTGCTGCGGCTACCACCTGATAAGGTCTGGAAACCAGATATAGTGTTATTCAACAACGCCGACGGTAATTACGAAGTGAGATACAAATCTAACGTGCTCATTTATCCAAACGGCGAGGTTCTTTGGGTTCCACCGGCAATTTACCAAAGCTCGTGCACCATTGACGTCACATACTTTCCATTCGATCAGCAGACTTGCATAATGAAGTTCGGATCGTGGACATTCAACGGCGATCAAGTGTCACTTGCACTCTACAACAATAAAAACTTTGTAGACCTCTCGGATTACTGGAAGTCGGGCACTTGGGACATTATAGAAGTACCAGCTTATTTAAATATCTATGAGGGCAATCATCCGACTGAAACAGACATCACTTTCTACATAATTATCCGAAGAAAGACATTATTTTATACCGTTAACTTAATCTTGCCGACAGTTTTAATATCATTTCTCTGTGTACTTGTGTTCTACTTGCCCGCCGAGGCTGGTGAAAAAGTGACATTGGGCATCAGTATTTTACTGTCACTGGTCGTGTTCCTGCTGCTTGTTTCAAAAATTCTTCCTCCCACGTCCCTGGTTCTGCCGTTAATAGCAAAATATTTACTGTTTACATTTATTATGAACACCGTCAGTATCCTCGTTACAGTAATCATAATCAACTGGAACTTTAGGGGACCTCGGACTCATAGAATGCCTCTCTGGATAAGGAGCGTCTTTTTACACTACTTACCAGCAGCTCTGCTTATGCGACGACCGCGCAAAACAAGGCTGCGTTGGATGATGGAAATGCCGGGAATGGGCGCTCCGCCACATCCAACAGCCACTCATGACTTGCCCAAACATATAAGCTCAATTGGTTCTAAACAAGGTAAAATGGAGGCTATGGAACTTTCGGACTTACATCATCCCAACTGTAAAATAAACCGCGCGTCGGGTGGAGGCGAGGTGGGCGCGCTGGGAGGACTCGGAGCGCTCGGCCTCGACAACCGAAGGGAATCCGAGAGTTCCGACTCACTACTCTTATCTCCCGAAGCGGCCAAGGCCACGGAAGCTGTGGAATTCATCGCAGAACATTTACGGAATGAAGATCTCTATATTCAAACTAGAGAAGATTGGAAATATGTGGCTATGGTCATAGATCGATTACAACTATACATATTCTTCATAGTGACCACAGCAGGGACCGTCGGTATATTAATGGACGCGCCTCACATCTTTGAATATGTCGATCAAGACCGCATCATTGAAATATATAGAGGAAAATAA

Protein sequence:

>DPOGS203809-PA
MAGRRARRSLLAAPAGLLLLLALLWPGEVSCGYHEKRLLHHLLDHYNVLERPVVNESDPLQLSFGLTLMQIIDVEWNDMNLRWNTSDFGGVKDLRVPPHRLWKPDVLMYNSADEGFDSTYPTNVVVRNNGSCLYVPPGIFKSTCKIDITWFPFDDQRCEMKFGSWTYDGYQLDLQLQDEAGGDISSFVTNGEWELIGVPGKRNEIYYNCCPEPYIDITFAVVIRRKTLYYFFNLIVPCVLIASMALLGFTLPPDSGEKLSLGVTILLSLTVFLNMVAETMPATSDAVPLLGTYFNCIMFMVASSVVSTILILNYHHRHANTHEMSDWIRCVFLYWLPWILRMSRPGSATTPPPARAPPPPDLELRERSSKSLLANVLDIDDDFRHTHAQQPPCCRYYRSLDDLHEHYSPSGEENGAGLAAHSCFGVDYELSLILKELRVITDKMRTDDEEADISRDWKFAAMVVDRLCLIIFTLFTIIATLAVLLSAPHIMVSGAGVEAATLELPPPASSPTPPYRNKLPLTANLMHSTKELPIQFLVFFFLASHSAAQSMLFYIHDCVGSSWKGVGCIGLGAAREESRLPRAAVTAAMTARSRAPLLVVLLLAALYSGWCSEDEERLVRDLFRGYNKLIRPVQNMTQKVDVRFGLAFVQLINVNEKNQIMKSNVWLRLVWMDYQLMWDEADYGGIGVLRLPPDKVWKPDIVLFNNADGNYEVRYKSNVLIYPNGEVLWVPPAIYQSSCTIDVTYFPFDQQTCIMKFGSWTFNGDQVSLALYNNKNFVDLSDYWKSGTWDIIEVPAYLNIYEGNHPTETDITFYIIIRRKTLFYTVNLILPTVLISFLCVLVFYLPAEAGEKVTLGISILLSLVVFLLLVSKILPPTSLVLPLIAKYLLFTFIMNTVSILVTVIIINWNFRGPRTHRMPLWIRSVFLHYLPAALLMRRPRKTRLRWMMEMPGMGAPPHPTATHDLPKHISSIGSKQGKMEAMELSDLHHPNCKINRASGGGEVGALGGLGALGLDNRRESESSDSLLLSPEAAKATEAVEFIAEHLRNEDLYIQTREDWKYVAMVIDRLQLYIFFIVTTAGTVGILMDAPHIFEYVDQDRIIEIYRGK-