Monarch geneset OGS2.0

DPOGS208586
TranscriptDPOGS208586-TA1554 bp
ProteinDPOGS208586-PA517 aa
Genomic positionDPSCF300064 + 1936044-1939170
RNAseq coverage101x (Rank: top 61%)
Annotation
HeliconiusHMEL0165362e-16860.08% 
BombyxBGIBMGA002556-TA0.078.11% 
DrosophilanAcRalpha-80B-PC0.067.22% 
EBI UniRef50UniRef50_P917660.090.56%Acetylcholine receptor subunit alpha-like n=46 Tax=Bilateria RepID=ACH1_MANSE
NCBI RefSeqNP_001103387.20.091.77%nicotinic acetylcholine receptor subunit alpha 3 [Bombyx mori]
NCBI nr blastpgi|1573672890.091.94%nicotinic acetylcholine receptor subunit alpha 3 [Bombyx mori]
NCBI nr blastxgi|27645150.090.02%nicotinic acetylcholine receptor alpha1 subunit [Heliothis virescens]
Group
Gene OntologyGO:00160201e-85membrane
GO:00068101e-85transport
GO:00052301e-85extracellular ligand-gated ion channel activity
GO:00068112.1e-79ion transport
GO:00048895.3e-25nicotinic acetylcholine-activated cation-selective channel activity
GO:00052165.3e-25ion channel activity
GO:00160215.3e-25integral to membrane
GO:00452115.3e-25postsynaptic membrane
KEGG pathway 
InterPro domain[8-499] IPR0062010Neurotransmitter-gated ion-channel
[30-243] IPR0062021e-85Neurotransmitter-gated ion-channel ligand-binding
[244-495] IPR0060292.1e-79Neurotransmitter-gated ion-channel transmembrane domain
[63-79] IPR0023945.3e-25Nicotinic acetylcholine receptor
Orthology groupMCL10360 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208586-TA
ATGCGCGCTCGCGCTCTGCCCGCCCGCGCGGCGAGACTGCTGCTGCTGGCGTTACTGCTGGCAGGCTGTGCTGGTAATCCCGACGCCAAGCGCTTGTATGACGACCTTCTCAGTAATTACAACAAGTTAGTTCGGCCCGTTCTTAACGTCAGCGACGCGCTCACCGTACGCATCAAACTCAAACTCAGTCAGCTCATAGACGTCAATCTCAAAAATCAAATTATGACAACTAATTTGTGGGTAGAGCAGAGTTGGTATGATTACAAGTTATCTTGGGAGCCTCGTGAGTACGGTGGAGTAGAGATGCTTCATGTTCCCTCTGATCACATTTGGCGTCCGGATATAGTACTCTACAATAATGCGGATGGGAACTTCGAGGTAACTTTAGCTACAAAAGCAACTCTGAACTACACCGGTCGTGTGGAATGGCGCCCTCCAGCCATCTATAAGTCATCTTGCGAAATTGATGTCGAATATTTTCCATTCGACCAACAGACATGTATTATGAAGTTTGGGTCTTGGACTTACGACGGCTTTCAGGTGGACTTACGGCATATTGACGAAGCACGTGGCACAAACGTGGTAGAACTAGGAGTCGACCTTAGCGAGTTCTATACATCTGTTGAGTGGGACATCTTAGAAGTACCCGCTGTTAGAAATGAAAAGTTCTACACATGTTGTGATGAACCATATTTAGATATAACATTCAACATAACCATGCGCCGCAAAACTCTCTTCTACACCGTAAATCTTATCATACCTTGTATGGGCATTTCATTCCTGACAGTTCTCGTTTTCTATTTGCCTTCCGACAGCGGAGAGAAAGTTTCACTTTCTATTTCCATACTTCTCTCTCTGACCGTGTTCTTTTTACTGTTGGCTGAGATAATTCCTCCAACATCACTTGTCGTACCGCTACTTGGGAAATTCGTTCTCTTCACGATGATACTTGACACATTCAGTATTTGCGTGACAGTGGTAGTGTTGAATGTACACTTCCGGTCACCACAGACACACACCATGGCGCCGTGGGTGCGGCGTGTATTCATCCATGTACTGCCAAGACTCTTAGTGATGCGTCGCCCCCACTACCGCCTTGATCCACATCGCAGTCGATTCGCCGGAGTCGTTGCTAGCGAAGGTTGGTCACCCATAGTCGGTCCAGTGGGAATGGGACCGACGGGCTCTATGGGACAAATGGATCTCTCGCCGACACCGGAAGCTTGCCGTGTTCATGACGCTCCTGCGTTGTGTGACGCTTTGCGACGCTGGCACCGCTGCCCTGAACTACATAAAGCCATTGACGGCATTAATTACATCGCTGAGCAGACACGCAAAGAAGAAGAATCCACCAGGGTGAAAGAGGACTGGAAGTACGTCGCTATGGTGTTGGATCGGCTGTTTCTTTGGATATTTACCCTGGCTGTATTAGTAGGATCTGCCGGGATAATTCTTCAGGCGCCCACGCTATACGACGAGCGCGCACCCATCGATGTGCGTCTCTCAGAGATCGCCTACGCGGCGGCCAAGCCTCGTCCGCCTCCGCCGCGCTAG

Protein sequence:

>DPOGS208586-PA
MRARALPARAARLLLLALLLAGCAGNPDAKRLYDDLLSNYNKLVRPVLNVSDALTVRIKLKLSQLIDVNLKNQIMTTNLWVEQSWYDYKLSWEPREYGGVEMLHVPSDHIWRPDIVLYNNADGNFEVTLATKATLNYTGRVEWRPPAIYKSSCEIDVEYFPFDQQTCIMKFGSWTYDGFQVDLRHIDEARGTNVVELGVDLSEFYTSVEWDILEVPAVRNEKFYTCCDEPYLDITFNITMRRKTLFYTVNLIIPCMGISFLTVLVFYLPSDSGEKVSLSISILLSLTVFFLLLAEIIPPTSLVVPLLGKFVLFTMILDTFSICVTVVVLNVHFRSPQTHTMAPWVRRVFIHVLPRLLVMRRPHYRLDPHRSRFAGVVASEGWSPIVGPVGMGPTGSMGQMDLSPTPEACRVHDAPALCDALRRWHRCPELHKAIDGINYIAEQTRKEEESTRVKEDWKYVAMVLDRLFLWIFTLAVLVGSAGIILQAPTLYDERAPIDVRLSEIAYAAAKPRPPPPR-