Monarch geneset OGS2.0

DPOGS208620
TranscriptDPOGS208620-TA708 bp
ProteinDPOGS208620-PA235 aa
Genomic positionDPSCF300052 + 654234-657324
RNAseq coverage37x (Rank: top 73%)
Annotation
HeliconiusHMEL0165839e-12596.17% 
BombyxBGIBMGA005723-TA3e-12294.78% 
DrosophilaHisCl1-PA6e-10282.13% 
EBI UniRef50UniRef50_Q9VGI08e-10082.13%Histamine-gated chloride channel subunit 1 n=28 Tax=Coelomata RepID=Q9VGI0_DROME
NCBI RefSeqXP_624072.11e-10880.51%PREDICTED: similar to Histamine-gated chloride channel subunit 1 CG14723-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3800271221e-10780.51%PREDICTED: glycine receptor subunit alpha-4-like [Apis florea]
NCBI nr blastxgi|3287204983e-10981.86%PREDICTED: glycine receptor subunit alpha-4-like [Acyrthosiphon pisum]
Group
Gene OntologyGO:00160216.6e-133integral to membrane
GO:00068116.6e-133ion transport
GO:00160201.7e-38membrane
GO:00068101.6e-07transport
GO:00052301.6e-07extracellular ligand-gated ion channel activity
KEGG pathwaydre:1921244e-41 
 K05195 (GLRA3)maps-> Neuroactive ligand-receptor interaction
InterPro domain[1-231] IPR0062016.6e-133Neurotransmitter-gated ion-channel
[66-235] IPR0060291.7e-38Neurotransmitter-gated ion-channel transmembrane domain
[69-89] IPR0060289.3e-35Gamma-aminobutyric acid A receptor
[1-66] IPR0062021.6e-07Neurotransmitter-gated ion-channel ligand-binding
Orthology groupMCL26686 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208620-TA
ATGCTGTTAGTTTTCTTATCACATACAGTGCACGACTTAGTATTCATCTGGAACCTAACAGATCCTTTGGTTGTAAATCCAGACATCGAGCTCCCTCAACTAGATATAGCTAACAACTTTACCTCCGACTGCACTATCGAGTACTCCACAGGTAACTTCACATGCCTAGCGGTAGTTTTCAACTTACGCCGACGTCTTGGCTACCACCTGTTCCACACATACATACCATCCGCGCTCATTGTAGTGATGTCCTGGATATCTTTCTGGATTAAGCCGGAAGCTATCCCTGCGAGGGTTACCCTCGGAGTAACCTCTCTTTTGACCTTGGCAACCCAAAATACTCAGTCCCAACAAAGTTTACCACCAGTCTCCTATGTCAAAGCTATAGATGTCTGGATGTCTTCATGTTCCGTCTTCGTTTTCCTGTCCCTATTCGAGTTCGCTGTGGTCAATAACTACATGGGCCCTGTAGCTACAAAGGCCATGAAAGGATACTCTGATGAAGATTTGAGTAGAGATTTAGATGCTTACAAGCACATATTTCCAAACTCGGTGGACCCTCGAGCGAGTACGTCTGCCTCGCTCCCACAATACGAAACCTTCTGTAACGGGAGAGAGACAGCATTATACATAGACCGTTTCTCCCGTTTCTTCTTCCCATTCTCATTCTTCATACTAAACGTCGTATATTGGTCCACGTTTCTATAA

Protein sequence:

>DPOGS208620-PA
MLLVFLSHTVHDLVFIWNLTDPLVVNPDIELPQLDIANNFTSDCTIEYSTGNFTCLAVVFNLRRRLGYHLFHTYIPSALIVVMSWISFWIKPEAIPARVTLGVTSLLTLATQNTQSQQSLPPVSYVKAIDVWMSSCSVFVFLSLFEFAVVNNYMGPVATKAMKGYSDEDLSRDLDAYKHIFPNSVDPRASTSASLPQYETFCNGRETALYIDRFSRFFFPFSFFILNVVYWSTFL-