Monarch geneset OGS2.0

DPOGS209172
TranscriptDPOGS209172-TA2481 bp
ProteinDPOGS209172-PA826 aa
Genomic positionDPSCF300061 - 39375-46087
RNAseq coverage85x (Rank: top 63%)
Annotation
HeliconiusHMEL0097410.060.71% 
BombyxBGIBMGA011484-TA0.047.31% 
Drosophilaclumsy-PB5e-11836.84% 
EBI UniRef50UniRef50_E2C8U32e-12738.84%Glutamate receptor, ionotropic kainate 2 n=7 Tax=Formicidae RepID=E2C8U3_HARSA
NCBI RefSeqXP_001655465.15e-13739.43%glutamate receptor, ionotropic kainate 1, 2, 3 (glur5, glur6, glur7) [Aedes aegypti]
NCBI nr blastpgi|1571297051e-13539.43%glutamate receptor, ionotropic kainate 1, 2, 3 (glur5, glur6, glur7) [Aedes aegypti]
NCBI nr blastxgi|1571297051e-13439.43%glutamate receptor, ionotropic kainate 1, 2, 3 (glur5, glur6, glur7) [Aedes aegypti]
Group
Gene OntologyGO:00160207.3e-73membrane
GO:00049707.3e-73ionotropic glutamate receptor activity
GO:00052347.3e-73extracellular-glutamate-gated ion channel activity
GO:00068103.7e-21transport
GO:00302883.7e-21outer membrane-bounded periplasmic space
GO:00052153.7e-21transporter activity
GO:00048728.9e-08receptor activity
GO:00068118.9e-08ion transport
GO:00052168.9e-08ion channel activity
KEGG pathwaytgu:1002319292e-104 
 K05201 (GRIK1)maps-> Neuroactive ligand-receptor interaction
InterPro domain[335-705] IPR0013207.3e-73Ionotropic glutamate receptor
[336-703] IPR0016383.7e-21Extracellular solute-binding protein, family 3
[345-409] IPR0195942.7e-08Glutamate receptor, L-glutamate/glycine-binding
[465-490] IPR0015088.9e-08NMDA receptor
Orthology groupMCL20952 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209172-TA
ATGTTACATTTGGTCGCAATTATAGTAAATTCACAGACTGGCTACGCTGAAGTTAGTACACGTCTACCTATCGGAGGGCTATTTAATTCGAAGACATTACCGAATTCCGCTTTGGCCTTCGAAAATATACTCAAGATCGGTACAACTAACGCCTATCATGGGAAATTAATTGCAGCACAAGTCACTGATAGCTATTCTACGGCATTAGAGATGTGCGCGAACACTTCGTCTGACGATGGAATAGTGGCACTAGTTGATGCCAGACCAACGAACGGAATTTGTGATGTCACGTGTTCGATATGCAACAAACTTAACATATCACATTTAATCCTGGGATGGCAGCCGCCAGCGACACTCGAAAATGATATGTACTCATTTTTCTATCACCCGTCACCGGAAATAATATCAAGAGCCTTTGCAACTCTCATAAAGACTCTAAATTGGGACAAGTTCACCATTCTTTACGAAGACGAAGGCACACTTGATGCCTATACAATAAATCTTGAAGATCTCGCCGACTTCAGGGCCAATGTATCAACATTGCACCTAACTATGCCCAACGACATACGATGGATAGACAAAGATATGAGCAATTATAATATTCGGTTGGAGACGGCCTTGACTGCGGATGCGTTGGGTCATCTAGACAAAGCTATAAGAAGTATGCTGATGGAGATCGAGGATCAAGAAGAATATTCTAGAAGACTAATACCGATAGCAGATCCGCCATCATTATGTTTTATGAAGAGTAAGGAGTATGAAGAAGCAGCCTGGCCTCAAGGGGCAGCATTGCGGGACGCTTTATTAAAGACCACTTATAAAGGTTTCACTGGTAATGTAAACTTCGACAAATATGGAAAAAGAACAAATTTCGTCCTCCATTACTCCAAACTAAGTAACGAGAGTCAGTTTATTTACGTCGGCAAATGGGACTATAAGACTGACACACTGTACACTGAAAAAGACATCACGGAAAGATCTTCAGCAAAAAGTTCCAAGTCAGTTATACGGATTGTGTCAAGAAAAGGCAAACCTTATTTCGATTTCTCCAACGAAACCACAACATTCCGAGGATATGCTGTGGATCTGATAGACAAGATATTTGAGCACATGAGAAACAATGGGAAGGATCTGAAATATGAGTTTTATAGAGTTAGCGGTGATGACTATGGCCATCCCATTGCGGGCACGAAAAAATGGAGTGGACTTATAGGAGAGGTCCTGGATCATAATGCCGATCTTGCAATATGTGATCTTGCTATCACATCAGAAAGGAACGCCCTAGTAGATTTCTCGACACCCTTTATGTCTTTGGGGATCGGTTTGTTGACCAAAGAACCGGAGCCTGAAGAACCTGACATGTTCTCTTTCATAAAACCTTTGTCTTTGGATGTGTGGCTGTATTTAGCTACGATATATATTATTGTATCTTTTGTCCTTCTAATATGTGCTCGTATGAGTCAAGATGATTGGGTGAATCCCCACCCCTGCAATCAGAATCCGGAGAACTTACAAAACATATGGAGCTTGTACAATTGCATGTGGCTCACCATGGGTTCCATCATGACCCAGGGCTGCGATATATTACCCAGGGCGGTGGGATCTCGCTGGGTCGCTGGTATGTGGTGGTTCTTCGCTCTTATCGTCACAGCATCTTATACAGCCAACATGTCCACCTTTCTCAGTGCAAGTCGTCGCAGTAATGATTTACAGGAAGTTTCGGACCTCGTTGATCAAAATTCCATTTCCTATGGTACTGTCGACAATGCATCTACTTACAGATTTTTTGAAACATCTAACGATACTTTATACAAGAAGCTGTGGAATGTGATGAAGTCAGCAAGACCTACCGTCTTCACCACCTCCAACGAGGAAGGTCGCGATCGTGTCTTACGTAGTGAAGGAAAATACGCGTTTTTCATGGAGTCCACGTCTATTGAATATTACATGCAGAGATTCTGTAGTCTTAAAATGACTGGAGGGAAACTAGATTCGAAGGATTATGGGATAGCAATGCCGAAAAATTCACCATACAAAAGAGGAATTGACAATGCGATACTAGCGCTCCAAGAATCAGGAGAATTGTTGAAGTTAAAAACGAAATGGTGGGAAAAAGAGGACAATGCTCTGGATTGTAAGAAAACTGAAACCGAGGAAAACAGCGGCTCTGTGCAAATGAAAAATACAAGCGGGATCTTTATTGTCCTTGCTTCTGGGGGCCTTATAGGATTCTTAGTAGCAATTATAGACTTCCTGTTGCATGCTAAGAAGATTTGTGTCACTGAAAAGGTGTCGTTTAAAGAAGCGGTTGTGAGCGAGTGGAATGCATCCTTGGATCCTCGTGCCTTACACCGCCTGGCTGCGCCGCCACGTTCTGCAGCCCCTTCTACAGCATCACCGTCTCGGGAGCGTTCACAGTCTCGGGCAGTCTCAGTACTCCGAGCAGCCACAAGTTTCATCAACTTCGATGAAATATATTGA

Protein sequence:

>DPOGS209172-PA
MLHLVAIIVNSQTGYAEVSTRLPIGGLFNSKTLPNSALAFENILKIGTTNAYHGKLIAAQVTDSYSTALEMCANTSSDDGIVALVDARPTNGICDVTCSICNKLNISHLILGWQPPATLENDMYSFFYHPSPEIISRAFATLIKTLNWDKFTILYEDEGTLDAYTINLEDLADFRANVSTLHLTMPNDIRWIDKDMSNYNIRLETALTADALGHLDKAIRSMLMEIEDQEEYSRRLIPIADPPSLCFMKSKEYEEAAWPQGAALRDALLKTTYKGFTGNVNFDKYGKRTNFVLHYSKLSNESQFIYVGKWDYKTDTLYTEKDITERSSAKSSKSVIRIVSRKGKPYFDFSNETTTFRGYAVDLIDKIFEHMRNNGKDLKYEFYRVSGDDYGHPIAGTKKWSGLIGEVLDHNADLAICDLAITSERNALVDFSTPFMSLGIGLLTKEPEPEEPDMFSFIKPLSLDVWLYLATIYIIVSFVLLICARMSQDDWVNPHPCNQNPENLQNIWSLYNCMWLTMGSIMTQGCDILPRAVGSRWVAGMWWFFALIVTASYTANMSTFLSASRRSNDLQEVSDLVDQNSISYGTVDNASTYRFFETSNDTLYKKLWNVMKSARPTVFTTSNEEGRDRVLRSEGKYAFFMESTSIEYYMQRFCSLKMTGGKLDSKDYGIAMPKNSPYKRGIDNAILALQESGELLKLKTKWWEKEDNALDCKKTETEENSGSVQMKNTSGIFIVLASGGLIGFLVAIIDFLLHAKKICVTEKVSFKEAVVSEWNASLDPRALHRLAAPPRSAAPSTASPSRERSQSRAVSVLRAATSFINFDEIY-