Monarch geneset OGS2.0

DPOGS209173
TranscriptDPOGS209173-TA2769 bp
ProteinDPOGS209173-PA922 aa
Genomic positionDPSCF300061 + 29283-38619
RNAseq coverage304x (Rank: top 37%)
Annotation
HeliconiusHMEL0155220.077.83% 
BombyxBGIBMGA011528-TA0.045.09% 
Drosophilaclumsy-PB0.045.08% 
EBI UniRef50UniRef50_E0VK110.048.83%Predicted protein n=1 Tax=Pediculus humanus corporis RepID=E0VK11_PEDHC
NCBI RefSeqXP_311343.40.048.49%AGAP000803-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582896720.048.49%AGAP000803-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1571297030.050.34%glutamate receptor, ionotropic kainate 1, 2, 3 (glur5, glur6, glur7) [Aedes aegypti]
Group
Gene OntologyGO:00160202.8e-96membrane
GO:00049702.8e-96ionotropic glutamate receptor activity
GO:00052342.8e-96extracellular-glutamate-gated ion channel activity
GO:00068106.8e-22transport
GO:00302886.8e-22outer membrane-bounded periplasmic space
GO:00052156.8e-22transporter activity
GO:00048722e-10receptor activity
GO:00068112e-10ion transport
GO:00052162e-10ion channel activity
KEGG pathwaymdo:1000208892e-167 
 K05202 (GRIK2)maps-> Neuroactive ligand-receptor interaction
InterPro domain[430-804] IPR0013202.8e-96Ionotropic glutamate receptor
[76-391] IPR0018282.3e-36Extracellular ligand-binding receptor
[456-802] IPR0016386.8e-22Extracellular solute-binding protein, family 3
[440-508] IPR0195942.5e-21Glutamate receptor, L-glutamate/glycine-binding
[473-501] IPR0015082e-10NMDA receptor
Orthology groupMCL10026 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209173-TA
ATGCCCGTAAATAAAGCTGAAGGAAGCTTAACGTGGGGGCTAAAATTGTCACTTCTAATTATTGTGCTATGTGGCGGTGTGCAATGTGCTTTTCGGAATTTTCAAGCATTGAAAACGAATATATATATCGGTGTTATACTACCCAACAACTCTGTGACCGAAGTAGCTTTTGCCTCGGCACTAGCGAGAGCCTCGATGGAAAGCGAACAATATCGTTATTCAATGAAAATAGTCTACGTGCCCTACGGCGACAGCTTCGCAGCTTCAAAAGCCGCTTGTGAGCTTTTATCAGCTGGTGTGATTGCGATCTTCGGACCCACTGACACCACTTCAGCTGCGGCTGTCGAAGCACGCTGCAGGTCTGCAGGTGTTCCGCACATTCAAGCATTATGGCGCCCACCTCACGTCCGAGGTTTGGAACGTCTCTCGCCCCCTAGCATTAATTTACACCCAGAATCCGTAGCGCTCTCAAAGGCAGTCGCCATCTTTATAAAAGACAGCGATTGGAACACTTATACGCTTCTTTATGACGATGATCAAGGACTGATTCGGTTGCAAGAAATATTAAAGAATGCCCAACCAGGACACAAATGGTTAGCACGCCGTCTGCGTCCAGGGGAAGATAATAGACCCTTGCTCAAATTATTAAAAGCCTACGGTGAAACAAGAGTTATAATAGATTGTCCCGCTAACAGAGTGCTCGAATATTTGCGACAGGCGCATGAAGTTAAATTCTTTGAGGACTACATGAGTTACATCCTAATGTCACTCGATGCTCATACGTTAGATCTACAAGAGCTGAGGTATGGTCTATCAAATGTCACATGCTTGAGAATCTTTGACCATTCAGACGGTCGAACTAAGTCCTATCTGGCTGATTGGAAGGCGAGAACTTCAAATGACATCAAAATGCCAAAGAAAACGCACGAGATTACTATCGAAGCAGCCCTTGCTGGTGACGCAGCAAGACTGATAACAGATTCTGTGGAAAATGCACCAAAACAGTTTAAAATAGCAGCACAGTCGATTGAATGTAACACAAAATCTAAATGGGAAGATGGAGAGACATTCACAAATCACCTTTTGACGAATCCAATACAAGGCATAACGGGACGTGTGCAAGTAGATAATATAACTGGGGAGAGAACAAACTTCAACGTAGAAGTTATGGAGCTATCTAACAGCGGATTTAACAGCATAGCAAAATGGAACGCGAAAACGGGCTTCGACTATGCACGGACGGCAACCGAGGTTTCCGATCTATTAGCAGAAAAATGGCAGAACAAAACATTCAAAGTGGTTTCTAGGATCGGTGCTCCTTATCTCGTCGAGAAAACACCAGCTGAAGGCGAAGTGTTGGTCGGGAACGATCGATACGAGGGTTATTCGAAGGATCTCATACACGAAATCTTAAAAGAAACGCTTCACTTAAATTACGTAATAGAAATAGTTCCTGGCAACGAGTACGGAAAGTATAATAAGGACACCAAGAAATGGAACGGCCTTATTGGACATCTCCTCGAAAGGAAAGCTGATTTAGCTATTTGTGATCTGACTATAACGTACGAAAGAAGAGCCTTTGTGGATTTTACGACGCCTTTTATGAGTTTAGGGATCAGTATTCTATATTCAAAGGCAACTCCGCCAGAGCCGGAACTTTTCTCATTTCTAAAGCCATTCTCCGTGGATGTCTGGATTTATATGGCCGCTGCATATTTGATGGTTTCATTATTGCTACATATTTTAGCAAGATTCGCTCCAAACGACTGGGAGAACCCGCATCCCTGTGACAAATCCCCTAAGGAATTGGAAAATATTTGGCATATCAAGAACTCTTGCTGGCTTACCGTCGGATCGATTATGACCCAAGGATCTGATATATTGCCCAAAGGATACTCCACAAGATGGGTGTGTGGCATGTGGTGGTTCTTTGCCCTCATCATGTGTTCCTCCTATACCGCCAATCTCGCGGCTTTCCTCACAAACGCTGCCATGGACGACTCCATTAAAAATGTTGAAGATCTTGCTTTGCAAACTAAAATCAAATACGGAACTGTAGATGGAGGTTCTACTTATTCATTTTTTAAGAGATCCAACGTGTCCACATATCAGAGGATGTGGACTGCAATGGAAGCAGCAAGACCATCAGTCTTTGTAAAAAATAATGATGAAGGTGTAGAAAGGGTTGTTAAATCAAAACGAGGATACGCTTTCCTAATGGAGTCAACCGCTATCGAATATCAACTTGAACGAAACTGCAATTTAATGCAAGTTGGCAACGAACTCGATTCTAAGGGATATGGTATTGCCATGCCTTTTTTGTCGTCTTACAGAACAGCGGTTGATAATGCCCTTCTAAAATTAGCTGAAGGTGGTAAATTGTTGGAACTTAAAAATCGTTGGTGGAAGCCAGCAGAGAAACGGTGTACGTCAGAAGAGGTTGGAGATAAGGGAGGTAGTGCCGTGGAGCTTGGTGTAGACAACGTGGGTGGAGTCTTTGTTGTTCTGGCTGTTGGTTGCGGCCTAGCAGCTTGTATGGGAGGATTCGAATTCCTCTGGCACGTTAGAGATGTTGCTGTTGAACAGAAGATTACTCAATCGGAAGTTTTTTGGGCGGAATTGAAATTCGCTTTGAGCTTTTGGGAAACTGAGAAGCCTGTCAACATTTCTCGATCATCGTCAGCTAAATCAGAAAACATTGCTTCTAGAGCATCGTCGGTGTTACGCTCTGTACTGGATTTAGCCCATCTTGATGTTTTTAATAAATGA

Protein sequence:

>DPOGS209173-PA
MPVNKAEGSLTWGLKLSLLIIVLCGGVQCAFRNFQALKTNIYIGVILPNNSVTEVAFASALARASMESEQYRYSMKIVYVPYGDSFAASKAACELLSAGVIAIFGPTDTTSAAAVEARCRSAGVPHIQALWRPPHVRGLERLSPPSINLHPESVALSKAVAIFIKDSDWNTYTLLYDDDQGLIRLQEILKNAQPGHKWLARRLRPGEDNRPLLKLLKAYGETRVIIDCPANRVLEYLRQAHEVKFFEDYMSYILMSLDAHTLDLQELRYGLSNVTCLRIFDHSDGRTKSYLADWKARTSNDIKMPKKTHEITIEAALAGDAARLITDSVENAPKQFKIAAQSIECNTKSKWEDGETFTNHLLTNPIQGITGRVQVDNITGERTNFNVEVMELSNSGFNSIAKWNAKTGFDYARTATEVSDLLAEKWQNKTFKVVSRIGAPYLVEKTPAEGEVLVGNDRYEGYSKDLIHEILKETLHLNYVIEIVPGNEYGKYNKDTKKWNGLIGHLLERKADLAICDLTITYERRAFVDFTTPFMSLGISILYSKATPPEPELFSFLKPFSVDVWIYMAAAYLMVSLLLHILARFAPNDWENPHPCDKSPKELENIWHIKNSCWLTVGSIMTQGSDILPKGYSTRWVCGMWWFFALIMCSSYTANLAAFLTNAAMDDSIKNVEDLALQTKIKYGTVDGGSTYSFFKRSNVSTYQRMWTAMEAARPSVFVKNNDEGVERVVKSKRGYAFLMESTAIEYQLERNCNLMQVGNELDSKGYGIAMPFLSSYRTAVDNALLKLAEGGKLLELKNRWWKPAEKRCTSEEVGDKGGSAVELGVDNVGGVFVVLAVGCGLAACMGGFEFLWHVRDVAVEQKITQSEVFWAELKFALSFWETEKPVNISRSSSAKSENIASRASSVLRSVLDLAHLDVFNK-