Monarch geneset OGS2.0

DPOGS213629
TranscriptDPOGS213629-TA3255 bp
ProteinDPOGS213629-PA1084 aa
Genomic positionDPSCF300033 + 1094864-1100805
RNAseq coverage64x (Rank: top 68%)
Annotation
HeliconiusHMEL0066500.094.08% 
BombyxBGIBMGA011691-TA0.093.67% 
DrosophilaNmdar2-PC0.066.25% 
EBI UniRef50UniRef50_Q8MM140.068.26%Glutamate NMDA receptor subunit variant NR2-a n=42 Tax=Arthropoda RepID=Q8MM14_DROME
NCBI RefSeqXP_971730.20.071.40%PREDICTED: similar to glutamate receptor, ionotropic, n-methyl d-aspartate epsilon (nmda epsilon) [Tribolium castaneum]
NCBI nr blastpgi|3407169660.061.75%PREDICTED: glutamate [NMDA] receptor subunit epsilon-2-like [Bombus terrestris]
NCBI nr blastxgi|1892356870.071.40%PREDICTED: similar to glutamate receptor, ionotropic, n-methyl d-aspartate epsilon (nmda epsilon) [Tribolium castaneum]
Group
Gene OntologyGO:00160201.1e-63membrane
GO:00049701.1e-63ionotropic glutamate receptor activity
GO:00052341.1e-63extracellular-glutamate-gated ion channel activity
GO:00068102.8e-26transport
GO:00302882.8e-26outer membrane-bounded periplasmic space
GO:00052152.8e-26transporter activity
GO:00048724.5e-13receptor activity
GO:00068114.5e-13ion transport
GO:00052164.5e-13ion channel activity
KEGG pathway 
InterPro domain[504-866] IPR0013201.1e-63Ionotropic glutamate receptor
[526-864] IPR0016382.8e-26Extracellular solute-binding protein, family 3
[154-402] IPR0018282.1e-16Extracellular ligand-binding receptor
[515-571] IPR0195941.1e-15Glutamate receptor, L-glutamate/glycine-binding
[542-570] IPR0015084.5e-13NMDA receptor
Orthology groupMCL10194 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213629-TA
ATGCATGCGCGTACCGCGCTAATGGCGCTGGCTGCGCTGGGGGCGCTGGCGTGGAGCGAGGCGGAGCGTGGCGCTGGCATCAAAGTCGGCGGCGGAGTGACGGGTGCTGGTCGCGAGGCAGCCCGGGGGGGCGGAGTACGGATCGGTGGCGACGGGCTGCGGCGACCTCGGACCTCTCCCGCCCCACAGGCTCCGCGCGCGCCGTCTGTTATCACCGCTGCCCTCGTTGTGCCTCACAAGGCCTTCGGTGCACGTGATTATACCAGAGCTGAGAAGGCTGCTCTGTCCAAACTGCCGCGCAAACTTAAACTTTTTTCACAAGTGCGCCTAAACGTTACGCTTTCGATGCAAGGCCTTACGCCTAGTCCCATGTCTATCTTGGACTCGCTATGTAAGGAATTTTTAGCAGTTAACGTATCTGCTATACTTTATCTTATGAATCATGAACAATACGGGCGATCTACTGCCTCTGCACAATATTTTCTACAACTCGCCGGATATCTTGGCATACCGGTCATTGCATGGAATGCTGACAATAGCGGTCTTGAAAAGCATGCGTCTCACGCGTCACTGCGGTTGCAATTAGCGCCCACTATCGAACACCAGACGGCCGCTATGCTCTCTATACTAGAGAGATACAAATGGCATCAGTTTAGTGTCGTCACATCAGCTATTGCTGGTCACGACGACTTCATACAAGCCGTGCGGGAGAGAGTCTCGGCTCTTCAAGACCGCTTTAAATTTACGATCCTCAATGCTGTGGTTGTGAAAAAACCAGCAGATTTAAACGAGCTGGTAACGAGTGAAGCGCGCGTGATGTTGTTGTACGCGACACGGGAGGAAGCGGCTGACATACTATCTTCAGCTGGTGATCTTCACCTCACTGGAGAAAATTTCGTTTGGATTGTGACGCAAAGTGTGCTGGGCTCTATGCAACAGCCAAACAAATTTCCTGTTGGCATGCTAGGTGTACACTTCGACACATCAAGCTCGTCAATTATATCTGAGATTGCGACCGCTGTCAAAGTTTTCGCTTACGGTGTGGAGTCCTACGTGTCTGAACCGGAAAACATTAGATATCCTCTGGGAACTAGGTTGTCTTGTAGCGGAGTGGGCACGGGTGAAGCGCGCTGGTCTACTGGTGAAAGATTTTATCGGCATCTACGTAATGTCAGCGTGGAGGGAGAGTCGGGAAGACCGAGTATAGAATTTACTCCAGATGGCGAACTTCGGGCTGCTGAGTTAAAAATAATGAACTTAAGACCAACTCTCGGTGAACAGCTTGTTTGGGAAGAAATTGGAACTTGGAATTCATATCCCAAGGAACGGTTGGTAATTAAGGACATTGTTTGGCCTGGTGGGTTACACACTCCACCACAGGGTGTACCAGAAAAGTTCCATATGCGTATAACGTTTCTAGAGGAACCGCCTTACATTAATCTAGCACCACCGGACCCCGTCAGTGGGAGATGCTCTTTAGATCGTGGAGTCATTTGTAGGGTCGCACCAGAGATTGAAGTAGCAGGACTAGAAGCGGGGACGGCGCACAGAAACAGTTCGCTGTATCAGTGTTGTAGTGGATTTTGCATAGATTTACTTCAACAGTTAGCGGAACATCTCGGATTCACTTACGAACTCGTTCGGGTAGAGGACGGCCGCTGGGGTACCTTACACCATGGAAAATGGAACGGCTTGATCGCGGAACTTGTAAACAAAAAAACTGACATGGTTTTAACATCATTGATAATCAATTCAGACCGAGAAGCTGTTGTAGATTTTAGTGTGCCGTTCATGGAAACTGGTGTGGCCATAGTGGTTGCTAAACGAACTGGAATTATTTCACCCACCGCATTCCTTGAACCATTCGATACAGCTTCTTGGATGCTGGTCGGAGCGGTTGCCATTCAAGCCGCCACATTTTCTATATTTTTTTTCGAATGGCTATCGCCTAGCGGGTTTGATTGCTCAACGGGAACTAATTCCAAACGAATTCCACAGAATAGATTTTCCCTGTGTCGGACTTACTGGATCGTGTGGGCGGTGTTGTTTCAGGCATCAGTCCACGTGGACTCGCCGAGAGGATTCACTGCGCGGTTTATGACGAATATGTGGGCGATGTTCGCGGTGGTGTTCCTAGCTATATACACGGCCAACCTGGCCGCGTTCATGATCACCCGGGAAGAATTCCATGAGCTGAGCGGGCTGGACGACCCGCGCATCGCCCGCCCTCTCACTCAACGACCCGCACTCAAGTTCGGGACAGTACCGTGGTCCCATACCGACGCTACGCTCGCGAAATACTTTCCCGAGCCCCACGCTTATATGGCTTCATACAATAGAAGTACTGTGAGTGCCGGTGTGACCAGCGTCCTGACGGGGGATCTCGATGCGTTTATCTATGATGGCACAGTGTTGGATTATCTCGTCTCCCAAGATGAGGACTGCCGATTATTAACCGTAGGCTCGTGGTACGCGATGTCGGGTTATGGGTTGGCATTCACCAGAAATTCTAAATATCTAAGCATGTTTAATAAAAGATTACTCGATCTACGCTCCAATGGAGACCTAGAGCGGTTACGAAGATACTGGATGACGGGGACGTGTAAGCCAAACAAGCAAGAGCACAAATCATCTGACCCGCTGGCGTTGGAGCAGTTCCTCTCCGCGTTCCTGCTGTTGATGGCGGGCATTCTGTTGGCAGCACTACTGCTGCTGCTGGAACACGTATACTTCAGATACATGAGAGAACACCTGGCTGCGTCTAGCGCCAGTGCGTGCTGCGCTCTCGTGTCATTATCAATGGGACAATCATTAACCTTCCACGGGGCAGTAGTTGAAGCGGCAGCACGGGGCTTCGGTCCCGGGAAGCGTGGACACTGCCGTTCCGCCGTATGCGCTGCACAGGTGTGGCGAGCTCGTCACGAGCGTGACGCCGCGGTGGCTCGTGCTCGTCAGCTGGCGGCGGCACTGGCGGCTCACGGGCTGCAGCCGCCTCCTCGGCGCCTGGCCTCGGCCGCCGCGCTGCTCGCCGCTGGACGAGCTCATGACGCGACTCGGCCGCGGACTCTACACGCTCCCGCCGATTTGTTGCCCGACCTCGAGCGACCGCTCTCCTGCGGCGATCTGCGCGCCAGAGAGATGGACGTAGATAGCGAAGTTGTTGTGGGAAGCGCGCGCGGGGCGGGCTGGGCGCCGGGCGCGCCTCCTCGTGTTGTCTACTACAATAAAATATATTTTTTGGACAATACAGTGTAG

Protein sequence:

>DPOGS213629-PA
MHARTALMALAALGALAWSEAERGAGIKVGGGVTGAGREAARGGGVRIGGDGLRRPRTSPAPQAPRAPSVITAALVVPHKAFGARDYTRAEKAALSKLPRKLKLFSQVRLNVTLSMQGLTPSPMSILDSLCKEFLAVNVSAILYLMNHEQYGRSTASAQYFLQLAGYLGIPVIAWNADNSGLEKHASHASLRLQLAPTIEHQTAAMLSILERYKWHQFSVVTSAIAGHDDFIQAVRERVSALQDRFKFTILNAVVVKKPADLNELVTSEARVMLLYATREEAADILSSAGDLHLTGENFVWIVTQSVLGSMQQPNKFPVGMLGVHFDTSSSSIISEIATAVKVFAYGVESYVSEPENIRYPLGTRLSCSGVGTGEARWSTGERFYRHLRNVSVEGESGRPSIEFTPDGELRAAELKIMNLRPTLGEQLVWEEIGTWNSYPKERLVIKDIVWPGGLHTPPQGVPEKFHMRITFLEEPPYINLAPPDPVSGRCSLDRGVICRVAPEIEVAGLEAGTAHRNSSLYQCCSGFCIDLLQQLAEHLGFTYELVRVEDGRWGTLHHGKWNGLIAELVNKKTDMVLTSLIINSDREAVVDFSVPFMETGVAIVVAKRTGIISPTAFLEPFDTASWMLVGAVAIQAATFSIFFFEWLSPSGFDCSTGTNSKRIPQNRFSLCRTYWIVWAVLFQASVHVDSPRGFTARFMTNMWAMFAVVFLAIYTANLAAFMITREEFHELSGLDDPRIARPLTQRPALKFGTVPWSHTDATLAKYFPEPHAYMASYNRSTVSAGVTSVLTGDLDAFIYDGTVLDYLVSQDEDCRLLTVGSWYAMSGYGLAFTRNSKYLSMFNKRLLDLRSNGDLERLRRYWMTGTCKPNKQEHKSSDPLALEQFLSAFLLLMAGILLAALLLLLEHVYFRYMREHLAASSASACCALVSLSMGQSLTFHGAVVEAAARGFGPGKRGHCRSAVCAAQVWRARHERDAAVARARQLAAALAAHGLQPPPRRLASAAALLAAGRAHDATRPRTLHAPADLLPDLERPLSCGDLRAREMDVDSEVVVGSARGAGWAPGAPPRVVYYNKIYFLDNTV-