Monarch geneset OGS2.0

DPOGS213730
TranscriptDPOGS213730-TA3063 bp
ProteinDPOGS213730-PA1020 aa
Genomic positionDPSCF300278 - 27220-39582
RNAseq coverage27x (Rank: top 77%)
Annotation
HeliconiusHMEL0137100.094.50% 
BombyxBGIBMGA011528-TA0.092.50% 
DrosophilaCG3822-PA0.065.63% 
EBI UniRef50UniRef50_Q9VDH50.065.63%CG3822 n=54 Tax=cellular organisms RepID=Q9VDH5_DROME
NCBI RefSeqXP_974911.20.078.24%PREDICTED: similar to CG3822 CG3822-PA [Tribolium castaneum]
NCBI nr blastpgi|1892347740.078.24%PREDICTED: similar to CG3822 CG3822-PA [Tribolium castaneum]
NCBI nr blastxgi|1892347740.077.77%PREDICTED: similar to CG3822 CG3822-PA [Tribolium castaneum]
Group
Gene OntologyGO:00160205.5e-95membrane
GO:00052345.5e-95extracellular-glutamate-gated ion channel activity
GO:00049705.5e-95ionotropic glutamate receptor activity
KEGG pathwaymmu:148060.0 
 K05202 (GRIK2)maps-> Neuroactive ligand-receptor interaction
InterPro domain[469-912] IPR0013205.5e-95Ionotropic glutamate receptor
[95-431] IPR0018285.6e-63Extracellular ligand-binding receptor
[479-544] IPR0195943.2e-33Glutamate receptor, L-glutamate/glycine-binding
Orthology groupMCL10026 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213730-TA
ATGGTTGACACTGGGTTATGCAAGAAACCTCGCATCCGGTTTCCTAGTCTGATGAAGGTGGTGTCTGCTAGTGAGACCGCTGACTTGATCGTTGACGCTGTAAGGAGAGACATCTTGGAGATCACCGTCCCACAGGAACTGCACTTCATGAACAGAGTGAGTATCTCTATAATGACAGGAACGAAAGCTTTACTATTAGTGCTCTTGATTGGACATTTGTCCGCCCTACCGGACACAATTCGTATCGGAGGTCTCTTCCACCCTGAGGATGATAAACAAGAGGTTGCATTTCGTTACGCTGTTGAAAGGGTAAACGCTGACCGCGCTGTCCTTCCCAGAGCCAAATTACTAGCTCAAGTTGAAACAATATCTCCGCAAGACAGTTTCCACGCGTCCAAAAGAGTATGCCACCTTCTTCGAAGTGGAGTGGCGGCTATATTTGGCCCCCAATCTGCTCCAGCGGCTGCCCACATCCAATCCATCTGTGATACAATGGAACTGCCCCATCTGGAGACGAGATGGGATTATCGAACGCGACGTGAATCCTGTCTTGTCAACTTGTATCCTCACCCAGCTGCTTTGAGTCGGGCGTACGTTGATCTCGTACGAGCTTGGGGATGGAAGTCATTCACTATAGTATACGAGAACAGTGATGGATTGGTTCGTTTACAAGAACTATTGAAAGCCCATGGTCCATCAGAATTACCTGTTGCTGTCCGACAACTACCAGATTCATATGACTACAGGCCGCTATTGAAACAAATAAAGAACTCGGCCGAGTCTCATATAGTACTTGACTGCGCTACTGAGAGGATTAGGGATGTGCTTCAACAAGCACAGCAGATCGGAATGATGTCGGATTATCACAGCTACCTTATAACGTCGTTGGACTTACATAGCGTAGATTTAGAGGAATTTAAATATGGCGGTACAAATATAACCGCGTTGCGCCTTCTCGATCCCGAACGAGCCGAGGTACAAAGAGTCGTTCGAGATTGGGTTTACGACGAAGCCAGAAAGGGAAGGAAGCTGCAACTAGGGCACACATCGGCTAAGACTGAAACCGCTTTAATATACGATGCGGTTCATTTATTTGCGAAAGCGTTACACGACCTTGACACTTCGCAACAAATCGATGTAAGACCCTTGTCATGCGAAGCCGAAGACACATGGCCCCACGGGTACAGCCTCATTAACTACATGAAAATCGTCGAAATGAGGGGCTTAACAGGAGTTATAAAGTTTGACCACCAGGGGTTCAGAAGTGATTTTACTCTCGATATCATTGAACTAACTAGAGATGGACTTCAGAAAGCCGGTGTTTGGAACTCTTCGGAGGGTGTCAATTACACGAGATCTTACGGAGATAACCAAAAACAAATAGTCGAGATACTTCAAAACAAAACCCTTGTCGTCACAACGATCTTGAGCGCTCCATATTGCATGCGGAGAGAAGCGAGCGAAAAATTGACAGGCAACGCTCAGTTCGAAGGCTACGCTGTTGATCTCATTCATGAGATATCTAAAATTCTGGGTTTCAATTACACATTCAAGCTTGCGCCCGACGGTCGATACGGGTCTTACAACAGGGAGACTAAAGAGTGGGATGGCATGATCAGGGAACTGCTCGAACAGAGAGCTGATGTTGCTATAGCTGATCTCACAATAACGTATGACAGGGAACAAGTGGTAGACTTCACGATGCCCTTCATGAATCTTGGCATCTCAGTGCTCTACCGCAAACCTATTAAGCAGCCTCCAAACTTATTCTCATTCCTGTCACCCCTCTCCCTTGATGTATGGATATATATGGCCACGGCGTACCTGGGCGTCTCTGTACTGCTATTCATTTTAGCCAGGTTCACTCCATACGAATGGCATCAAACGCATACGCCGGACGGAGAAAAAATGGAAAATATTTTCTCCCTCTCCAACTGCTTGTGGTTTGCAATTGGATCTCTTATGCAGCAAAGTTGTGACTTTTTACCCAAGATTATAGTGACCTCAGCGAAGTCCACAGAAAACTTTATCGGGGATATGGGAACATCTAGAATATTTGTATTTTCGACAATTCGGTTCAGCCCGTACGAGTGGGACAGCCCCCGGAACTGTCTAGACGAGCCGCAGGTGTTGGAGAATCAGTTCACACTGTTGAACTCGCTGTGGTTCACAATCGGATCCTTGATGCAGCAAGGTTCGGATATCGCACCGAAAGCGGTGTCAACAAGGATGGTGGCAGGAATGTGGTGGTTTTTCACTTTGATCATGATATCTTCATATACTGCTAACTTGGCCGCATTCCTGACAGTGGAACGTATGGACTCACCCATTGAAAGCGCCGAAGATTTGGCCAAGCAAACAAAAATTAAATATGGTGCCCTTAAAGGAGGATCTACAGCAGCTTTCTTTAGGGATTCAAATTTTTCGACATACCAACGGATGTGGTCGTTCATGGAGTCGGCTCGACCTTCGGTATTCACAAGCAGCAATAAAGAGGGGGAAGAGAGGGTTATGAGGGGGAAAGGTGCTTATGCATATCTCATGGAGTCCACCACCATAGAGTATGTTGTGGAAAGAAACTGTGACCTCACTCAAGTAGGGGGCATGTTGGATTCCAAAGGATATGGCATTGCTATGCCACCCAATTCACCTTACCGTACCGCTATAAGCGGTGCTGTTTTGAAGTTACAAGAGGAGGGTAAACTTCACATATTAAAAACAAAATGGTGGAAAGAGAAACGCGGCGGAGGATCGTGTAGAGATGAAACATCAAAGTCCTCATCCACCGCCAATGAGTTGGGTTTGGCGAACGTGGGCGGCGTGTTTGTTGTTTTGATGGGCGGCATGGGCGTCGCCTGTGTAATCGCTGTCTGCGAATTTGTATGGAAATCAAGGAAAGTCGCTGTTGATGAACGGGCGTCTCTTTGTTCGGATATGGCCTCTGAGCTGCGTTCCGCTTTGAAGTGTCCGAGTGGAGCCGGCGGGGGCTCTGGGGGGGCGAGAGAGGGAGCGGATTCCCCCTACTTGCATTACGGTTTTAGTACTAAGAGCCAGCTACACTAA

Protein sequence:

>DPOGS213730-PA
MVDTGLCKKPRIRFPSLMKVVSASETADLIVDAVRRDILEITVPQELHFMNRVSISIMTGTKALLLVLLIGHLSALPDTIRIGGLFHPEDDKQEVAFRYAVERVNADRAVLPRAKLLAQVETISPQDSFHASKRVCHLLRSGVAAIFGPQSAPAAAHIQSICDTMELPHLETRWDYRTRRESCLVNLYPHPAALSRAYVDLVRAWGWKSFTIVYENSDGLVRLQELLKAHGPSELPVAVRQLPDSYDYRPLLKQIKNSAESHIVLDCATERIRDVLQQAQQIGMMSDYHSYLITSLDLHSVDLEEFKYGGTNITALRLLDPERAEVQRVVRDWVYDEARKGRKLQLGHTSAKTETALIYDAVHLFAKALHDLDTSQQIDVRPLSCEAEDTWPHGYSLINYMKIVEMRGLTGVIKFDHQGFRSDFTLDIIELTRDGLQKAGVWNSSEGVNYTRSYGDNQKQIVEILQNKTLVVTTILSAPYCMRREASEKLTGNAQFEGYAVDLIHEISKILGFNYTFKLAPDGRYGSYNRETKEWDGMIRELLEQRADVAIADLTITYDREQVVDFTMPFMNLGISVLYRKPIKQPPNLFSFLSPLSLDVWIYMATAYLGVSVLLFILARFTPYEWHQTHTPDGEKMENIFSLSNCLWFAIGSLMQQSCDFLPKIIVTSAKSTENFIGDMGTSRIFVFSTIRFSPYEWDSPRNCLDEPQVLENQFTLLNSLWFTIGSLMQQGSDIAPKAVSTRMVAGMWWFFTLIMISSYTANLAAFLTVERMDSPIESAEDLAKQTKIKYGALKGGSTAAFFRDSNFSTYQRMWSFMESARPSVFTSSNKEGEERVMRGKGAYAYLMESTTIEYVVERNCDLTQVGGMLDSKGYGIAMPPNSPYRTAISGAVLKLQEEGKLHILKTKWWKEKRGGGSCRDETSKSSSTANELGLANVGGVFVVLMGGMGVACVIAVCEFVWKSRKVAVDERASLCSDMASELRSALKCPSGAGGGSGGAREGADSPYLHYGFSTKSQLH-