Monarch geneset OGS2.0

DPOGS202815
TranscriptDPOGS202815-TA1650 bp
ProteinDPOGS202815-PA549 aa
Genomic positionDPSCF300018 + 272672-276259
RNAseq coverage536x (Rank: top 23%)
Annotation
HeliconiusHMEL0037620.085.88% 
BombyxBGIBMGA010513-TA0.077.30% 
DrosophilaEaat2-PC1e-16362.50% 
EBI UniRef50UniRef50_E1JHQ62e-16162.50%Excitatory amino acid transporter 2, isoform C n=12 Tax=Endopterygota RepID=E1JHQ6_DROME
NCBI RefSeqNP_001162844.13e-16262.50%excitatory amino acid transporter 2, isoform C [Drosophila melanogaster]
NCBI nr blastpgi|3597514030.077.30%glutamate transporter [Bombyx mori]
NCBI nr blastxgi|3597514030.078.62%glutamate transporter [Bombyx mori]
Group
Gene OntologyGO:00160201.4e-247membrane
GO:00068351.4e-247dicarboxylic acid transport
GO:00171531.4e-247sodium:dicarboxylate symporter activity
KEGG pathwaybmy:Bm1_429302e-104 
 K05613 (SLC1A2, EAAT2)maps-> Amyotrophic lateral sclerosis (ALS)
InterPro domain[60-507] IPR0019911.4e-247Sodium:dicarboxylate symporter
Orthology groupMCL15947 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202815-TA
ATGGTTGGTGGTGCAGTAATGGAAACCTCTCTGTATAACAAAAAGCGTCAAGTGGATACAACAGTTCTAGAAATAGAAGTGCATCTACACGTTAAGGTCCAGTTACCTGGACCAAAGGGTTGCACTGCTCCTGTAATGGAGAAAAGTGGCCGTTTTTCAGAGTACCTGTTCGTGAAGATGGGCCCAGGCGATAGGGGCCCGAAGACCACAGAGGATCGATCAGCACCCATTGATCATTCCGGCGTCAAAAAATGGCTTTTAGACAATACCATGCTCGTAGTTACATTGGCGGGTGTTATAACTGGAATTGCTATTGGTTTTGGTCTTCGCCCCTACCACCTAGGTCCAGATGCATTGATGATAATATCATATCCAGGAGAGTTATTTATGAGATTATTAAAGCTAATGATTTTGCCTCTTATCATTGCTAGTCTCATTGCTGGCTCAGCAAGCCTTAACGCTAAGATGAGCGGGAAGCCGGAGTTAAAAGACGACTTTGGTACGTCATTTGAAAATAAAAGAGACCATAGCATTCTGGACAGTCTGCTTGATATTGGCAGAAACATTTTTCCGGACAATATTGTTCAAGCTGCATTCCAACAAGCACATACAGTATATGCGGAACCGAAGACATTATTTGCTAAAAATGCGACTGAAAATGGCACTGAGCCAGTTCTTGTTCGTGATATTAGCTACAGATCTGGTACAAACACTTTGGGCTTAGTATTTTTCTGTCTTGTATTTGGAAGTTTGTTAGGGACACTTGGGCCGAAAGGCAAAGTTGTTATTGACTTCTTCCAAGCCATCTTTGAAGTTATTATGAAAATGGTCACTGGTGTAATGTGGTTCACTCCTGTTGGGGTCAGTAGCGTGATAGCTGGAAAGATTCTTGGTGTAAGCAATGTTGGCCAAGTTATGTCCCAACTAGCGTGGTTTATCGCAACAGTCGCAGTTGGGATATTTCTTTACCAACTTATAGTTATGCAGTTAATCTATTTTATTTTCTTAAGAAGAAACCCGTATAAGTTCTATTGGGGACTATCTCAAGCCATGCTTACCGCGTCAGCTACGGCTTCCACGGCTGCAGCTCTCCCAGTAACCTTCCGTGCTATGGAAGGTCCGTTGAACATTGATTCACGCATCACTCGTTTCGTCCTTCCTATCGGTTGTAATATCAACATGGATGGCACAGCATTATTTCTGGCCGTTGCTAGCGTCTTCGTATGTCAGATGAACAATTTGCACCTTGGATTCGCCCAACTAGCTACTATATTTCTAACATCGACGGCAGCGTCAGTGTCTTCGGCTTCAGTGCCGTCCGCAGCGATGGTGCTGTTATTAGTCGTTTTAGCGGCAGTTGACGCACCGACACACGATGTATCTCTGCTATTCGCTGTGGATTGGCTTGTTGACCGCATCCGAACGACAAACAACATGCTTGGAGATTGCTACGCTGCGGCTGTTGTAGAGAAACTTTCCAAAAACGAACTCATGGCCTGTGATGCTGCTTCTATGGACCAATCGCTTCCCAATGGTTTGCCGACATCAAACACTGAGCTAGAAATAGGTATAGTAACGCCAGGGGACAAGTCTATAGCATCGGACGATGTTATTATTGATATGCATTTACATAATACTAATAGATTATAG

Protein sequence:

>DPOGS202815-PA
MVGGAVMETSLYNKKRQVDTTVLEIEVHLHVKVQLPGPKGCTAPVMEKSGRFSEYLFVKMGPGDRGPKTTEDRSAPIDHSGVKKWLLDNTMLVVTLAGVITGIAIGFGLRPYHLGPDALMIISYPGELFMRLLKLMILPLIIASLIAGSASLNAKMSGKPELKDDFGTSFENKRDHSILDSLLDIGRNIFPDNIVQAAFQQAHTVYAEPKTLFAKNATENGTEPVLVRDISYRSGTNTLGLVFFCLVFGSLLGTLGPKGKVVIDFFQAIFEVIMKMVTGVMWFTPVGVSSVIAGKILGVSNVGQVMSQLAWFIATVAVGIFLYQLIVMQLIYFIFLRRNPYKFYWGLSQAMLTASATASTAAALPVTFRAMEGPLNIDSRITRFVLPIGCNINMDGTALFLAVASVFVCQMNNLHLGFAQLATIFLTSTAASVSSASVPSAAMVLLLVVLAAVDAPTHDVSLLFAVDWLVDRIRTTNNMLGDCYAAAVVEKLSKNELMACDAASMDQSLPNGLPTSNTELEIGIVTPGDKSIASDDVIIDMHLHNTNRL-