Monarch geneset OGS2.0

DPOGS210424
TranscriptDPOGS210424-TA1317 bp
ProteinDPOGS210424-PA438 aa
Genomic positionDPSCF300062 - 477810-484526
RNAseq coverage552x (Rank: top 23%)
Annotation
HeliconiusHMEL0119462e-13776.76% 
BombyxBGIBMGA010513-TA3e-6436.06% 
DrosophilaEaat1-PA1e-14761.22% 
EBI UniRef50UniRef50_G9BIL30.081.01%Amino acid transporter-like protein n=1 Tax=Bombyx mori RepID=G9BIL3_BOMMO
NCBI RefSeqXP_001842803.15e-15765.06%excitatory amino acid transporter 3 [Culex quinquefasciatus]
NCBI nr blastpgi|23522980.086.53%high-affinity Na+-dependent glutamate transporter [Trichoplusia ni]
NCBI nr blastxgi|23522980.086.53%high-affinity Na+-dependent glutamate transporter [Trichoplusia ni]
Group
Gene OntologyGO:00160201.7e-237membrane
GO:00068351.7e-237dicarboxylic acid transport
GO:00171531.7e-237sodium:dicarboxylate symporter activity
KEGG pathway 
InterPro domain[1-430] IPR0019911.7e-237Sodium:dicarboxylate symporter
Orthology groupMCL17966 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210424-TA
ATGTACTTCCAGTACCCAGGGGAGCTCTTCCTCAGGATGTTGAAGAGTTTGATCGTGCCACTCCTTGTATCATCTATAGTTTCCGCTATCGGATCGTTGGACCTTAGTTTATCTGGCAAGGTTGGTCTTCGCGCTATCATCTACTACATGACGACCACCATATGCGCTGTGATGCTGGGTATAGCTCTCGTAACTACCATCAAGCCTGGCCAACACTCCGAGGAGTACAACACCAACTCCACCAAGAAACTGGTCACGAAGGACACCCTCACTTCGGACACTTTGTTGGATTTGATAAGAAATGTATTTCCAGAGAATCTAGCGCAAGCAACGATCGCGTCTTACCGCACAAGGCTGACTTACGACCAGAACGACACAAAAATCATTAAAGGCCAGTTGGAAACCTATAAGATAGAAGGTGACTATCAAAGTGGCAGCAACGTCTTAGGGTTGGTGTGTTTTAGCGTTGTGCTCGGCATTACGCTCGGAAAAATGGGGGATAAGAGCCAGCCGCTGCAGACTTTTTTCCATTCACTGTCTGAAGCGATGATGATTATCACGGGATGGGTTATTTGGTTATCGCCATTGGGAGTATTTTTCCTTGTTACGGCTAAGATCATGGAGATAGAATCCTTCGCTGAGCTGGTCGGGCGTCTGGGTCTGTACTTTATGACTGTACTTCTGGGCTTGTTTTTACACGGTTTTGGAACTCTTAGCGTTCTGTTTATATTGGCTACCAAGAAATTGCCCTGCCGGTACATCGCTAAAATGGGTCAAGTTATGGCTACCGCTATTCCCATGCTCATGCACATGATATCTGCCATCAGTTCAGCAACGATGCCCATAACGATTGGCTGCTGTGACGACATGGGTCTCGACCCGCGCATCACACGGTTCGTGATCCCCATCGGTGCTACCATCAACATGGACGGTACAGCACTGTACGAAGCCGTCGCTGCCATATTTATAGCACAGATGAGAAAAGTTGATATGTCCTTCGGGAAGATTGTTGCTGTTAGTGTGACAGCCACAGCTGCTAGTATTGGAGCGGCGGGCATACCGCAGGCGGGGCTGGTTACCATGGTGATGGTCTTGGACACCGTGAACCTGCCCGCGGAGGACGTCTCTATTATCCTGGCGGTGGATTGGCTACTTGACAGATTCAGAACAACGATAAACGTGGTTTGCGATGCTCTCGGCGCAATTATTGTGACGTCACTCTCACAGGGAGACATTGAGAAGACACGAGCGTTACAGAACGAGAGAGAAGTAGCTGCCGGCCACGAGCTCGCTGAGTTAGAGAAGGGTGAACATTGA

Protein sequence:

>DPOGS210424-PA
MYFQYPGELFLRMLKSLIVPLLVSSIVSAIGSLDLSLSGKVGLRAIIYYMTTTICAVMLGIALVTTIKPGQHSEEYNTNSTKKLVTKDTLTSDTLLDLIRNVFPENLAQATIASYRTRLTYDQNDTKIIKGQLETYKIEGDYQSGSNVLGLVCFSVVLGITLGKMGDKSQPLQTFFHSLSEAMMIITGWVIWLSPLGVFFLVTAKIMEIESFAELVGRLGLYFMTVLLGLFLHGFGTLSVLFILATKKLPCRYIAKMGQVMATAIPMLMHMISAISSATMPITIGCCDDMGLDPRITRFVIPIGATINMDGTALYEAVAAIFIAQMRKVDMSFGKIVAVSVTATAASIGAAGIPQAGLVTMVMVLDTVNLPAEDVSIILAVDWLLDRFRTTINVVCDALGAIIVTSLSQGDIEKTRALQNEREVAAGHELAELEKGEH-