Monarch geneset OGS2.0

DPOGS209803
TranscriptDPOGS209803-TA1893 bp
ProteinDPOGS209803-PA630 aa
Genomic positionDPSCF300117 - 334734-343801
RNAseq coverage39x (Rank: top 73%)
Annotation
HeliconiusHMEL0089950.076.94% 
BombyxBGIBMGA008027-TA5e-15574.50% 
DrosophilaCG5549-PB3e-13944.11% 
EBI UniRef50UniRef50_E2AMG91e-17752.35%Transporter n=4 Tax=Endopterygota RepID=E2AMG9_CAMFO
NCBI RefSeqXP_968187.20.055.98%PREDICTED: similar to IP14091p [Tribolium castaneum]
NCBI nr blastpgi|1892415150.055.98%PREDICTED: similar to IP14091p [Tribolium castaneum]
NCBI nr blastxgi|1892415150.056.37%PREDICTED: similar to IP14091p [Tribolium castaneum]
Group
Gene OntologyGO:00160212.5e-259integral to membrane
GO:00053282.5e-259neurotransmitter:sodium symporter activity
GO:00068362.5e-259neurotransmitter transport
KEGG pathway 
InterPro domain[16-623] IPR0001752.5e-259Sodium:neurotransmitter symporter
Orthology groupMCL18333 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209803-TA
ATGCAGGAAAACGGTAATGCCGTGGCTAGTGGCGATGACAATACTGAAACAGCGCAGCGAGCTTCTTGGGGTCGACCTCTGGAATTCATATTGGCATGTCTGGGATACGCTGTCGGGCTTGGAAATGTGTGGCGATTCCCTTATCTATGTTACAGGAATGGAGGAGGTGCTTTCCTGATACCATTCTTTCTGATGCTTATTCTCGTTGGACTTCCAATATTTTATCTGGAACTGTACATTGGTCAATTCACTGCATTAGGACCTCTTAAAGCTTTTAATGCTATCAGTCCATTCTTCACTGGACTTGGCTACTGTACTTTAATGGTTATTTCGATGATATCAATATATTACATGATCATTGTAGCATGGACAATTTTCTACACTTTCATGTCGATAGGCGGCGATTTAGGATGGGGGTCGTGTGAAAATGATTATAATTCTCTATATTGTTACAGTGGCGCGTATGACAGTGAATGTAGAGTTAATAACACAGGAAACTACACGGATTTAACATACTATTTACAAAAATGTATCAGTATTGGAGAGATTTGTCAGCTTACGGGGTTCGAGCCATATGACGGTTCTAGTTGCTGGACGGGAAATGAAACTATCCCGTGGTATAGAAATGTGAATAGGACTTTGGCCTCAGAAGAATATTACAACGAACGTGTCCTTGGAAGAGGTGATTCCACTTGGGAAAACTGGGGAAGTATACAGTGGCACTTAGTTGGATGTCTATTTTTCTGTTGGGTAGTAGCATTCTTATGTGTCATTAAGGGAGTGCAGTCAGCAGGAAAAGTTGTGTATTTCACGGCGTTGTTCCCGTATGTAATGCTGACAGCTCTTCTTATAAGGGGTGTAACATTGGATGGAGCCGGAGAAGGCATCTTGTTTTATCTAACACCGAAGTGGGAAACACTTTTAGAAGCTCGTGTCTGGGGCGACGCAGCTTCACAGATATTTTATTCATTCGGCGTTGCTTGTGGCTCTCTGGTGACTTTGGCATCTTATAATAACTTTCACAATAACTGTCACTTCGATGCAGTGTTTGTCAGCTTAACAAACTTTATGACTTCGATATACGCTGGGTTTGCAATCTTCTCTGTTTTGGGATTTCAAGCAGAGCTCATGGGTGTTGGAGTCGATGATGTAGCGGAGCAAGGTCCTGGGCTAGCATTTGTTGTCTATCCCGAAGCCCTCCTACAAATGCCTATACCTCGACTATGGTGTATCCTATTTTTCTTCATGATGTTCATATTGGGATTAGGAAGTCAGTTCGCTGGTATCGAGGCTATAAATACAGCCATAGTCGATCGCTGGCCACAATTTCGGAATCGTTATTGGATGGTAACTGCATTCACGTGCTTCACGTGTTTTATTCTCGGTCTTCCAATGTGTTTCAGTGGCGGAGTATATCTATTTACATTGCTCGATTGGAACACCGCATCCTGGGCTATATTACTTATAGGTTTAGCTGAGTGTACCGCCGTATCTTGGTGCTACGGCATCAACCGAGCGATGCGAGACTTAGCTTCTATGGATATGAAGTTAAATGTGGTGATACAATTTTATTGGAAATTTGTGTGGATGATCAGTGTGCCAGTTGTTAGTTTAGGTATATTAATATTTTTGTTCATCGAATGGCAACCGCCGTCTTACGAAGGCTACGAATTTCCTCTCTACGCTGATCTTCTCGGATGGGGGATTGGTCTGTCTACACTGATGTTTTTCCCAGTTGGAGTTTGTTGGGCATTATACAAAGGATATAGAGGCAAGGAACTATTTCAACCAACAGAAGTTTGGAAACCGGCAGTGAAGCTATCAGATCAGTCACCTGACCCTAGACGGAATCCAAGACCCTCAGAATATGATAATCTCGGTTATATGATGTAA

Protein sequence:

>DPOGS209803-PA
MQENGNAVASGDDNTETAQRASWGRPLEFILACLGYAVGLGNVWRFPYLCYRNGGGAFLIPFFLMLILVGLPIFYLELYIGQFTALGPLKAFNAISPFFTGLGYCTLMVISMISIYYMIIVAWTIFYTFMSIGGDLGWGSCENDYNSLYCYSGAYDSECRVNNTGNYTDLTYYLQKCISIGEICQLTGFEPYDGSSCWTGNETIPWYRNVNRTLASEEYYNERVLGRGDSTWENWGSIQWHLVGCLFFCWVVAFLCVIKGVQSAGKVVYFTALFPYVMLTALLIRGVTLDGAGEGILFYLTPKWETLLEARVWGDAASQIFYSFGVACGSLVTLASYNNFHNNCHFDAVFVSLTNFMTSIYAGFAIFSVLGFQAELMGVGVDDVAEQGPGLAFVVYPEALLQMPIPRLWCILFFFMMFILGLGSQFAGIEAINTAIVDRWPQFRNRYWMVTAFTCFTCFILGLPMCFSGGVYLFTLLDWNTASWAILLIGLAECTAVSWCYGINRAMRDLASMDMKLNVVIQFYWKFVWMISVPVVSLGILIFLFIEWQPPSYEGYEFPLYADLLGWGIGLSTLMFFPVGVCWALYKGYRGKELFQPTEVWKPAVKLSDQSPDPRRNPRPSEYDNLGYMM-