Monarch geneset OGS2.0

DPOGS213718
TranscriptDPOGS213718-TA1488 bp
ProteinDPOGS213718-PA495 aa
Genomic positionDPSCF300310 - 160955-164433
RNAseq coverage18x (Rank: top 80%)
Annotation
HeliconiusHMEL0176615e-8268.30% 
BombyxBGIBMGA011849-TA1e-8163.59% 
DrosophilaCG10804-PC1e-2523.27% 
EBI UniRef50UniRef50_D6WF781e-2623.83%Transporter n=4 Tax=Coelomata RepID=D6WF78_TRICA
NCBI RefSeqXP_969026.13e-2825.15%PREDICTED: similar to sodium- and chloride-dependent neurotransmitter transporter [Tribolium castaneum]
NCBI nr blastpgi|910784966e-2725.15%PREDICTED: similar to sodium- and chloride-dependent neurotransmitter transporter [Tribolium castaneum]
NCBI nr blastxgi|3485435562e-3124.49%PREDICTED: sodium-dependent serotonin transporter-like [Oreochromis niloticus]
Group
Gene OntologyGO:00160213.1e-35integral to membrane
GO:00053283.1e-35neurotransmitter:sodium symporter activity
GO:00068363.1e-35neurotransmitter transport
KEGG pathway 
InterPro domain[28-377] IPR0001753.1e-35Sodium:neurotransmitter symporter
Orthology groupMCL25894 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213718-TA
ATGAGCGAAAGTCTGAGTAGTGTTACGAAGACAGTCACCGAGACCGTTGATACGGAATCCACTGAAGACAATTTGATCGTAGACCGCTGGACGAACAATATCAGTTATCGCCAGTTGGTGATGTCATCATGTATTGGCATCGTACAATTGTGGTGGCAACCTTACATTCACGACTTCCGAAAATTAGTCCCCTTCCTTTTCCTCTATAACGTGTTCAGTGTATTCTTTGCCTACCCCGCTTTCTATCTGGAGCTTGCTCTCGGTGTGGTGACAAAGAAAGGCGTTCTTAACTGCTGGGATTTGGCTCCTGTGGCCCGAGGTCTGGGTCTAGCGATGTTAATATTATGTACATTTACGGCGTTGTCTCTGAGTGCTGTGAGTTCATGGTGTTTGGCGTTGGTAGTTCATTCTTTTCATTCGTTTTTACCTTGGCTTCACTGCGCTTCAACGGCTAATCCGCCGTGTGCAGCTCGACATAGACCGCTACCAGCTGGTAGTGAAACCCCACCTCAGTCTTTCTTCTTTAACTTTGTTCTGAAGTTGAAACGTGACGGACTGCACGGGGGTCTAGGCAGTATTGTATCAGAATTATCCGTTTATTACGTCATATCCTGGATACTGGTGTACTTCATTGCGTGCAAGAAAATCTATAGCTATTCGAAACTGGTGCTCTTCAAGGATTCTTTAGCGTTTTTCGTTTTGGTGTGGAGTGCGTTCGGTGTAATCAGATTGAAGGGGTCATCGCGTATGTTTTATGATTGCGATTGGAATGTTCTCTTTGAAAGTTTTCAGATTTGGCGAGAGGCTTTGGAATTTGCATTTATTCAAATGTCGGTATCCCAAGGCACGCTCATAATGTTGGGATCCTATTGTCCTAAACAGAAACGTATGCTTGGGAATACATCAGTATTTGCTTTCGGCGTTTCTAAAATCAGTTGTTCAGCAACGGCTCTTATTCTTGGAGGTGCCCACGGCGCTTTGAATTGGGACTATGACAATAACAGCACTCACATTGTCAAAGGTTCTTCCGCGTCCATTATAATTTGGGCTGACTTTGTTGCAAGAGCACCTGGAAGCCAGTTTTGGTCGATTTTGACATTTTTTACACTGTTTGTTTTGTCCGTTTGCTCAACGGGTGGACTGTACATTTTGAATTTCCTTTTGACTTGGCCTGTAACAAAACCGCGAATTCCTATCGCAGCAGTCGTGGCCTTCGTAGTGACGTACCAATATGGACAGAGTACATTTTGTGAGGATGTATTTTATGCTGTGGGGGAATACCCTTGTGTATTCCTAAGAGTCTGTTGGGCCTTGACACCGATTTTCCTTTTGATAACTTTTGTATCTGGCATGGCTTCGTCGCCTGTGCCCGAGTCAGTCGCCGGCTGGTCGTTAGTGATGACGTCACTACTGCCATTGGCCGTATTAACGTTCCTTTTCCTCGTTTATAAATTTAGAGTTCGGAATATCGTCGCTACGGAGAAGTGA

Protein sequence:

>DPOGS213718-PA
MSESLSSVTKTVTETVDTESTEDNLIVDRWTNNISYRQLVMSSCIGIVQLWWQPYIHDFRKLVPFLFLYNVFSVFFAYPAFYLELALGVVTKKGVLNCWDLAPVARGLGLAMLILCTFTALSLSAVSSWCLALVVHSFHSFLPWLHCASTANPPCAARHRPLPAGSETPPQSFFFNFVLKLKRDGLHGGLGSIVSELSVYYVISWILVYFIACKKIYSYSKLVLFKDSLAFFVLVWSAFGVIRLKGSSRMFYDCDWNVLFESFQIWREALEFAFIQMSVSQGTLIMLGSYCPKQKRMLGNTSVFAFGVSKISCSATALILGGAHGALNWDYDNNSTHIVKGSSASIIIWADFVARAPGSQFWSILTFFTLFVLSVCSTGGLYILNFLLTWPVTKPRIPIAAVVAFVVTYQYGQSTFCEDVFYAVGEYPCVFLRVCWALTPIFLLITFVSGMASSPVPESVAGWSLVMTSLLPLAVLTFLFLVYKFRVRNIVATEK-