Monarch geneset OGS2.0

DPOGS204331
TranscriptDPOGS204331-TA1716 bp
ProteinDPOGS204331-PA571 aa
Genomic positionDPSCF300142 - 70333-76256
RNAseq coverage522x (Rank: top 24%)
Annotation
HeliconiusHMEL0046340.064.46% 
BombyxBGIBMGA007228-TA0.073.90% 
DrosophilaNAAT1-PA8e-12747.71% 
EBI UniRef50UniRef50_O761880.068.20%Transporter n=6 Tax=Obtectomera RepID=O76188_MANSE
NCBI RefSeqNP_001124343.10.063.10%putative amino acid transporter [Bombyx mori]
NCBI nr blastpgi|32528360.068.20%potassium coupled amino acid transporter [Manduca sexta]
NCBI nr blastxgi|32528360.068.20%potassium coupled amino acid transporter [Manduca sexta]
Group
Gene OntologyGO:00160212.8e-224integral to membrane
GO:00053282.8e-224neurotransmitter:sodium symporter activity
GO:00068362.8e-224neurotransmitter transport
KEGG pathway 
InterPro domain[11-544] IPR0001752.8e-224Sodium:neurotransmitter symporter
Orthology groupMCL10106 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204331-TA
ATGCCTATGTCCATTAACATCACGATGAATAATTCAGAGAAAGGAGGCAACGTCAACCCTGGGTTCGAGATATCTGAACCTAGGAAGTCACTTGACGCAAAGTTCCCTGACAATGTCATCAACGAAAAGGGCGAAGAAGACGATGAGGAAGACTCACGGCCGATGTGGGGAAACCAACTCGAATTTCTTATGTCATGTATTGCTACATCAGTCGGGCTTGGTCTCGGATATGCACAAGCTTTGGCCTGTGGTTATATTCTCTCCTACTACGTATCAATCATCGCTTTATGTATCTACTATCTTGCCATGAGCTTCCAAGCCCCCTTGCCGTGGGCGGTATGTGATCCTTCCTGGGTAAATTGTGTGCCCTCATCTAGTACAGGGGAGAATGCTAACGTGGTTAATGGAACAAGTAGTGCTGAATCATATTTCATAAAAACTGTTCTTCAGCGGAACAATGGACTTGAAGAGGGCCTTGGTCTTCCTGTTTGGTACCTGGTACTTTGCCTGTTAGCGTCCTGGATAATAATCTTTGTGATCGTTTCAAGAGGTGTTAAAAGTTCAGGCAAAGCATCATACTTCTTAGCACTCTTCCCTTACGTGGTGATGATTATTCTTCTTATCAGTACAGTAATTCTTCCCGGAGCTGGGAACGGTATTTTGTTTTTCATCACACCAGAATGGAATAAATTGCTGGAACTTGATGTATGGTATGCTGCTGTCACACAGGTGTTCTTCTCTTTGACAGTCTGCAACGGACCCATTATTATGTTCTCCTCTTATAATGCCTTCAAACAAAACGTATACAGAGACGCGATGATTGTTACTACTTTAGATACCTTCACCAGTTTGTTATCCGGAGTCACAATTTTCGGTATTCTTGGAAATTTGGCATACGAATTAAGAAGAGAAGTTGGTGAAGTTGTCGGTTCTGGAGGAACGGGACTCGCTTTCGTTTCCTACCCTGATGCTATTGCTAAGACTTTCCAACCACAGTTATTCTCAGTGCTCTTCTTCTTGATGATGACTGTACTTGGTATTGGATCAGCTGTAGCACTCTTGTCATCTATTAACACGCTTCTGTTGGACGCTTTCCCTCGCGTTAGAACTGTCTTCATGTCTGCCTTCTCATGCACTGTCGGTTTTGCTTGTGGTCTTGTGTACATCACGCCTGGTGGAGCATATGTATTGGAGTTGGTTGACTATTATGGTGGAACTTTCCTTGTTCTTTTCTGCGGTATCATTGAAGTTATAGGTTTCTTCTGGATTTATGGACTGGAAAACGTATGTTTGGACATAGAATTCATGTTGAACATAAAGACCTCTATATACTGGCGTTTCTGTTGGGGTTTCATTACACCAGCTATGATGGTTGTCGTCTTTGTTTACGCTCTCATGTCCTTCGATAGTTTAGAGTTCGCTGGATATACTTACCCATTAGCTGGTTATGTTTCTGGATACCTGATGTTGTTCGTTGGAGTTTTCTTTGTTCCTCTCGTTATTCTGCTGACATTCTACAAATACAGAAGCGGCAGTTTCTATGATACCCTGAAGAAGTCTTTTACACCCAAAGAATCTTGGGGTCCCAGGTCTGCAAAGACTAGACGCGAATGGAAACTTTTCAAAGAGGAAGTTGAAAGGGAAAGAAGTATGGTGCCGAGATCATGGCTGAAACACGTCGGTTTAAGTCTGATAGGAGGATACAAACGTCCTTGA

Protein sequence:

>DPOGS204331-PA
MPMSINITMNNSEKGGNVNPGFEISEPRKSLDAKFPDNVINEKGEEDDEEDSRPMWGNQLEFLMSCIATSVGLGLGYAQALACGYILSYYVSIIALCIYYLAMSFQAPLPWAVCDPSWVNCVPSSSTGENANVVNGTSSAESYFIKTVLQRNNGLEEGLGLPVWYLVLCLLASWIIIFVIVSRGVKSSGKASYFLALFPYVVMIILLISTVILPGAGNGILFFITPEWNKLLELDVWYAAVTQVFFSLTVCNGPIIMFSSYNAFKQNVYRDAMIVTTLDTFTSLLSGVTIFGILGNLAYELRREVGEVVGSGGTGLAFVSYPDAIAKTFQPQLFSVLFFLMMTVLGIGSAVALLSSINTLLLDAFPRVRTVFMSAFSCTVGFACGLVYITPGGAYVLELVDYYGGTFLVLFCGIIEVIGFFWIYGLENVCLDIEFMLNIKTSIYWRFCWGFITPAMMVVVFVYALMSFDSLEFAGYTYPLAGYVSGYLMLFVGVFFVPLVILLTFYKYRSGSFYDTLKKSFTPKESWGPRSAKTRREWKLFKEEVERERSMVPRSWLKHVGLSLIGGYKRP-