Monarch geneset OGS2.0

DPOGS214114
TranscriptDPOGS214114-TA1548 bp
ProteinDPOGS214114-PA515 aa
Genomic positionDPSCF300014 - 1724731-1729801
RNAseq coverage47x (Rank: top 71%)
Annotation
HeliconiusHMEL0113990.064.24% 
BombyxBGIBMGA006164-TA0.079.16% 
DrosophilaNtl-PA1e-13449.35% 
EBI UniRef50UniRef50_D2KBC00.069.94%Transporter n=4 Tax=Endopterygota RepID=D2KBC0_9NEOP
NCBI RefSeqXP_001654961.11e-16056.93%sodium/shloride dependent amino acid transporter [Aedes aegypti]
NCBI nr blastpgi|2809836160.069.94%proline transporter [Chilo suppressalis]
NCBI nr blastxgi|2809836160.069.94%proline transporter [Chilo suppressalis]
Group
Gene OntologyGO:00160218.7e-228integral to membrane
GO:00053288.7e-228neurotransmitter:sodium symporter activity
GO:00068368.7e-228neurotransmitter transport
KEGG pathway 
InterPro domain[8-474] IPR0001758.7e-228Sodium:neurotransmitter symporter
Orthology groupMCL17197 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214114-TA
ATGTCGTATGCTGCACACGAGCGCTGGGCCAGTCAGGCCGAATACTTGTTGTCATGTCTCGGATACGCTGTTGGTATTGGAAACATATGGAGGTTTCCATATCTATGTTACAGGAATGGGGGCGGTGCATTTTTAGTTCCATACTTTTTAACCTTAATCGTGTGTGGCATCCCTTTGGTTTATTTGGAAACTCTGTTAGGACAATTCTCTAATGCTGGATGCATTTCAGTTTTCAATATAAATCCGTTGCTGAAAGGGGCCGGTTATGCCGCTGTAATCCTGAATGTTATAGCTGTAATATACTTCGCGTCTATAATGGCTTATCCAATACTATACATATATCACTCCTTCGGTTCGCCACTCCCCTGGCAAACTTGTTCCAATTCATGGAACACTGAAAACTGCACAGAGATTACTGTAAACTCGAGTTTATTTATGAATGGTACAATCACTACGCCTGAGGACGAATTTTTCCATATACGTCTATTACACATGTCTTCAGGCTTGAGTCAAATCGGTGGTATAGTCTGGCCAGTTTTCTGGTGTAACGTCATCTCGTGGATTCTAGTCTATCTTTGCATCTGCAACGGAGTTAAAAGCGTCGGAAAAATTGTTTACTTCACCGTCCTATTTCCTTATGTCGTGCTAACAGCGTTGTTCGTAAGAGGCATAACATTGCCTGGCTCTTGGCAAGGCATACTTTACTTTGTGCTGCCGGACTGGAAACAATTATTGAACCCAAAAGTGTGGGCTGATGCGGCTACCCAAATCTTCTATTCACTAGGTCCTGGATGGGGTGGACTCGTCAGCATGGCCAGTTTTAACAAATTCCATTATAATAACTTAAGATCATCAATAATAATTCCCTTAGTTAACAGCGGTACCAGCATCTGGGCTGGGTTCGTTGTGTTCTCGGTTTTGGGTTTTGTGGCTGAGCGGGCGGGTGTGCCGGTGGGCCGGGTAGCTACAGCTGGACCCGGGCTCGCTTTTGTAACTTACCCAGCCGCTGTATCCATGATGCCGGCTCCCAACTTTTGGGCTATAACCTTCTTTGTTATGCTTTTCTTTCTCGGTATCGACTCAATGTTCGTTACAATAGAATCGGTAATAGCTGGTGTGATGGACGAGTTCCCTCAACTGCGGCGTCGAAAGAGATTCATTACTTTGATGACCTGTCTGTGTCTCTTCTGTTTGTCAATCATTTGCAATACTGAGGGCGGGTTACATATAATAACTCTACTGGACGCTCATGTCGCGATAGCGGCCGTACCGGTAGTATGCGGTATGGAGATATTAGGGGCTGTGTACACATACGGACCGAGGAAGTTTAGTATCGATGTGTTATTCATGACCGGAAAACCTCTGATGCGATTTTGGTTGATTATGTGGAGGTATATAATACCAGTTCTGCTCGTGAGGATACGAGACAGTTGTGAGCCGAGTGAGAACTGGGGTCCCATTGAACCTGTTGTAAGGGAACAGTGGAAAGCCTTCCAGAACCTTCACAATGTCTCTGATATTCCGCTAAACTTAACAAAACCGAACTGA

Protein sequence:

>DPOGS214114-PA
MSYAAHERWASQAEYLLSCLGYAVGIGNIWRFPYLCYRNGGGAFLVPYFLTLIVCGIPLVYLETLLGQFSNAGCISVFNINPLLKGAGYAAVILNVIAVIYFASIMAYPILYIYHSFGSPLPWQTCSNSWNTENCTEITVNSSLFMNGTITTPEDEFFHIRLLHMSSGLSQIGGIVWPVFWCNVISWILVYLCICNGVKSVGKIVYFTVLFPYVVLTALFVRGITLPGSWQGILYFVLPDWKQLLNPKVWADAATQIFYSLGPGWGGLVSMASFNKFHYNNLRSSIIIPLVNSGTSIWAGFVVFSVLGFVAERAGVPVGRVATAGPGLAFVTYPAAVSMMPAPNFWAITFFVMLFFLGIDSMFVTIESVIAGVMDEFPQLRRRKRFITLMTCLCLFCLSIICNTEGGLHIITLLDAHVAIAAVPVVCGMEILGAVYTYGPRKFSIDVLFMTGKPLMRFWLIMWRYIIPVLLVRIRDSCEPSENWGPIEPVVREQWKAFQNLHNVSDIPLNLTKPN-