Monarch geneset OGS2.0

DPOGS204231
TranscriptDPOGS204231-TA1305 bp
ProteinDPOGS204231-PA434 aa
Genomic positionDPSCF300046 - 631658-634882
RNAseq coverage29x (Rank: top 76%)
Annotation
HeliconiusHMEL0151581e-7272.12% 
Bombyx% 
DrosophilaNAAT1-PA8e-0723.47% 
EBI UniRef50UniRef50_UPI0000E49DFE6e-0735.63%UPI0000E49DFE related cluster n=8 Tax=unknown RepID=UPI0000E49DFE
NCBI RefSeqXP_001196483.13e-0837.50%PREDICTED: similar to taurine transporter [Strongylocentrotus purpuratus]
NCBI nr blastpgi|1157440775e-0737.50%PREDICTED: similar to taurine transporter [Strongylocentrotus purpuratus]
NCBI nr blastxgi|1157440773e-0737.50%PREDICTED: similar to taurine transporter [Strongylocentrotus purpuratus]
Group
Gene OntologyGO:00160212.2e-07integral to membrane
GO:00053282.2e-07neurotransmitter:sodium symporter activity
GO:00068362.2e-07neurotransmitter transport
KEGG pathway 
InterPro domain[78-168] IPR0001752.2e-07Sodium:neurotransmitter symporter
Orthology groupMCL34536 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204231-TA
ATGGCTACCCGTAACCCAGTTTCTGTGGAGACTATACTTACGTTCGATAACAGAGATGCGCAAAGTCAATGGTCCGAAGTATGGTCGACTACACACATCCCGGGCATGGACAACGCATCCGGTGGTGACGGCTTGAGTGGTATCCAGGGTCGGAAGAAACTGCGATTTGGAGCTCTCATACTTGGGACTTTCCTAACGATGCATTGCCGAGTGACTGCGTCCGCTCTTACTGGCGGCGTCTTTGTTTTCATATTATTCCATATCCTGGCTACGGTGACGCTTGCCAATCCCTTGCGTCACTTTGAGGTGTATCTTGGTCAATGGAGTATTAGTGGACCCGGTCGTGCGTTCCGGATTTTACCAATGTTTGAAGGTCTTGGTATAGCGATATGCATAAACGCTTTGGTGCGGGCCATTATATGTTGTACAATGGCAGCAATATGTGCCATATACGTCGTATATTCCGTCAGTGATTCAAAACTTCCATTTAGTTACTGCAGGGATTTTGATCTTAGGCCGTATGACCCAATACAAAAAGATATATCCCTTCGAATGTTAAGAGAATCAAGATTTTCTTTGGTGACGGCTGGTACAGAAGAGGAAGAAATACACAATGAAGCTGTAACGGTTGGAATCCGTGATTTTGCATGGAGAAATGCGAGTAAATTAAAATCGAAAAGAAAAAGGCAAGTATTCCTAACTACCGGCCATATAGACCCGCATTTAAACGGTGAGAACGTATGGCACAGTGGCTTACTTCTATTGATGACAGGACTACACTCAAGTGGCGCTGCTATATGTGCAATAGTCGATAACATTCAGCCCAACAACAAGAATGGTTATGATATGCGCGAAATTATTTTATGCGGCAGATTCGCTGAACAGGTTGTAGTAGCTGGAATAATACTTACATGTATGGGTCTAAGCTTTCTTTTTGCTACAAGTGGTGGAGTTGCGTTGCTAGAAAGCGTGGATGCCTTAATGACAGGAATATCAACGCCATTCGTTTGCATATTGGAACTGGTAGCGATCCTTTACGTTTACCGCGGTCGGGATTTTGTGTCGGACATGAATATAGCGACCGAGGAAAACGCGTGCTCCTCCAGAATAGACACACAGTGGCAAATTATCCCTATTATAACTTTGGTTACGTTAATAATAAAGCTGAGCGTTGTGTTCGTGGCGGAATTGCCGAAGATGTCGTTAGCGTATGCGGCGGCGGCATTGGTGGCCGTGGTCATCGCCGGACCGCTGCGCGCCGCGAGGAACGCTGTGGTCTTCTGGCGAGCCAGGAGACACAAGTAG

Protein sequence:

>DPOGS204231-PA
MATRNPVSVETILTFDNRDAQSQWSEVWSTTHIPGMDNASGGDGLSGIQGRKKLRFGALILGTFLTMHCRVTASALTGGVFVFILFHILATVTLANPLRHFEVYLGQWSISGPGRAFRILPMFEGLGIAICINALVRAIICCTMAAICAIYVVYSVSDSKLPFSYCRDFDLRPYDPIQKDISLRMLRESRFSLVTAGTEEEEIHNEAVTVGIRDFAWRNASKLKSKRKRQVFLTTGHIDPHLNGENVWHSGLLLLMTGLHSSGAAICAIVDNIQPNNKNGYDMREIILCGRFAEQVVVAGIILTCMGLSFLFATSGGVALLESVDALMTGISTPFVCILELVAILYVYRGRDFVSDMNIATEENACSSRIDTQWQIIPIITLVTLIIKLSVVFVAELPKMSLAYAAAALVAVVIAGPLRAARNAVVFWRARRHK-