Monarch geneset OGS2.0

DPOGS207182
TranscriptDPOGS207182-TA1482 bp
ProteinDPOGS207182-PA493 aa
Genomic positionDPSCF300001 + 5095280-5129739
RNAseq coverage44x (Rank: top 72%)
Annotation
HeliconiusHMEL0103806e-5547.37% 
BombyxBGIBMGA007228-TA3e-0623.31% 
DrosophilaSerT-PA2e-0625.68% 
EBI UniRef50UniRef50_Q9N3C72e-0626.20%Transporter n=7 Tax=Chromadorea RepID=Q9N3C7_CAEEL
NCBI RefSeqNP_491095.33e-0726.20%Modulation Of locomotion Defective family member (mod-5) [Caenorhabditis elegans]
NCBI nr blastpgi|3418859775e-0622.92%hypothetical protein CAEBREN_01689 [Caenorhabditis brenneri]
NCBI nr blastxgi|2420141547e-0922.07%sodium-dependent nutrient amino acid transporter, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00160214.1e-05integral to membrane
GO:00053284.1e-05neurotransmitter:sodium symporter activity
GO:00068364.1e-05neurotransmitter transport
KEGG pathway 
Orthology groupMCL34694 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207182-TA
ATGGATTTCTTTATAAAACAATATACAAGAAAGCTGGATACTGGAAAATTGATGAATCCACTGCTGAAAGGAGTATCATACGGTATTATTATACAAACCGATCTTTCCGCTATTTTGCATGCAATCGATTTAGCCGACTCGGTCAGGTTTCTACTCACTAGTATGAATAAAACACCGGCGTGGTCCAAGTGTCAGAACCTCCCGCCAAATATTAGTTGTGTCTCTTCTCAAGATATAAAAATAAAATGCAATAACGAGGTCATTGTGAATCTGAAGCATACGTCAGCCCACCTCAACTACCTAAAACTGTTTGCCGGTCTTGACAGAAACAGTCTATCAATGAGATTTTTTATAATCGCAATTGTTTGGATATGTAATTTTTTCATCGCATCAATTACGGATACTGCTCTTTTAAGGAACAAGGGAATAAATCTAGTGACCGACCCTTTCGGAGTAGGCCTCATCGGGGTATATGATTTTGGGACCATGTCTCCTTTCACGATGGTTGACAACGCAGTGTTAATTTTCGCAATGGTGTTTATTGCTATGGCCTTCGCAAGATCTCTCATAGTACGAGTGCTGTATTTGAAACTTACTGAATGTGTTAAAGTAGACCTGGCCGAGTCGCCTCACTACCTTTTATTCGCAATTCTGCCTTTGAGTACGGAATTTATGGATGCTCACAAGATCTTCGTGTTGTACATTTATCTGTACATGATGGCTGCATTAGTGGCATATCTGGCAATGTTCACATCGACGATGTCAAGGCTACTTCACAGCGAATTCTGTTCCGTCAAAACAATTTATATAATTGGATTGGTCTGCTTTTTAGGTTTCATCCAATCATTGCCATTGACCTTGTATTCTGGCGACACTATGGGCTTATTCTTTGGCCTCAATACATGTACCTTATCTCTGGGCGCCATTAAGGTGGCCATAGTCATGTGGGTATATGGAGTCCAAAAGTTTTCAACAGATATACAATTTTGGCTTGGTTTTGAGCCTACAAGCTTCTGGCAGAATTTATGGACAGTCCTTCCATTATTCCTTACTGGCTTTGCTTTGCAACATATTAAGGATTTAATAACTTTTAAGCAAATAGGCCAAATCTATACCGCAATGTTTTGGTCTTTGATAACGTTTCTGGTTGTAATCATTTGCATGTTAAAAGCCGTGGCGCAGTGTATTGTAAAAAATAATTTAGCTGGCATACTTAAAAGTAGTTATAAACACGGTCCCCCGGAGATCGAAGATAGAAAGAAAAGAAGGAATTTTGACAAGGTTGCACAAAATAGAAAATGTAAACATAATTGCCTTATACTTGATGAAACTTTCGACTGCAATCATTTGCCGTTAACATTTAGGAGAAAATCAAACATAAATAATGATTCTTTGACTAATATATACGAGGCCGGACCGTCAAATGAAAGACACAGAACATCGAGTGTTTTAGATATTGCTAATTTAGATGCGCAAAAATAA

Protein sequence:

>DPOGS207182-PA
MDFFIKQYTRKLDTGKLMNPLLKGVSYGIIIQTDLSAILHAIDLADSVRFLLTSMNKTPAWSKCQNLPPNISCVSSQDIKIKCNNEVIVNLKHTSAHLNYLKLFAGLDRNSLSMRFFIIAIVWICNFFIASITDTALLRNKGINLVTDPFGVGLIGVYDFGTMSPFTMVDNAVLIFAMVFIAMAFARSLIVRVLYLKLTECVKVDLAESPHYLLFAILPLSTEFMDAHKIFVLYIYLYMMAALVAYLAMFTSTMSRLLHSEFCSVKTIYIIGLVCFLGFIQSLPLTLYSGDTMGLFFGLNTCTLSLGAIKVAIVMWVYGVQKFSTDIQFWLGFEPTSFWQNLWTVLPLFLTGFALQHIKDLITFKQIGQIYTAMFWSLITFLVVIICMLKAVAQCIVKNNLAGILKSSYKHGPPEIEDRKKRRNFDKVAQNRKCKHNCLILDETFDCNHLPLTFRRKSNINNDSLTNIYEAGPSNERHRTSSVLDIANLDAQK-