New model in OGS2.0 | DPOGS205278 |
---|---|
Genomic Position | scaffold445:+ 13981-21174 |
See gene structure | |
CDS Length | 3411 |
Paired RNAseq reads | 1632 |
Single RNAseq reads | 4009 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA011081 (0.0) |
Best Drosophila hit | bedraggled, isoform B (8e-101) |
Best Human hit | sodium- and chloride-dependent GABA transporter 1 (9e-20) |
Best NR hit (blastp) | hypothetical protein TcasGA2_TC000366 [Tribolium castaneum] (0.0) |
Best NR hit (blastx) | hypothetical protein TcasGA2_TC000366 [Tribolium castaneum] (0.0) |
GeneOntology terms | GO:0005326 neurotransmitter transporter activity GO:0005887 integral to plasma membrane GO:0005328 neurotransmitter:sodium symporter activity GO:0006836 neurotransmitter transport GO:0007464 R3/R4 cell fate commitment |
InterPro families | IPR000175 Sodium:neurotransmitter symporter |
Orthology group | MCL16352 |
Nucleotide sequence:
ATGGAAACAATTGCAATGGAAGAGGGTAGGTATGAACAAAAGGAAAGTTTTCCAGGAAGT
AGTGGTATTGACAGTGATACATCAAGTGTATTTGAGAGTAATGAAAGTAGTGAACAAATA
AGCGAAGATGAAGAAGATTTGAAAGATGACCTCACAAAACATTTACCTATATTTTATAAG
TCTGATGCAAACCATGAATCACATAACAACATCACAAACGATGAGGAATTTCAAGCAGCC
GCTCTTGGTCAGTTTATGGACATATTAAATGATTTAGATGAAGTTCTAGATAAATCCCTG
CTTGCATGTCTTGACGATGGTACTAAATCTTTAGACACTGACGAAGGGGATATTATCTGT
AAAATAAAGGAATGTATAGTGGATACTGAAAATATATCTGAACATTTAGGATCCCAGAAC
TCTTTAGATGATGATACGCAAGTCGTGTGCTCCGTTCTTCCGTCTTCGACAGCATCAATA
GATGATTTAGAACTTTCACAATGTGCTCCCATTTCACGCAATAATCTCATCCGTTCGAAA
AGTTTCTCTGAAATTCCACGAAATCATACACAGTTTACCACCGCTCAGGCTTTAGAAAGG
GCTAATACGGTAAACAACACGCTAAGAAATTCAATGAGAAGATTAGATCCTATAGTTCTA
CCAGCTATAACAAACCAGGAATCTGAACCTCTTACACTGCCCGTCATATTGTTTTTGGAG
CATCATGTTAATGCGCGACCAACAAGTTCTCCTATACAATTGCAAGTGACCGCAGCAAAT
TTAAGTAGCGATGGGAGTTCCGGTCCACTCATAGTTGGAAGACGGACACTGTTAATGAAC
AGAGCGTTATCATTACCGTCGCCTGTTGATAGTGATATCACGACTAATTGGAACGAACGA
ATATCGAGAAGTCTAGCGAGAAGCGCAAGTGCATCTAGTTCGTCAGAAGATTCACTGCCA
GGGTTAGCAGCTGTAAATCACAGTCCTGCTTCTGACGACCGACATGAGGATGCTGACATG
CCCCCGTTTGGAGTTTGGCCGCATAGAATGAGCGCGATGCTAGCGTGTTTCAGTTGCACG
ATAGGAATATTCAACATCTCAAGATTTGCCATATTTAGCGTTAACTTTGGAGCCAGTTTT
ATAGTGCAATTTATTATACTATCATTAATAGTAGGTATACCATTGTTTACATTGCACTTG
TGTTTGGGTCAAGTTTTGGAATCCGGGCCAGTTGACATGTGGAAGATATCTCCAATATTC
CAAGGTGTTGGTATATCATTATTGTTAACACAAGCTGTTATTGGGATGTATAGTATAATA
GGATTGTCCTGGATATTCGTTTACTTCAGAGATTCCTTTATAACATCAGATGATAGATAC
AAATGGGCATTACCAAATGAATACAACTTTGACAGTCACAGAAATAACACTAAAATATAT
GAGACACTGCCAAAGTATTTCCATACTGAAGTGCTTCAAAGAAATGGGAATTCAAACAGT
TTTGGTACTATAAAGTTTCAAGTGGCATTCAACTTGGCTGTTGTATGGATGATAGTTTTT
GTTTCTCTCAGCAAAGGATTGAGGTCGTACGGCAAGGCTGTGTATATGTTGATATTTTTA
CCTATCTGTGGTACTTTAGTGCTTTCTATCAAACTATTGACTCTAATACCTTATGATACT
GTGACCAATATATTCCCTGAGACTGAATGGAGCGAATTTTTCATAAATAGTAGTAGTTGG
GCTGCCGCTGCCCAAGAGACTTATCTGACATGGGGTTTGTTGTCAGCTTGTGTAATGCAG
TTGACTACACACAAACATCCGAAACACAAAACACATCTTATACTACAACGGGAGAGCGCA
TGTATAGTTGTGTTCACCATGAGCGTTCTATTTTTAGGAGCTTTCCTTGCTAATACATGT
GTCGTTATATTGAAGAGTTACGGTTTCACCTACGTGCCTAGTAGTTTCGAGACAGTTAAA
TCATCACAATTCTTATGGCCAGTCTCGGAACCGCTACCTGGTAACACAGTATCAACTCCC
TTGCGGTATATGGGGCATTATGGGAGTCTGGTAGGAGTTACAGTATGGAAGACTGGTAAC
ATTGCAAGAACTTTGAGTGGTTGGCAACCTTTACAACTGGCAACACAGATAGTTCCTGCA
ACACTGGCTGTGCTGCCGACAAATTTTCTGTCACCAGCGTGGGCTGTGATATTCTACTTC
ATCTTAATAATGTTTGGTATAGCCCAACAGCTTGCTATATGGCATTGCGTCATAACAGGA
ATCATGGCTATTAACGCTAAGGCCCTCAAAGTATGGGAGACGACTATAACTTTCCTAAGC
TGTGTTTTTGGTCTTGCTGTGGGATTGCTTTTATCTACTGATGCGGGGATACGTATAGTA
CATTTCATCGACTACGTGTGGGTGGGATGTTGGTGGCAGTGCATAGTACACGTGTCGCTA
GTCGTAGGTGTGTTCGTGGTACGAGGGCGGCCGTACTCGCCGGACGCGGTGGTGGGGGCG
CTGTACACCGCGGGCTCTCGTCTGTCCGCCACACTCGCCGCTCTATTAAGTTTCACGTGG
ACCGTGGTGCTGCCTGTGTTGCTCTGTGCAATATGCATAATGGATTTTCGGACTGGACAG
CAACGACAATTGTACAGTTGGCGGAAACCTATCAGTTACTGGCCAATATGGACACGCCAA
GTGGCAGTTTTCTTACAGCTGACCGCACTTCTGATTGTACCTGTAACAGCTTTCGTACAA
ACTTGGATATACATATATAAAGGACCTACCGATATATTAGAGGATGACGAGTCCATCATT
TGCGCCGATGACCCGAGAATTCAGAATCTGTATCGTCCTCGTATTGGTTCGTCGGGCTCC
ACGCCGATTGTTATCGGTGCGGTTGAAGATAGGCCTTCGCCCCCCGACCCTCCCCCGAAA
TATACGCCGCCGCCATCATACTCGACCGCCACAGGAGCCCGACTTATGCATACCTTGAGG
CGGAGCTTTAGAACACTAAGGAGAATAACATCAACACGCGAGGAGACGGTCGACGAGACA
TCACTACCCATCACACTGAGCGAGAGTGTCGACAGACAGCCGCAGACTAGCGACCAGCGG
ACCTTCACACCGGTGACACTCACCGACGAGCCGGACACGACACGACAAATCCGTCCGACG
TCCTCACGACAATCGCTCACACTCACAAGGGATTACCTCAGGCGATCCTTCGTCAGGAAG
AACGACTCCAAAGGCATACGGTCCAGCTTACGAAGAAGCTTCAAGTACGGTGGTAATCTG
ACGACCAGCCACGAACATTTGGTGAGGGATATTGAACCAATATCAAACACTGTAGCCATG
TCCGCCATGACCGAACCCTCTCATACAAGGAGCTTGGGATCAGTCTTATGA
Protein sequence:
METIAMEEGRYEQKESFPGSSGIDSDTSSVFESNESSEQISEDEEDLKDDLTKHLPIFYK
SDANHESHNNITNDEEFQAAALGQFMDILNDLDEVLDKSLLACLDDGTKSLDTDEGDIIC
KIKECIVDTENISEHLGSQNSLDDDTQVVCSVLPSSTASIDDLELSQCAPISRNNLIRSK
SFSEIPRNHTQFTTAQALERANTVNNTLRNSMRRLDPIVLPAITNQESEPLTLPVILFLE
HHVNARPTSSPIQLQVTAANLSSDGSSGPLIVGRRTLLMNRALSLPSPVDSDITTNWNER
ISRSLARSASASSSSEDSLPGLAAVNHSPASDDRHEDADMPPFGVWPHRMSAMLACFSCT
IGIFNISRFAIFSVNFGASFIVQFIILSLIVGIPLFTLHLCLGQVLESGPVDMWKISPIF
QGVGISLLLTQAVIGMYSIIGLSWIFVYFRDSFITSDDRYKWALPNEYNFDSHRNNTKIY
ETLPKYFHTEVLQRNGNSNSFGTIKFQVAFNLAVVWMIVFVSLSKGLRSYGKAVYMLIFL
PICGTLVLSIKLLTLIPYDTVTNIFPETEWSEFFINSSSWAAAAQETYLTWGLLSACVMQ
LTTHKHPKHKTHLILQRESACIVVFTMSVLFLGAFLANTCVVILKSYGFTYVPSSFETVK
SSQFLWPVSEPLPGNTVSTPLRYMGHYGSLVGVTVWKTGNIARTLSGWQPLQLATQIVPA
TLAVLPTNFLSPAWAVIFYFILIMFGIAQQLAIWHCVITGIMAINAKALKVWETTITFLS
CVFGLAVGLLLSTDAGIRIVHFIDYVWVGCWWQCIVHVSLVVGVFVVRGRPYSPDAVVGA
LYTAGSRLSATLAALLSFTWTVVLPVLLCAICIMDFRTGQQRQLYSWRKPISYWPIWTRQ
VAVFLQLTALLIVPVTAFVQTWIYIYKGPTDILEDDESIICADDPRIQNLYRPRIGSSGS
TPIVIGAVEDRPSPPDPPPKYTPPPSYSTATGARLMHTLRRSFRTLRRITSTREETVDET
SLPITLSESVDRQPQTSDQRTFTPVTLTDEPDTTRQIRPTSSRQSLTLTRDYLRRSFVRK
NDSKGIRSSLRRSFKYGGNLTTSHEHLVRDIEPISNTVAMSAMTEPSHTRSLGSVL