DPGLEAN01598 in OGS1.0

New model in OGS2.0DPOGS205278 
Genomic Positionscaffold445:+ 13981-21174
See gene structure
CDS Length3411
Paired RNAseq reads  1632
Single RNAseq reads  4009
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA011081 (0.0)
Best Drosophila hit  bedraggled, isoform B (8e-101)
Best Human hitsodium- and chloride-dependent GABA transporter 1 (9e-20)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC000366 [Tribolium castaneum] (0.0)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC000366 [Tribolium castaneum] (0.0)
GeneOntology terms



  
GO:0005326 neurotransmitter transporter activity
GO:0005887 integral to plasma membrane
GO:0005328 neurotransmitter:sodium symporter activity
GO:0006836 neurotransmitter transport
GO:0007464 R3/R4 cell fate commitment
InterPro families  IPR000175 Sodium:neurotransmitter symporter
Orthology groupMCL16352

Nucleotide sequence:

ATGGAAACAATTGCAATGGAAGAGGGTAGGTATGAACAAAAGGAAAGTTTTCCAGGAAGT
AGTGGTATTGACAGTGATACATCAAGTGTATTTGAGAGTAATGAAAGTAGTGAACAAATA
AGCGAAGATGAAGAAGATTTGAAAGATGACCTCACAAAACATTTACCTATATTTTATAAG
TCTGATGCAAACCATGAATCACATAACAACATCACAAACGATGAGGAATTTCAAGCAGCC
GCTCTTGGTCAGTTTATGGACATATTAAATGATTTAGATGAAGTTCTAGATAAATCCCTG
CTTGCATGTCTTGACGATGGTACTAAATCTTTAGACACTGACGAAGGGGATATTATCTGT
AAAATAAAGGAATGTATAGTGGATACTGAAAATATATCTGAACATTTAGGATCCCAGAAC
TCTTTAGATGATGATACGCAAGTCGTGTGCTCCGTTCTTCCGTCTTCGACAGCATCAATA
GATGATTTAGAACTTTCACAATGTGCTCCCATTTCACGCAATAATCTCATCCGTTCGAAA
AGTTTCTCTGAAATTCCACGAAATCATACACAGTTTACCACCGCTCAGGCTTTAGAAAGG
GCTAATACGGTAAACAACACGCTAAGAAATTCAATGAGAAGATTAGATCCTATAGTTCTA
CCAGCTATAACAAACCAGGAATCTGAACCTCTTACACTGCCCGTCATATTGTTTTTGGAG
CATCATGTTAATGCGCGACCAACAAGTTCTCCTATACAATTGCAAGTGACCGCAGCAAAT
TTAAGTAGCGATGGGAGTTCCGGTCCACTCATAGTTGGAAGACGGACACTGTTAATGAAC
AGAGCGTTATCATTACCGTCGCCTGTTGATAGTGATATCACGACTAATTGGAACGAACGA
ATATCGAGAAGTCTAGCGAGAAGCGCAAGTGCATCTAGTTCGTCAGAAGATTCACTGCCA
GGGTTAGCAGCTGTAAATCACAGTCCTGCTTCTGACGACCGACATGAGGATGCTGACATG
CCCCCGTTTGGAGTTTGGCCGCATAGAATGAGCGCGATGCTAGCGTGTTTCAGTTGCACG
ATAGGAATATTCAACATCTCAAGATTTGCCATATTTAGCGTTAACTTTGGAGCCAGTTTT
ATAGTGCAATTTATTATACTATCATTAATAGTAGGTATACCATTGTTTACATTGCACTTG
TGTTTGGGTCAAGTTTTGGAATCCGGGCCAGTTGACATGTGGAAGATATCTCCAATATTC
CAAGGTGTTGGTATATCATTATTGTTAACACAAGCTGTTATTGGGATGTATAGTATAATA
GGATTGTCCTGGATATTCGTTTACTTCAGAGATTCCTTTATAACATCAGATGATAGATAC
AAATGGGCATTACCAAATGAATACAACTTTGACAGTCACAGAAATAACACTAAAATATAT
GAGACACTGCCAAAGTATTTCCATACTGAAGTGCTTCAAAGAAATGGGAATTCAAACAGT
TTTGGTACTATAAAGTTTCAAGTGGCATTCAACTTGGCTGTTGTATGGATGATAGTTTTT
GTTTCTCTCAGCAAAGGATTGAGGTCGTACGGCAAGGCTGTGTATATGTTGATATTTTTA
CCTATCTGTGGTACTTTAGTGCTTTCTATCAAACTATTGACTCTAATACCTTATGATACT
GTGACCAATATATTCCCTGAGACTGAATGGAGCGAATTTTTCATAAATAGTAGTAGTTGG
GCTGCCGCTGCCCAAGAGACTTATCTGACATGGGGTTTGTTGTCAGCTTGTGTAATGCAG
TTGACTACACACAAACATCCGAAACACAAAACACATCTTATACTACAACGGGAGAGCGCA
TGTATAGTTGTGTTCACCATGAGCGTTCTATTTTTAGGAGCTTTCCTTGCTAATACATGT
GTCGTTATATTGAAGAGTTACGGTTTCACCTACGTGCCTAGTAGTTTCGAGACAGTTAAA
TCATCACAATTCTTATGGCCAGTCTCGGAACCGCTACCTGGTAACACAGTATCAACTCCC
TTGCGGTATATGGGGCATTATGGGAGTCTGGTAGGAGTTACAGTATGGAAGACTGGTAAC
ATTGCAAGAACTTTGAGTGGTTGGCAACCTTTACAACTGGCAACACAGATAGTTCCTGCA
ACACTGGCTGTGCTGCCGACAAATTTTCTGTCACCAGCGTGGGCTGTGATATTCTACTTC
ATCTTAATAATGTTTGGTATAGCCCAACAGCTTGCTATATGGCATTGCGTCATAACAGGA
ATCATGGCTATTAACGCTAAGGCCCTCAAAGTATGGGAGACGACTATAACTTTCCTAAGC
TGTGTTTTTGGTCTTGCTGTGGGATTGCTTTTATCTACTGATGCGGGGATACGTATAGTA
CATTTCATCGACTACGTGTGGGTGGGATGTTGGTGGCAGTGCATAGTACACGTGTCGCTA
GTCGTAGGTGTGTTCGTGGTACGAGGGCGGCCGTACTCGCCGGACGCGGTGGTGGGGGCG
CTGTACACCGCGGGCTCTCGTCTGTCCGCCACACTCGCCGCTCTATTAAGTTTCACGTGG
ACCGTGGTGCTGCCTGTGTTGCTCTGTGCAATATGCATAATGGATTTTCGGACTGGACAG
CAACGACAATTGTACAGTTGGCGGAAACCTATCAGTTACTGGCCAATATGGACACGCCAA
GTGGCAGTTTTCTTACAGCTGACCGCACTTCTGATTGTACCTGTAACAGCTTTCGTACAA
ACTTGGATATACATATATAAAGGACCTACCGATATATTAGAGGATGACGAGTCCATCATT
TGCGCCGATGACCCGAGAATTCAGAATCTGTATCGTCCTCGTATTGGTTCGTCGGGCTCC
ACGCCGATTGTTATCGGTGCGGTTGAAGATAGGCCTTCGCCCCCCGACCCTCCCCCGAAA
TATACGCCGCCGCCATCATACTCGACCGCCACAGGAGCCCGACTTATGCATACCTTGAGG
CGGAGCTTTAGAACACTAAGGAGAATAACATCAACACGCGAGGAGACGGTCGACGAGACA
TCACTACCCATCACACTGAGCGAGAGTGTCGACAGACAGCCGCAGACTAGCGACCAGCGG
ACCTTCACACCGGTGACACTCACCGACGAGCCGGACACGACACGACAAATCCGTCCGACG
TCCTCACGACAATCGCTCACACTCACAAGGGATTACCTCAGGCGATCCTTCGTCAGGAAG
AACGACTCCAAAGGCATACGGTCCAGCTTACGAAGAAGCTTCAAGTACGGTGGTAATCTG
ACGACCAGCCACGAACATTTGGTGAGGGATATTGAACCAATATCAAACACTGTAGCCATG
TCCGCCATGACCGAACCCTCTCATACAAGGAGCTTGGGATCAGTCTTATGA

Protein sequence:

METIAMEEGRYEQKESFPGSSGIDSDTSSVFESNESSEQISEDEEDLKDDLTKHLPIFYK
SDANHESHNNITNDEEFQAAALGQFMDILNDLDEVLDKSLLACLDDGTKSLDTDEGDIIC
KIKECIVDTENISEHLGSQNSLDDDTQVVCSVLPSSTASIDDLELSQCAPISRNNLIRSK
SFSEIPRNHTQFTTAQALERANTVNNTLRNSMRRLDPIVLPAITNQESEPLTLPVILFLE
HHVNARPTSSPIQLQVTAANLSSDGSSGPLIVGRRTLLMNRALSLPSPVDSDITTNWNER
ISRSLARSASASSSSEDSLPGLAAVNHSPASDDRHEDADMPPFGVWPHRMSAMLACFSCT
IGIFNISRFAIFSVNFGASFIVQFIILSLIVGIPLFTLHLCLGQVLESGPVDMWKISPIF
QGVGISLLLTQAVIGMYSIIGLSWIFVYFRDSFITSDDRYKWALPNEYNFDSHRNNTKIY
ETLPKYFHTEVLQRNGNSNSFGTIKFQVAFNLAVVWMIVFVSLSKGLRSYGKAVYMLIFL
PICGTLVLSIKLLTLIPYDTVTNIFPETEWSEFFINSSSWAAAAQETYLTWGLLSACVMQ
LTTHKHPKHKTHLILQRESACIVVFTMSVLFLGAFLANTCVVILKSYGFTYVPSSFETVK
SSQFLWPVSEPLPGNTVSTPLRYMGHYGSLVGVTVWKTGNIARTLSGWQPLQLATQIVPA
TLAVLPTNFLSPAWAVIFYFILIMFGIAQQLAIWHCVITGIMAINAKALKVWETTITFLS
CVFGLAVGLLLSTDAGIRIVHFIDYVWVGCWWQCIVHVSLVVGVFVVRGRPYSPDAVVGA
LYTAGSRLSATLAALLSFTWTVVLPVLLCAICIMDFRTGQQRQLYSWRKPISYWPIWTRQ
VAVFLQLTALLIVPVTAFVQTWIYIYKGPTDILEDDESIICADDPRIQNLYRPRIGSSGS
TPIVIGAVEDRPSPPDPPPKYTPPPSYSTATGARLMHTLRRSFRTLRRITSTREETVDET
SLPITLSESVDRQPQTSDQRTFTPVTLTDEPDTTRQIRPTSSRQSLTLTRDYLRRSFVRK
NDSKGIRSSLRRSFKYGGNLTTSHEHLVRDIEPISNTVAMSAMTEPSHTRSLGSVL