DPGLEAN12018 in OGS1.0

New model in OGS2.0DPOGS214017 
Genomic Positionscaffold2072:- 4590-11828
See gene structure
CDS Length1542
Paired RNAseq reads  2976
Single RNAseq reads  6952
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010699 (6e-90)
Best Drosophila hit  CG11655 (3e-74)
Best Human hitP3 protein isoform 2 (2e-26)
Best NR hit (blastp)  sodium-bile acid cotransporter [Aedes aegypti] (6e-112)
Best NR hit (blastx)  PREDICTED: similar to sodium-bile acid cotransporter [Tribolium castaneum] (5e-103)
GeneOntology terms

  
GO:0008508 bile acid:sodium symporter activity
GO:0006814 sodium ion transport
GO:0016020 membrane
InterPro families  IPR002657 Bile acid:sodium symporter
Orthology groupMCL12795

Nucleotide sequence:

ATGTGTCCATTATGGCCGCTGCACCTGATAGTGTTGTATCTCCTGGTGCTAGGCCCTATG
TGGGTTCTATGTCAGGCGGCACCGAATCTCATGGCCACGTACCTGCCAACCGAAGTCGAG
GAAGTGCACATGGGGGACACGTATTACGTCGATGTAAACGTTACAGGTGTAGGTCTTCGT
CCAGGTGCTCGTCTCCAGGTGAACGTTAGAGACGAACACGTGGCGGACACTAAATGGAAT
TCCTCATATCAGGTCACAGAAAATGACGTCAGTGAGGGGAAGTTTAAAGGGAGGTTGAGG
ATTATAGGGAATTTTCTTGGAAGGACGATATTATCATTGGAGTCCCATGGCGTTGGGGAC
ACCATAGAACCCGTGAATGGTACGCTAGCAGTCACCGTTACCAGACCCCAGAGAGTCATA
GACACTATATTCACTACTAGTATAGCGATATTCATATCAATTGTGTTCATAAACTTTGGT
TGTGCGATGCACTGGGATGAAGTTAAAGGGGTTGTGAGAAGACCTGTCGGACCTATCATA
GGTCTCTGTGGACAGTTCGTGTTTATGCCATTGATATCCTTCGGTCTTGGTTACCTGATC
TTCCCCTCATCTCCATCTCTCCACCTGGGTATGTTCTGCACGGGTGTAGCGCCGGGTGGC
GGTGCCTCTAACATATGGACCTTCATATTGGGAGGGAATCTGGATTTGAGCCTCACAATG
ACATCCATATCAACCTTGGCTGCGTTCGGTTTCATGCCGCTGTGGCTGTTCACGCTCGGT
CAAGTGGTGTTCGCTAACGCCAGTATAGTGGTTCCGTACAGTCGGATAGCTATGTTCGTG
GTGGGTCTGATAGTCCCCCTCATCATCGGCCTGGCTATGCAGAAATTCACCCCTCGACTA
TCAGCCTTCATGGTCCGGATATTGAAGCCTTTCTCGTCTTGTATATTGATTTTCATTATA
GTGTTCGCGATTGTCACCAATTTATACATATTCGAACTGTTCTCGTGGCAGATACTACTA
GCTGGTATGGGTATCCCGTGGCTGGGATACATATCGGGATACCTGGTAGCCTGGCTATTC
CGTCAACCTCATCCGGATGCACTGGCTATATCGATAGAAACGGGCATACAAAACACTGGC
ATCGCTATATTCCTACTGAGATACGCTCTGCCACAACCGGAAGCCGATATAACAACCGTG
GTACCCGTTGCCTGTGCCATAATGACACCAATCCCGATGACAGCAATATTCATATATCAA
AAATTAAGTTCATGCGGGATATGTCCGACATCAAAAACAGAACACAACAGAAGAAAGATG
TCGACCGCCCTGAGACCGTCGAGTCCGGCATCGAACCTGCCATTAATGGAAAATAAGAAA
GAACCGAAATTTCTCATCATTCCATGGACTCACCGTGCGGGCGCCGGGTCCCTCGTCGCA
GGGAGAGGAGCTTCAACAGTGCCTGGGACTTGCGACCTAGGTGTAGGTCTAAGGGGACCC
TATGGAACGGGCGGTAAACGCTCATCTCGACCGACTATCTAA

Protein sequence:

MCPLWPLHLIVLYLLVLGPMWVLCQAAPNLMATYLPTEVEEVHMGDTYYVDVNVTGVGLR
PGARLQVNVRDEHVADTKWNSSYQVTENDVSEGKFKGRLRIIGNFLGRTILSLESHGVGD
TIEPVNGTLAVTVTRPQRVIDTIFTTSIAIFISIVFINFGCAMHWDEVKGVVRRPVGPII
GLCGQFVFMPLISFGLGYLIFPSSPSLHLGMFCTGVAPGGGASNIWTFILGGNLDLSLTM
TSISTLAAFGFMPLWLFTLGQVVFANASIVVPYSRIAMFVVGLIVPLIIGLAMQKFTPRL
SAFMVRILKPFSSCILIFIIVFAIVTNLYIFELFSWQILLAGMGIPWLGYISGYLVAWLF
RQPHPDALAISIETGIQNTGIAIFLLRYALPQPEADITTVVPVACAIMTPIPMTAIFIYQ
KLSSCGICPTSKTEHNRRKMSTALRPSSPASNLPLMENKKEPKFLIIPWTHRAGAGSLVA
GRGASTVPGTCDLGVGLRGPYGTGGKRSSRPTI