New model in OGS2.0 | DPOGS214017  |
---|---|
Genomic Position | scaffold2072:- 4590-11828 |
See gene structure | |
CDS Length | 1542 |
Paired RNAseq reads   | 2976 |
Single RNAseq reads   | 6952 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA010699 (6e-90) |
Best Drosophila hit   | CG11655 (3e-74) |
Best Human hit | P3 protein isoform 2 (2e-26) |
Best NR hit (blastp)   | sodium-bile acid cotransporter [Aedes aegypti] (6e-112) |
Best NR hit (blastx)   | PREDICTED: similar to sodium-bile acid cotransporter [Tribolium castaneum] (5e-103) |
GeneOntology terms    | GO:0008508 bile acid:sodium symporter activity GO:0006814 sodium ion transport GO:0016020 membrane |
InterPro families   | IPR002657 Bile acid:sodium symporter |
Orthology group | MCL12795 |
Nucleotide sequence:
ATGTGTCCATTATGGCCGCTGCACCTGATAGTGTTGTATCTCCTGGTGCTAGGCCCTATG
TGGGTTCTATGTCAGGCGGCACCGAATCTCATGGCCACGTACCTGCCAACCGAAGTCGAG
GAAGTGCACATGGGGGACACGTATTACGTCGATGTAAACGTTACAGGTGTAGGTCTTCGT
CCAGGTGCTCGTCTCCAGGTGAACGTTAGAGACGAACACGTGGCGGACACTAAATGGAAT
TCCTCATATCAGGTCACAGAAAATGACGTCAGTGAGGGGAAGTTTAAAGGGAGGTTGAGG
ATTATAGGGAATTTTCTTGGAAGGACGATATTATCATTGGAGTCCCATGGCGTTGGGGAC
ACCATAGAACCCGTGAATGGTACGCTAGCAGTCACCGTTACCAGACCCCAGAGAGTCATA
GACACTATATTCACTACTAGTATAGCGATATTCATATCAATTGTGTTCATAAACTTTGGT
TGTGCGATGCACTGGGATGAAGTTAAAGGGGTTGTGAGAAGACCTGTCGGACCTATCATA
GGTCTCTGTGGACAGTTCGTGTTTATGCCATTGATATCCTTCGGTCTTGGTTACCTGATC
TTCCCCTCATCTCCATCTCTCCACCTGGGTATGTTCTGCACGGGTGTAGCGCCGGGTGGC
GGTGCCTCTAACATATGGACCTTCATATTGGGAGGGAATCTGGATTTGAGCCTCACAATG
ACATCCATATCAACCTTGGCTGCGTTCGGTTTCATGCCGCTGTGGCTGTTCACGCTCGGT
CAAGTGGTGTTCGCTAACGCCAGTATAGTGGTTCCGTACAGTCGGATAGCTATGTTCGTG
GTGGGTCTGATAGTCCCCCTCATCATCGGCCTGGCTATGCAGAAATTCACCCCTCGACTA
TCAGCCTTCATGGTCCGGATATTGAAGCCTTTCTCGTCTTGTATATTGATTTTCATTATA
GTGTTCGCGATTGTCACCAATTTATACATATTCGAACTGTTCTCGTGGCAGATACTACTA
GCTGGTATGGGTATCCCGTGGCTGGGATACATATCGGGATACCTGGTAGCCTGGCTATTC
CGTCAACCTCATCCGGATGCACTGGCTATATCGATAGAAACGGGCATACAAAACACTGGC
ATCGCTATATTCCTACTGAGATACGCTCTGCCACAACCGGAAGCCGATATAACAACCGTG
GTACCCGTTGCCTGTGCCATAATGACACCAATCCCGATGACAGCAATATTCATATATCAA
AAATTAAGTTCATGCGGGATATGTCCGACATCAAAAACAGAACACAACAGAAGAAAGATG
TCGACCGCCCTGAGACCGTCGAGTCCGGCATCGAACCTGCCATTAATGGAAAATAAGAAA
GAACCGAAATTTCTCATCATTCCATGGACTCACCGTGCGGGCGCCGGGTCCCTCGTCGCA
GGGAGAGGAGCTTCAACAGTGCCTGGGACTTGCGACCTAGGTGTAGGTCTAAGGGGACCC
TATGGAACGGGCGGTAAACGCTCATCTCGACCGACTATCTAA
Protein sequence:
MCPLWPLHLIVLYLLVLGPMWVLCQAAPNLMATYLPTEVEEVHMGDTYYVDVNVTGVGLR
PGARLQVNVRDEHVADTKWNSSYQVTENDVSEGKFKGRLRIIGNFLGRTILSLESHGVGD
TIEPVNGTLAVTVTRPQRVIDTIFTTSIAIFISIVFINFGCAMHWDEVKGVVRRPVGPII
GLCGQFVFMPLISFGLGYLIFPSSPSLHLGMFCTGVAPGGGASNIWTFILGGNLDLSLTM
TSISTLAAFGFMPLWLFTLGQVVFANASIVVPYSRIAMFVVGLIVPLIIGLAMQKFTPRL
SAFMVRILKPFSSCILIFIIVFAIVTNLYIFELFSWQILLAGMGIPWLGYISGYLVAWLF
RQPHPDALAISIETGIQNTGIAIFLLRYALPQPEADITTVVPVACAIMTPIPMTAIFIYQ
KLSSCGICPTSKTEHNRRKMSTALRPSSPASNLPLMENKKEPKFLIIPWTHRAGAGSLVA
GRGASTVPGTCDLGVGLRGPYGTGGKRSSRPTI