New model in OGS2.0 | DPOGS216072  |
---|---|
Genomic Position | scaffold214:+ 2239-10169 |
See gene structure | |
CDS Length | 1539 |
Paired RNAseq reads   | 1044 |
Single RNAseq reads   | 3280 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA008871 (0.0) |
Best Drosophila hit   | CG9864 (6e-100) |
Best Human hit | sialin (6e-57) |
Best NR hit (blastp)   | AGAP010370-PA [Anopheles gambiae str. PEST] (5e-129) |
Best NR hit (blastx)   | AGAP010370-PA [Anopheles gambiae str. PEST] (2e-130) |
GeneOntology terms    | GO:0005316 high affinity inorganic phosphate:sodium symporter activity GO:0055085 transmembrane transport |
InterPro families    | IPR016196 Major facilitator superfamily, general substrate transporter IPR020846 Major facilitator superfamily IPR011701 Major facilitator superfamily MFS-1 |
Orthology group | MCL15292 |
Nucleotide sequence:
ATGGTCCCAGGGCACTTATTTAAGTGCCGGACGAAGAAATCCTTAAAGCTATATGTTATA
CCGGCGAGGTTAAACGTGGCGTTAATGATGTTCTTCGCCTGCTGGGTCAATTACATGCTG
CGCGTTAACATGAGTGTCAATATCATTGCCATGGTTCCTGATCGTGGTGAAACTAAGTCG
GTACAAAGCGAATGTGAGGCCATCACTAATGACGACACTGCATTACATAATGGTACGACA
GCGGTTACACGACAAGTCCAACAGCCCGATGGATCTATTACTTTTGATTGGACAGCGCAA
CAACAGGCATACGTGCTCTCCGGATACTTCTGGGGTTACGCAATCACTTGCCTCTTCAGT
GGTATAGCAGCGGAGAGATGGGGTCCAAGGAATGTAGTCTTCATATCCATGTTGATATCT
GCGGTCCTCACAATTCTCATTCCCCCAGCAGCGAAAGTACATTTCATGATGCTAGTAGCT
ACAAGATTCGTCATAGGTCTTGCTGCGGGCTTCCTTTTCCCGTCACTACACGCGCTTGTT
GCTCATTGGGCTCCTCCAGCAGAGAAGGGGAAGTTTGTGAGCGCTCTCCTAGGAGGGGCC
ATAGGAACCGTCGTGACCTGGTCGCTTAGTGGACCTCTTATTGAGAACTTTGGGTGGACT
TACGCATTTTATGTACCAGGTATTATAGCCATCGTTTGGTGTGCTGCGTGGTGGTTCCTT
GTATACGATTCCCCCGTCATACATCCACGTATTAGCGAAGAAGAAAAAACGTACATCCTA
AGTGCCATTGGAGACAAAGTGCAACAGAGTTCCAAGGAGCATAAAATTGTTCCACCGTTT
AAAGATATATTCACGTCGTTTCCGTTCCTCGCCATGGTTATCCTCCACTATGGTAACACA
TGGGGTATATACTTCGTAATGACAGCCGCACCAAAATACGTGTCAAGTGCTCTCGGATAC
AATTTGACTTCAACGGGCACTCTGTCATCACTACCTTACCTTGCGAGGATGATATTTTCA
TTAATATTCGGAGCTATTGGTGACAGAATCGTCAAACAGAACGTTGTATCCACGACGTTT
ATGAGGAAGTTCTTCTGCTTGTTTTCCCACGTGGTGCCGGGTCTGCTGCTCATCGGTCTG
GGCTACACGGGCTGTGCCCCCATCTTGTCAGTGGCTCTTATAACATTCTCCATGGGCTCC
AATGGCGCCGCCACACTCACTAACTTAGTGAACCACCAGGATCTGGCGCCAAACTTTGCC
GGCACCATTTACGGCATAGCCAATGGTATTGGTAACACAGCTGGTTTCATAACACCGCTT
GTGACTGCCTACTTCACCAAACATGGGAATGGTTTTGCGGAATGGCGGCCAGTTTTCCTC
ACGGGAGCCTCAATATACATTGCCGCAGCAGTTTACTTCATTCTCTTCGGCACCGGTGAA
ACACAATCGTGGAATTACGTCGCCCCGGCGGAAGACGATAGGGACAAGAGGCCCAATAAC
AGCGAAGATACCACCGTCAACATACCAGTTAAAACATAA
Protein sequence:
MVPGHLFKCRTKKSLKLYVIPARLNVALMMFFACWVNYMLRVNMSVNIIAMVPDRGETKS
VQSECEAITNDDTALHNGTTAVTRQVQQPDGSITFDWTAQQQAYVLSGYFWGYAITCLFS
GIAAERWGPRNVVFISMLISAVLTILIPPAAKVHFMMLVATRFVIGLAAGFLFPSLHALV
AHWAPPAEKGKFVSALLGGAIGTVVTWSLSGPLIENFGWTYAFYVPGIIAIVWCAAWWFL
VYDSPVIHPRISEEEKTYILSAIGDKVQQSSKEHKIVPPFKDIFTSFPFLAMVILHYGNT
WGIYFVMTAAPKYVSSALGYNLTSTGTLSSLPYLARMIFSLIFGAIGDRIVKQNVVSTTF
MRKFFCLFSHVVPGLLLIGLGYTGCAPILSVALITFSMGSNGAATLTNLVNHQDLAPNFA
GTIYGIANGIGNTAGFITPLVTAYFTKHGNGFAEWRPVFLTGASIYIAAAVYFILFGTGE
TQSWNYVAPAEDDRDKRPNNSEDTTVNIPVKT