New model in OGS2.0 | DPOGS204659  |
---|---|
Genomic Position | scaffold1203:+ 7439-14628 |
See gene structure | |
CDS Length | 1395 |
Paired RNAseq reads   | 524 |
Single RNAseq reads   | 1456 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA007462 (5e-164) |
Best Drosophila hit   | CG4726 (4e-127) |
Best Human hit | sialin (2e-68) |
Best NR hit (blastp)   | AGAP009649-PA [Anopheles gambiae str. PEST] (2e-172) |
Best NR hit (blastx)   | AGAP009649-PA [Anopheles gambiae str. PEST] (3e-158) |
GeneOntology terms    | GO:0005316 high affinity inorganic phosphate:sodium symporter activity GO:0055085 transmembrane transport |
InterPro families    | IPR016196 Major facilitator superfamily, general substrate transporter IPR011701 Major facilitator superfamily MFS-1 IPR020846 Major facilitator superfamily |
Orthology group | MCL16880 |
Nucleotide sequence:
ATGGACGTACCAGCAAAAGGAAATATATTGGGGCGTTTTGTACCAGCCCGGTATATCCTC
GCGATCCTGGGCTCTTTAGGGATGGCCATAGTTTACGGGCTCAAGGTCAATCTGTCGGTG
GCTATGGTGGGCATGTTGAATCATACCGCGATCAAATCTATGGAACACCATAACACGGAA
TTCAATTCTACCGTCTCTGATGTCGAATGCCTGCCGGCTAAGAATGACACACATGGAGAG
GAAGCCGACGGTCCATTTACTTGGTCGTCTGAAGTTCAGGGTATTGTTCTCAGTTGCTAC
TTCTGGGGCTACTTCATTTCTCAAATTCCTGGTGGTCGCATAGCGGAGTTATTTTCCGCT
AAATGGGTGATGTTCTTCAGTGTTGCAATCAACGTCGTGTGCACGCTGCTGACTCCTGTC
ATGGCAGAGTTGCACTACCTGGCAGCCGTGGTGATGAGAGTGGGCGAGGGTATCGGAGGG
GGCGTGACGTTCCCTGCGATGCACGTGTTGCTGTCTCGCTGGGCGCCGCCCGCTGAGCGG
TCGTTGCTGTCGGCTCTGGTCTACGCTGGCACGAGCCTGGGCACCGTCGCGTCTATGCTG
CTCGCCGGTTTACTCACCGCAACCGCTGGTTGGGAGAGCGTATTCTACGTGATGGGCGGT
CTGTCCGTACTGTGGTGCGGTTTGTGGGTGACGTTAGTGGCGGACGATCCCAGAACACAG
AGACTCATCAGTTTAGAGGAGAGAGAGATGATTGTTAACTCTCTGGGGAGGAAAACTGCC
AGCGCTGAACGAAAGAAGCTGCCTGTACCGTGGAGGTCAGTCGTGACATCAGGTCCGTTC
CTCTCCATCCTGGTGTCCCACACGTGTTCCAACTGGGGCTGGTACATGCTGCTCATTGAA
CTGCCGTTTTATATGAAGCAGATATTGATCAATAATTATTTTTTGTATTATTATCTGTCA
CAGAACGCTGTAACCACAGCTCTGCCGTTTCTCTCGCTGTGGTTCTTCAGTATGGCGCTG
AGCAGGACATTGGACTGGTTGCGGGCTAAAGGCAGTATTACAACAACCACTGCTAGGAAG
ATAGGGACTTTGTTTGCATCAGCGGTGCCAGCTGTATGTTTGTTCTGTCTCTGTTTCGTT
GGTTGTAACCGGTCCTTGGCGGTGGCACTCACAGCGGTCGGCGTCACCTCAATCGGTGGA
ATGTTCTGTGGATTTCTCTCCAACCATATCGACATCGCCCCTAACTTCGCCGGTACGCTA
ATGGCAATAACGAACACGGTCGCAACGATCCCCGGTATAGTCGTGCCAGTTTTCGTCGGT
GTTTTGACACATGGGAACGTAAGTGACACCATGATGAACAAAAAACCCAAGTTTAATCTT
ATAAGTTATATTTGA
Protein sequence:
MDVPAKGNILGRFVPARYILAILGSLGMAIVYGLKVNLSVAMVGMLNHTAIKSMEHHNTE
FNSTVSDVECLPAKNDTHGEEADGPFTWSSEVQGIVLSCYFWGYFISQIPGGRIAELFSA
KWVMFFSVAINVVCTLLTPVMAELHYLAAVVMRVGEGIGGGVTFPAMHVLLSRWAPPAER
SLLSALVYAGTSLGTVASMLLAGLLTATAGWESVFYVMGGLSVLWCGLWVTLVADDPRTQ
RLISLEEREMIVNSLGRKTASAERKKLPVPWRSVVTSGPFLSILVSHTCSNWGWYMLLIE
LPFYMKQILINNYFLYYYLSQNAVTTALPFLSLWFFSMALSRTLDWLRAKGSITTTTARK
IGTLFASAVPAVCLFCLCFVGCNRSLAVALTAVGVTSIGGMFCGFLSNHIDIAPNFAGTL
MAITNTVATIPGIVVPVFVGVLTHGNVSDTMMNKKPKFNLISYI