DPGLEAN14123 in OGS1.0

New model in OGS2.0DPOGS216072 
Genomic Positionscaffold214:+ 2239-10169
See gene structure
CDS Length1539
Paired RNAseq reads  1044
Single RNAseq reads  3280
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008871 (0.0)
Best Drosophila hit  CG9864 (6e-100)
Best Human hitsialin (6e-57)
Best NR hit (blastp)  AGAP010370-PA [Anopheles gambiae str. PEST] (5e-129)
Best NR hit (blastx)  AGAP010370-PA [Anopheles gambiae str. PEST] (2e-130)
GeneOntology terms
  
GO:0005316 high affinity inorganic phosphate:sodium symporter activity
GO:0055085 transmembrane transport
InterPro families

  
IPR016196 Major facilitator superfamily, general substrate transporter
IPR020846 Major facilitator superfamily
IPR011701 Major facilitator superfamily MFS-1
Orthology groupMCL15292

Nucleotide sequence:

ATGGTCCCAGGGCACTTATTTAAGTGCCGGACGAAGAAATCCTTAAAGCTATATGTTATA
CCGGCGAGGTTAAACGTGGCGTTAATGATGTTCTTCGCCTGCTGGGTCAATTACATGCTG
CGCGTTAACATGAGTGTCAATATCATTGCCATGGTTCCTGATCGTGGTGAAACTAAGTCG
GTACAAAGCGAATGTGAGGCCATCACTAATGACGACACTGCATTACATAATGGTACGACA
GCGGTTACACGACAAGTCCAACAGCCCGATGGATCTATTACTTTTGATTGGACAGCGCAA
CAACAGGCATACGTGCTCTCCGGATACTTCTGGGGTTACGCAATCACTTGCCTCTTCAGT
GGTATAGCAGCGGAGAGATGGGGTCCAAGGAATGTAGTCTTCATATCCATGTTGATATCT
GCGGTCCTCACAATTCTCATTCCCCCAGCAGCGAAAGTACATTTCATGATGCTAGTAGCT
ACAAGATTCGTCATAGGTCTTGCTGCGGGCTTCCTTTTCCCGTCACTACACGCGCTTGTT
GCTCATTGGGCTCCTCCAGCAGAGAAGGGGAAGTTTGTGAGCGCTCTCCTAGGAGGGGCC
ATAGGAACCGTCGTGACCTGGTCGCTTAGTGGACCTCTTATTGAGAACTTTGGGTGGACT
TACGCATTTTATGTACCAGGTATTATAGCCATCGTTTGGTGTGCTGCGTGGTGGTTCCTT
GTATACGATTCCCCCGTCATACATCCACGTATTAGCGAAGAAGAAAAAACGTACATCCTA
AGTGCCATTGGAGACAAAGTGCAACAGAGTTCCAAGGAGCATAAAATTGTTCCACCGTTT
AAAGATATATTCACGTCGTTTCCGTTCCTCGCCATGGTTATCCTCCACTATGGTAACACA
TGGGGTATATACTTCGTAATGACAGCCGCACCAAAATACGTGTCAAGTGCTCTCGGATAC
AATTTGACTTCAACGGGCACTCTGTCATCACTACCTTACCTTGCGAGGATGATATTTTCA
TTAATATTCGGAGCTATTGGTGACAGAATCGTCAAACAGAACGTTGTATCCACGACGTTT
ATGAGGAAGTTCTTCTGCTTGTTTTCCCACGTGGTGCCGGGTCTGCTGCTCATCGGTCTG
GGCTACACGGGCTGTGCCCCCATCTTGTCAGTGGCTCTTATAACATTCTCCATGGGCTCC
AATGGCGCCGCCACACTCACTAACTTAGTGAACCACCAGGATCTGGCGCCAAACTTTGCC
GGCACCATTTACGGCATAGCCAATGGTATTGGTAACACAGCTGGTTTCATAACACCGCTT
GTGACTGCCTACTTCACCAAACATGGGAATGGTTTTGCGGAATGGCGGCCAGTTTTCCTC
ACGGGAGCCTCAATATACATTGCCGCAGCAGTTTACTTCATTCTCTTCGGCACCGGTGAA
ACACAATCGTGGAATTACGTCGCCCCGGCGGAAGACGATAGGGACAAGAGGCCCAATAAC
AGCGAAGATACCACCGTCAACATACCAGTTAAAACATAA

Protein sequence:

MVPGHLFKCRTKKSLKLYVIPARLNVALMMFFACWVNYMLRVNMSVNIIAMVPDRGETKS
VQSECEAITNDDTALHNGTTAVTRQVQQPDGSITFDWTAQQQAYVLSGYFWGYAITCLFS
GIAAERWGPRNVVFISMLISAVLTILIPPAAKVHFMMLVATRFVIGLAAGFLFPSLHALV
AHWAPPAEKGKFVSALLGGAIGTVVTWSLSGPLIENFGWTYAFYVPGIIAIVWCAAWWFL
VYDSPVIHPRISEEEKTYILSAIGDKVQQSSKEHKIVPPFKDIFTSFPFLAMVILHYGNT
WGIYFVMTAAPKYVSSALGYNLTSTGTLSSLPYLARMIFSLIFGAIGDRIVKQNVVSTTF
MRKFFCLFSHVVPGLLLIGLGYTGCAPILSVALITFSMGSNGAATLTNLVNHQDLAPNFA
GTIYGIANGIGNTAGFITPLVTAYFTKHGNGFAEWRPVFLTGASIYIAAAVYFILFGTGE
TQSWNYVAPAEDDRDKRPNNSEDTTVNIPVKT