DPGLEAN18725 in OGS1.0

New model in OGS2.0DPOGS208849 
Genomic Positionscaffold2789:+ 712-15235
See gene structure
CDS Length2370
Paired RNAseq reads  1855
Single RNAseq reads  4421
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA011390 (0.0)
Best Drosophila hit  hoepel1, isoform A (1e-165)
Best Human hitP protein (1e-105)
Best NR hit (blastp)  AGAP009284-PA [Anopheles gambiae str. PEST] (0.0)
Best NR hit (blastx)  AGAP009284-PA [Anopheles gambiae str. PEST] (5e-176)
GeneOntology terms







  
GO:0005215 transporter activity
GO:0006810 transport
GO:0016020 membrane
GO:0005302 L-tyrosine transmembrane transporter activity
GO:0015105 arsenite transmembrane transporter activity
GO:0015746 citrate transport
GO:0016021 integral to membrane
GO:0055085 transmembrane transport
GO:0015137 citrate transmembrane transporter activity
InterPro families
  
IPR004680 Divalent ion symporter
IPR000802 Arsenical pump membrane protein
Orthology groupMCL10581

Nucleotide sequence:

ATGAATAAAGATAAGAGCAACAGTGTTTATTCTATGGTGTCATTCAGTGACCTGACGCCA
GGAAGTTTGGATGTGTGGGTGGACCTGCCGGACGCCATAAAATATGATCCGACCCTGGCT
CCCTTCAAGCAGATGTACGAACAGAAACATGGCAAAGATCTGTCTAACGTCGAGGTCGAA
GTACCAGCAGGTGATGACATAAACAAAAATAACAAAATAGCAGAAGAAAATTTAGTTTGC
GAAAAGCAAATACGTGATAAAGATGTTGAGAACAGATTGGTGGAAGATATTAGTCCTGAT
GACGAAAAAACATTACCGAAACGTAAGAAACAGCTGGACCGCACGCTACGAGTAATAAAA
CTGTCAGTACTGGTGGCCGGCTGGGTGATGCTAACTGTTTCTTTGCTGATGAACAGAGAG
AAGACCGACATTATTCTCCATACAGCTGTAAATGCTGGAGAAATCAAAGAATATTTTTTG
GGGTCGTCGAGTGAGGAGTTTAGGGTCGCTATTTCATTGACCGGTCCCTTCACTGACTCT
TCCACCAACGCGACCTCGTCACTCCAGCTCTGGCTGCACAAAACATCCAAATACAAAGAA
GATGAACAGGATTCCCCAGCGTGGAGTATTAATCTTCAGCCAGATGACGTCATAGACTTC
TCTCCCAGCGCATCCGAGGATAATGTCCTCATGATAGATAAGCAAACCTTCATAAACAAT
GAGACGTACGAAGAAAAAAATGCCTCCGAACACGGAGTGAACGAATCCAGAATCTTTCTA
TGTCTCAATAGCAGCAGTAGTCAAGCCGTTCCTCTTACCATCAGTCTTCACGGAAAACCA
CTGTCTGAAACCGAAGGACTTATATACGCGAGCGTCCTCTTAGCCACGCTGTATATTCTT
ATAATATTTGAGATAGTTAACCGAACGCTAGCAGCCCTATTGTCGTCTTCCTTGGGTGTG
GCAACCCTAGCGCTGGTCGGGGAACGTCCTTCCCTCCCCGAGCTGATCTCGTGGCTGGAT
GTGGAAACACTCTTGCTGCTCTTCAGCATGATGATACTCGTCGCCATAATCGCGGAAACC
GGATTGTTTGATTTCCTCGCTGTTAAGGCTTTCGAGATAACGGCGGGCAGGACCTGGCCT
TTGATTAACTGCCTCTGTTTCTTTACCGCATTCTTTTCAACATTCCTCGACAACGTAACC
ACAGTCCTTCTGATGACGCCAGTCACTATACGGTTATGCGAGGTGATGCAGCTGAATCCG
GTTCCAGTTCTGATGTCCATGGTCATTTTTAGCAATGTAGGCGGCGCGGCCACGCCTGTT
GGAGATCCTCCAAATGTGATCATAGCCAGTCACCCCTCCATACTCGCTGTGAACATAAAC
TTCACGTCTTTCACCCTCCACATGGGTCTGGGTATACTCCTGGTGTGCATACAGACATAC
GTACAGCTGAGGTTCATGTTCAGGGACATGAACAGTCTAAGACACTGCGTGCCACGCGAT
ATACTTGAATTGCGTCAAGAAATCAGCGTGTGGAAGCGCGCGGCCGCGTCATTATCATCT
TACTCGAGAGACGAAGACATCGTCAGACGAGCGCTGGAGAAGAAGGTGCAGAGACTGAAG
TCGACCCTCGGAAGAAGGGAGGCTGGCGGAGGCAATGACAAACTCTTCTGTTCAACTCTC
GCTCATATGAAGGATAAGTATCGAATAAGGGACAAGGCGTTGCTAGTGAAGAGTGGTGTG
TGTATTAGCTTCGTCGTTCTGGTCTTCTTCCTCCACGCTGTGCCTGAGCTACAGAGTTTG
TCTCTGGGCTGGACGGCCTTGCTGGGAGCCCTGCTACTTCTGCTGCTGTCTGAGCGCGAA
GACCTGGAACCTGTGCTGGCTAGAGTTGAATGGTCCACACTGCTGTTCTTTGCAGCTCTA
TTTGTGATGATGGAGGTGTTATCGAAGTTAGGTCTCATAGCGTGGATAGGAAGGATGACT
GAAACTGTGATATCCCAAGTCGGCGAGGACTCTAGACTGGCTGTGGCTGTCATGCTGATA
CTTTGGGTGTCAGGTCTGGCCTCGGCCTTCGTAGATAACATCCCGTTGACGACGATGATG
GTGCGCGTGGTGGCAGCCCTGGCAGACGGCGCGCTCGCCCTACCGCTGCCACCCCTGGCC
TGGGCACTCAGCTTCGGAGCTTGTTTAGGAGGTAACGGTACGTTGATTGGCGCCAGCGCC
AACGTGGTGTGCGCCCCGCAGCGACTTGTCGTGCCGATCCGGTTCACCTTCATACAGTTT
CTTAGAATAGGTTTCCCCATCATGATAGGTAACCTCCTGGTAGCGTCTGGCTATCTCCTG
CTCTGCCACTGTCTGTTCACCTGGCACTGA

Protein sequence:

MNKDKSNSVYSMVSFSDLTPGSLDVWVDLPDAIKYDPTLAPFKQMYEQKHGKDLSNVEVE
VPAGDDINKNNKIAEENLVCEKQIRDKDVENRLVEDISPDDEKTLPKRKKQLDRTLRVIK
LSVLVAGWVMLTVSLLMNREKTDIILHTAVNAGEIKEYFLGSSSEEFRVAISLTGPFTDS
STNATSSLQLWLHKTSKYKEDEQDSPAWSINLQPDDVIDFSPSASEDNVLMIDKQTFINN
ETYEEKNASEHGVNESRIFLCLNSSSSQAVPLTISLHGKPLSETEGLIYASVLLATLYIL
IIFEIVNRTLAALLSSSLGVATLALVGERPSLPELISWLDVETLLLLFSMMILVAIIAET
GLFDFLAVKAFEITAGRTWPLINCLCFFTAFFSTFLDNVTTVLLMTPVTIRLCEVMQLNP
VPVLMSMVIFSNVGGAATPVGDPPNVIIASHPSILAVNINFTSFTLHMGLGILLVCIQTY
VQLRFMFRDMNSLRHCVPRDILELRQEISVWKRAAASLSSYSRDEDIVRRALEKKVQRLK
STLGRREAGGGNDKLFCSTLAHMKDKYRIRDKALLVKSGVCISFVVLVFFLHAVPELQSL
SLGWTALLGALLLLLLSEREDLEPVLARVEWSTLLFFAALFVMMEVLSKLGLIAWIGRMT
ETVISQVGEDSRLAVAVMLILWVSGLASAFVDNIPLTTMMVRVVAALADGALALPLPPLA
WALSFGACLGGNGTLIGASANVVCAPQRLVVPIRFTFIQFLRIGFPIMIGNLLVASGYLL
LCHCLFTWH