New model in OGS2.0 | DPOGS208849  |
---|---|
Genomic Position | scaffold2789:+ 712-15235 |
See gene structure | |
CDS Length | 2370 |
Paired RNAseq reads   | 1855 |
Single RNAseq reads   | 4421 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA011390 (0.0) |
Best Drosophila hit   | hoepel1, isoform A (1e-165) |
Best Human hit | P protein (1e-105) |
Best NR hit (blastp)   | AGAP009284-PA [Anopheles gambiae str. PEST] (0.0) |
Best NR hit (blastx)   | AGAP009284-PA [Anopheles gambiae str. PEST] (5e-176) |
GeneOntology terms    | GO:0005215 transporter activity GO:0006810 transport GO:0016020 membrane GO:0005302 L-tyrosine transmembrane transporter activity GO:0015105 arsenite transmembrane transporter activity GO:0015746 citrate transport GO:0016021 integral to membrane GO:0055085 transmembrane transport GO:0015137 citrate transmembrane transporter activity |
InterPro families    | IPR004680 Divalent ion symporter IPR000802 Arsenical pump membrane protein |
Orthology group | MCL10581 |
Nucleotide sequence:
ATGAATAAAGATAAGAGCAACAGTGTTTATTCTATGGTGTCATTCAGTGACCTGACGCCA
GGAAGTTTGGATGTGTGGGTGGACCTGCCGGACGCCATAAAATATGATCCGACCCTGGCT
CCCTTCAAGCAGATGTACGAACAGAAACATGGCAAAGATCTGTCTAACGTCGAGGTCGAA
GTACCAGCAGGTGATGACATAAACAAAAATAACAAAATAGCAGAAGAAAATTTAGTTTGC
GAAAAGCAAATACGTGATAAAGATGTTGAGAACAGATTGGTGGAAGATATTAGTCCTGAT
GACGAAAAAACATTACCGAAACGTAAGAAACAGCTGGACCGCACGCTACGAGTAATAAAA
CTGTCAGTACTGGTGGCCGGCTGGGTGATGCTAACTGTTTCTTTGCTGATGAACAGAGAG
AAGACCGACATTATTCTCCATACAGCTGTAAATGCTGGAGAAATCAAAGAATATTTTTTG
GGGTCGTCGAGTGAGGAGTTTAGGGTCGCTATTTCATTGACCGGTCCCTTCACTGACTCT
TCCACCAACGCGACCTCGTCACTCCAGCTCTGGCTGCACAAAACATCCAAATACAAAGAA
GATGAACAGGATTCCCCAGCGTGGAGTATTAATCTTCAGCCAGATGACGTCATAGACTTC
TCTCCCAGCGCATCCGAGGATAATGTCCTCATGATAGATAAGCAAACCTTCATAAACAAT
GAGACGTACGAAGAAAAAAATGCCTCCGAACACGGAGTGAACGAATCCAGAATCTTTCTA
TGTCTCAATAGCAGCAGTAGTCAAGCCGTTCCTCTTACCATCAGTCTTCACGGAAAACCA
CTGTCTGAAACCGAAGGACTTATATACGCGAGCGTCCTCTTAGCCACGCTGTATATTCTT
ATAATATTTGAGATAGTTAACCGAACGCTAGCAGCCCTATTGTCGTCTTCCTTGGGTGTG
GCAACCCTAGCGCTGGTCGGGGAACGTCCTTCCCTCCCCGAGCTGATCTCGTGGCTGGAT
GTGGAAACACTCTTGCTGCTCTTCAGCATGATGATACTCGTCGCCATAATCGCGGAAACC
GGATTGTTTGATTTCCTCGCTGTTAAGGCTTTCGAGATAACGGCGGGCAGGACCTGGCCT
TTGATTAACTGCCTCTGTTTCTTTACCGCATTCTTTTCAACATTCCTCGACAACGTAACC
ACAGTCCTTCTGATGACGCCAGTCACTATACGGTTATGCGAGGTGATGCAGCTGAATCCG
GTTCCAGTTCTGATGTCCATGGTCATTTTTAGCAATGTAGGCGGCGCGGCCACGCCTGTT
GGAGATCCTCCAAATGTGATCATAGCCAGTCACCCCTCCATACTCGCTGTGAACATAAAC
TTCACGTCTTTCACCCTCCACATGGGTCTGGGTATACTCCTGGTGTGCATACAGACATAC
GTACAGCTGAGGTTCATGTTCAGGGACATGAACAGTCTAAGACACTGCGTGCCACGCGAT
ATACTTGAATTGCGTCAAGAAATCAGCGTGTGGAAGCGCGCGGCCGCGTCATTATCATCT
TACTCGAGAGACGAAGACATCGTCAGACGAGCGCTGGAGAAGAAGGTGCAGAGACTGAAG
TCGACCCTCGGAAGAAGGGAGGCTGGCGGAGGCAATGACAAACTCTTCTGTTCAACTCTC
GCTCATATGAAGGATAAGTATCGAATAAGGGACAAGGCGTTGCTAGTGAAGAGTGGTGTG
TGTATTAGCTTCGTCGTTCTGGTCTTCTTCCTCCACGCTGTGCCTGAGCTACAGAGTTTG
TCTCTGGGCTGGACGGCCTTGCTGGGAGCCCTGCTACTTCTGCTGCTGTCTGAGCGCGAA
GACCTGGAACCTGTGCTGGCTAGAGTTGAATGGTCCACACTGCTGTTCTTTGCAGCTCTA
TTTGTGATGATGGAGGTGTTATCGAAGTTAGGTCTCATAGCGTGGATAGGAAGGATGACT
GAAACTGTGATATCCCAAGTCGGCGAGGACTCTAGACTGGCTGTGGCTGTCATGCTGATA
CTTTGGGTGTCAGGTCTGGCCTCGGCCTTCGTAGATAACATCCCGTTGACGACGATGATG
GTGCGCGTGGTGGCAGCCCTGGCAGACGGCGCGCTCGCCCTACCGCTGCCACCCCTGGCC
TGGGCACTCAGCTTCGGAGCTTGTTTAGGAGGTAACGGTACGTTGATTGGCGCCAGCGCC
AACGTGGTGTGCGCCCCGCAGCGACTTGTCGTGCCGATCCGGTTCACCTTCATACAGTTT
CTTAGAATAGGTTTCCCCATCATGATAGGTAACCTCCTGGTAGCGTCTGGCTATCTCCTG
CTCTGCCACTGTCTGTTCACCTGGCACTGA
Protein sequence:
MNKDKSNSVYSMVSFSDLTPGSLDVWVDLPDAIKYDPTLAPFKQMYEQKHGKDLSNVEVE
VPAGDDINKNNKIAEENLVCEKQIRDKDVENRLVEDISPDDEKTLPKRKKQLDRTLRVIK
LSVLVAGWVMLTVSLLMNREKTDIILHTAVNAGEIKEYFLGSSSEEFRVAISLTGPFTDS
STNATSSLQLWLHKTSKYKEDEQDSPAWSINLQPDDVIDFSPSASEDNVLMIDKQTFINN
ETYEEKNASEHGVNESRIFLCLNSSSSQAVPLTISLHGKPLSETEGLIYASVLLATLYIL
IIFEIVNRTLAALLSSSLGVATLALVGERPSLPELISWLDVETLLLLFSMMILVAIIAET
GLFDFLAVKAFEITAGRTWPLINCLCFFTAFFSTFLDNVTTVLLMTPVTIRLCEVMQLNP
VPVLMSMVIFSNVGGAATPVGDPPNVIIASHPSILAVNINFTSFTLHMGLGILLVCIQTY
VQLRFMFRDMNSLRHCVPRDILELRQEISVWKRAAASLSSYSRDEDIVRRALEKKVQRLK
STLGRREAGGGNDKLFCSTLAHMKDKYRIRDKALLVKSGVCISFVVLVFFLHAVPELQSL
SLGWTALLGALLLLLLSEREDLEPVLARVEWSTLLFFAALFVMMEVLSKLGLIAWIGRMT
ETVISQVGEDSRLAVAVMLILWVSGLASAFVDNIPLTTMMVRVVAALADGALALPLPPLA
WALSFGACLGGNGTLIGASANVVCAPQRLVVPIRFTFIQFLRIGFPIMIGNLLVASGYLL
LCHCLFTWH