New model in OGS2.0 | DPOGS200084  |
---|---|
Genomic Position | scaffold39:- 91096-116601 |
See gene structure | |
CDS Length | 3546 |
Paired RNAseq reads   | 2769 |
Single RNAseq reads   | 7090 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA002074 (0.0) |
Best Drosophila hit   | CG8177, isoform A (0.0) |
Best Human hit | anion exchange protein 2 isoform 2 (0.0) |
Best NR hit (blastp)   | SLC4-like anion exchanger [Aedes aegypti] (0.0) |
Best NR hit (blastx)   | AGAP006115-PB [Anopheles gambiae str. PEST] (0.0) |
GeneOntology terms    | GO:0015380 anion exchanger activity GO:0005452 inorganic anion exchanger activity GO:0006820 anion transport GO:0016021 integral to membrane |
InterPro families    | IPR003020 Bicarbonate transporter, eukaryotic IPR016152 Phosphotransferase/anion transporter IPR013769 Bicarbonate transporter, cytoplasmic IPR018241 Anion exchange, conserved site IPR011531 Bicarbonate transporter, C-terminal |
Orthology group | MCL10110 |
Nucleotide sequence:
ATGAACAGCGGCAAAAGAAAACTATCGTTCCTCGGGTTTGCTTCTCAGGGAGAGGATCAA
GACGAGCTGCGTCTTGATGAGGAGATGGAGCGAGTCGTTTGGCACGAGAATAACCCGCCA
ACTCGGCTACAGCCCAGCGGTGAGCCCGTTAGAGATGCAGCAGCCCCGCCACCTCCTTAC
TCGCCAGATGATATTATCAGGACAAGCAGTTCTGAGCAAACCCGTGCCAGTAGGGAACAA
CTTAATTCCAGTTCGGAAGGACAAATGTCAGGACAAATTTCTGGCTCTGGTTCAGGCAGT
ACGCCACGTGGAGAAGATCGCCATGTTCAGTTCGACAGTGAACGTGCCAGGGCTGATCCA
CTAGATGATGCAAGCGAAGAACGTCGTCATCAAAGAAACACTCGACACTTGCATCACAAA
TCACGAAAATATTCCCTACAAGAGGGGGCGAGAGGTGGTGGGGCAGATGGCGAGAGACAA
GTGCCTGCGGCATCAACGGATGAACCATTACCCGAAGCTGATCTCGATGAACTCCGGAGC
CATCGAATTGATGACCAGCGGGTTTTGCGGAGACTTAAACTTCAGCCCAGGAGTCCAACC
ATTCATGTAGGGCGCAAGGATGGAGGCGATAAAATACAAAACATTTTTTCTGACTTAACT
CTGAAAAAAATGTACGACCACAGTCCACATTCGGTGTTTGTTCAACTGGACGAATTACTA
GCCACAGAAGATGGTGATACGGAGTGGAAGGAAACTGCACGTTGGATTAAATATGAAGAA
GATGTTGAAGAAGGATCTGCCAGATGGGGTCGGCCCCACGTAGCCTCTCTTTCTTTCCAT
TCCCTATTAAACCTACGCCGTTGTTTAGAAACTGGAGTGGTACTGCTTGACCTCGATGAA
AAAGACCTTCCTGGTGTTGCATACAGGGTTGTAGAAAGTATGGTTAATGAAGGATTGATA
GAAGAAGATGACAAACCAGTCGTAATGAGATCTCTACTTCTTCGCCATCGACATGTACAT
GATGAAAGGTTCCGATTCTCCATAAGTGGTCGAAAGCACTCTTCCTATACAAGCCTACAG
TCGCTGTGGTTGGAGGAAGGTGGTGGCGCCCGCCAGCGATACTCCACATGCTCTGCCATC
GGTCCCTGTCGTCGACACAGCTCTCATATTCTCAATCTTTCGGATAACAAGCGACGAAGA
AGCTCCAATGCTCTACCACAAGATCGAACAGAAGCTCGAGCAAAAACATCAGTGGCGGGC
ATGGATACACGCGAAGTAGAATATTTAGCCACAGCCCCCGTGGGGTCTCAGGATGAATTA
AGACGGGGTCACAATGATTCAATCATGAAACGTATACCTGACGATGCGGAAGCCACAACA
GTCCTCGTTGGTGCAGTTGGATTTTTAGATCAACCAACGATTGCCTTTGTACGTCTCGCT
CAAGGCATATTAATGCCATCCATCACAGAGGTTCCCATACCAGTCCGCTTTATGTTCATA
TTACTTGGGCCAACATCAGCTGACCTTGACTATCATGAAGTGGGTAGATCCATTTCTACT
CTTATGTCAAACCCTTCCTTTCATTCTATTGCGTACAAGGCTGATGATCGACGTGAACTT
TTGTCGGCAATTAATGAATTCTTAGACGATTCGATAGTGTTACCGCCTGGTGATTGGGAG
CGGCAGGCTCTATTGCCTTTCGAAGAATTACGAGCTAAAAGTGAAATGATAAGAAAACGT
AAGCGTGATGCTTTGGAGCGTAAAAAGGGCATTGAAATTACAACAGCTTCGCCAATAGAT
GAAAAAAAGGCTTTGTTAGCTGGTGAAACTGGTGGATTGCCAGAAAAAGAACGTGATGAT
CCATTATCCAAGAGTGGTCGTCTCTTTGGTGGTGTTATAAGAGACATAAAAAGGCGTTAT
CCCCACTATATATCCGACTTTCGTGATGCATTAAATGGACAATGTGCGGCAGCTACAATA
TTCATGTACTTTGCTGCGCTTTCATCAGCCATTACTTTTGGAGGACTGTTAGCTGAAAAA
ACTGACAGACAGATTGGTATCTCGGAAACATTGGTATTTACTTGCGTAGGTGGATTATTT
TTCGCCCTAGTAGCAGGTCAACCAATGATGATTACTGGCGCTACTGGACCTTTGCTGCTT
CTCGACGAATCGCTTTTTGTATTTTGCCGCTCCTACGGTTTTGATTTTTTGGCCGCTAGA
ATGTACTGTGGTTTATGGATGATAGTGATTGCTTTGTGTGTTGCCTCTGTTGAAGGTAGT
GTCGCCGTAAAGAAAATTACGAGATTTACTGAAGACATCTTCGCATTTTTGATATCGCTT
ATTTTCATATCTGAGCCTGTGACGAATATAATAAATGTTTACCGTGCTCACCCGCTCGGT
TATGACTACTGCGGCAATTACACACTTGAAAATTCCACTGCTGGCGTTGATACGGTTAAC
TCAAATTTTACAGGAAACCTAACAGTTCCTCCAGTTTTACCGCCTACAAATATGTTACTT
ACACCGAAACCAAATACAGCTTTGTTTTGTACAATGTTGACTCTTTGTACCTTTATTCTT
GCTTACTATCTCCGCATATTCCGCAACGGAAAATTTCTTGGTCGAAGTGCTCGACGTGCA
CTTGGTGATTTCGGAGTTCCGATTGCGATTGTTTTAATGGTTGGAATATCCTGCTTAGTA
CCCGTTTGGACTGAAAAATTACAAGTACCGGATGGTCTGAGCCCAACCTCAAATCGTTCT
TGGCTTGTGCCCCTTAATAAGGGACTTGAAACAATACCACTGTGGGCAACAATTGCTATG
GTTTTACCGGCGCTCATGGTTTACATCATCGTCTTTATGGAAACCCACATCGCAGAGTTG
ATTATTGACAAACCAGAGAGAAAACTGAAGAAAGGCAGTGGATTCCACATGGACATAGTC
GTCATGTCGTTAGTGAACTCGGTGTGTGGCATGTTTGGGGCTCCGTGGCAGTGTGTAGCC
ACAGTACGATCTGTGAGCCATGTTTCCGCATTAACTGTTATGTCAACAACTCATGCCCCC
GGTGACAAACCTTATATTGTTGAAGTTAAGGAACAACGTCTTACTGGATTACTAGTTGCT
TTTCTCGTTGGCATATCTGTTTTGGCTTCCGGCTGGCTAAGATTAGTTCCAATGGCTGTA
TTATTTGGAGTTTTCCTCTATATGGGAATTTCTGCCCTCGGAGGAATTCAGTTCTGGGAT
CGATGTATTTTACTATTAAAACCTGTGAAGCATCACCCGCAAATACCTTACGTGAGACGA
GTACCGACATTTAAAATGCATCTCTACACTCTTATCCAAATAGCTGGTGTATGTGTATTG
TATGCTGTGAAGTCTTCGAAGTTTTCCCTCGCGCTTCCCTTCTTCTTGGTACTCATGGTG
CCGCTGCGAATGGCAATCAGTTACATTTTTACCCCGCTACAACTGCGTGCGTTGGATGGA
TCCCAAAAAGATATTGACGTCGATGATGAGCCAGATTTCTATGAAGAAGCGCCTTTGCCC
GGATAG
Protein sequence:
MNSGKRKLSFLGFASQGEDQDELRLDEEMERVVWHENNPPTRLQPSGEPVRDAAAPPPPY
SPDDIIRTSSSEQTRASREQLNSSSEGQMSGQISGSGSGSTPRGEDRHVQFDSERARADP
LDDASEERRHQRNTRHLHHKSRKYSLQEGARGGGADGERQVPAASTDEPLPEADLDELRS
HRIDDQRVLRRLKLQPRSPTIHVGRKDGGDKIQNIFSDLTLKKMYDHSPHSVFVQLDELL
ATEDGDTEWKETARWIKYEEDVEEGSARWGRPHVASLSFHSLLNLRRCLETGVVLLDLDE
KDLPGVAYRVVESMVNEGLIEEDDKPVVMRSLLLRHRHVHDERFRFSISGRKHSSYTSLQ
SLWLEEGGGARQRYSTCSAIGPCRRHSSHILNLSDNKRRRSSNALPQDRTEARAKTSVAG
MDTREVEYLATAPVGSQDELRRGHNDSIMKRIPDDAEATTVLVGAVGFLDQPTIAFVRLA
QGILMPSITEVPIPVRFMFILLGPTSADLDYHEVGRSISTLMSNPSFHSIAYKADDRREL
LSAINEFLDDSIVLPPGDWERQALLPFEELRAKSEMIRKRKRDALERKKGIEITTASPID
EKKALLAGETGGLPEKERDDPLSKSGRLFGGVIRDIKRRYPHYISDFRDALNGQCAAATI
FMYFAALSSAITFGGLLAEKTDRQIGISETLVFTCVGGLFFALVAGQPMMITGATGPLLL
LDESLFVFCRSYGFDFLAARMYCGLWMIVIALCVASVEGSVAVKKITRFTEDIFAFLISL
IFISEPVTNIINVYRAHPLGYDYCGNYTLENSTAGVDTVNSNFTGNLTVPPVLPPTNMLL
TPKPNTALFCTMLTLCTFILAYYLRIFRNGKFLGRSARRALGDFGVPIAIVLMVGISCLV
PVWTEKLQVPDGLSPTSNRSWLVPLNKGLETIPLWATIAMVLPALMVYIIVFMETHIAEL
IIDKPERKLKKGSGFHMDIVVMSLVNSVCGMFGAPWQCVATVRSVSHVSALTVMSTTHAP
GDKPYIVEVKEQRLTGLLVAFLVGISVLASGWLRLVPMAVLFGVFLYMGISALGGIQFWD
RCILLLKPVKHHPQIPYVRRVPTFKMHLYTLIQIAGVCVLYAVKSSKFSLALPFFLVLMV
PLRMAISYIFTPLQLRALDGSQKDIDVDDEPDFYEEAPLPG