DPGLEAN17607 in OGS1.0

New model in OGS2.0DPOGS200084 
Genomic Positionscaffold39:- 91096-116601
See gene structure
CDS Length3546
Paired RNAseq reads  2769
Single RNAseq reads  7090
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA002074 (0.0)
Best Drosophila hit  CG8177, isoform A (0.0)
Best Human hitanion exchange protein 2 isoform 2 (0.0)
Best NR hit (blastp)  SLC4-like anion exchanger [Aedes aegypti] (0.0)
Best NR hit (blastx)  AGAP006115-PB [Anopheles gambiae str. PEST] (0.0)
GeneOntology terms


  
GO:0015380 anion exchanger activity
GO:0005452 inorganic anion exchanger activity
GO:0006820 anion transport
GO:0016021 integral to membrane
InterPro families



  
IPR003020 Bicarbonate transporter, eukaryotic
IPR016152 Phosphotransferase/anion transporter
IPR013769 Bicarbonate transporter, cytoplasmic
IPR018241 Anion exchange, conserved site
IPR011531 Bicarbonate transporter, C-terminal
Orthology groupMCL10110

Nucleotide sequence:

ATGAACAGCGGCAAAAGAAAACTATCGTTCCTCGGGTTTGCTTCTCAGGGAGAGGATCAA
GACGAGCTGCGTCTTGATGAGGAGATGGAGCGAGTCGTTTGGCACGAGAATAACCCGCCA
ACTCGGCTACAGCCCAGCGGTGAGCCCGTTAGAGATGCAGCAGCCCCGCCACCTCCTTAC
TCGCCAGATGATATTATCAGGACAAGCAGTTCTGAGCAAACCCGTGCCAGTAGGGAACAA
CTTAATTCCAGTTCGGAAGGACAAATGTCAGGACAAATTTCTGGCTCTGGTTCAGGCAGT
ACGCCACGTGGAGAAGATCGCCATGTTCAGTTCGACAGTGAACGTGCCAGGGCTGATCCA
CTAGATGATGCAAGCGAAGAACGTCGTCATCAAAGAAACACTCGACACTTGCATCACAAA
TCACGAAAATATTCCCTACAAGAGGGGGCGAGAGGTGGTGGGGCAGATGGCGAGAGACAA
GTGCCTGCGGCATCAACGGATGAACCATTACCCGAAGCTGATCTCGATGAACTCCGGAGC
CATCGAATTGATGACCAGCGGGTTTTGCGGAGACTTAAACTTCAGCCCAGGAGTCCAACC
ATTCATGTAGGGCGCAAGGATGGAGGCGATAAAATACAAAACATTTTTTCTGACTTAACT
CTGAAAAAAATGTACGACCACAGTCCACATTCGGTGTTTGTTCAACTGGACGAATTACTA
GCCACAGAAGATGGTGATACGGAGTGGAAGGAAACTGCACGTTGGATTAAATATGAAGAA
GATGTTGAAGAAGGATCTGCCAGATGGGGTCGGCCCCACGTAGCCTCTCTTTCTTTCCAT
TCCCTATTAAACCTACGCCGTTGTTTAGAAACTGGAGTGGTACTGCTTGACCTCGATGAA
AAAGACCTTCCTGGTGTTGCATACAGGGTTGTAGAAAGTATGGTTAATGAAGGATTGATA
GAAGAAGATGACAAACCAGTCGTAATGAGATCTCTACTTCTTCGCCATCGACATGTACAT
GATGAAAGGTTCCGATTCTCCATAAGTGGTCGAAAGCACTCTTCCTATACAAGCCTACAG
TCGCTGTGGTTGGAGGAAGGTGGTGGCGCCCGCCAGCGATACTCCACATGCTCTGCCATC
GGTCCCTGTCGTCGACACAGCTCTCATATTCTCAATCTTTCGGATAACAAGCGACGAAGA
AGCTCCAATGCTCTACCACAAGATCGAACAGAAGCTCGAGCAAAAACATCAGTGGCGGGC
ATGGATACACGCGAAGTAGAATATTTAGCCACAGCCCCCGTGGGGTCTCAGGATGAATTA
AGACGGGGTCACAATGATTCAATCATGAAACGTATACCTGACGATGCGGAAGCCACAACA
GTCCTCGTTGGTGCAGTTGGATTTTTAGATCAACCAACGATTGCCTTTGTACGTCTCGCT
CAAGGCATATTAATGCCATCCATCACAGAGGTTCCCATACCAGTCCGCTTTATGTTCATA
TTACTTGGGCCAACATCAGCTGACCTTGACTATCATGAAGTGGGTAGATCCATTTCTACT
CTTATGTCAAACCCTTCCTTTCATTCTATTGCGTACAAGGCTGATGATCGACGTGAACTT
TTGTCGGCAATTAATGAATTCTTAGACGATTCGATAGTGTTACCGCCTGGTGATTGGGAG
CGGCAGGCTCTATTGCCTTTCGAAGAATTACGAGCTAAAAGTGAAATGATAAGAAAACGT
AAGCGTGATGCTTTGGAGCGTAAAAAGGGCATTGAAATTACAACAGCTTCGCCAATAGAT
GAAAAAAAGGCTTTGTTAGCTGGTGAAACTGGTGGATTGCCAGAAAAAGAACGTGATGAT
CCATTATCCAAGAGTGGTCGTCTCTTTGGTGGTGTTATAAGAGACATAAAAAGGCGTTAT
CCCCACTATATATCCGACTTTCGTGATGCATTAAATGGACAATGTGCGGCAGCTACAATA
TTCATGTACTTTGCTGCGCTTTCATCAGCCATTACTTTTGGAGGACTGTTAGCTGAAAAA
ACTGACAGACAGATTGGTATCTCGGAAACATTGGTATTTACTTGCGTAGGTGGATTATTT
TTCGCCCTAGTAGCAGGTCAACCAATGATGATTACTGGCGCTACTGGACCTTTGCTGCTT
CTCGACGAATCGCTTTTTGTATTTTGCCGCTCCTACGGTTTTGATTTTTTGGCCGCTAGA
ATGTACTGTGGTTTATGGATGATAGTGATTGCTTTGTGTGTTGCCTCTGTTGAAGGTAGT
GTCGCCGTAAAGAAAATTACGAGATTTACTGAAGACATCTTCGCATTTTTGATATCGCTT
ATTTTCATATCTGAGCCTGTGACGAATATAATAAATGTTTACCGTGCTCACCCGCTCGGT
TATGACTACTGCGGCAATTACACACTTGAAAATTCCACTGCTGGCGTTGATACGGTTAAC
TCAAATTTTACAGGAAACCTAACAGTTCCTCCAGTTTTACCGCCTACAAATATGTTACTT
ACACCGAAACCAAATACAGCTTTGTTTTGTACAATGTTGACTCTTTGTACCTTTATTCTT
GCTTACTATCTCCGCATATTCCGCAACGGAAAATTTCTTGGTCGAAGTGCTCGACGTGCA
CTTGGTGATTTCGGAGTTCCGATTGCGATTGTTTTAATGGTTGGAATATCCTGCTTAGTA
CCCGTTTGGACTGAAAAATTACAAGTACCGGATGGTCTGAGCCCAACCTCAAATCGTTCT
TGGCTTGTGCCCCTTAATAAGGGACTTGAAACAATACCACTGTGGGCAACAATTGCTATG
GTTTTACCGGCGCTCATGGTTTACATCATCGTCTTTATGGAAACCCACATCGCAGAGTTG
ATTATTGACAAACCAGAGAGAAAACTGAAGAAAGGCAGTGGATTCCACATGGACATAGTC
GTCATGTCGTTAGTGAACTCGGTGTGTGGCATGTTTGGGGCTCCGTGGCAGTGTGTAGCC
ACAGTACGATCTGTGAGCCATGTTTCCGCATTAACTGTTATGTCAACAACTCATGCCCCC
GGTGACAAACCTTATATTGTTGAAGTTAAGGAACAACGTCTTACTGGATTACTAGTTGCT
TTTCTCGTTGGCATATCTGTTTTGGCTTCCGGCTGGCTAAGATTAGTTCCAATGGCTGTA
TTATTTGGAGTTTTCCTCTATATGGGAATTTCTGCCCTCGGAGGAATTCAGTTCTGGGAT
CGATGTATTTTACTATTAAAACCTGTGAAGCATCACCCGCAAATACCTTACGTGAGACGA
GTACCGACATTTAAAATGCATCTCTACACTCTTATCCAAATAGCTGGTGTATGTGTATTG
TATGCTGTGAAGTCTTCGAAGTTTTCCCTCGCGCTTCCCTTCTTCTTGGTACTCATGGTG
CCGCTGCGAATGGCAATCAGTTACATTTTTACCCCGCTACAACTGCGTGCGTTGGATGGA
TCCCAAAAAGATATTGACGTCGATGATGAGCCAGATTTCTATGAAGAAGCGCCTTTGCCC
GGATAG

Protein sequence:

MNSGKRKLSFLGFASQGEDQDELRLDEEMERVVWHENNPPTRLQPSGEPVRDAAAPPPPY
SPDDIIRTSSSEQTRASREQLNSSSEGQMSGQISGSGSGSTPRGEDRHVQFDSERARADP
LDDASEERRHQRNTRHLHHKSRKYSLQEGARGGGADGERQVPAASTDEPLPEADLDELRS
HRIDDQRVLRRLKLQPRSPTIHVGRKDGGDKIQNIFSDLTLKKMYDHSPHSVFVQLDELL
ATEDGDTEWKETARWIKYEEDVEEGSARWGRPHVASLSFHSLLNLRRCLETGVVLLDLDE
KDLPGVAYRVVESMVNEGLIEEDDKPVVMRSLLLRHRHVHDERFRFSISGRKHSSYTSLQ
SLWLEEGGGARQRYSTCSAIGPCRRHSSHILNLSDNKRRRSSNALPQDRTEARAKTSVAG
MDTREVEYLATAPVGSQDELRRGHNDSIMKRIPDDAEATTVLVGAVGFLDQPTIAFVRLA
QGILMPSITEVPIPVRFMFILLGPTSADLDYHEVGRSISTLMSNPSFHSIAYKADDRREL
LSAINEFLDDSIVLPPGDWERQALLPFEELRAKSEMIRKRKRDALERKKGIEITTASPID
EKKALLAGETGGLPEKERDDPLSKSGRLFGGVIRDIKRRYPHYISDFRDALNGQCAAATI
FMYFAALSSAITFGGLLAEKTDRQIGISETLVFTCVGGLFFALVAGQPMMITGATGPLLL
LDESLFVFCRSYGFDFLAARMYCGLWMIVIALCVASVEGSVAVKKITRFTEDIFAFLISL
IFISEPVTNIINVYRAHPLGYDYCGNYTLENSTAGVDTVNSNFTGNLTVPPVLPPTNMLL
TPKPNTALFCTMLTLCTFILAYYLRIFRNGKFLGRSARRALGDFGVPIAIVLMVGISCLV
PVWTEKLQVPDGLSPTSNRSWLVPLNKGLETIPLWATIAMVLPALMVYIIVFMETHIAEL
IIDKPERKLKKGSGFHMDIVVMSLVNSVCGMFGAPWQCVATVRSVSHVSALTVMSTTHAP
GDKPYIVEVKEQRLTGLLVAFLVGISVLASGWLRLVPMAVLFGVFLYMGISALGGIQFWD
RCILLLKPVKHHPQIPYVRRVPTFKMHLYTLIQIAGVCVLYAVKSSKFSLALPFFLVLMV
PLRMAISYIFTPLQLRALDGSQKDIDVDDEPDFYEEAPLPG