Monarch geneset OGS2.0

DPOGS200084
TranscriptDPOGS200084-TA3189 bp
ProteinDPOGS200084-PA1062 aa
Genomic positionDPSCF300044 - 210362-220193
RNAseq coverage617x (Rank: top 21%)
Annotation
HeliconiusHMEL0150650.080.56% 
BombyxBGIBMGA002074-TA0.078.48% 
DrosophilaCG8177-PA0.051.66% 
EBI UniRef50UniRef50_E0W1T80.056.45%Anion exchange protein, putative n=2 Tax=Neoptera RepID=E0W1T8_PEDHC
NCBI RefSeqXP_001652863.10.055.90%anion exchange protein 2, slc4a2 [Aedes aegypti]
NCBI nr blastpgi|1973181000.055.75%SLC4-like anion exchanger [Aedes aegypti]
NCBI nr blastxgi|3320266060.056.56%Anion exchange protein 2 [Acromyrmex echinatior]
Group
Gene OntologyGO:00068201.4e-187anion transport
GO:00160211.4e-187integral to membrane
GO:00085091.4e-112anion transmembrane transporter activity
GO:00068102.1e-93transport
GO:00052152.1e-93transporter activity
GO:00160201.3e-34membrane
GO:00054521.3e-34inorganic anion exchanger activity
KEGG pathwaygga:3958090.0 
 K13855 (SLC4A2, AE2)maps-> Salivary secretion
    Gastric acid secretion
InterPro domain[13-1061] IPR0030200Bicarbonate transporter, eukaryotic
[502-993] IPR0115311.4e-187Bicarbonate transporter, C-terminal
[314-464] IPR0137691.4e-112Bicarbonate transporter, cytoplasmic
[112-463] IPR0161522.1e-93Phosphotransferase/anion transporter
Orthology groupMCL10090 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200084-TA
ATGCCTGTGACGTATTCCCGGGCTTCTGATGTACATCATCAACGTCGACACTTGCATCACAAATCACGAAAATATTCCCTACAAGAGGGGGCGAGAGGTGGTGGGGCAGATGGCGAGAGACAAGTGCCTGCGGCATCAACGGATGAACCATTACCCGAAGCTGATCTCGATGAACTCCGGAGCCATCGAATTGATGACCAGCGGGTTTTGCGGAGACTTAAACTTCAGCCCAGGAGTCCAACCATTCATGTAGGGCGCAAGGATGGAGGCGATAAAATACAAAACATTTTTTCTGACTTAACTCTGAAAAAAATGTACGACCACAGTCCACATTCGGTGTTTGTTCAACTGGACGAATTACTAGCCACAGAAGATGGTGATACGGAGTGGAAGGAAACTGCACGTTGGATTAAATATGAAGAAGATGTTGAAGAAGGATCTGCCAGATGGGGTCGGCCCCACGTAGCCTCTCTTTCTTTCCATTCCCTATTAAACCTACGCCGTTGTTTAGAAACTGGAGTGGTACTGCTTGACCTCGATGAAAAAGACCTTCCTGGTGTTGCATACAGGGTTGTAGAAAGTATGGTTAATGAAGGATTGATAGAAGAAGATGACAAACCAGTCGTAATGAGATCTCTACTTCTTCGCCATCGACATGTACATGATGAAAGGTTCCGATTCTCCATAAGTGGTCGAAAGCACTCTTCCTATACAAGCCTACAGTCGCTGTGGTTGGAGGAAGGTGGTGGCGCCCGCCAGCGATACTCCACATGCTCTGCCATCGGTCCCTGTCGTCGACACAGCTCTCATATTCTCAATCTTTCGGATAACAAGCGACGAAGAAGCTCCAATGCTCTACCACAAGATCGAACAGAAGCTCGAGCAAAAACATCAGTGGCGGGCATGGATACACGCGAAGTAGAATATTTAGCCACAGCCCCCGTGGGGTCTCAGGATGAATTAAGACGGGGTCACAATGATTCAATCATGAAACGTATACCTGACGATGCGGAAGCCACAACAGTCCTCGTTGGTGCAGTTGGATTTTTAGATCAACCAACGATTGCCTTTGTACGTCTCGCTCAAGGCATATTAATGCCATCCATCACAGAGGTTCCCATACCAGTCCGCTTTATGTTCATATTACTTGGGCCAACATCAGCTGACCTTGACTATCATGAAGTGGGTAGATCCATTTCTACTCTTATGTCAAACCCTTCCTTTCATTCTATTGCGTACAAGGCTGATGATCGACGTGAACTTTTGTCGGCAATTAATGAATTCTTAGACGATTCGATAGTGTTACCGCCTGGTGATTGGGAGCGGCAGGCTCTATTGCCTTTCGAAGAATTACGAGCTAAAAGTGAAATGATAAGAAAACGTAAGCGTGATGCTTTGGAGCGTAAAAAGGGCATTGAAATTACAACAGCTTCGCCAATAGATGAAAAAAAGGCTTTGTTAGCTGGTGAAACTGGTGGATTGCCAGAAAAAGAACGTGATGATCCATTATCCAAGAGTGGTCGTCTCTTTGGTGGTGTTATAAGAGACATAAAAAGGCGTTATCCCCACTATATATCCGACTTTCGTGATGCATTAAATGGACAATGTGCGGCAGCTACAATATTCATGTACTTTGCTGCGCTTTCATCAGCCATTACTTTTGGAGGACTGTTAGCTGAAAAAACTGACAGACAGATTGGTATCTCGGAAACATTGGTATTTACTTGCGTAGGTGGATTATTTTTCGCCCTAGTAGCAGGTCAACCAATGATGATTACTGGCGCTACTGGACCTTTGCTGCTTCTCGACGAATCGCTTTTTGTATTTTGCCGCTCCTACGGTTTTGATTTTTTGGCCGCTAGAATGTACTGTGGTTTATGGATGATAGTGATTGCTTTGTGTGTTGCCTCTGTTGAAGGTAGTGTCGCCGTAAAGAAAATTACGAGATTTACTGAAGACATCTTCGCATTTTTGATATCGCTTATTTTCATATCTGAGCCTGTGACGAATATAATAAATGTTTACCGTGCTCACCCGCTCGGTTATGACTACTGCGGCAATTACACACTTGAAAATTCCACTGCTGGCGTTGATACGGTTAACTCAAATTTTACAGGAAACCTAACAGTTCCTCCAGTTTTACCGCCTACAAATATGTTACTTACACCGAAACCAAATACAGCTTTGTTTTGTACAATGTTGACTCTTTGTACCTTTATTCTTGCTTACTATCTCCGCATATTCCGCAACGGAAAATTTCTTGGTCGAAGTGCTCGACGTGCACTTGGTGATTTCGGAGTTCCGATTGCGATTGTTTTAATGGTTGGAATATCCTGCTTAGTACCCGTTTGGACTGAAAAATTACAAGTACCGGATGGTCTGAGCCCAACCTCAAATCGTTCTTGGCTTGTGCCCCTTAATAAGGGACTTGAAACAATACCACTGTGGGCAACAATTGCTATGGTTTTACCGGCGCTCATGGTTTACATCATCGTCTTTATGGAAACCCACATCGCAGAGTTGATTATTGACAAACCAGAGAGAAAACTGAAGAAAGGCAGTGGATTCCACATGGACATAGTCGTCATGTCGTTAGTGAACTCGGTGTGTGGCATGTTTGGGGCTCCGTGGCAGTGTGTAGCCACAGTACGATCTGTGAGCCATGTTTCCGCATTAACTGTTATGTCAACAACTCATGCCCCCGGTGACAAACCTTATATTGTTGAAGTTAAGGAACAACGTCTTACTGGATTACTAGTTGCTTTTCTCGTTGGCATATCTGTTTTGGCTTCCGGCTGGCTAAGATTAGTTCCAATGGCTGTATTATTTGGAGTTTTCCTCTATATGGGAATTTCTGCCCTCGGAGGAATTCAGTTCTGGGATCGATGTATTTTACTATTAAAACCTGTGAAGCATCACCCGCAAATACCTTACGTGAGACGAGTACCGACATTTAAAATGCATCTCTACACTCTTATCCAAATAGCTGGTGTATGTGTATTGTATGCTGTGAAGTCTTCGAAGTTTTCCCTCGCGCTTCCCTTCTTCTTGGTACTCATGGTGCCGCTGCGAATGGCAATCAGTTACATTTTTACCCCGCTACAACTGCGTGCGTTGGATGGATCCCAAAAAGATATTGACGTCGATGATGAGCCAGATTTCTATGAAGAAGCGCCTTTGCCCGGATAG

Protein sequence:

>DPOGS200084-PA
MPVTYSRASDVHHQRRHLHHKSRKYSLQEGARGGGADGERQVPAASTDEPLPEADLDELRSHRIDDQRVLRRLKLQPRSPTIHVGRKDGGDKIQNIFSDLTLKKMYDHSPHSVFVQLDELLATEDGDTEWKETARWIKYEEDVEEGSARWGRPHVASLSFHSLLNLRRCLETGVVLLDLDEKDLPGVAYRVVESMVNEGLIEEDDKPVVMRSLLLRHRHVHDERFRFSISGRKHSSYTSLQSLWLEEGGGARQRYSTCSAIGPCRRHSSHILNLSDNKRRRSSNALPQDRTEARAKTSVAGMDTREVEYLATAPVGSQDELRRGHNDSIMKRIPDDAEATTVLVGAVGFLDQPTIAFVRLAQGILMPSITEVPIPVRFMFILLGPTSADLDYHEVGRSISTLMSNPSFHSIAYKADDRRELLSAINEFLDDSIVLPPGDWERQALLPFEELRAKSEMIRKRKRDALERKKGIEITTASPIDEKKALLAGETGGLPEKERDDPLSKSGRLFGGVIRDIKRRYPHYISDFRDALNGQCAAATIFMYFAALSSAITFGGLLAEKTDRQIGISETLVFTCVGGLFFALVAGQPMMITGATGPLLLLDESLFVFCRSYGFDFLAARMYCGLWMIVIALCVASVEGSVAVKKITRFTEDIFAFLISLIFISEPVTNIINVYRAHPLGYDYCGNYTLENSTAGVDTVNSNFTGNLTVPPVLPPTNMLLTPKPNTALFCTMLTLCTFILAYYLRIFRNGKFLGRSARRALGDFGVPIAIVLMVGISCLVPVWTEKLQVPDGLSPTSNRSWLVPLNKGLETIPLWATIAMVLPALMVYIIVFMETHIAELIIDKPERKLKKGSGFHMDIVVMSLVNSVCGMFGAPWQCVATVRSVSHVSALTVMSTTHAPGDKPYIVEVKEQRLTGLLVAFLVGISVLASGWLRLVPMAVLFGVFLYMGISALGGIQFWDRCILLLKPVKHHPQIPYVRRVPTFKMHLYTLIQIAGVCVLYAVKSSKFSLALPFFLVLMVPLRMAISYIFTPLQLRALDGSQKDIDVDDEPDFYEEAPLPG-