Monarch geneset OGS2.0

DPOGS215753
TranscriptDPOGS215753-TA3354 bp
ProteinDPOGS215753-PA1117 aa
Genomic positionDPSCF300041 + 1032973-1253219
RNAseq coverage511x (Rank: top 24%)
Annotation
HeliconiusHMEL0141410.088.48% 
BombyxBGIBMGA003629-TA0.093.77% 
Drosophilakcc-PD0.069.73% 
EBI UniRef50UniRef50_Q8MLQ50.069.73%CG5594, isoform D n=37 Tax=Pancrustacea RepID=Q8MLQ5_DROME
NCBI RefSeqXP_001655744.10.076.32%potassium/chloride symporter, putative [Aedes aegypti]
NCBI nr blastpgi|3064786290.073.53%SLC12-like K,Cl cotransporter [Aedes aegypti]
NCBI nr blastxgi|3064786290.073.53%SLC12-like K,Cl cotransporter [Aedes aegypti]
Group
Gene OntologyGO:00068210chloride transport
GO:00160210integral to membrane
GO:00153770cation:chloride symporter activity
GO:00068140sodium ion transport
GO:00160202.9e-43membrane
GO:00068102.9e-43transport
GO:00550852.9e-43transmembrane transport
GO:00068111.8e-06ion transport
GO:00052151.8e-06transporter activity
KEGG pathwaygga:4207970.0 
 K13627 (SLC12A7, KCC4)maps-> Collecting duct acid secretion
InterPro domain[65-1117] IPR0048420Na/K/Cl co-transporter superfamily
[406-683] IPR0048412.9e-43Amino acid permease domain
[266-278] IPR0000761.8e-06K-Cl co-transporter
Orthology groupMCL10330 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215753-TA
ATGTCTGAACGCTTTAAGGTGACTATGTTCGATGAGCCGTCTGGCAAAAATTACAAGAACTACGGCGCGACATCGGCTGAGAGCGATGTAGAGCTGAAAGGCAAACTTCTCCAAAACTCAGGCAAGTACGCTAGATATAAATCTCAAACGTTCTTCAGCCTGCCGTTAGGTGACACCGATGAGTATGGCTTCTCGAAGGACAAAGACAAAGGCGACACAACGCTGTACCTCTACCAAGAGGAGATCGAAGACAGGCCTCGAGCGGCTACGTTCCTCAGTTCTCTGGCGGACTATTCCAATACCATCCCCACTGCCTCAGCGGCTGATCCCGATGCCCCAAAACCGGCACCCCCCGCTCGTATGGGTACCCTCATCGGGGTGTACCTTCCTTGCATCCAAAACATTTTCGGCGTTATCCTCTTCATCCGTTTAACATGGGTTGTTGGAACCGCTGGCGCTATTCAAGGCTTTCTGATTGTGCTTGTCTGCTGTTGTACGACGATGCTCACTGCGATATCAATGTCAGCGATTGCTACGAACGGTGTGGTCCCAGCAGGGGGGTCATATTTCATGATCGGTCGGTCTCTGGGTCCAGAGTGTGGTGGCGCAGTTGGAATGTTGTTCTACACAGGCACTACCCTCGCTGCTGCCATGTATATCGTCGGAGCTGTTGAAATAGTTCTGACGTACATAGCACCCTGGATGTCAATTTTCGGCGACTTTACTAAGGATCCTGAAGCGATGTACAATAACTTCAGAGTTTACGGAACTGGCCTCTTATTGATAATGGGCATGGTTGTGTTTGTGGGAGTCAAATTTGTCAACAAGTTCGCTACACTCGCCCTCGCTTGTGTCATTCTTTCCATTAGTGCTGTCTACGCTGGCATCTTTGTAAACTTCAACGGAAACGATAAACTTCAAATGTGTGTATTGGGAAAACGTTTATTGAAGGACATTCATATTAGTAACTGCAGCAAGGATTTGGGAGGCGAACTGCATCAATTATTTTGCCCGAATAATACATGTGATCCATATTATCAACAACATGAAGTTTCAGTGGTCCAAGGCATTAAGGGTTTGGCAAGTGGAGTTTTCTTTGACAACTTACAAGACTCTTTTCTACAACTTGGACAATATATCGCTTATGGCAAAGAACCAGATGACATTGAACAGATGGAACGACCAACCTATAACCAGATCTATGCTGATCTTACCACTACCTTTACAATTCTCATAGGCATTTTCTTCCCATCGGTAACTGGTATAATGGCTGGATCGAATCGTTCTGGAGACTTGGCTGACGCCCAGAAAAGCATTCCAATCGGAACAATTTGCGCTATTCTTACAACATCGACTGTTTACCTATCTTGTGTGTTACTTTTCGCTGGAACTGTCGATAACTTATTACTGAGAGACAAATTTGGACAATCGATTGGCGGGAAACTTGTAGTTGCAAATATGGCATGGCCGAATCAATGGGTTATATTAATTGGATCGTTCCTTTCTACCCTCGGAGCTGGACTACAATCTCTGACTGGCGCACCACGTTTACTGCAGGCTATCGCTAAGGATGAAATAATACCTTTCCTTTCACCGTTCGCAGTTTCTTCTAGCAGGGGAGAACCAACAAGGGCTTTATTATTGACTATGGTTATTTGCCAATGCGGCATCCTTCTGGGCAACGTCGATATCCTAGCGCCATTGTTGTCTATGTTCTTCCTCATGTGTTACGGATTCGTCAATTTGGCTTGCGCTCTGCAAACGCTTTTGAAGACTCCCAACTGGCGACCCAGATTCAAATATTACCATTGGTCACTTTCACTAGCAGGTTTAACGCTTTGTATTTCCATTATGTTTATGACATCATGGTTTTATGCCTTAATAGCTATAGGCATGGCTGGTCTTATCTACAAGTACATCGAGTATCGAGGAGCCGAAAAAGAGTGGGGTGATGGCCTGCGCGGTCTGGCGCTTTCCGCAGCTCGTTATTCATTATTGCGTCTGGAAGAAGGACCCCCACATACAAAGAACTGGAGACCTCAAGTTCTTGTTCTTGCTAAACTGAATGAGGATCTGAATCCGAAGTATCGTAAAATGCTTGCTTTTGCCAGTCAACTTAAAGCTGGAAAAGGTTTGACAGTTTGTGTATCCGTGTTGGGTGGTGATTTCACCCGCCGCGCAGGGGAAGCAGCTACGGCTAAACAAAACTTACGCAAATGCATGGATGAAGAGAAAGTCAAAGGATTCGTTGATGTCCTTGTATCACATAGTATTGCTGATGGCCTGGGCCACTTTGTTCAAACGACCGGTCTCGGCGGATTGAAGCCAAATACAGTTATTGTTGGATGGCCATACGGCTGGCGACAATCGGAAGATGAACGTACTTGGCAAGTGTTCCTGCACACTGTCCGAGCTGTTACCGCCGCGAGAATGGCCATGCTGGTACCTAAGGGAATCAATTTCTTCCCTGACTCTACTGAAAAGGTCTCTGGTAACATAGATATTTGGTGGATAGTTCACGATGGTGGAATGTTGATGCTCTTACCATTCCTGTTGAAGCATCATCGCACGTGGAAAAATTGCAAGATGCGAATTTTCACTGTCGCTCAAATAGAAGATAACTCCATACAAATGAAAAAAGATCTGAAAATGTTCCTGTATCAATTACGTTTGGAAGCTGAAGTTGAAGTCGTAGAAATGACTGATAATGATATCTCCGCCTACACTTACGAACGCACCTTGATGATGGAACAGCGCAATCAGATGTTACGTGAACTGAGACTTAATAAGAAGGAATCGCTTGGAATGGTTCAAGCCATAGTGGATCATCATCACGCTGATGTGAAGACTGCTAGCAAGGTCCGTTTCGCTGAACCTGGTTCGGAGCCGGCAGCCGAAGACGCACCCTCACCTCCTCTGGCAGAAAACGATGACAAGGACAAAGATGATAAGGACGAATTTCGAAGTAGTTTGTTGCACATAATCGATTCAATGTGGGACTCGCCCGATGAACGCTCGCAGTGTGAGTCGCCAACGCCTCCCATCGACGCGGACAAACACAAGGATGGCAACCTCAACGGCGACGCGCTTAAACCCCAGCCCAACATGCCGATACTGACTCCCGACGAGGGAACGGTTCGACGGATGCACACCGCCGTTAAGCTGAATGAAGTCATAGTGTCGCGTTCACACGACGCACAATTAGTCATACTGAACCTGCCGGGCCCGCCGCGCGACACTAAACTCGAGAGGGAATCCAACTACATGGAGTTCCTGGAGGTGCTGACGGAGGGACTGGAAAAGGTCCTGATGGTTCGTGGCGGAGGCCGCGAGGTCATCACTATCTACTCGTGA

Protein sequence:

>DPOGS215753-PA
MSERFKVTMFDEPSGKNYKNYGATSAESDVELKGKLLQNSGKYARYKSQTFFSLPLGDTDEYGFSKDKDKGDTTLYLYQEEIEDRPRAATFLSSLADYSNTIPTASAADPDAPKPAPPARMGTLIGVYLPCIQNIFGVILFIRLTWVVGTAGAIQGFLIVLVCCCTTMLTAISMSAIATNGVVPAGGSYFMIGRSLGPECGGAVGMLFYTGTTLAAAMYIVGAVEIVLTYIAPWMSIFGDFTKDPEAMYNNFRVYGTGLLLIMGMVVFVGVKFVNKFATLALACVILSISAVYAGIFVNFNGNDKLQMCVLGKRLLKDIHISNCSKDLGGELHQLFCPNNTCDPYYQQHEVSVVQGIKGLASGVFFDNLQDSFLQLGQYIAYGKEPDDIEQMERPTYNQIYADLTTTFTILIGIFFPSVTGIMAGSNRSGDLADAQKSIPIGTICAILTTSTVYLSCVLLFAGTVDNLLLRDKFGQSIGGKLVVANMAWPNQWVILIGSFLSTLGAGLQSLTGAPRLLQAIAKDEIIPFLSPFAVSSSRGEPTRALLLTMVICQCGILLGNVDILAPLLSMFFLMCYGFVNLACALQTLLKTPNWRPRFKYYHWSLSLAGLTLCISIMFMTSWFYALIAIGMAGLIYKYIEYRGAEKEWGDGLRGLALSAARYSLLRLEEGPPHTKNWRPQVLVLAKLNEDLNPKYRKMLAFASQLKAGKGLTVCVSVLGGDFTRRAGEAATAKQNLRKCMDEEKVKGFVDVLVSHSIADGLGHFVQTTGLGGLKPNTVIVGWPYGWRQSEDERTWQVFLHTVRAVTAARMAMLVPKGINFFPDSTEKVSGNIDIWWIVHDGGMLMLLPFLLKHHRTWKNCKMRIFTVAQIEDNSIQMKKDLKMFLYQLRLEAEVEVVEMTDNDISAYTYERTLMMEQRNQMLRELRLNKKESLGMVQAIVDHHHADVKTASKVRFAEPGSEPAAEDAPSPPLAENDDKDKDDKDEFRSSLLHIIDSMWDSPDERSQCESPTPPIDADKHKDGNLNGDALKPQPNMPILTPDEGTVRRMHTAVKLNEVIVSRSHDAQLVILNLPGPPRDTKLERESNYMEFLEVLTEGLEKVLMVRGGGREVITIYS-