Monarch geneset OGS2.0

DPOGS208044
TranscriptDPOGS208044-TA3183 bp
ProteinDPOGS208044-PA1060 aa
Genomic positionDPSCF300203 + 186398-201813
RNAseq coverage380x (Rank: top 31%)
Annotation
HeliconiusHMEL0108960.070.72% 
BombyxBGIBMGA001472-TA0.076.33% 
DrosophilaCG31547-PB0.055.78% 
EBI UniRef50UniRef50_Q254790.080.39%Bumetanide-sensitive sodium-(potassium)-chloride cotransporter n=29 Tax=Endopterygota RepID=NKCL_MANSE
NCBI RefSeqXP_967948.20.060.63%PREDICTED: similar to bumetanide sensitive NaK2Cl cotransporter isoform 1 [Tribolium castaneum]
NCBI nr blastpgi|25007920.080.39%bumetanide sensitive NaK2Cl cotransporter [Manduca sexta]
NCBI nr blastxgi|25007920.080.39%bumetanide sensitive NaK2Cl cotransporter [Manduca sexta]
Group
Gene OntologyGO:00068210chloride transport
GO:00160210integral to membrane
GO:00153770cation:chloride symporter activity
GO:00068140sodium ion transport
GO:00160203.3e-114membrane
GO:00068103.3e-114transport
GO:00550853.3e-114transmembrane transport
GO:00068115.2e-05ion transport
GO:00052155.2e-05transporter activity
KEGG pathwayrno:836290.0 
 K10951 (SLC12A2)maps-> Salivary secretion
    Vibrio cholerae infection
InterPro domain[92-1060] IPR0048420Na/K/Cl co-transporter superfamily
[125-630] IPR0048413.3e-114Amino acid permease domain
Orthology groupMCL10131 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208044-TA
ATGTCGGACAACAGGTTCACAGTTTCGACAGTTGAGTCGGAATGTAAAAAAAATGGAATTCACATGGGTGCCAGTATTATTGGCAGGCCGCTACGAACCTCTTTGGAGACGGTGGAACGAGGGGATCCCAACTCACAGTCTGATACATGGCTACACGATGCAGGCTGGAGGAGAAAGCGATCGTTGGCTCAGCTCACACGAGAAGCCTTACCAAGAATGGAGAACTATAGGAACTCCAAGCGAGCTCTAAAGAGACCGAGCTTGGGGGAACTACATGGAGATCATCTGATAACCGAAGAGGACGAGAAACAATGTCAAAGGGAAACAAAATCACCGACGCCGGCACACGGGATCAAACTAGGATGGATTCAAGGCGTGTTGATACCATGCCTGCTGAACATCTGGGGTGTTATGTTGTTTCTGCGTATATCGTGGGTGGTGTCGCAGGCTGGTATTGGCCTCACCCTTATAATCATCGCCATATCAGCGATCGTGTGCGTCATAACCACCCTCTCCATGAGCGCCATATGCACCAACGGGGAGGTGAAAGGAGGTGGTATATACTACATAATATCAAGATCTTTGGGTCCTGAATTCGGTGCCTCTGTTGGCATTATATTCGCGTTCGCCAATGCAGTAGCTGCGAGCATGAACACAATCGGATTCTGCGACTCTCTCAACGACCTGCTCAGAATACAGGGGGTCAAAATTATTGACAATGGAGTCAATGATGTCAGAATAGTTGGGGCTGTAGCGTTGGTTGTGATGTGTATAATCTGCGCAGTCGGCATGGACTGGGAAAGTAAGGCGCAGAATTTTCTGATCGCCATTATTGTTGGAGCGATGGTGGACTTCATAGTAGGGACTCTGATGGGCCCCAATGACGCGAGCGATGTTGCTCACGGCTTCGTAGGGTTGTCAGCGACAACCCTCTCGGAGAACTTCAATTCAGACTTTCGGTTCAGTGAGGGTCTTCACCAGGACTTCTTTAGCGTCTTCGCAATATTCTTTCCTTCCGTCACCGGTATCCAGGCCGGTGCTAACATCTCCGGAGATTTGAAGGATCCAGCTTCCGCGATTCCCAAAGGTACGCTTCTGGCGCTACTGATATCCATGGTGAGTTACGCGCTCATGGTGTTGTTTTCTGGTGGCGGAGCCCTGAGGGACGCGTCAGGGAACGTCTCCGACCTCATCATGGCCAATGGAACTGTCGTCAACTACACCGGGATATCGAATTGCGTAAATAGCTTACACGGCTGCGGTTATGGCCTCCACAATAGTTATTCCGTGATGCAATTGATGTCAGCCTGGGGACCTCTTATTTACGGTGGTTGCTGGGCGGCGACTTTATCGACCGCTCTGACAAACCTTCTATCAGTCCCTCGACTGATCCAAGCGCTCGGAGTCGACCGCATCTACCCCGGACTGATATTCTTCTCTAAGCCTTACGGCAAACACGGAGAGCCTTACAGAGGATACGTGCTTACTTTCATTGTATCGCTGATGTTCTTGCTTATAGCTGATCTTAATACGATCGCCCCTCTTATATCCAACTTCTACTTGGCGTCGTATGCTCTCATCAATTTCTGCACCTTCCACGCTGCCCTGGTGCGCCCCCTGGGCTGGCGGCCTACATTCCGGTACTACAACGTGTGGTTATCAATGAGCGGTTTCCTAATGTGTGTTGCCATCATGTTTCTCATAAGTTGGATTATGTCCCTCGTCACATTCGCTGTATTCTTCACTCTGTACCTCATAGTACACTATCGGAAACCCGACGTTAATTGGGGCAGTAGCACGCAAGCTCAAATGTACAAAACAGCGTTATCCAGCGCCCATAACCTGGCGCGGACTGGTGAACATGTCAAGAACTACTGGCCACAGCTATTAGTTCTGGCTGGGAGACCTTCTGATAGACCAGCTTTAGTCGACCTCGGCAATCTCATCACAAAATCTGGCTCGCTCATGATGATTGGTGACATATCGCAGAAAAAGTTAAGCTACAAAGAGCGTGTACATCGCTCGCGGACTGGTGACGAGTGGTTGAGGGGACGGAAGGTCCGCGCGTTCTGTGCCTCTGTGAACGGGTTCAGCTTCGAGTCCGGTGCGAGAGCCTTGATGCAAGCGGCGGGCGTCGGGAGACTCGCCCCCAACGTGCTGCTCATGGGATATAAAGCAGACTGGGCCACCGCGCCGGCCGCTGATTTAGAGTCTTACTTCAATGTTCTGCATACTGCATTTGAGACGCGCCTCGCGGTGGCCATAGTGCGAGTTGCCGGCGGCTTGGACTACAGTGCTGTAGAGGGTCAGGGGGAGGCACCAGGGGAGGGACAGGGAGAGGGAGCCGCTCTCACCGCCACCTCCAGTGGAAGCGGAGACCTCAGGATAAAGAGACCACAAGCCCAGATAATGCACGCGGACTCAGATCTCGATATACGATCGACGGCAACCACTGGATCCCAACCTTCCAGCAGACATAATCTTAACCTATTGACACTGACAACATCGCGTTCATTCACGATATCAGAGAAAAGTGACAGCAAGGAAAAGAAGAAGGACAAGAAACTATTTGACATTCAGCGACAAATCATTTACAAGTCATCGTCTGGCGTGGAAATGACGTTGTTCAAGAGGAAGCAGGAGTCTGGCACTTTGGATGTGTGGTGGTTGTATGATGACGGTGGCCTCACCATTCTTCTGCCGTACATCGTGTCCCAGCGGGCCTCGTGGCAGAGGTGCAAGTTGAGGATCTTCGCCCTCGCCAACAGGAGACACGAGATGGAACTGGAGGAGCGGAATATGGCTAATCTGTTAGCGAAGTTCCGTATAGACTACTCGTCGCTGACCATGGTGCAGGACATCATGGAACCTCCGCAGGCGGAAACCAAGAAACTCTTTGAAGAAATCATAAAAGGTTTCACAGACGGCAAAGGAGAGTGCAGTATCGCTCAGTCGGAGCTAGCAACCCTGTGTGAGAAGACCCACCGTCAACTGCGACTCCGAGAGCTCCTCCTCGCGAACTCCTCGAACGCCCAGCTGGTTGTGATGTCCCTACCCATGCCTAGAAAGGGTTCCGTATCAGCTCCGCTGTACATGGCCTGGTTAGAGGTGATGAGTCGCGACCTCCCGCCCATGTTGTTCGTGAGAGGCAACCAGACCTCCGTGCTCACTTTCTATTCATAG

Protein sequence:

>DPOGS208044-PA
MSDNRFTVSTVESECKKNGIHMGASIIGRPLRTSLETVERGDPNSQSDTWLHDAGWRRKRSLAQLTREALPRMENYRNSKRALKRPSLGELHGDHLITEEDEKQCQRETKSPTPAHGIKLGWIQGVLIPCLLNIWGVMLFLRISWVVSQAGIGLTLIIIAISAIVCVITTLSMSAICTNGEVKGGGIYYIISRSLGPEFGASVGIIFAFANAVAASMNTIGFCDSLNDLLRIQGVKIIDNGVNDVRIVGAVALVVMCIICAVGMDWESKAQNFLIAIIVGAMVDFIVGTLMGPNDASDVAHGFVGLSATTLSENFNSDFRFSEGLHQDFFSVFAIFFPSVTGIQAGANISGDLKDPASAIPKGTLLALLISMVSYALMVLFSGGGALRDASGNVSDLIMANGTVVNYTGISNCVNSLHGCGYGLHNSYSVMQLMSAWGPLIYGGCWAATLSTALTNLLSVPRLIQALGVDRIYPGLIFFSKPYGKHGEPYRGYVLTFIVSLMFLLIADLNTIAPLISNFYLASYALINFCTFHAALVRPLGWRPTFRYYNVWLSMSGFLMCVAIMFLISWIMSLVTFAVFFTLYLIVHYRKPDVNWGSSTQAQMYKTALSSAHNLARTGEHVKNYWPQLLVLAGRPSDRPALVDLGNLITKSGSLMMIGDISQKKLSYKERVHRSRTGDEWLRGRKVRAFCASVNGFSFESGARALMQAAGVGRLAPNVLLMGYKADWATAPAADLESYFNVLHTAFETRLAVAIVRVAGGLDYSAVEGQGEAPGEGQGEGAALTATSSGSGDLRIKRPQAQIMHADSDLDIRSTATTGSQPSSRHNLNLLTLTTSRSFTISEKSDSKEKKKDKKLFDIQRQIIYKSSSGVEMTLFKRKQESGTLDVWWLYDDGGLTILLPYIVSQRASWQRCKLRIFALANRRHEMELEERNMANLLAKFRIDYSSLTMVQDIMEPPQAETKKLFEEIIKGFTDGKGECSIAQSELATLCEKTHRQLRLRELLLANSSNAQLVVMSLPMPRKGSVSAPLYMAWLEVMSRDLPPMLFVRGNQTSVLTFYS-