Monarch geneset OGS2.0

DPOGS215409
TranscriptDPOGS215409-TA3435 bp
ProteinDPOGS215409-PA1144 aa
Genomic positionDPSCF300088 + 473616-492613
RNAseq coverage279x (Rank: top 39%)
Annotation
HeliconiusHMEL0174220.075.87% 
BombyxBGIBMGA012368-TA0.061.75% 
DrosophilaNcc69-PA0.057.29% 
EBI UniRef50UniRef50_Q9VTW80.057.29%GH27027p n=18 Tax=Pancrustacea RepID=Q9VTW8_DROME
NCBI RefSeqXP_321556.30.057.20%AGAP001557-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1187945130.057.20%AGAP001557-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1954938200.057.39%GE20127 [Drosophila yakuba]
Group
Gene OntologyGO:00068210chloride transport
GO:00160210integral to membrane
GO:00153770cation:chloride symporter activity
GO:00068140sodium ion transport
GO:00160202.6e-131membrane
GO:00068102.6e-131transport
GO:00550852.6e-131transmembrane transport
KEGG pathwaytgu:1002308710.0 
 K10951 (SLC12A2)maps-> Salivary secretion
    Vibrio cholerae infection
InterPro domain[197-1144] IPR0048420Na/K/Cl co-transporter superfamily
[225-719] IPR0048412.6e-131Amino acid permease domain
Orthology groupMCL10131 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215409-TA
ATGGAATCCAATCCAAAAGACATCGAATTAAGCCCGGTAAATTCTAGCCCAGACAATAGTAGATCGGCAAGAGAAAAGTTTATCACGCCGGCAAACGACGAAGAGGATGTTATTATCGGTCGATTTAGAAAAATTAGTTCAGCCAAATCCATGGCAGCTACCAATGGTAACATTCAATATCCTTCCGAGATTATTAGGAATGAAGCCGACGTAGAGGCCGGGTTACAGTTTAAAGTTAAAGAATGCGAAAAGGAGCGGTTGTTGCCTGAGCCTTCGGATAGCAACGTGGTTGTTCTTCCCAAGTCTCCCGTCACGAAGAGCAGCTTTCGAGACATGGAGAAACCTTCACGATTCAAGGATCAACCGTCCACAACCAGATTTCAGATGGAAAATCCTGACCCTCGTTCGGATTCTGACAGTTCAGGCATGGAGAGCGACGACCCTTTAACGACCTCTGACACGAAATACGGTAAAAGTTTCAGACACTTCACCCGCGAGGCCCTACCTCGGCTGGACAACTACAGGAATGTCCTGTCGCTGCACGCGGCTCCTAGACCCACCCTGGATGAGCTTCATAATGCATCGTTATCAAGGAAGCCAGGTCAGACCATGGAAAAGGACCAGGCCACAGTGGCCATACCAACCACATCTGTCAAGTTCGGCTGGATCAAGGGTGTCCTGATGAGATGTCTCCTCAACATCTGGGGTGTGATGCTGTTCTTGCGACTGTCCTGGGTCGTGGGTCAGGCCGGGATTGCGCAGGCATCTCTGTTGATATTGACCACCAGCGTGGTGACCACCATCACTGCCCTCTCGATGTCCGCCATCAGCACCAACGGAGTCATTAAAGGAGGTGGTACGTACTACATGATTTCTCGCTCCCTGGGCCCGGAGTTCGGTGGTTCCATCGGGCTCATCTTCTCTATGGCCAACGCAGTCGCCTGCGCCATGTACGTGGTGGGCTTCGCGGAGTCACTCATCACACTCATACCAGAAACCGCCTACATGGTCGACAAGAATTGGGACCAGGCCATCTACGGCTGCATCACGATCGTTTTACTCACCGGTATCGTGATGGTGGGTATGGAATGGGAGGCGAAGGCCCAGATTGTGTTGCTGGTGGTATTGTTGGCAGCCATAGCGGACTTCTGTGTGGGGGCGCTCGTGGGGCCCAAGAGCGAGCAGGAGGTCGCGCAGGGCTTCGTCGGCTTTAACTGGACTGTGATGCTGAGTAACCTGGGTCCGGACTATAGATATTTCGAGGGCCAGCACCACAACTTCTTCTCGGTGTTCTCGATATTCTTCCCAGCCGCTACCGGTATACTCGCAGGGGCTAACATATCCGGGGACCTGAAGGACCCGCAGAAGTCAATCCCCAAGGGCACCCTACTGGCCATTCTCCTCACGACCCTGTCGTACTTGTTGATCGCGGTGGTGGCGGGCGCGTGTGTGGTGAGGGATGCGTCGGGAAACCTCCAGGACGTGGTGGACGGCACCCTCGGCCTCTGTAGAGACAACGGCACCTGTCAATACGGCCTACATCACAGCAACGATGTAATAAGGCTGGTGTCAGGGTTCGGGCCTCTGATATACGGCGGCTGTTTCGCGGCCACACTGTCCTCGGCGCTCGCGTCCCTGGTCTCCGCGCCCAAGGTGTTCCAGGCGCTGTGTCAGGACAAGCTGTACCCGTGGCTGGAGTTCTTCGCGAAGGGTTACGGAGCTAACAACGAGCCGGTCAGGGGATACGTGCTCACCTTCGTCATAGCCGTGGCCTTCATCCTGATGGGCGGGTTGAACCAGATCGCTCCCCTGATATCTAACTTCTTCCTGGCCGCCTACGCCCTCATCAACTTCGCCACGTTCCACGCCAGCCTCGCCCGCCCCGTGGGCTGGAGACCCACCTTCAGATTATACAACATGTGGCTGTCCCTGGCGGGATCGCTGGTGTGCGCCGCCATCATGTTCGTGGTGTCGTGGTTCAACGCGCTCCTGACGCTGGCAGCCCTGCTGGCCCTCTATCTGCTGGTGTCGTATCGCAAGCCAGATGTGAACTGGGGCTCCACCACCCAGGCCCAGAGGTACAAGGCGGCCCTGTCCGGCGTACACCAGCTGAACGCGGTCAGCGAACACGTCAAGAACTACAGGCCTCAGATCCTGGTGCTGACGGGTTTCCCCGGGGAGAGGTCCATGCTCACGGACTTCACGTATCTCCTTACCAAGGGACTGTCGCTGATGCTCTGTGGACACATCCTGCAGAGCGCCATCAACCACCGCACCCGCGAGGCGCTGTCGGCGCGCGCCTACCAGTGGTTCAGCAAACGAAACATCAAGGCCTTCTACACCATCGTGGACGACGCCAGCTTCAAGGACGGAGCCGGCGCGCTCCTACAGGCGAGCGGTCTGGGCAAGTTGAAGCCGAACATTCTTCTGATGGGCTTCAAGGAGGACTGGCAGACCTGCCCGCGACAGGAACTGGCCGGCTACATCGACGTCATGCACAAAGCTCTAGACTTGCACATGGGCCTGTCCCTGCTGCGCGTGTCGGGAGGTCTGTACAGCTGCGACACGCTGGACGAGGACCTGCTGGCCTGCCTCCAGCCGCCCGCCCAGGTCGAGCCCGCCCTGGCACTCACGCGGAGCCGGTCTAATAGAATATATCTTGATCAAACCGATAGCAAATATTCTTTGAGAGTCGTGATTAGCGAACCGTTCAATAATATTCTTACTATTTGGTCCCCAGCATCAATGGGCGACGGACACAAGAAGTCCTCCGAGACGCTGAACTCTCAGTCCAGAGGTGAGGGAGTCAGCAGCGTGTCCGACGTGTCGTGTGAGGCGGGCGGGGCGAGCGGCGCGTGTCCCACCCCCGAGCGCTTCCCCCGGCTGGCGGGCGGGGTGGACGTGTGGTGGCTGTACGACGACGGCGGCCTGACGCTCCTGCTGCCCTACATCCTCTCCACGAGGCGGGCCTGGGCCTCGTGCCCGCTCCGGGTCTTCACCCTGGCCAACAACAACGCCGAAATGGAGATAGAGGAACGCAACATGGCGTCCCTCCTGTCCAAGTTCCGTATCGACTACTCGTCGCTGAAGATGATCCCGGACGTGTCCCGGCGGCCGCGGGACTCCACCCTCGCCTACTTCAACAAGCTCATAGAACCCTTCACCGCCAGGGACGACTCGGACGACAGCTTCGGCATCACCCCGTCGGAGCTGCGCGCGGCCGAGTCCCGCACTCACCGCTACTTGCGCGTGCGGGAGCTGGTGAGCAGCCAGTCGGCCTGCAGCCGCCTGGTGTGCGTGACGCAGCCCATGCCGCGGCGCCGGGGCCTGCCGCCCGCGCTGTACGCCGCCTGGCTGCACGCCCTCGCCACCGCCGCCGACCGCGTGCTGCTGGTGCGGGGGAACCACTCGTCCGTGCTCACCTTCTACTCCTAG

Protein sequence:

>DPOGS215409-PA
MESNPKDIELSPVNSSPDNSRSAREKFITPANDEEDVIIGRFRKISSAKSMAATNGNIQYPSEIIRNEADVEAGLQFKVKECEKERLLPEPSDSNVVVLPKSPVTKSSFRDMEKPSRFKDQPSTTRFQMENPDPRSDSDSSGMESDDPLTTSDTKYGKSFRHFTREALPRLDNYRNVLSLHAAPRPTLDELHNASLSRKPGQTMEKDQATVAIPTTSVKFGWIKGVLMRCLLNIWGVMLFLRLSWVVGQAGIAQASLLILTTSVVTTITALSMSAISTNGVIKGGGTYYMISRSLGPEFGGSIGLIFSMANAVACAMYVVGFAESLITLIPETAYMVDKNWDQAIYGCITIVLLTGIVMVGMEWEAKAQIVLLVVLLAAIADFCVGALVGPKSEQEVAQGFVGFNWTVMLSNLGPDYRYFEGQHHNFFSVFSIFFPAATGILAGANISGDLKDPQKSIPKGTLLAILLTTLSYLLIAVVAGACVVRDASGNLQDVVDGTLGLCRDNGTCQYGLHHSNDVIRLVSGFGPLIYGGCFAATLSSALASLVSAPKVFQALCQDKLYPWLEFFAKGYGANNEPVRGYVLTFVIAVAFILMGGLNQIAPLISNFFLAAYALINFATFHASLARPVGWRPTFRLYNMWLSLAGSLVCAAIMFVVSWFNALLTLAALLALYLLVSYRKPDVNWGSTTQAQRYKAALSGVHQLNAVSEHVKNYRPQILVLTGFPGERSMLTDFTYLLTKGLSLMLCGHILQSAINHRTREALSARAYQWFSKRNIKAFYTIVDDASFKDGAGALLQASGLGKLKPNILLMGFKEDWQTCPRQELAGYIDVMHKALDLHMGLSLLRVSGGLYSCDTLDEDLLACLQPPAQVEPALALTRSRSNRIYLDQTDSKYSLRVVISEPFNNILTIWSPASMGDGHKKSSETLNSQSRGEGVSSVSDVSCEAGGASGACPTPERFPRLAGGVDVWWLYDDGGLTLLLPYILSTRRAWASCPLRVFTLANNNAEMEIEERNMASLLSKFRIDYSSLKMIPDVSRRPRDSTLAYFNKLIEPFTARDDSDDSFGITPSELRAAESRTHRYLRVRELVSSQSACSRLVCVTQPMPRRRGLPPALYAAWLHALATAADRVLLVRGNHSSVLTFYS-