Monarch geneset OGS2.0

DPOGS206847
TranscriptDPOGS206847-TA1944 bp
ProteinDPOGS206847-PA647 aa
Genomic positionDPSCF300001 - 3015749-3023292
RNAseq coverage616x (Rank: top 21%)
Annotation
HeliconiusHMEL0093120.081.54% 
BombyxBGIBMGA012800-TA0.069.65% 
DrosophilaIndy-PB1e-14949.08% 
EBI UniRef50UniRef50_Q16UM14e-15047.02%Sodium/dicarboxylate cotransporter, putative n=11 Tax=Endopterygota RepID=Q16UM1_AEDAE
NCBI RefSeqXP_308709.42e-16050.72%AGAP007055-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582863673e-15950.72%AGAP007055-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1582863672e-16350.46%AGAP007055-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00160203e-76membrane
GO:00550853e-76transmembrane transport
GO:00068143e-76sodium ion transport
GO:00052153e-76transporter activity
KEGG pathway 
InterPro domain[44-579] IPR0018983e-76Sodium/sulphate symporter
Orthology groupMCL10209 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206847-TA
ATGTCAGCTGCGGGTGAAAGTGGTATTCCAGAAGAAGGACGTGGCACAACATACTATTTCTCAAAGAGTAGGTCTAATGCTACAATATGGCAGAGATTGTGTCTCTTCTTTTCAATTTATTGGAAGTCTGTTATCGTTGTTTTGACACCAATTGTTTTACTGCCATTGCCTATCTTAAATTCTGGATCTGAATTTGCTCGTGCATACCGCTGTATGTATGTGGTGTTAATAATGGCTACGTATTGGGTGTTGGAGTTGCTGCCTCTTCCAATTACCGCCATGTTGCCGATCGTGCTGTTCCCGACGATGGGAATACTGGACTCTGATCGCACATGTGCTGCTTACATGAGAGAAACGAACATGATGTTCATGGGTGGATTGATGATAGCTGCTGGTGTGCAGCACTCTAAACTGCCCAAGAGGGTGGCTTTGTGGACAGTGCAGGTGGTTGGCTGCTCTCACAGACGTTTAAACTTCGGTCTAACTTTCGTGACGATGTTCATATCAATGTGGGTTTCAAACGCAGCGGCCACGACAATGATGGTACCCATGGTTGAGGCTATACTGGAGGTTTTAGAACAGCAAGGCTTTGGTGAAGTGTACATAAACAAGAAGAAGGCGCTTTCTGAAAATGGAACCACTGTTAAGACTGATAAGAATACAGAAGAAGAGCCTCCTGTGCCTAGTGACACAACCATCTGCTACTACTTGAGCATCGCATATGCCTCTACCCTTGGTGGATGTGGGACTCTTGTTGGAACAGCCACCAACTCAGCTTTCAAGGGGATTTTTGACTCGGAGTTTCCAGAAATATCGGGTGCAGTAGATTTCTTTTGGTTTATGGCATACAGCACTCCTCCGATGCTGTTAATGCAGATTCTGGTGTGGTTGTCACTACAAGTGACGTACATGGGTATGTTCAGGCCAAATAGTGAGGCCGCCAAAAGGGCTTCCCAAGCATCTGGTGGCTCTGAGACCACCATGAACATTATAAGACAACAGTACAAGGGACTGGGGCCGGTCACCTTCCATGAGAAGGCTTCAGGTACATTATTCATCCTAGCTGTGTTCCTGTACATCTTCAGGAAGCCGGGCTTCATGTTGGGATGGGCTGATGTCATAACATCCATGAGAGTAAAAGATGGTGTAGTGTCAATTCTCATTGTTGTCCTGATGTTTATTCTGCCCATGTCGGTGGACTTCATCAAGTTCTTCACAACTACCGCATCATATGAAGAGTTGGCTGCCTCAAAGCCTTCCACTGGAATTGTTACATGGAATATTCTGAAGGAGAAGATTCCTTGGGGTCTGCTTTTCTTGTTGGGTGGAGGTTTCGCGCTCGCTGAAGGCAGTAAGGCGACGGGTCTCTCAGCTATGATCGGGTCGTCGTTGACCGGTCTTCATGGTCTTCCACCAGCTGTCGTTCTACTTGTAGTAGTTTTGGTCACACAGTTCATCACTGAGTTCACATCCAACGTGGCCATCGCTAATCTTATACTGCCAGTGCTAGCCAATATGGCCCGTACTCTGGAAATGGATCCTCGATACTTAATGGTCCCGGCGACTCTGGCTTGTTCTATGGCTTTCCACATGCCTGTGGGGACTCCTCCCAACGCTATAGTGGCAGGTGTCGCACACATACCTACCTCTAGAATGGCTGTGGGTGGTATCGGTCCTAAAATAATCACTACCCTCATTGTATGGGGTGCTTATCCAACCTGGGGTTCGTTGATCTTCAAGCCGGAGGATATTATCGCTTTAGACTCTGGACCGAGGGTCTCAACTAAGAATATCAGCCTAAATTGTGCCGTCACCGCCAGTAACCTTATGCCAAATCACGTTTACAACTGCAGCATGCCAGTAACTGGCATAGCGAATAAAACTCTAGCCCTACAGGTTAGTAATAGCTCTTTTGCATGCAATTATACGTTATTTAAGAAATAA

Protein sequence:

>DPOGS206847-PA
MSAAGESGIPEEGRGTTYYFSKSRSNATIWQRLCLFFSIYWKSVIVVLTPIVLLPLPILNSGSEFARAYRCMYVVLIMATYWVLELLPLPITAMLPIVLFPTMGILDSDRTCAAYMRETNMMFMGGLMIAAGVQHSKLPKRVALWTVQVVGCSHRRLNFGLTFVTMFISMWVSNAAATTMMVPMVEAILEVLEQQGFGEVYINKKKALSENGTTVKTDKNTEEEPPVPSDTTICYYLSIAYASTLGGCGTLVGTATNSAFKGIFDSEFPEISGAVDFFWFMAYSTPPMLLMQILVWLSLQVTYMGMFRPNSEAAKRASQASGGSETTMNIIRQQYKGLGPVTFHEKASGTLFILAVFLYIFRKPGFMLGWADVITSMRVKDGVVSILIVVLMFILPMSVDFIKFFTTTASYEELAASKPSTGIVTWNILKEKIPWGLLFLLGGGFALAEGSKATGLSAMIGSSLTGLHGLPPAVVLLVVVLVTQFITEFTSNVAIANLILPVLANMARTLEMDPRYLMVPATLACSMAFHMPVGTPPNAIVAGVAHIPTSRMAVGGIGPKIITTLIVWGAYPTWGSLIFKPEDIIALDSGPRVSTKNISLNCAVTASNLMPNHVYNCSMPVTGIANKTLALQVSNSSFACNYTLFKK-