Monarch geneset OGS2.0

DPOGS214271
TranscriptDPOGS214271-TA1635 bp
ProteinDPOGS214271-PA544 aa
Genomic positionDPSCF300014 + 1736874-1745075
RNAseq coverage896x (Rank: top 14%)
Annotation
HeliconiusHMEL0114010.072.29% 
BombyxBGIBMGA005988-TA0.074.07% 
DrosophilaCG8451-PA1e-11139.83% 
EBI UniRef50UniRef50_D6WKE22e-11943.36%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WKE2_TRICA
NCBI RefSeqXP_001658302.17e-12644.18%sodium/solute symporter [Aedes aegypti]
NCBI nr blastpgi|1571158291e-12444.18%sodium/solute symporter [Aedes aegypti]
NCBI nr blastxgi|2700075303e-11943.01%hypothetical protein TcasGA2_TC014127 [Tribolium castaneum]
Group
Gene OntologyGO:00160201.9e-138membrane
GO:00068101.9e-138transport
GO:00550851.9e-138transmembrane transport
GO:00052151.9e-138transporter activity
KEGG pathway 
InterPro domain[4-506] IPR0017341.9e-138Sodium/solute symporter
[48-355] IPR0199002e-56Sodium/solute symporter, subgroup
Orthology groupMCL10870 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214271-TA
ATGTCGTTCACTATGAAAGAAGGTGGTTTTGGCGTGGGAGACTATGTTGCATTTGGGGTGTTATGTGCGTCTTCATGCGCTGGAGGTGTTTGGTACAGCGCGGTGGGATCCAGGAGCAAGGCTGTAGTCGACTTGAAGGACTACCTTCTCGGAGGAAAGGCTATGTCCACCTTTCCCGTCGCCATGTCGCTTATCGCCAGTTACGTATCCGGCGTGACGATACTTGGTACTCCGGCCGAGATTTACAATTACGGCTCCGAGTATTGGCTGGTGGTGGTGGGTGTGACTCTTAGCTGCTTGATCGTAGCCACCGTCTACTTACCAGTGTTCTGTACTCTGAGACTGTCTTCCTCATATGAGTATCTTGAACTTCGTTTCAATCGGCACGTGCGGGCTGTGGCGTCTGTGCTTTTTTTATTGGACGAAGTATTATTTCTGCCTATGGTCGTGTACGTTCCTGCTTTAGCATTTAATCAATTGACTGGTTTTAATGTGTTCGCGGTTGGTGGTGTTATGGTCCTTATATGTGGACTTTATACAGTATTAGGTGGTCTCCGCGCGGTCGTATGGACGGACTCGGTTCAGACCGGGGTCATGTTTATCGGTGTGTTACTTGTGGCGGCGGCTGGGACGCTTGCGGTCGGGGGCGTTAGTGCTGTCGTTAGCATCGCAAACGAATCCGGGAGACTTGAAATCTCCAATTGGAGCTTTTCTCCGTACGAACGTCAAACTGGTTGGGGTGCGATATTCGGTGGTTTCCTCTACTGGACTTGCTTCAACTCCGTCAACCAGACCATGGTCCAGCGTTACATAGCTTTACCATCAAAGCGGAAGGCGATCACCGCGCTCTTCATCTTCTGTATCGGCGCGATCCTCGCTATATCTCTGTGCGTGTGGTGTGGCCTGGCGGCGTGGGGAGCCTGGGTCCAGGGCGGCTGCGACCCCAGCGGCTCTCCCCTGGTCGGTGACCAATTACTCCCAGCGTTCGTGACATACGTCGCTGAAGTCCAACACCTGCCTGGTCTCTCTGGCGTATTCCTCGCTGGAGTATTTGGAGCCGGATTAAGGTCTATACATTTTAATACAAAAAATATGGAACAAATGACTAGCGTGGCGACTGCGTTGTCAGCGATCGCAGCCAGCGCCACTTGTGGTATCTTCACGCTGGGCATGTCGTGCTGGTGGGTCGGTCCTCGCGGGGCGGCTGCGGGCGGCGCGGCGGGCGCTCTCCTGGCCGGGGCCGTGTCTCTCGGCAGCCAGGCGGCCGCGGCCTCCGGTCTGAGAGCACTCCCAAGAAACTTCACTGAAGCCTGTTCTGCTAACGCCACTTTTATCCCCGAACAGACCATCGATCCGACCACAATATTTCCTCTATTCCGCTTATCGTATCACTGGATCGCGCCGCTCGGTCTCCTGGCCACTATAGTCGTTGGCATGATAGTAGGCTGGTTGTTCGACAAGCCGGATCCAAGTAAGATGGACGCGGAACTGTTCACTCCTGTAGTGTGGAGGTTGTTGCCTAAGGAGGCCCACGAGAACGCTGGCATGACGCGCCAGGCGCTGCCCGTCAAAGCGGAGTCGCCGGCGCCCTCCGCCAGGCTTATACTGGAACAGCTGCCAGATAAGGTTGAGTGA

Protein sequence:

>DPOGS214271-PA
MSFTMKEGGFGVGDYVAFGVLCASSCAGGVWYSAVGSRSKAVVDLKDYLLGGKAMSTFPVAMSLIASYVSGVTILGTPAEIYNYGSEYWLVVVGVTLSCLIVATVYLPVFCTLRLSSSYEYLELRFNRHVRAVASVLFLLDEVLFLPMVVYVPALAFNQLTGFNVFAVGGVMVLICGLYTVLGGLRAVVWTDSVQTGVMFIGVLLVAAAGTLAVGGVSAVVSIANESGRLEISNWSFSPYERQTGWGAIFGGFLYWTCFNSVNQTMVQRYIALPSKRKAITALFIFCIGAILAISLCVWCGLAAWGAWVQGGCDPSGSPLVGDQLLPAFVTYVAEVQHLPGLSGVFLAGVFGAGLRSIHFNTKNMEQMTSVATALSAIAASATCGIFTLGMSCWWVGPRGAAAGGAAGALLAGAVSLGSQAAAASGLRALPRNFTEACSANATFIPEQTIDPTTIFPLFRLSYHWIAPLGLLATIVVGMIVGWLFDKPDPSKMDAELFTPVVWRLLPKEAHENAGMTRQALPVKAESPAPSARLILEQLPDKVE-