Monarch geneset OGS2.0

DPOGS201366
TranscriptDPOGS201366-TA1782 bp
ProteinDPOGS201366-PA593 aa
Genomic positionDPSCF300083 - 251237-258038
RNAseq coverage228x (Rank: top 44%)
Annotation
HeliconiusHMEL0250390.087.97% 
BombyxBGIBMGA000687-TA4e-10975.85% 
DrosophilaCG6723-PA6e-13245.26% 
EBI UniRef50UniRef50_E2BD540.056.09%Sodium-coupled monocarboxylate transporter 1 n=4 Tax=Formicidae RepID=E2BD54_HARSA
NCBI RefSeqXP_973939.20.063.14%PREDICTED: similar to igf2 mRNA binding protein, putative [Tribolium castaneum]
NCBI nr blastpgi|1892371540.063.14%PREDICTED: similar to igf2 mRNA binding protein, putative [Tribolium castaneum]
NCBI nr blastxgi|1892371540.063.60%PREDICTED: similar to igf2 mRNA binding protein, putative [Tribolium castaneum]
Group
Gene OntologyGO:00160201.1e-217membrane
GO:00068101.1e-217transport
GO:00550851.1e-217transmembrane transport
GO:00052151.1e-217transporter activity
KEGG pathway 
InterPro domain[1-546] IPR0017341.1e-217Sodium/solute symporter
[26-429] IPR0199006.6e-90Sodium/solute symporter, subgroup
Orthology groupMCL17731 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201366-TA
ATGTTGCTTCTATCCGCTTTGATTGGTGTCTACTTTGCTTTCTTCGCAACGAAGAAACAAAATACTACAGCTGAATATTTAATGGGTGGGAAAACGATGGGCATGTTTCCAATATCTATGTCCTTGATAGCCAGCTATGTATCTGGAATTTCGCTGTTGGGACTACCTGCTGAAATGTACACATACGGCACACAACTTTGGACAATCGTGCTGTCGGAGTGGGTGGTATCAGTAACGATCTCTATAATCTATCTTCCGGTGTTCTATAACTTGCAAATTACATCTACTTATGAGTATCTTCGTTTGAGGTTCAATCAAAACGTGCGTCTTTTAGGATCAATAATATTTATTATTAAAATGTTATTGTACATACCAATTGTCATTTATGTACCAGCATTAGCGTTTAGTCAAGTTACTGGAGTGAATTTACATATGATAACGCCTATCGTATGTCTCGTTTGTATCTTCTATACTACGCTTGGAGGCCTTAGAGCCGTCGTATGGACGGATGCTTTGCAAACAATTTTAATGTATTTTGGCGTGGTTTTCGTACTCGTATATGGAACTTGGCGGCTTGGAGGAGTATCGGAGGTGATCAGAATTAATAAAGAAGGAGACCGTCTCGACTTTTTCATAATGGACATGGATCCCACAATTCGTCATACTTTTTGGACTACCGTCTTTGGAAACTACTTCAGCTGGCTAGCGTCATGTTCTGTCAACCAAGCAATGATTCAGCGCTGCCTTGCACTATCTTCTTTAAAGAGGGCTAGAATAACAATATTTATAATGGCAGCAGGAATATTCATCATTGTATCACTGTGTTGTTACACGGGGCTGGTTATATATGCCACATTCGCCACATGCGACCCTCTAACCACTGGGGCTATACGTAAAAGTGATCAATTGCTACCATATTTCGTCATGACGATCACCGGTTCAATACCAGCACTCCCAGGGATCTTCATGAGCGGTGTCTTCAGCGCCGCTTTGAGTTCCATGTCCACGGGACTTAATTCTTTATGTGGAGTTATATTCGAAGATTTGATCAGACCAGCTTACAACAAACCGATATCCGAAAGGACTGCCAGCTTCATCATGAAAATAATAGTGGTTGTTATTGGTGGGATATGCGTTGCGCTTGTTTTTCTTGTCGAACACATGGGAGCGTTGATTCAAGCTGGCAAAAGCCTAGCTGGTATAACTGCTGGGAGTTTGCTTGGTCTATTCAGTATGGGACTATTCCTACCATGGATCAATAACACGGGGGCGCTTGTTGGTGGCATTTCATCCACTCTGTTAGTTGGATGGATCTCACTGGGCACGCAGGCGGCAATGCTGCGCGGAGAGATCGTGATAACCCCGAAGCCTGTTGACGTGTCCGGTTGTGCAACTAACTATACACTGCCGACTACATCACCGCCACACACAACTGTAGAATTTGATAGGTCGGGGACATTTTTTCTGTACAGATTAAGCTACCTGTATTACACTTTAGTTGGAATGTTAGTTTGTATCACTGTCGGATCTCTTGTGTCCTACTTCACGCGATTTAATGATCCTGCCATGGTTCACCGCGACCTCTTAACACCGGTGATACATCGGTTCTTGCCACCCCAAAGTGTATACTGTCGTCGTCCACGTCTCACGCAACACGATATTGAGTTACACGCGAGAGAAATGGAACGTCTGCGCTCAAGTGCTGCCTTAATGAAAGATGAAACGGGAATCGAAGCCAATCTGTTTCCTCGCAGGAACAGCAACAATAATTATATGTATTAA

Protein sequence:

>DPOGS201366-PA
MLLLSALIGVYFAFFATKKQNTTAEYLMGGKTMGMFPISMSLIASYVSGISLLGLPAEMYTYGTQLWTIVLSEWVVSVTISIIYLPVFYNLQITSTYEYLRLRFNQNVRLLGSIIFIIKMLLYIPIVIYVPALAFSQVTGVNLHMITPIVCLVCIFYTTLGGLRAVVWTDALQTILMYFGVVFVLVYGTWRLGGVSEVIRINKEGDRLDFFIMDMDPTIRHTFWTTVFGNYFSWLASCSVNQAMIQRCLALSSLKRARITIFIMAAGIFIIVSLCCYTGLVIYATFATCDPLTTGAIRKSDQLLPYFVMTITGSIPALPGIFMSGVFSAALSSMSTGLNSLCGVIFEDLIRPAYNKPISERTASFIMKIIVVVIGGICVALVFLVEHMGALIQAGKSLAGITAGSLLGLFSMGLFLPWINNTGALVGGISSTLLVGWISLGTQAAMLRGEIVITPKPVDVSGCATNYTLPTTSPPHTTVEFDRSGTFFLYRLSYLYYTLVGMLVCITVGSLVSYFTRFNDPAMVHRDLLTPVIHRFLPPQSVYCRRPRLTQHDIELHAREMERLRSSAALMKDETGIEANLFPRRNSNNNYMY-