Monarch geneset OGS2.0

DPOGS207268
TranscriptDPOGS207268-TA1803 bp
ProteinDPOGS207268-PA600 aa
Genomic positionDPSCF300008 - 362449-368914
RNAseq coverage35x (Rank: top 74%)
Annotation
HeliconiusHMEL0163010.086.24% 
BombyxBGIBMGA012132-TA2e-11777.87% 
DrosophilaCG9657-PA3e-7430.84% 
EBI UniRef50UniRef50_E2BD542e-8932.60%Sodium-coupled monocarboxylate transporter 1 n=4 Tax=Formicidae RepID=E2BD54_HARSA
NCBI RefSeqXP_973939.26e-9834.33%PREDICTED: similar to igf2 mRNA binding protein, putative [Tribolium castaneum]
NCBI nr blastpgi|3071696847e-9733.97%Sodium-coupled monocarboxylate transporter 1 [Camponotus floridanus]
NCBI nr blastxgi|3407230797e-10234.69%PREDICTED: sodium-coupled monocarboxylate transporter 2-like [Bombus terrestris]
Group
Gene OntologyGO:00160201.2e-106membrane
GO:00068101.2e-106transport
GO:00550851.2e-106transmembrane transport
GO:00052151.2e-106transporter activity
KEGG pathway 
InterPro domain[13-564] IPR0017341.2e-106Sodium/solute symporter
Orthology groupMCL25923 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207268-TA
ATGAGGAACTCCAGTGTTCCAGTGTTGCCTATTTATTTTAACGTGGCCGAGTATTGTTTATTCGGGATAATCGTTGGTGTCATTATAGGAATAATAGTTTATTATGTATTTGTTAACAATAAGTACAATACAGTGTCTGGATTTTTGTTTGGAGGAAAAAATATGTCTATCATATCTATATCATTGGCCCTAACAGCAAGTCACCTTACAAGTATAACATTACTCGGAGTTCCAGTCGAGATATACTTGCGTGGTACTCAGTACTGGGCGTCCGCTTTATCACTTATCTTTGTCACTTTCTTAACAGCTGTAATTTATTTGCCAGTTTTCCACAGGCTACAGCTGTCTTCGAGTTTTGAATATTTAGAAATTAGGTTTTCTACTCACATACGTACCATAGCAGCGGTTCTGTTTGTTGTTAGTAAGCTGATGTTGCTGCCGATTGTTCCCTACGTCCCCCTATTGGCTTTCAGATTGGTAACTAATGCGGAAGCGAGCCGAATCACTGCCGCTCTATGCGTTGCTTGTTCGACATTTATAGCAGTGGGAGGTTTACGGGCAATAGTTAGCATCGGAATTGTGACAACGTTTTTAGCATTTGCCGGTACAGCATTACCAAGTGGGCTGGCGTTACTGCCGATGGGATTCAAAAAAATGTGGGAGGTCGCTAACCACGGCAGTAGACTGGTTCTGTACGACCCTGATCCTGAAATTGCTCACCACACATCTTTCTTTGTTGTAACTTTGGCACTAAGTACAAACTGGCTGTGGAAGATAGCGCTAAGCCAGTCTTCATTGCAGAAGTTGTTAGCCGTTCCTACCATAAGCAAAGCCAGGATATGCCTGACGATATCCTGTGTTGGTGTAATATTAATGAAACTGTTATCCTGTTTCCTTGGATTGGTTTTATACGCTTGGTTTGCTGGTTGTGATCCATTACTCACAGGACAAATAAAAAAACACGAACAGCTGGTACCGCATTTTTTAAATAATTTGTCAACGATGTTTCCAGGAATATGTGGCATCTTTATAATATCTATCTTCAGTGCAACAGCATGTTGTATTGCGTCTATTATTAATTCAGTGTCTGGGGTTGTGTTTGAAGAATTCATAAGACCGTGGATGCCAGAAAGTACAAATGAATTGGCGTGCTGTAGATTTATGAAGTTTCTCTGCATAGTGGTTGGTCTTTACTGTGGAGCTGTTATATGGTTGGTGCTTGAATTAGACCGCCTGCAACATGTAGCTTCCGGTGTCACTGGAGTTACTGCAGGAACACTTTTGGGCATATTCACTTTAGGGATAGTCTTCTCAAGAGCAAATTGTTCCGGTGCGTTAAGCGGCTGCCTCTTAAGTTTGCTACTCTGTGGCTGGCTTCTCGTGGGGGCTGAAAATGCTTTAGCCACGGGGGCATTAACCTTTCAGGGTAAACCGCTAGTAACGTCAGGATGCGGCAAAACTAACTTTACGCATATTCTGAATGTGACAACTATTATTCCTGTAGCATTAATGCCAAAGAAGCCGCTGCAGTCATTATTCCGAATATCTTTTACCTATTGTCCATTCGCCGGAGCCATAACAGTTCTCCTCATTGGAGTGCCCATGAGCTATCTCACAGGAAAATCAAAAACCGATTCAATGAACCCAGACGTCTTTTGTCCTCTCACACAAAGCTTTTTACACAGACTTCCCGATAGAAGTAATTCCGTCTCCTTGGAACTCCGACCAGCTAACACCCAGGAAGTTTACATGGCGTTGGACGAAGCTTTGAAACATTTGAAGGAAGTTATAGATAAGAAATAA

Protein sequence:

>DPOGS207268-PA
MRNSSVPVLPIYFNVAEYCLFGIIVGVIIGIIVYYVFVNNKYNTVSGFLFGGKNMSIISISLALTASHLTSITLLGVPVEIYLRGTQYWASALSLIFVTFLTAVIYLPVFHRLQLSSSFEYLEIRFSTHIRTIAAVLFVVSKLMLLPIVPYVPLLAFRLVTNAEASRITAALCVACSTFIAVGGLRAIVSIGIVTTFLAFAGTALPSGLALLPMGFKKMWEVANHGSRLVLYDPDPEIAHHTSFFVVTLALSTNWLWKIALSQSSLQKLLAVPTISKARICLTISCVGVILMKLLSCFLGLVLYAWFAGCDPLLTGQIKKHEQLVPHFLNNLSTMFPGICGIFIISIFSATACCIASIINSVSGVVFEEFIRPWMPESTNELACCRFMKFLCIVVGLYCGAVIWLVLELDRLQHVASGVTGVTAGTLLGIFTLGIVFSRANCSGALSGCLLSLLLCGWLLVGAENALATGALTFQGKPLVTSGCGKTNFTHILNVTTIIPVALMPKKPLQSLFRISFTYCPFAGAITVLLIGVPMSYLTGKSKTDSMNPDVFCPLTQSFLHRLPDRSNSVSLELRPANTQEVYMALDEALKHLKEVIDKK-