Monarch geneset OGS2.0

DPOGS206078
TranscriptDPOGS206078-TA2091 bp
ProteinDPOGS206078-PA696 aa
Genomic positionDPSCF300028 - 188027-197450
RNAseq coverage567x (Rank: top 22%)
Annotation
HeliconiusHMEL0057720.080.43% 
BombyxBGIBMGA006860-TA6e-11859.89% 
DrosophilaCG5687-PA1e-14748.42% 
EBI UniRef50UniRef50_F4X3I90.061.62%Sodium-coupled monocarboxylate transporter 1 n=24 Tax=Arthropoda RepID=F4X3I9_ACREC
NCBI RefSeqXP_001122196.10.060.24%PREDICTED: similar to CG5687-PA [Apis mellifera]
NCBI nr blastpgi|3838511800.060.55%PREDICTED: uncharacterized protein LOC100879178 [Megachile rotundata]
NCBI nr blastxgi|3838511800.059.83%PREDICTED: uncharacterized protein LOC100879178 [Megachile rotundata]
Group
Gene OntologyGO:00160204e-225membrane
GO:00068104e-225transport
GO:00550854e-225transmembrane transport
GO:00052154e-225transporter activity
KEGG pathway 
InterPro domain[35-607] IPR0017344e-225Sodium/solute symporter
[91-496] IPR0199001.5e-74Sodium/solute symporter, subgroup
Orthology groupMCL16477 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206078-TA
ATGGATTATCTGTCGAAATTCAGGTTTTTCAGTGTTTACAAGGTTGTATTTTTATATACAATTTTCTCCACATCTTTTACAAACGCAGCGGCTAAATTACTTGTATTAAACACAAGTCAAATTGACAACTTCGAACAGATACAAAATGATACAAATATCGAACTATTTCATTGGGAAGACTACACTGTTCTCGCTGCTATGCTCATCATATCTTGTGGTATTGGGGTATTTTACGGATATTTCGGAGAAAAACAATTGACAGGAGACGACTTTCTTCTTGGTGGATCGTCTATGGGAACGTTTCCGATGGCACTCAGTTTAGCGGCGAGCTTTATTACAGCCATTGAGTTGTTAGGGAATCCGGCAGAGATGTTTATAGCCGGTGGTCAATTTTGGATGATCTGTATAGCCTTCGTCCTTACAGTGCCCGTTGCAAGTCAATTATACCTACCAGTGTTTATGAGGCTTCGACTGACATCATGTTACGAATATTTGGAGGTACGATTTTGCAAGTCAATGCGTGTGTATGCAAGCGTCCTATACATGTTGCAAATGATATTGTACACTGCAGTTGCAGTGTATGCGCCGGCGTTGGCACTTTCAGACGTGACAGGACTTAATACGTATTTAGCGGTTTCTGTGGTATACATAGTGTGTATATTTTACGCTTCCCAGGGCGGTATGAAAGCTGTTATAATGACTGATACCTTTCAATCAGCCGTACTCATCGGCTCTTTAGCTGCCATTCTAGCTCTCGGCTCAGCACAAACCGGTGGCTTCAATTATATTTGGGACTACGCACAACGCACCGAAAGATTGCACTTTTTTGACATGAACCCCGACCCTACAGTACGGCACTCCTTTTGGTCGGTGGTTGTGGGTGGTACCATGTACTGGGTCAGCATGTTCTGCGCTAACCAAGCATCGGTGCAGAAATACTTATCAGTAGAGCGCATATCTCAAGCTCGTGTCGCGTTATGGGTATCAGCTATTGGCCTAGTATCTGTGTACTCTGTTAACTTCGCTACCGGCGCCCTTCTTGCAGCGCACTACGCGGGATGCGATCCCATTGAATCTGGACAAATAAATGCTTCTGACCGACTCCTGCCACTGTACGTCGTTAAACAGCTGGGTGTATATCGCGGCGTACCAGGATTCTTTGTGGCTGGCATTTTCGCGGCTAGTCTTGGAACAGTAGCATCAGCACTAAATTCCTTGGCAGCTATTGCATGTCAAGACTTGGCGTCCGGTTTGTTTGGTATCACTCTGCCAGAGAATAAGGGCGCAGCAATTGCAAGATGGGTGTGCTTGGCATGTGGTGCGTTGTCTTTCGCGCTTGTGTTCGTGGTGGAGAGATTGGGTCCTGTGCTGCAACTGGCCCTCTCGTTTAATGGAATGACGGGAGGTGTCACCTTAGGACTCTTCAGCCTCGGAATGTTGTTCCCGTGGGCAAACGCCAAGGGTGCGGTTGCGGGCGGCGTTTGCGGGTTGCTGCTGGTGGTAGCGGCGGGTGGCGGCGCGCAGTACGCCTCAGTGCCATTACCCAGGCTACCTGCCTCAACCGATACCTGCCCCACCACCTTAAATATGACATCTAATTATGACCATACAGAAAATCATGACAAGGCAGTATTTTGGTTATTCAAAGTGTCATATCTCTGGTATTCTGGACTGGGTTGTGTTGGCACTCTGATTATTGGGTTAACGGTTTCCCTGATCACTGGTATGACTAACCCCGCTGATGTTCCCATTGACCTTATATCGCCTCCAGTAGTCAGCTTCCTAAACTCATTGCCGAATAAGTTTAAGAAAGCGTTGCGTGTTCCGACTCGTCTCGGTTCGGAGCGTCGTAGAAGCTCGAGCGTCCGACCGCCGAGTCTGTCCGACGACAAGGACACGAGACGTCGCCGGCGGTTGTCCGAGTTGGCGGGCGGCGTGTTCTACAGCGATGCACCCACACTCACTCGCGGCCACGATAACCTCGCACTCGGCCTTGACTCAGAAAAACCGCCACCTCAGGAACCCAAGCGAACGAGTTTCAAATTCGATGCACCCTTAAATCCTAATTCTCCCCAAACATCCTCATGCTGA

Protein sequence:

>DPOGS206078-PA
MDYLSKFRFFSVYKVVFLYTIFSTSFTNAAAKLLVLNTSQIDNFEQIQNDTNIELFHWEDYTVLAAMLIISCGIGVFYGYFGEKQLTGDDFLLGGSSMGTFPMALSLAASFITAIELLGNPAEMFIAGGQFWMICIAFVLTVPVASQLYLPVFMRLRLTSCYEYLEVRFCKSMRVYASVLYMLQMILYTAVAVYAPALALSDVTGLNTYLAVSVVYIVCIFYASQGGMKAVIMTDTFQSAVLIGSLAAILALGSAQTGGFNYIWDYAQRTERLHFFDMNPDPTVRHSFWSVVVGGTMYWVSMFCANQASVQKYLSVERISQARVALWVSAIGLVSVYSVNFATGALLAAHYAGCDPIESGQINASDRLLPLYVVKQLGVYRGVPGFFVAGIFAASLGTVASALNSLAAIACQDLASGLFGITLPENKGAAIARWVCLACGALSFALVFVVERLGPVLQLALSFNGMTGGVTLGLFSLGMLFPWANAKGAVAGGVCGLLLVVAAGGGAQYASVPLPRLPASTDTCPTTLNMTSNYDHTENHDKAVFWLFKVSYLWYSGLGCVGTLIIGLTVSLITGMTNPADVPIDLISPPVVSFLNSLPNKFKKALRVPTRLGSERRRSSSVRPPSLSDDKDTRRRRRLSELAGGVFYSDAPTLTRGHDNLALGLDSEKPPPQEPKRTSFKFDAPLNPNSPQTSSC-