Monarch geneset OGS2.0

DPOGS208211
TranscriptDPOGS208211-TA1644 bp
ProteinDPOGS208211-PA547 aa
Genomic positionDPSCF300179 + 114287-179702
RNAseq coverage555x (Rank: top 23%)
Annotation
HeliconiusHMEL0120140.067.57% 
BombyxBGIBMGA003739-TA2e-7637.19% 
DrosophilaCG10960-PB7e-8337.32% 
EBI UniRef50UniRef50_D2XRA70.066.87%Sugar transporter 4 n=2 Tax=Obtectomera RepID=D2XRA7_BOMMO
NCBI RefSeqNP_001165395.10.066.87%sugar transporter 4 [Bombyx mori]
NCBI nr blastpgi|2848135790.066.87%sugar transporter 4 [Bombyx mori]
NCBI nr blastxgi|2848135790.066.87%sugar transporter 4 [Bombyx mori]
Group
Gene OntologyGO:00550852.8e-76transmembrane transport
GO:00160212.8e-76integral to membrane
GO:00228572.8e-76transmembrane transporter activity
GO:00160202e-15membrane
GO:00228912e-15substrate-specific transmembrane transporter activity
KEGG pathway 
InterPro domain[120-527] IPR0058282.8e-76General substrate transporter
[42-524] IPR0161961.4e-58Major facilitator superfamily domain, general substrate transporter
[186-205] IPR0036632e-15Sugar/inositol transporter
Orthology groupMCL34738 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208211-TA
ATGTCATTTGTGCCCTACAAGTGGCATTTACGTTATCGAGATGTTCTTGGCATCAGTCAATCTGAGCTGATTCAAGAGCGCGTCCGCAGCATCACCCGTCTCTTATCACAGCGTGTTATTAGAATGGATGAAAAAACAACAGATATCGATCCAGAGGCAGGAGCTTCACAGACAGAAGATAAAAATGGCAACTCAAAGGTCTTGAAACGCCTTGAAACTGTTCCGGATATTAAAAGAGACACTTCCGGCATCACCACACAATACATCGCCACTGGTATCGTGAGGCTCTCCTTTATGATGAGAAGAGTTTACAAAAAACTAATGGAAACACCGGGCATTCCAGTCACCAACAACGGAACTCATCCCAAACTAGTCCTGACAGACAGTGAGGCTTCATGGGTCGCTTCATTACTGTGTCTAGGAGCACTATGGGGCGCAGTACCCGCTGGGCTCATCTCAGAACACTTCGGAAGGAAGAAGACATTGCTGTATCTCGCTTTACCTCTTCTTGTCTCCTGGATCCTAGTCGCTTCAAGTCCTAATGTGTACGGTATGTACGTTGGTCGGTTCGTGGGTGGGATTGCTGTTGGAGCATTCAGCGTCGGCATCCCTCCATACGTGGAAGACATCGCAGAAATACAAAACCTACCAGCCCTCGTCAACTTCTACCACGTACATTTCTCTTGCGGTGTCCTCTTTGGATATATAATTGGTATGGTCCAAAGTACGTCTTGGCTGTCGGTCTTATGTGCCATCATACCAATCGCATATTTTATTGCTTTTATCTTCCTGCCAGAATCTCCGGCGTATCTCATATCTCAAGGAAAATCTAGCCAAGCAGAAGCTGCATTGCGGTACTTTCGTGGAATTGATAATAACGTTGAAGCCGAACTGAAGGAATTAAAAAAATATACAAGGAATACTGCGAAAAACCGTGTGACATTTAAAGAACTGTTCAGTACGAGATCGACTTTGAAAGCTCTCGTTGTTTCTTTTGGATTGATGATTTTCCAACAACTAAGCGGCATTTATCCTGTATTATTTTACGCAGAAAAAATCTTCAAAAAGTTTTCCATATCGCTGTACCTACCCGGCGCTACTATTATTTTGGGCTTCTGTCTCGTATCGTCTACCTACTTCTCCACAATGTTCGTGAAAAAAGTGAGACGACGTATTTTGTTAATGGTATCGTTTTCAGTGATGTTCCTTAGTTTGGCAGGCTTGGGTGTTTATTATCACTTAAAGGCATCTAACATCATATCTGACAGTACGTGGGTGCCGGTACTCACTCTCTGCATATTTGTATCTGTATACGCGGTCGGTGCCGGACCTATACCTTGGTTGATGTTGAGAGAAATATTCCCACCGCAAGTGAGGAGACGAGCCACAGCCATCACAGCTGGATTCCATTGGTTTTTGGCATTTGGGGTAACGAAATTATATCAGAATTTCCTTGACGTAGTAAGCCTTGGGTGGACGCTTTGGAATTTCTCTATTATCTGTCTCATAGGTACAGCGTTCGTTTATTTAGTTGTACCCGAGACAAAGGGACGAACGCTAGAGGAAATTCAAAATCAATTTGAAGGTATTCACAAGACGAAAACGCATATACATGTCATAGAGGTAGAAACCATTAACGGTTAG

Protein sequence:

>DPOGS208211-PA
MSFVPYKWHLRYRDVLGISQSELIQERVRSITRLLSQRVIRMDEKTTDIDPEAGASQTEDKNGNSKVLKRLETVPDIKRDTSGITTQYIATGIVRLSFMMRRVYKKLMETPGIPVTNNGTHPKLVLTDSEASWVASLLCLGALWGAVPAGLISEHFGRKKTLLYLALPLLVSWILVASSPNVYGMYVGRFVGGIAVGAFSVGIPPYVEDIAEIQNLPALVNFYHVHFSCGVLFGYIIGMVQSTSWLSVLCAIIPIAYFIAFIFLPESPAYLISQGKSSQAEAALRYFRGIDNNVEAELKELKKYTRNTAKNRVTFKELFSTRSTLKALVVSFGLMIFQQLSGIYPVLFYAEKIFKKFSISLYLPGATIILGFCLVSSTYFSTMFVKKVRRRILLMVSFSVMFLSLAGLGVYYHLKASNIISDSTWVPVLTLCIFVSVYAVGAGPIPWLMLREIFPPQVRRRATAITAGFHWFLAFGVTKLYQNFLDVVSLGWTLWNFSIICLIGTAFVYLVVPETKGRTLEEIQNQFEGIHKTKTHIHVIEVETING-