Monarch geneset OGS2.0

DPOGS213912
TranscriptDPOGS213912-TA1008 bp
ProteinDPOGS213912-PA261 aa
Genomic positionDPSCF300218 - 1226-2450
RNAseq coverage0x (Rank: top 99%)
Annotation
HeliconiusHMEL0155575e-8864.76% 
BombyxBGIBMGA002852-TA7e-7354.43% 
DrosophilaCG4797-PB1e-2030.97% 
EBI UniRef50UniRef50_E0X9064e-3134.82%Sugar transporter protein 3 n=2 Tax=Obtectomera RepID=E0X906_BOMMO
NCBI RefSeqNP_001182631.17e-3234.82%sugar transporter protein 3 [Bombyx mori]
NCBI nr blastpgi|3076119291e-3034.82%sugar transporter protein 3 [Bombyx mori]
NCBI nr blastxgi|910845675e-3336.56%PREDICTED: similar to sugar transporter [Tribolium castaneum]
Group
Gene OntologyGO:00550854.1e-25transmembrane transport
GO:00160214.1e-25integral to membrane
GO:00228574.1e-25transmembrane transporter activity
KEGG pathway 
InterPro domain[48-232] IPR0058284.1e-25General substrate transporter
[1-203] IPR0161961.5e-24Major facilitator superfamily domain, general substrate transporter
Orthology groupMCL21008 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213912-TA
ATGGTTTCGCCCTATTTTAAACAGACATGGACAGTCATGGGGGTCTTGCTGAATATGTTTGGCCAAGGCATGGTGCTGAGTTATCCTTCTATCATATTGCCAACTCTTTTGTCTCCCAACTCCATCATTAAGGCGGAATTTCACGAGGCTTCATGGGTTGCATCATGCATTGGCATCGCTTCGATACCTGGATTTCTGTCGTCTTCTTTTCTTATGGAAATGTATGGAAGAAAAAAAGCTCATGTGGCTGTGTTGATACCGGGGATAGTCGGTTGGCTGATAATTTACTTTGCTACAAGTATACCGGTTCTGTTGTGCGGTAGATGTTTATGTGGATTTGCTGTGGGAGCAACAATAAGTCTCGGAGCTATTGTGATAGGAGAATATACTACCAGTCCCAACAAGAATAGGGGAATATTTCTTAATTTGAAGACTGCAGCTGTGTGTTTGGGCAATATGGCTGTCCATATTCTTGCTCACTTTCTTAACTGGAACACAATAGCACTCATAGCTGTTATTCCCCTTATGTTAGCACTGCTTATAATCCTAACATGGCCTGAAAGCCCATCTTGGCTAGCATCCAAGCGGCGGTTTGACGAAAGTCAAAAGTCATTCTATTGGTTGAGAGGAAACGGTAAGAGAGCTATCCTAGAAATGGAAGACTTACTTAGAACACAAAAAGAAAAATTGTCGCAATATCCTGAACATGTGGATAnCTATCCAGAACTAACATCAGTTCATTCTTGGAACGTGCGTCGAAGACAGCCAGGTTCGTTTCAAATTTGATTAGGTAGAGAGAGGAGCTTGAGCGTTTACCGCCGGCGTTACATAGAGGCCCTTTAAACCTACACCTGGGTCGCAAGTCCCCGGCACTATTGAAACGTAATGCGATGGACCGGCACCCGTACGTTGAATCCATGGATTTCTGAGAGGTTTCGTCTCTTCACTGTATCTGTAACTTTATTTTACGAATCACTTAAGTCCAATGAAATGCGTGCTTTTCATTGA

Protein sequence:

>DPOGS213912-PA
MVSPYFKQTWTVMGVLLNMFGQGMVLSYPSIILPTLLSPNSIIKAEFHEASWVASCIGIASIPGFLSSSFLMEMYGRKKAHVAVLIPGIVGWLIIYFATSIPVLLCGRCLCGFAVGATISLGAIVIGEYTTSPNKNRGIFLNLKTAAVCLGNMAVHILAHFLNWNTIALIAVIPLMLALLIILTWPESPSWLASKRRFDESQKSFYWLRGNGKRAILEMEDLLRTQKEKLSQYPEHVDXYPELTSVHSWNVRRRQPGSFQI-