Monarch geneset OGS2.0

DPOGS214616
TranscriptDPOGS214616-TA972 bp
ProteinDPOGS214616-PA323 aa
Genomic positionDPSCF300050 + 38143-41404
RNAseq coverage113x (Rank: top 59%)
Annotation
HeliconiusHMEL0142782e-15180.12% 
BombyxBGIBMGA001664-TA1e-13775.78% 
DrosophilaGlut1-PK1e-7644.89% 
EBI UniRef50UniRef50_D4AHX53e-10360.13%Sugar transporter 10 n=22 Tax=Coelomata RepID=D4AHX5_NILLU
NCBI RefSeqXP_392938.21e-10460.57%PREDICTED: similar to Glucose transporter 1 CG1086-PB, isoform B isoform 1 [Apis mellifera]
NCBI nr blastp%
NCBI nr blastx%
Group
Gene OntologyGO:00550853e-82transmembrane transport
GO:00160213e-82integral to membrane
GO:00228573e-82transmembrane transporter activity
GO:00160203.3e-72membrane
GO:00228913.3e-72substrate-specific transmembrane transporter activity
KEGG pathwaycfa:4824371e-76 
 K07299 (SLC2A1, GLUT1)maps-> Pathways in cancer
    Adipocytokine signaling pathway
    Renal cell carcinoma
InterPro domain[6-319] IPR0058283e-82General substrate transporter
[7-315] IPR0036633.3e-72Sugar/inositol transporter
[10-315] IPR0161962.9e-29Major facilitator superfamily domain, general substrate transporter
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214616-TA
ATGGCGTCTCTTCTTCATCAGATCGGAACAGTCTACCAGTTGGTGATCACGATGTCAATTCTGTTGTCCCAAGTGCTGGGTCTGAACTCAGTGCTGGGGACCGATAGCTGGCCCTGGCTGCTGGCTGTGCCTCTCATCCCAGCTGTGCTTCAATGTGGCATGCTGAGAATGTGCCCTGAATCACCGAAATATCTTTTACTTAACAAGGGCGCGGAGTTGAGGGCCCAAAGAGCTCTCAACTACTTGAGAGGTGACGTCGCCGTTCACGGTGAGATGGAAGAAATGCGTCAGGAAGCGGAGAAGAACAAAGTAAGCAAGAAGGTGACTCTCCGCGAGCTGTTCCGTGACCGCAGCCTCCGCCGGCCGCTGCTGGTGGCGGTGGTGGCGATGGTGGCCCAACAGTTCTCTGGAATCAACGTCGTTATATTTTTCTCCACTGAAATCTTCACAGCCGCCAACCTGAGTCCCACTCAGAGCCAGTACGCCACTCTGGGTATGGGTGCTATGAACGTGGTGATGACCGTGGTGTCCCTGATGCTGGTGGAGATCGCGGGTCGGAAGACCTTACTCCTGGCGGGCTTCTCTGGCATGTTCCTGTGTACTGTGGGGCTCTGCGTCGCCAGTCTATATACTACCCACATCTGGGTGTCGTATCTGTGCATCGCGCTGGTGATTTTGTTCGTGGTCACGTTCGCTGCGGGTCCCGGCTCCATACCCTGGTTCCTCGTGACCGAGCTGTTCAACCAGTCGTCTCGCCCCGCGGCTTCGTCTGTCGCCGTCACTGTCAACTGGACGGCCAACTTCATCGTCGGCCTCAGTTATTTGCCGCTGGCCTCCGTTCTGAAGTCCAACACCTTCGCCATCTTCGCCGTGCTTCAGTTCATCTTCATCATCTTCATCGCGGCCAAGGTCCCGGAGACGAAGAACAAAACCATCGAAGAAATAACGGCCATGTTCAGACAACAGCTGTAG

Protein sequence:

>DPOGS214616-PA
MASLLHQIGTVYQLVITMSILLSQVLGLNSVLGTDSWPWLLAVPLIPAVLQCGMLRMCPESPKYLLLNKGAELRAQRALNYLRGDVAVHGEMEEMRQEAEKNKVSKKVTLRELFRDRSLRRPLLVAVVAMVAQQFSGINVVIFFSTEIFTAANLSPTQSQYATLGMGAMNVVMTVVSLMLVEIAGRKTLLLAGFSGMFLCTVGLCVASLYTTHIWVSYLCIALVILFVVTFAAGPGSIPWFLVTELFNQSSRPAASSVAVTVNWTANFIVGLSYLPLASVLKSNTFAIFAVLQFIFIIFIAAKVPETKNKTIEEITAMFRQQL-