Monarch geneset OGS2.0

DPOGS210742
TranscriptDPOGS210742-TA1188 bp
ProteinDPOGS210742-PA395 aa
Genomic positionDPSCF300013 + 332886-334073
RNAseq coverage3234x (Rank: top 4%)
Annotation
HeliconiusHMEL0217710.082.78% 
BombyxBGIBMGA006273-TA0.078.48% 
DrosophilaGlut1-PK3e-12253.05% 
EBI UniRef50UniRef50_G3I4Q01e-9444.21%Solute carrier family 2, facilitated glucose transporter member 3 n=9 Tax=Euteleostomi RepID=G3I4Q0_CRIGR
NCBI RefSeqXP_001664054.15e-12354.01%glucose transporter [Aedes aegypti]
NCBI nr blastpgi|1571378739e-12254.01%glucose transporter [Aedes aegypti]
NCBI nr blastxgi|1571378735e-11953.55%glucose transporter [Aedes aegypti]
Group
Gene OntologyGO:00550857.8e-113transmembrane transport
GO:00160217.8e-113integral to membrane
GO:00228577.8e-113transmembrane transporter activity
GO:00160201.9e-103membrane
GO:00228911.9e-103substrate-specific transmembrane transporter activity
KEGG pathwayspu:5902741e-96 
 K07299 (SLC2A1, GLUT1)maps-> Pathways in cancer
    Adipocytokine signaling pathway
    Renal cell carcinoma
InterPro domain[1-393] IPR0058287.8e-113General substrate transporter
[1-389] IPR0036631.9e-103Sugar/inositol transporter
[1-388] IPR0161961e-59Major facilitator superfamily domain, general substrate transporter
Orthology groupMCL25222 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210742-TA
ATGATTGGCTGTCCTCTGGCCAGTTGGGTGGCTGATACCCGGGGGCGTAAATTCGCGTTAATGGTGAATGCCGCATTTGGAGTCGTTGGCGCTGTTCTGATGGGCTTCAGTAAGATGTCTACATCACTTGCAATGCTGATCATTGGCAGATTTCTGATTGGTATCAATTGTGGATTTGCTACGACCGCTTCGCCGACTTACGTGTCAGAGATCGCGCCAATTAGATTGCGAGGTGCCTTTGGTACTGTTAACCAATTGGCTGTGGCGCTCGGTTTGACCTTAGGACAGGTTTTGGGTATTGATGTGATTTTGGGGAGCGATGAAGGTTGGCCCTGGCTTCTTGGACTCGCGATCGTCCCATCAACGATACAATTCTTCATGCTGATCCTGGCTCCGGAGTCGCCTCGCTATCTTCTTCTCGTTCAAAGAGATGAAGAACAAACCAGAAAAGTGTTGTCCAATCTCCGTGGGACTTCCGACATAAATGATGAAATCAAGGACATGCACGACGAAGACCACGCCGAGAAACAGGAACAGAAATTCTCGATCGCTGATTTGATACGCATCAAGTTCTTACGTACACCTATGATCATTGGTATAGTGATGCATCTCTCTCAGCAGCTGGGAGGCATAAATGCGGTGTTGTACTATTCGTCGTCTATCTTTATCAAAACGGGATTGAGCGACGGGGATGCAAGATTGGCGTCGATAGGTGTTGGGAGTATGTTGTTCATAATGGCCTTAGTGTCTATACCTTTGATGGACCGCCTCGGCAGACGGACGCTCCAGCTGGTAGGTCTGGGTGGCATGACAGTTTTCTCTGTGCTGATGACGATAGCTTTCTTTACGTACGAGAACAATACAACGATGAGTATATTTGCCGTTATATTTACGTTGTTGTACGTCGGGTTCTTCGGCGTCGGACCGAGCTCTATACCTTGGATGATCCTGTCCGAGCTGTTCAGCCAGGGCGCTAGGAGTGCCGCTGTTAGTGTTGGCGCCCTTGTCAACTGGTTAGCGAACTTCATAGTGGGCCTGACCTTTATACCTCTATCAGATGCTTTAGGCAACTTCGTTTTCTTGCCGTTCACTGTATTATTGATATTCTTCTTTGCGTTCACGTATTTCAAGTTACCGGAGACCAAAAACAGGACGATCGAAGAGGTGACAGCTATATTTAAGAAGTAA

Protein sequence:

>DPOGS210742-PA
MIGCPLASWVADTRGRKFALMVNAAFGVVGAVLMGFSKMSTSLAMLIIGRFLIGINCGFATTASPTYVSEIAPIRLRGAFGTVNQLAVALGLTLGQVLGIDVILGSDEGWPWLLGLAIVPSTIQFFMLILAPESPRYLLLVQRDEEQTRKVLSNLRGTSDINDEIKDMHDEDHAEKQEQKFSIADLIRIKFLRTPMIIGIVMHLSQQLGGINAVLYYSSSIFIKTGLSDGDARLASIGVGSMLFIMALVSIPLMDRLGRRTLQLVGLGGMTVFSVLMTIAFFTYENNTTMSIFAVIFTLLYVGFFGVGPSSIPWMILSELFSQGARSAAVSVGALVNWLANFIVGLTFIPLSDALGNFVFLPFTVLLIFFFAFTYFKLPETKNRTIEEVTAIFKK-