Monarch geneset OGS2.0

DPOGS201101
TranscriptDPOGS201101-TA1461 bp
ProteinDPOGS201101-PA486 aa
Genomic positionDPSCF300137 - 455721-466995
RNAseq coverage40x (Rank: top 72%)
Annotation
HeliconiusHMEL0053194e-16559.78% 
BombyxBGIBMGA013656-TA2e-14955.90% 
DrosophilaCG10960-PB2e-4627.93% 
EBI UniRef50UniRef50_E2BLV71e-5429.82%Solute carrier family 2, facilitated glucose transporter member 8 n=7 Tax=Formicidae RepID=E2BLV7_HARSA
NCBI RefSeqXP_396250.22e-5730.98%PREDICTED: similar to CG1213-PA, isoform A isoform 1, partial [Apis mellifera]
NCBI nr blastpgi|3800242261e-5630.77%PREDICTED: facilitated trehalose transporter Tret1-like [Apis florea]
NCBI nr blastxgi|3504039861e-5931.48%PREDICTED: facilitated trehalose transporter Tret1-like [Bombus impatiens]
Group
Gene OntologyGO:00550851.1e-60transmembrane transport
GO:00160211.1e-60integral to membrane
GO:00228571.1e-60transmembrane transporter activity
KEGG pathway 
InterPro domain[55-458] IPR0058281.1e-60General substrate transporter
[9-466] IPR0161961.3e-51Major facilitator superfamily domain, general substrate transporter
Orthology groupMCL19868 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201101-TA
ATGTTACAAAGAAAATATTTTCAAGATGGAGGACAAATAAATCAAATAATTTGTGCGCTCTTGATAAATTTGCCCGTACTATCTTATGGATGTAGCGTTGGGTGGATGTCCCCTATGACACTTCTTCTGCAGTCAAAAGATTCACCCAGAGGGACCCCTTTAACCGATTTAGAGGTGTCATGGATGGCATCAGTTCCATACTTGGTTTGCATCCCATGTGATCTTCTCATGGCTGTCATAACAGATAAGTGGGGGAGGAAAACAGCTTTGATACTTATATCGATATCATCAGCGATAAGTTGGATTCTTCTTCTCTCGTCCTTCAACATTTGGGTCTTGATTCTGGGCCGAGCGCTAGTTGGGATCAGTATGGCAGGTTCCTACGTTACGTGCCCTATTTACACCAAGGAAATAAGTGATGACAACATCCGAGGCGCCTTGGGATGCTTGGTTATTCTTTTCCAAACAACCGGCAACCTCTTTTTGTATATTATAGGGGATATTTTAAGTTATAACTCTATACTCTGGATATGTCTAGCTATTCCTGGGATACATATACTGTTGTTCATACTAATGCCTGATTCCCCTTCCTACCTGCTCAAGAAAGGAAGAATTGAGGATACCACCAGAGCCTTATCATGGCTGAGATGTAGACCAGCTGGTGATCCCAAAATCGAACAAGAACTAGATTTGATCAGGGCTGAACAGGACAAAGATGAATCCAAGAATTTTTTACTGAAGGATATATATCAAGACAAAATTCTGTTCAGGGCTTTTATAATAGCCATGGTGACGACACTGTCCAGAGAAGCTTGTGGTGCGGTGCCAGTTCTCAACTTCGCAGGGGAAATCTTCAGTCTAGCATCCAGTGACAATAATCTACGTCTCAGTCCAAATCAACAAGCCATGCTGTTGGGGGGTGTTCAAGTACTCGGTTCAGCGTTGGCTTCCAGTTTGGTCGAGAAATCTGGGCGAAAGCCGCTGCTCTTCACAACGTCCCTTCTATCTGGTATCAGTATGTGCACACTGGCGTCTTGGTTCCTTCTCCGTGATAATGGTATCCTAGCACCTTCCTGGTTGCCACTGGTTACGCTGTGTGTTTGCATCTTCTGCGATTCCTCCGGTCTACAACCCATGTCCGTGGTCATAACGGGAGAAATATTCTCTTTCAGATACCGTGGAACGATATTAGCAATAACGATGGCATGTGCGTCATTATTTGACTTTGTGCAACTGTTATTTTTCAAGTCTCTAGCCAATGCTGTTGGGATTCACGTCTCATTTTACTTTTTTGGTATTCTTTGTCTCCTGATGGCTCTATACGTGATATTGGCGATACCAGAAACAAGAGCCAGAAGTCTAGAAGATATTTACAAAGATCTCGTAAAGAAGAAAGATTTGAAGGGGATTGTTAATGAAAGATATGTTGAAACAAGAGACAGAGAAGTGTCACGAATTTGA

Protein sequence:

>DPOGS201101-PA
MLQRKYFQDGGQINQIICALLINLPVLSYGCSVGWMSPMTLLLQSKDSPRGTPLTDLEVSWMASVPYLVCIPCDLLMAVITDKWGRKTALILISISSAISWILLLSSFNIWVLILGRALVGISMAGSYVTCPIYTKEISDDNIRGALGCLVILFQTTGNLFLYIIGDILSYNSILWICLAIPGIHILLFILMPDSPSYLLKKGRIEDTTRALSWLRCRPAGDPKIEQELDLIRAEQDKDESKNFLLKDIYQDKILFRAFIIAMVTTLSREACGAVPVLNFAGEIFSLASSDNNLRLSPNQQAMLLGGVQVLGSALASSLVEKSGRKPLLFTTSLLSGISMCTLASWFLLRDNGILAPSWLPLVTLCVCIFCDSSGLQPMSVVITGEIFSFRYRGTILAITMACASLFDFVQLLFFKSLANAVGIHVSFYFFGILCLLMALYVILAIPETRARSLEDIYKDLVKKKDLKGIVNERYVETRDREVSRI-