Monarch geneset OGS2.0

DPOGS212673
TranscriptDPOGS212673-TA1242 bp
ProteinDPOGS212673-PA413 aa
Genomic positionDPSCF300198 + 177438-181257
RNAseq coverage22x (Rank: top 79%)
Annotation
HeliconiusHMEL0075102e-12849.18% 
BombyxBGIBMGA014054-TA3e-9042.72% 
DrosophilaCG10960-PB2e-2023.19% 
EBI UniRef50UniRef50_D6X0Y16e-2724.35%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6X0Y1_TRICA
NCBI RefSeqXP_624322.13e-2925.93%PREDICTED: similar to CG10960-PB, isoform B [Apis mellifera]
NCBI nr blastpgi|665583537e-2825.93%PREDICTED: facilitated trehalose transporter Tret1-like isoform 3 [Apis mellifera]
NCBI nr blastxgi|665583532e-3225.62%PREDICTED: facilitated trehalose transporter Tret1-like isoform 3 [Apis mellifera]
Group
Gene OntologyGO:00550851.4e-25transmembrane transport
GO:00160211.4e-25integral to membrane
GO:00228571.4e-25transmembrane transporter activity
KEGG pathway 
InterPro domain[148-401] IPR0058281.4e-25General substrate transporter
[1-146] IPR0161961.1e-16Major facilitator superfamily domain, general substrate transporter
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212673-TA
ATGAGTGAAAAAGTAAAGCCAAAAGCATTTTTAATGCAAGGATGTGCGACTTTGATTATTTGTTTTCTGACATCGTTGACTGGTTTCGTTTTCGCCTGGCCCTCTTATACCTTCCAAATATATTTGTCCAATGAGACGTATCTAGAAGCTCCTATTAGTACCAGTCAAATGTCAATGTTGGGAAGTATTATAAATGTTGGAGCTTTGTTAGGGACGCCCTTGACAGTTTATATGGCTGACAAACTTGGAAGGAAGTATTCTGCTATGCTCGTTGGACTACCCTACGTCATAACATGGGCGTTAGTCTCCGTCACCAGGTCGTATTACGTAGTCCTTTTCTCGCTTGGCTTGTCTGGTCTCAGTGCCGCTGGCCAATCAATTTCTTCTATTTACATATCAGAAATATCTCAAGACTCGATCAGAGGGTCATTGACCTCATCGGAAGCTCGGGATTCCATCGCGTTCTATCGACGGGTGGATGTAGATTCCAAAGAAGTGGAAGGAGAAATTAAGAGTCTGAGAGTTCAATTAGATTTAAATTCGGATATTATTATAGAAGACAGCACGACAAATCACGTGCCTGAAACTGTAAACGAAAAATCAGAATCATCCAAGAGAGCTTTGTTGACGGTGATAATAACTATGTCAGTCATGGTTCTTATGGGTTCATTGGTATTGCAAATGTTTGCAGACTCCATATTTAAGGAGGCCATTCCTAGTATGCGGCCGAATACGTGTGCTATATTACTTGTTGTGGATTATCTGATGGCATCATTGGTTGGTGCTTCCACGTTGGATAAATTAGGACGGAAGAATCTTATGACGATAACGTCCTTCATTGCCGGTATATTTACGATCCTCATAGGGACACAGCTGCACATGCATTGGGCTCCATATTGGTTCACCGCTGTCATTATATACTTACACAGTTTCATTTTCAACTTAGGAGTCGCGCAAGTACCCCTGGTGCTGGCCGCAGAAGTGTTTTTACCGGAGGTACGAGCTCTTGGGAATAGCATTGCCTTGGCGTTTTTGTGGATCACAAACTGGATTGTTGTGTCAACATTCTTGCCTTTGGTTGAATTCATTGGTTTAGGGCAAACATTTTATATATTTTCTGTTATATGTTTCATTGGTTCGGTCTACAGTCATTTATGTCTTCCGGAGACGAAAGGATTATCAGCGGATGCTATTCAACTTCTGTTTATAAAGAAAGAGAGAAACAGTAATCTAAAAGTATAG

Protein sequence:

>DPOGS212673-PA
MSEKVKPKAFLMQGCATLIICFLTSLTGFVFAWPSYTFQIYLSNETYLEAPISTSQMSMLGSIINVGALLGTPLTVYMADKLGRKYSAMLVGLPYVITWALVSVTRSYYVVLFSLGLSGLSAAGQSISSIYISEISQDSIRGSLTSSEARDSIAFYRRVDVDSKEVEGEIKSLRVQLDLNSDIIIEDSTTNHVPETVNEKSESSKRALLTVIITMSVMVLMGSLVLQMFADSIFKEAIPSMRPNTCAILLVVDYLMASLVGASTLDKLGRKNLMTITSFIAGIFTILIGTQLHMHWAPYWFTAVIIYLHSFIFNLGVAQVPLVLAAEVFLPEVRALGNSIALAFLWITNWIVVSTFLPLVEFIGLGQTFYIFSVICFIGSVYSHLCLPETKGLSADAIQLLFIKKERNSNLKV-