Monarch geneset OGS2.0

DPOGS204795
TranscriptDPOGS204795-TA1245 bp
ProteinDPOGS204795-PA414 aa
Genomic positionDPSCF300460 - 109092-113742
RNAseq coverage20x (Rank: top 79%)
Annotation
HeliconiusHMEL0131850.082.56% 
BombyxBGIBMGA010722-TA3e-13865.04% 
DrosophilaCG1213-PC1e-7636.84% 
EBI UniRef50UniRef50_Q7JVN62e-7436.84%CG1213, isoform A n=19 Tax=Diptera RepID=Q7JVN6_DROME
NCBI RefSeqXP_397016.23e-7638.18%PREDICTED: similar to CG1213-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3838566254e-8241.10%PREDICTED: facilitated trehalose transporter Tret1-like [Megachile rotundata]
NCBI nr blastxgi|3838566252e-8141.10%PREDICTED: facilitated trehalose transporter Tret1-like [Megachile rotundata]
Group
Gene OntologyGO:00550851.2e-66transmembrane transport
GO:00160211.2e-66integral to membrane
GO:00228571.2e-66transmembrane transporter activity
GO:00160203e-11membrane
GO:00228913e-11substrate-specific transmembrane transporter activity
KEGG pathway 
InterPro domain[55-407] IPR0058281.2e-66General substrate transporter
[1-407] IPR0161963e-47Major facilitator superfamily domain, general substrate transporter
[114-133] IPR0036633e-11Sugar/inositol transporter
Orthology groupMCL25765 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204795-TA
ATGTTCGGTTTTGACATTCAGGAAGTCAAATTAAAATGGCGGCAATATTTGGCAGCGTCGTTGGCGAGTTACGGCAGTCTCTGTACTGGCATGTCTATGGGTTGGACCTCGCCGGTGTTCCCCCATCTACGATCTGTGAACTCACCGCTGGCGGAACCCCCGACCTTACAACAAGAATCTTGGATTGGATCGCTTTTGGTACTAGGAGGATTATTAGGTCCGTTGATAACAGTTCCGCTGTCAAATAGGATCGGCAGACGTTACGTTATCATGATCTCCAATATTCCTCTGCTCCTCGGCTGGCTGTTGGCCGGAGTCGCCTCTGACTTGCCCACGCTATACGCGGCGCGCATCATGTGGGGCTGCGCCACGGGCATGCAGTTCGCCACAGTGCCTTTATACATCGGTGAGATCGCTGAGGATAAAATCCGCGGTTCGCTTAGCGCTCTCTTCCTACTTTTCATCAACATCGGCTTCCTGCTCGCGTACGCGATCGGTCCCTATTCCTCGTACTGGGGTCTCACAGCATCAGGCGGTATACTGTCGCTGTTCTACGTACCGTTCACTTGGCTTATACCGGAGACGCCATTCTTCCTTGTCTACAAAGACAAAACTGAAGAGGCTATACAAGTATTGCAACAACTCCGCGGCAGCTCCAAGGAAGCTGTTCAAGACGAGCTCGACGGTCTGCGAGCGATGGTTCAGAGGGAGTTCAAAACGGAACCCAGCGTTAGAGACCTCTGGGCGAGTTCCGGCAATTTGAAGGCTTTGGGTATATGCGTGTTCCTGGCCATGCTGCTCCAGCTGTCCGGGATCGATGTGTTACTGTTTTACATGGAGGAGCTCTTAGAGAAGGTCGGCACGAAAATATCAGCCGCCGACGGCACCGTCATCATGGGAGTCGTTCAAGTCGTGACGAGTTGTATAACGCCGCTCGTAGTGGACAGGCTCGGCAGAAAACTGCTCATGTGGACGACCTCCCTCGGCCTGGCTGTATTTCTAAGCGTAATAGGCGTCTACGCGTTGTTGGACTCTCATTTTAAGTATAACGTGGAACCTTACGCCTTCCTGCCTCTGCTGTGTCTCGTCGTCTACATGGTGCTGTTTACTTTGGGTGTGGGTCCGGTCCCGTGGATACTGGTCGCGGAAATGTTCCCTCCTCGTAGTAAATGCCTCGCCAGCGGCGTCGCCTCCTTCATGTGCTGGCTCGCGGGCTTCGTGTGGACGAGGCGAGTGAACAAATAA

Protein sequence:

>DPOGS204795-PA
MFGFDIQEVKLKWRQYLAASLASYGSLCTGMSMGWTSPVFPHLRSVNSPLAEPPTLQQESWIGSLLVLGGLLGPLITVPLSNRIGRRYVIMISNIPLLLGWLLAGVASDLPTLYAARIMWGCATGMQFATVPLYIGEIAEDKIRGSLSALFLLFINIGFLLAYAIGPYSSYWGLTASGGILSLFYVPFTWLIPETPFFLVYKDKTEEAIQVLQQLRGSSKEAVQDELDGLRAMVQREFKTEPSVRDLWASSGNLKALGICVFLAMLLQLSGIDVLLFYMEELLEKVGTKISAADGTVIMGVVQVVTSCITPLVVDRLGRKLLMWTTSLGLAVFLSVIGVYALLDSHFKYNVEPYAFLPLLCLVVYMVLFTLGVGPVPWILVAEMFPPRSKCLASGVASFMCWLAGFVWTRRVNK-