Monarch geneset OGS2.0

DPOGS210150
TranscriptDPOGS210150-TA2748 bp
ProteinDPOGS210150-PA915 aa
Genomic positionDPSCF300465 - 11166-29644
RNAseq coverage188x (Rank: top 48%)
Annotation
HeliconiusHMEL0074750.073.31% 
BombyxBGIBMGA010161-TA4e-16760.81% 
DrosophilaCG4797-PB5e-5629.74% 
EBI UniRef50UniRef50_Q7PX651e-9540.16%AGAP001236-PA n=1 Tax=Anopheles gambiae RepID=Q7PX65_ANOGA
NCBI RefSeqXP_974017.12e-11031.41%PREDICTED: similar to sugar transporter [Tribolium castaneum]
NCBI nr blastpgi|910829773e-10931.41%PREDICTED: similar to sugar transporter [Tribolium castaneum]
NCBI nr blastxgi|910829775e-11531.51%PREDICTED: similar to sugar transporter [Tribolium castaneum]
Group
Gene OntologyGO:00550858.7e-50transmembrane transport
GO:00160218.7e-50integral to membrane
GO:00228578.7e-50transmembrane transporter activity
KEGG pathway 
InterPro domain[467-891] IPR0161965.4e-52Major facilitator superfamily domain, general substrate transporter
[469-892] IPR0058288.7e-50General substrate transporter
[32-443] IPR0117012.9e-24Major facilitator superfamily
Orthology groupMCL19581 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210150-TA
ATGGTGATAAAATTGCAAATAAACGATACAAATCAGAATGTGAGTCGGTTGCGTTCTATAACGTCTCAGCTAATAGCATGTTCTTCTTCATTTTTATTATTATTTGACTTGGGAATGGCGATAAATTTTTCAACCATTATAATACCAGCATTGCTTGACTCAAAGGGGGAAATTTCATTTGATGAGAGCCAAGCTTCTTGGTTCGGGAGCATATCATTTTTGGCCCAACCAATTGGAGCAATAGTTTCAGGTCCCTTAGTAGATTATGTAGGACGCAAGAAGGCCAACTTTTTGGTTAACATACCGATGATAGCGGCGTGGCTCCTTATGTACTTTGCTTGGAATCTACCCTCGCTTTTCACTGCTAATGCTCTGTTAGGAATTAGTTCGGGAATCATGGAAGCCCCTATTAATTCTTATGTTGGTGAAATAAGCGAACCGTCAATCCGTGGAGCGCTGTGTACCTTGACACAATTCTTCTCATCGTTCGGTATACTGGTTATGTATTTCCTGGGAACTTTTATGCAGTGGCGAAATGCTGCCCTCATGTGTCTCATAGCGCCCATTGCCTCCATGATTACTGTTGCATTTTCACCGGAAACTCCAGTATGGCTATTAACAAGGAATCGTGAAAAGGAAGCGCTTAAGTCACTTTGCACGCTTCGAGGATGGACTACTCCAGATAATGTCAAGGAAGAATTTACCGACCTTCTTGATTATAGCAAGAAATTACAGCAATGTGTGATATGTTGCAACACGAACCAGGACTGTAAAAGTTGTCCTCATGAGTCCATGAATTGGTTTATTAGGCGTGTTCTCAAAATCAGATATGTTATAATGTGCAAAGAAACCTTAAGGCCATTGACTCTCGTGGTAATGTACTTTTTATTTTTTGTGATGAGCGGACTTACGCCAATCAGACCGAATCTGGTCAATGTATGTGGAGCTTTTGGAATGGCACAGGACAGCAAGCAAGTTGTGCTGTTCGTCGGAGTAATCACGTTTTTGGTTTGTTTCCTTATTATTGGTCTGATAAAAATCTTGGGTAAACGAAAATTGGTAATATCCTCAATGCTCGGAAGCGCAATTTCTTGTTTACTTCTGAGTACGTACGCGGCTAAGGTCTTGGACGAATCTGTGTCATCTTATCACCCTGAGACATTTCCTGAGAAAACAAGTTTGACACCATTGATATTATTTTATTTCATGACTATATTCACTGGATTGGGGATACCGTGGGTGCTGCTTGGAGAGCTGTTTCCATTTAGAAGCCGCGCCACAGCCCAGGGTTTATCAGCTGCTAGTTTCTACGTTTTTTCATTTCTTGGTTCTAAAACCTTCATTAATCTTGAGAACAGTGTGAAATTATGGGGAACTTTTGCTACATACGCAGCTTTCGGTTTTGCCGGGAGCATATCATTTTTGGCCCAACCAATTGGAGCAATAGTTTCAGGTCCCTTAGTAGATTATGTAGGACGCAAGAAGGCCAACTTTTTGGTTAACATACCGATGATAGCGGCGTGGCTCCTTATGTACTTTGCTTGGAATCTACCCTCGCTTTTCACTGCTAATGCTCTGTTAGGAATTAGTTCGGGAATCATGGAAGCCCCTATTAATTCTTATGTTGGTGAAATAAGCGAACCGTCAATCCGTGGAGCGCTGTGTACCTTGACACAATTCTTCTCATCGTTCGGTATACTGGTTATGTATTTCCTGGGAACTTTTATGCAGTGGCGAAATGCTGCCCTCATGTGTCTCATAGCGCCCATTGCTTCCATGATCACTGTTGCATTTTCACCGGAAACTCCAGTATGGCTATTAACAAGGAATCGTGAAAAGGAAGCGCTTAAGTCACTTTGCACGCTTCGAGGATGGACTACTCCAGATAATGTCAAGGAAGAATTTACCGACCTTCTTGATTATAGCAAGAAATTACAGCAATGTGTGATATGTTGCAACACGAACCAGGACTGTAAAAGTTGTCCTCATGAGTCCATGAATTGGTTTATTAGGCGTGTTCTCAAAATCAGATATGTTATAATGTGCAAAGAAACCTTAAGGCCATTGACTCTCGTGGTAATGTACTTTTTATTTTTTGTGATGAGCGGACTTACGCCAATCAGACCGAATCTGGTCAATGTATGTGGAGCTTTTGGAATGGCACAGGACAGCAAGCAAGTTGTGCTGTTCGTCGGAGTAATCACGTTTTTGGTTTGTTTCCTTATTATTGGTCTGATAAAAATCTTGGGTAAACGAAAATTGGTAATATCCTCAATGCTCGGAAGCGCAATTTCTTGTTTACTTCTGAGTACGTACGCGGCTAAGGTCTTGGACGAATCTGTGTCATCTTATCACCCTGAGACATTTCCTGAGAAAACAAGTTTGACACCATTGATATTATTTTATTTCATGACTATATTCACTGGATTGGGGATACCGTGGGTGCTGCTTGGAGAGCTGTTTCCATTTAGAAGCCGCGCCACAGCCCAGGGTTTATCAGCTGCTAGTTTCTACGTTTTTTCATTTCTTGGTTCTAAAACCTTCATTAATCTTGAGAACAGTGTGAAATTATGGGGAACTTTTGCTACATACGCAGCTTTCGGTTTTGCCGGTACGATATATTTGTACTTTTTCTTACCAGAAACCGAAGGCAAATCATTGCAAGAAATTGAAAACTATTACAATGGACAATTTAGGACATTTGCTGACGATCCCGTTATAAACAAATTAAAAAGACTAAAGAGATAA

Protein sequence:

>DPOGS210150-PA
MVIKLQINDTNQNVSRLRSITSQLIACSSSFLLLFDLGMAINFSTIIIPALLDSKGEISFDESQASWFGSISFLAQPIGAIVSGPLVDYVGRKKANFLVNIPMIAAWLLMYFAWNLPSLFTANALLGISSGIMEAPINSYVGEISEPSIRGALCTLTQFFSSFGILVMYFLGTFMQWRNAALMCLIAPIASMITVAFSPETPVWLLTRNREKEALKSLCTLRGWTTPDNVKEEFTDLLDYSKKLQQCVICCNTNQDCKSCPHESMNWFIRRVLKIRYVIMCKETLRPLTLVVMYFLFFVMSGLTPIRPNLVNVCGAFGMAQDSKQVVLFVGVITFLVCFLIIGLIKILGKRKLVISSMLGSAISCLLLSTYAAKVLDESVSSYHPETFPEKTSLTPLILFYFMTIFTGLGIPWVLLGELFPFRSRATAQGLSAASFYVFSFLGSKTFINLENSVKLWGTFATYAAFGFAGSISFLAQPIGAIVSGPLVDYVGRKKANFLVNIPMIAAWLLMYFAWNLPSLFTANALLGISSGIMEAPINSYVGEISEPSIRGALCTLTQFFSSFGILVMYFLGTFMQWRNAALMCLIAPIASMITVAFSPETPVWLLTRNREKEALKSLCTLRGWTTPDNVKEEFTDLLDYSKKLQQCVICCNTNQDCKSCPHESMNWFIRRVLKIRYVIMCKETLRPLTLVVMYFLFFVMSGLTPIRPNLVNVCGAFGMAQDSKQVVLFVGVITFLVCFLIIGLIKILGKRKLVISSMLGSAISCLLLSTYAAKVLDESVSSYHPETFPEKTSLTPLILFYFMTIFTGLGIPWVLLGELFPFRSRATAQGLSAASFYVFSFLGSKTFINLENSVKLWGTFATYAAFGFAGTIYLYFFLPETEGKSLQEIENYYNGQFRTFADDPVINKLKRLKR-