Monarch geneset OGS2.0

DPOGS210116
TranscriptDPOGS210116-TA2772 bp
ProteinDPOGS210116-PA923 aa
Genomic positionDPSCF300017 + 1360543-1366720
RNAseq coverage6x (Rank: top 87%)
Annotation
HeliconiusHMEL0098964e-9166.81% 
BombyxBGIBMGA000224-TA4e-8045.59% 
DrosophilaCG4797-PB7e-5129.95% 
EBI UniRef50UniRef50_E2C3F33e-8136.03%Sugar transporter ERD6-like 5 n=5 Tax=Endopterygota RepID=E2C3F3_HARSA
NCBI RefSeqXP_308203.41e-7734.48%AGAP007667-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3071956491e-8036.03%Sugar transporter ERD6-like 5 [Harpegnathos saltator]
NCBI nr blastxgi|3071806042e-8526.66%Probable polyol transporter 4 [Camponotus floridanus]
Group
Gene OntologyGO:00550851.4e-53transmembrane transport
GO:00160211.4e-53integral to membrane
GO:00228571.4e-53transmembrane transporter activity
KEGG pathway 
InterPro domain[457-878] IPR0058281.4e-53General substrate transporter
[449-878] IPR0161963.7e-51Major facilitator superfamily domain, general substrate transporter
Orthology groupMCL26712 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210116-TA
ATGTCTGATAATGAAAATATTGTAACCAATCACCCCGGATTTTGGACAAAAACATCATTGATTCAGTTAGGTTACGGCGTGTGGGTGAATCTGTCGTTTATTGCAATTGGACTGGCTTTTGGATTTTCTGCTGTCGCTGTGCCTCAGTTGCTTCGTCCAGAATCTCGCATAAAAGTCTCTGTAGCTGATGCGTCTTGGATAGCCAGCGTGTTAACATTGATTTCGCCGATTGGCTGCGTTTTTTCCGGCTATTTGATGGATAAGTTGGGCCGTCGAATAATGCTGTCCTTCAGTCAACTGCCCGTTTGTCTTGGTTGGTTATATACTGGATATTCGGGTTCAGCGAGAGATATCATCATTGGCAGAATATTAACTGGCTTTGGGATCGGTATGGTAACCAGTGCTCCCAGGGTTTTCATGACCGAAATATCTCTACCAAATATGCGTGGTGTGGTCGCAACATTTCCGAACATTGCGATATCTATTGGGATAACTGTACAGGCAGGTTTGGGAGCCAGATATAACTGGACTACGTTGTGTTTCATCAGTAGTGCTTACGCATTATTTTTGTTTTTCTTTAATTTCGGTCTCCCGGAGACTCCGTACTTTTTATTGCAAAGAGAGTGTACTGATGACGCGAGAGATTCACTCAAGAAATTTAGATCGAAAAGCCATGACATCGAGAAAGAAATGGTTTATCTTATAGAATATAAGAGACAGAATGATCTTCACAGATTGTCTTTCAAAGAGCAAATGTTGATAGTATTTATGAAGTCAACATGCAAATCATTTTGGATGATATTGATTTATTTTATTATCACACAATGTTCTGGAGTCACGATTATAGCTATGTGGACGGTGGATATAATACGGAAATCAAATTCGTCCGTCGATGCTAATTCTGGTAACGTTGTACTCGGAATTACTAGGTTAATTGGAGGGGTGGTTACAGCAGTGCTTATATTCAGAATTAGAAGACGACCAATGGCCCTGGTATCGGGTGCAGGCGTCGGTGTAATGTGTTTAGCTGTGACATTGCTAATAAATAATTTAAAGGCACCGACACCGCTTCCACTGCTGTGTTACGCGGGTTACATTCTGTTTGCGACTCTCGGACATTACAATTTGCCGATTTTAATTATGTACGAATTATATCCTCTGCAGGTGAGAGGACTGATGGGAGGAATTTCTCTATGCTGTCTAAATATTTTCATTTTCTTCGCTATCAAATCGTACCCTTACTTAAGAGATGACATCGGCTTCGCTAACACAATACTCGCTTTTGGTATATGTTCGTTAATTGGCTTTTGGACGGAAACCTCACTAATCCAGCTCGGTTACGGTGTTTGGGTGAATCTCTCCCAGGTCATAGTAGGCTTGGCTTTTGGTTTTTCATCAGTAGCTCTTCCACAAATAGCATCGGCTACATCACCAATAAAAGTTACAATATCCGATCAGTCTTGGATAGCTAGTGTTTTGCCTTTATTTTGCCCGCCTGGCTGTATTTTGGGTGGATATTTGATAGATAAATTCGGTCGTCGTACTATGCTTATATGCAGTCAGTTGCCAGTTATGGCTGGCTGGTTTTACACTGGCGTTGCCCAATCCGCTGTGAATCTTATCATAGGCAGGATGTTGACGGGTTTCGGCGTAGGAATGGCAATGAGTGTCCCTCGTGTGTATATGACTGAAATGTCCTTACCAAATATGAGAGGTATTATTGGATCTTTTCCAAACATTGCCATGTCTATAGGCATCGCTTCCCAGGCTGGTTTGGGATCAATTCTGAAATGGAATATTTTATGTTTTATTAGTTTTAGCTGTTCTTTAAGCTTATTCTTTTTAAACTTCAAACTTCCTGAATCACCATATTACTTGCTACAAAAGGCTTCGATAGATGATGCGAGGAATTCCCTCAAACATTTCAGAGGTAAACAATATAATATTGAAAACGAAATTAACGACCTCATCGATTTTAAGAGAGACAACGACATCCATAAGTTGAATGCTAAGGAGAGAATGCAGGCTTTATTTAAACGATCCGCTTGTAAGCCATTCTGGACAATGATGGTATATGTTGTCATTATGGAACTGTCTGGCGCTTCGATTGTCTTTATGTGGGGTGTTCAGATACTGCAGAGATCAAAGTCTTCCGTAAATCCTGAAATAGGTAACTTTATCTTGGGACTGATGAGAATTATTAGCGGTGTAATAACAGCAGTTTTCGTATTCCACATAGGCAGGAGACCGTTGGCACTTACGTCAGGTATAGGTGTTGGTGTGACTTGCTTGTTTCTTGGTTCTATCATGCATTATTTAAGTACACCATCAATATTCCCCCAACTAGGATACGTGGCTTACATATTCTTCGCTACATTGGGTTACTACACACTACCTCCTCTTATAATGTTTGAATTGTATCCACTCCAGGTAAGAGGAATACTGGGGGGCTTGTCATTGTCGAACATAAGTTTATGTATATTCGTAGCAAACAAGAGTTTTCCATTCGTCAGAGATTCCCTTGGGTTTGCGAACACTATTTTGGCGTTTGGTATATGGTCTTTTCTTGGGTCAGTGTTCCTGTACTTCTTCTTGCCAGAAACGAAAGATTTGACCTTGCAAGAAATAGAGGAATATTATAATGACATACGGCCAACTCTGACGTCACAGAGGAAAATACTTTCAATGCAACGGATACAGAGCATGGAAAACACAAGCACTTCCAAAGGAATAATGAAGAAAACGAGCAATACCGGCAAACCAGTGTCGTAG

Protein sequence:

>DPOGS210116-PA
MSDNENIVTNHPGFWTKTSLIQLGYGVWVNLSFIAIGLAFGFSAVAVPQLLRPESRIKVSVADASWIASVLTLISPIGCVFSGYLMDKLGRRIMLSFSQLPVCLGWLYTGYSGSARDIIIGRILTGFGIGMVTSAPRVFMTEISLPNMRGVVATFPNIAISIGITVQAGLGARYNWTTLCFISSAYALFLFFFNFGLPETPYFLLQRECTDDARDSLKKFRSKSHDIEKEMVYLIEYKRQNDLHRLSFKEQMLIVFMKSTCKSFWMILIYFIITQCSGVTIIAMWTVDIIRKSNSSVDANSGNVVLGITRLIGGVVTAVLIFRIRRRPMALVSGAGVGVMCLAVTLLINNLKAPTPLPLLCYAGYILFATLGHYNLPILIMYELYPLQVRGLMGGISLCCLNIFIFFAIKSYPYLRDDIGFANTILAFGICSLIGFWTETSLIQLGYGVWVNLSQVIVGLAFGFSSVALPQIASATSPIKVTISDQSWIASVLPLFCPPGCILGGYLIDKFGRRTMLICSQLPVMAGWFYTGVAQSAVNLIIGRMLTGFGVGMAMSVPRVYMTEMSLPNMRGIIGSFPNIAMSIGIASQAGLGSILKWNILCFISFSCSLSLFFLNFKLPESPYYLLQKASIDDARNSLKHFRGKQYNIENEINDLIDFKRDNDIHKLNAKERMQALFKRSACKPFWTMMVYVVIMELSGASIVFMWGVQILQRSKSSVNPEIGNFILGLMRIISGVITAVFVFHIGRRPLALTSGIGVGVTCLFLGSIMHYLSTPSIFPQLGYVAYIFFATLGYYTLPPLIMFELYPLQVRGILGGLSLSNISLCIFVANKSFPFVRDSLGFANTILAFGIWSFLGSVFLYFFLPETKDLTLQEIEEYYNDIRPTLTSQRKILSMQRIQSMENTSTSKGIMKKTSNTGKPVS-