Monarch geneset OGS2.0

DPOGS210115
TranscriptDPOGS210115-TA2568 bp
ProteinDPOGS210115-PA855 aa
Genomic positionDPSCF300017 + 1329452-1354161
RNAseq coverage118x (Rank: top 58%)
Annotation
HeliconiusHMEL0098952e-9884.18% 
BombyxBGIBMGA000223-TA2e-12850.31% 
DrosophilaCG4607-PB2e-4727.31% 
EBI UniRef50UniRef50_Q7QJU94e-12050.11%AGAP007667-PA n=12 Tax=Neoptera RepID=Q7QJU9_ANOGA
NCBI RefSeqNP_001182385.14e-16259.06%putative sugar transporter protein 5 [Bombyx mori]
NCBI nr blastpgi|3065186468e-16159.06%putative sugar transporter protein 5 [Bombyx mori]
NCBI nr blastxgi|3065186463e-16058.87%putative sugar transporter protein 5 [Bombyx mori]
Group
Gene OntologyGO:00550854.3e-46transmembrane transport
GO:00160214.3e-46integral to membrane
GO:00228574.3e-46transmembrane transporter activity
GO:00160203.8e-07membrane
GO:00228913.8e-07substrate-specific transmembrane transporter activity
KEGG pathway 
InterPro domain[272-696] IPR0058284.3e-46General substrate transporter
[246-697] IPR0161967e-41Major facilitator superfamily domain, general substrate transporter
[52-211] IPR0117019.9e-24Major facilitator superfamily
[58-68] IPR0036633.8e-07Sugar/inositol transporter
Orthology groupMCL17066 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210115-TA
ATGGCAGACGAAAAGAGTGAATCTAAGCCGTTCATACAAACCAGCCCTCCGCCGCTGATTACTAAGACACCTATACCAGTGAAAAAAGTCAAACCCGGAAAGGAAGGTCGAGGGAAGGCTTTCAAACAGATCGTAGCAGCGTTTGTAGCCAACCTGGGTACAATTAATACGGGTATGGCCTTCGGGTTTTCAGCGACAGCACTACCTCAGTTAAAGAGTGAAACCTCAAGTTTACATGTAACAGAAAATGAAGCTAGTTGGATTGCCAGTTTAAGTTCTGCGGGCACTCCTATCGGCTGTATACTGAGCGGTTATCTCATGGATGCTATCGGTAGACGACGAACACTCATCGTATCGGAGGTGCCCCTCATCATTGGATGGATTCTGGTCGCATCGGCTGTAAACGTGCCAATGATGTATGTCGGTAGACTACTAATAGGCCTGGGATCTGGAATGGTGGGTGCCCCGGCCCGGGTGTACACGTGTGAAGTGTCACAGCCTCACCTTCGAGGAATGCTGGGAGCGCTGGCCTCTGTCGGCGTTTCTACTGGGGTGCTCATACAGTACGTTATAGGCAGTATAACGACATGGAATGTATTGGCCGGTGTGAGCGCTATAGTTCCTATAGTGTCTCTGTGCAGCGACGATGTCAGGCTGTATGAAGATACTTTTCAAAATTACGGCTCTATGAGTGAATCTAAGCCGTTCATACAAACCAGCCCTCCGCCGCTGATTACTAAGACACCGATACCAGTGAAAAAAGTCAAACCCGGAAAGGAAGGTCGAGGGAAGGCTTTCAAACAGATCGTAGCAGCGTTTGTAGCCAACCTGGGTACCATTAATACGGGTATGGCCTTCGGGTTTTCAGCGACAGCACTACCTCAGTTAAAGAGTGAAACCTCAAGTTTACATGTAACAGAAAATGAAGCTAGTTGGATTGCCAGTTTAAGTTCTGCGGGCACTCCTATCGGCTGTATACTGAGCGGTTATCTCATGGATGCTATCGGTAGACGACGAACACTCATCGTATCGGAGGTGCCCCTCATCATTGGATGGATTCTGGTCGCATCGGCTGTAAACGTGCCAATGATGTATGTCGGTAGACTACTAATAGGCCTGGGATCTGGAATGGTGGGTGCCCCGGCCCGGGTGTACACGTGTGAAGTGTCACAGCCTCACCTTCGAGGAATGCTGGGAGCGCTGGCCTCTGTCGGCGTCTCTACTGGGGTGCTCATACAGGTGGCATTTCAAATCCGTATTCGCGCCGTTTTCGCCGTTTACATCGACCGGTGGACAGATCTTTCGCCCCCCCTCTCGGACTGTCCTAGGGGTATTTTTCTTAATAGGACTTTTAGCTTCGTAAATCCCGGGCTGGGCTTGCTAGGCCGTTGCCCTCGGCTGTCTTTCCAGCAGTCATCTGTCTTTCCAGCGCCATCTGTCTTTCCAGCGTCAACTGTCTTTCCAGCGTCAGCCCGACTCTCCGGGCATTTAAAAACGTCCAAGGAGATAATAAAGGCCCTGCTGTCGCCGTCAGCTCTCAAGCCGTTCGGTATCCTCGCCTTGTATTTCTTCATTTACCAATGGTGTGGTGTCAACACCATCACTTTCTACGCCGTTGAAGTTTTCGAGGCCTCGGGCGCGTCTTTGGACAAGTATTATCTAACGATATCAATGGGCGTACTGCGTGTGGTGTTCACTGTAGTCGGTTGTATTCTGTGCAGGCGATGTGGCAGACGTCCACTCACATTCGTCTCTGCCTTCGGTTGCGGATCCACTATGATAATTCTGTCGGTGTACATGTACTACGTCCAGTACTGGAACAACAACAACATCCCCCCCCAACACTCTTGGATACCGATCGCTGCTATTTACCTCTTCACGGTCTTCTGTACCCTGGGGTACTTGATCGTACCCTGGATCATGATCGGCGAGGTTTACCCCACACAGGTCCGCGGCATCATCGGCGGTATGACCACTTGTGCCGCTCACTTGTCGATATTCACTGTGGTCAAAACCTTCCCGTACCTTAAGCACGCGCTCAACGACTACGGAACCTTCGGACTGTATGGAGCCATGTCCATAGCTGCTCTCAAGCCGTTCGGTATCCTCGCCTTGTATTTCTTCATATACCAATGGTGTGGTGTCAACACCATCACTTTCTACGCCGTTGAAGTTTTTGAGGCCTCGGGCGCGTCTTTGGACAAGTATTATCTAACGATATCAATGGGCGTACTGCGTGTGGTGTTCACTGTAGTCGGTTGTATTCTGTGCAGGCGATGTGGCAGACGTCCACTCACATTCGTCTCTGTACTGGAACAACAACAACATCCCCCCCCAACACTCTTGGATACCGATCGCTGCTATTTACCTCTTCACGGTCTTCTGTACCCTGGGGTACTTGATCGTACCCTGGATCATGATCGGCGAGGTTTACCCCACACAGGTATGGTGTTCTTCTACATATTCTTACCAGAAACGAAAGGAAGGACGCTGCAGGAAATAGAAGATTACTTCAGCGGCCGGACGAAGACACTCAAAAAAGTTAACGCGCAAACTGAAACCGCGTGA

Protein sequence:

>DPOGS210115-PA
MADEKSESKPFIQTSPPPLITKTPIPVKKVKPGKEGRGKAFKQIVAAFVANLGTINTGMAFGFSATALPQLKSETSSLHVTENEASWIASLSSAGTPIGCILSGYLMDAIGRRRTLIVSEVPLIIGWILVASAVNVPMMYVGRLLIGLGSGMVGAPARVYTCEVSQPHLRGMLGALASVGVSTGVLIQYVIGSITTWNVLAGVSAIVPIVSLCSDDVRLYEDTFQNYGSMSESKPFIQTSPPPLITKTPIPVKKVKPGKEGRGKAFKQIVAAFVANLGTINTGMAFGFSATALPQLKSETSSLHVTENEASWIASLSSAGTPIGCILSGYLMDAIGRRRTLIVSEVPLIIGWILVASAVNVPMMYVGRLLIGLGSGMVGAPARVYTCEVSQPHLRGMLGALASVGVSTGVLIQVAFQIRIRAVFAVYIDRWTDLSPPLSDCPRGIFLNRTFSFVNPGLGLLGRCPRLSFQQSSVFPAPSVFPASTVFPASARLSGHLKTSKEIIKALLSPSALKPFGILALYFFIYQWCGVNTITFYAVEVFEASGASLDKYYLTISMGVLRVVFTVVGCILCRRCGRRPLTFVSAFGCGSTMIILSVYMYYVQYWNNNNIPPQHSWIPIAAIYLFTVFCTLGYLIVPWIMIGEVYPTQVRGIIGGMTTCAAHLSIFTVVKTFPYLKHALNDYGTFGLYGAMSIAALKPFGILALYFFIYQWCGVNTITFYAVEVFEASGASLDKYYLTISMGVLRVVFTVVGCILCRRCGRRPLTFVSVLEQQQHPPPTLLDTDRCYLPLHGLLYPGVLDRTLDHDRRGLPHTGMVFFYIFLPETKGRTLQEIEDYFSGRTKTLKKVNAQTETA-