Monarch geneset OGS2.0

DPOGS209687
TranscriptDPOGS209687-TA1674 bp
ProteinDPOGS209687-PA557 aa
Genomic positionDPSCF300134 + 294440-305021
RNAseq coverage276x (Rank: top 39%)
Annotation
HeliconiusHMEL0084592e-12163.22% 
BombyxBGIBMGA000527-TA6e-7458.13% 
DrosophilaCG31100-PA2e-9950.67% 
EBI UniRef50UniRef50_D4AHX84e-14048.75%Sugar transporter 13 n=1 Tax=Nilaparvata lugens RepID=D4AHX8_NILLU
NCBI RefSeqXP_974017.14e-14248.53%PREDICTED: similar to sugar transporter [Tribolium castaneum]
NCBI nr blastpgi|910829779e-14148.53%PREDICTED: similar to sugar transporter [Tribolium castaneum]
NCBI nr blastxgi|2700070373e-13949.13%hypothetical protein TcasGA2_TC013484 [Tribolium castaneum]
Group
Gene OntologyGO:00550853.2e-55transmembrane transport
GO:00160213.2e-55integral to membrane
GO:00228573.2e-55transmembrane transporter activity
KEGG pathway 
InterPro domain[82-512] IPR0058283.2e-55General substrate transporter
[18-510] IPR0161962.6e-52Major facilitator superfamily domain, general substrate transporter
Orthology groupMCL15584 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209687-TA
ATGGCTGCCCTTACAAAACCGATCATAAAATTGTCTCAGATTACGGCAAATATGAAGGATCCCACAGTAAAAGACGACGCTCATCATTACGGCTACAGGAAAAATTTTCGTGTCGCCCTACCCCAGTTCCTGGCAGTCAGTGTGAAGAATCTTGTACTTTTAAGTTACGGAATGACACTCGGATTCCCCACGATCCTTATTCCTGCCGTGAAAGATCCCATTGATGTAGAAGTATTGAAACTCAATAACTCCGAGATATCATGGATCAGTTCAATAAATTTGATTATAGTACCACTTGGTTGTGCGCTGTCCGGAATAGTCACAACTCCGATGGGCCGCCGACGAGCTATGCAGATGGTTAACATACCGTTTTTCATAGCCTGGCTGATTTTCCATTACTCTTCCACCGCTAACCATTTATACGGAGCATTGTTTCTCACTGGCCTGGCTGGAGGTCTACTTGAGGCACCTGTACTAACATATGTAGCGGAAATCACCCAGCCCCATCTGCGCGGGGCCTTGACTGCAACTAGCTCAATGTGCATCATTATTGGGGTCTTCACTCAATTTCTGTTCGGACTTTTGATGTATTGGAGGACCGTAGCCTTGGTCAATATTTTCTTTGCGCTCATCGCTATACTGGCGTTGTTCTTTATACCAGAATCTCCTCATTGGCTTGTGATGAAAAAAAGGCACGATGACGCTAGAAAGAGCTTACAGTGGCTGAGAGGTTGGACGACAGCACAGGACGTTGAACTAGAACTAAAGGATATTCAAGCTTTGTTTAAACGAAAAAAAGCTGAAACGGGTCAAGAAGAAACTTTTATGGAAAAGTTGTCATATTATCTTGACAAGAGCTTTCTGGTGCCATTCTTCTTAGTGTCTTACGCATTCTTCGTTGGTCATTTTAGCGGCATGACCACCTTGCAGACGTATGCGGTGTCAATATTTCAGACGTTAGAGGCGCCCATTGATAAATATTACGCGACGCTTATACTTGGTCTACTACAAATCATCGGCTGTGGTACCTGTGTGATGCTGGTTCACTATACCGGAAAGAGGATCTTGACTTTCTTCTCCACCTTCAGTGCTGGTATATGTTGCTTGCTAGTTGCTGGCTATGAAGGCTATATCAAAACTCAAGATGTATTTGGCAACTCGTCGCTTCCGATGAATACTAGCAACACTACATCTGGAATCATAAATGGTGATTTGCAAAATGGGTATTCGTGGATACCAACAACACTTCTGATGTTGCTGGCTCTATTAACGCACACAGGAATAAGGCTTTTGCCATGGATTCTTATTGGCGAGGTATTCAGCGCCAAGACGAGGTCTGGTGCAGCAGGAATTGCAAGTGCTGTGGGATATATATTTGGTTTCCTAACTAACAAGACGTATATAAGCATGGTGGATGTTTTATCTTTTTGGGGGACATACGGCTTCTATGGCATTATTTGTCTCACTGGATGCGTTGTATTTTACTTTATATTACCGGAGACGGAGGGCAAAAAGTTATATGATATTGAGAATCACTTTGCTGGAATAAAAAAATTGACAAATGAAGTCTATCGCTCAAAGAAGAATATAAATAAAGAGTCATCAAAGTTACGGGACTTACAAGGTAATACAAATCCTACATTCGAAGGAGACAACATTACGATTCGCCAGTGA

Protein sequence:

>DPOGS209687-PA
MAALTKPIIKLSQITANMKDPTVKDDAHHYGYRKNFRVALPQFLAVSVKNLVLLSYGMTLGFPTILIPAVKDPIDVEVLKLNNSEISWISSINLIIVPLGCALSGIVTTPMGRRRAMQMVNIPFFIAWLIFHYSSTANHLYGALFLTGLAGGLLEAPVLTYVAEITQPHLRGALTATSSMCIIIGVFTQFLFGLLMYWRTVALVNIFFALIAILALFFIPESPHWLVMKKRHDDARKSLQWLRGWTTAQDVELELKDIQALFKRKKAETGQEETFMEKLSYYLDKSFLVPFFLVSYAFFVGHFSGMTTLQTYAVSIFQTLEAPIDKYYATLILGLLQIIGCGTCVMLVHYTGKRILTFFSTFSAGICCLLVAGYEGYIKTQDVFGNSSLPMNTSNTTSGIINGDLQNGYSWIPTTLLMLLALLTHTGIRLLPWILIGEVFSAKTRSGAAGIASAVGYIFGFLTNKTYISMVDVLSFWGTYGFYGIICLTGCVVFYFILPETEGKKLYDIENHFAGIKKLTNEVYRSKKNINKESSKLRDLQGNTNPTFEGDNITIRQ-