Monarch geneset OGS2.0

DPOGS205791
TranscriptDPOGS205791-TA1050 bp
ProteinDPOGS205791-PA349 aa
Genomic positionDPSCF300144 - 85905-87340
RNAseq coverage147x (Rank: top 54%)
Annotation
HeliconiusHMEL0120475e-15780.98% 
BombyxBGIBMGA001220-TA2e-2326.60% 
DrosophilaCG14040-PA7e-8445.57% 
EBI UniRef50UniRef50_Q9VMU81e-8145.57%CG14040 n=12 Tax=cellular organisms RepID=Q9VMU8_DROME
NCBI RefSeqXP_002078201.16e-8846.52%GD23318 [Drosophila simulans]
NCBI nr blastpgi|1955766761e-8646.52%GD23318 [Drosophila simulans]
NCBI nr blastxgi|1700283852e-8747.51%CMP-sialic acid transporter [Culex quinquefasciatus]
Group
Gene OntologyGO:00086431.2e-88carbohydrate transport
GO:00160211.2e-88integral to membrane
GO:00053511.2e-88sugar:hydrogen symporter activity
GO:00001391.2e-88Golgi membrane
KEGG pathway 
InterPro domain[27-334] IPR0072711.2e-88Nucleotide-sugar transporter
Orthology groupMCL16344 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205791-TA
ATGAAAGAATGGAATAAATTATTTCCGAATAAAGAAGGATTCATAGTATTTTCGTTGTATATAATCCTTTTTGTATTTCAAGGAGTTTTTATAACAGCATCTAAAACAGAAAATGGAGTTTATGACTATAATACAACACTTGTTGTATTTTTATCTGAATTACTAAAATTGTTGATATCCGGATTTTTATATACTTGCAAACAGGGGAATAAGCCAAATTTATTCAAAGCTATTGCTTTGAATTATAGGCTGCTAATATATTACTTTATACCATCACTTTTATACTGTTTCTATAATAACCTGGCATTTATAAACTTGTCCCATTATGATCCCACTTCATATTATATTTTACTTCAGTTTCGAGTAGTTTTAACAGCATTAATATTTCAGTTCCTGTTCAAAAGAAAGCTTACATTTTTCCAATGGATTTCGCTCGGGATACTTACATTAGGTTGTATGATAAAAAACTTTGACACTGAAACAGCACAGACAAAGGAAGATTCTGAATTTTTGTCTCAAATTTTTAATATATATTTCTTATCAATAAATTTTCAAAACTTTTGTTCATGTTTGGCCGGAACTTACAATGAATATTTGTTGAAAACCGTCGGCTCCGATGTAGATATATTCTTACAAAATGTTTTCATGTATCTCGATTCAGTTTTATGCAATTTCTTTATTTTACTGTACATGGGAGAATTAGGTGGCATCTTTAATGACTTTAAATATCTCGGTGATATATTTGTTATTCTTATAACAGTGAATAGCGCTGTTGTCGGTATTGTTACCAGTTTTTTTCTGAAGAATTTGAATTCAATTTTAAAGACATATGCCAGTGCTTTAGAACTAGTTATAACCGCAATTGTATGTTACATGCTATTTAATATCCTTATCACGAAATACACTGTGTTATCAATATGCCTAGTCAGTATTGCAGTTGCAATGTACGTTAGGAATCCCGTTAATAATGTTAACTCGAATAAAACCAATTCCATTGATAAGAAACCTTTATTACCAGTAACAGAAAATAAACATAGAAATGATAATTGA

Protein sequence:

>DPOGS205791-PA
MKEWNKLFPNKEGFIVFSLYIILFVFQGVFITASKTENGVYDYNTTLVVFLSELLKLLISGFLYTCKQGNKPNLFKAIALNYRLLIYYFIPSLLYCFYNNLAFINLSHYDPTSYYILLQFRVVLTALIFQFLFKRKLTFFQWISLGILTLGCMIKNFDTETAQTKEDSEFLSQIFNIYFLSINFQNFCSCLAGTYNEYLLKTVGSDVDIFLQNVFMYLDSVLCNFFILLYMGELGGIFNDFKYLGDIFVILITVNSAVVGIVTSFFLKNLNSILKTYASALELVITAIVCYMLFNILITKYTVLSICLVSIAVAMYVRNPVNNVNSNKTNSIDKKPLLPVTENKHRNDN-