Monarch geneset OGS2.0

DPOGS200280
TranscriptDPOGS200280-TA1263 bp
ProteinDPOGS200280-PA420 aa
Genomic positionDPSCF300026 - 830675-831937
RNAseq coverage3x (Rank: top 89%)
Annotation
HeliconiusHMEL0225120.067.86% 
BombyxBGIBMGA005555-TA9e-15859.52% 
Drosophila% 
EBI UniRef50UniRef50_B2DD582e-15559.52%Putative beta-fructofuranosidase n=123 Tax=Obtectomera RepID=B2DD58_BOMMO
NCBI RefSeqNP_001119721.12e-7938.72%beta-fructofuranosidase [Bombyx mori]
NCBI nr blastpgi|2613359714e-18067.62%putative BmSuc2 [Heliconius melpomene]
NCBI nr blastxgi|2613359710.067.62%putative BmSuc2 [Heliconius melpomene]
Group
Gene OntologyGO:00045535.2e-71hydrolase activity, hydrolyzing O-glycosyl compounds
GO:00059755.2e-71carbohydrate metabolic process
KEGG pathwaybay:RBAM_0318203e-84 
 K01193 (E3.2.1.26, sacA)maps-> Starch and sucrose metabolism
    Galactose metabolism
InterPro domain[1-270] IPR0232961.5e-77Glycosyl hydrolase family 43, five-bladed beta-propellor domain
[1-373] IPR0013625.2e-71Glycoside hydrolase, family 32
[1-269] IPR0131482.8e-69Glycosyl hydrolases family 32, N-terminal
[270-385] IPR0089859.6e-19Concanavalin A-like lectin/glucanase
Orthology groupMCL25146 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200280-TA
ATGAGTTGGGGTCATGTGGTCAGCAAGAATCTCATAGATTGGACCCACTATCCATCAGCAGTGATGCCGAAGGACGTATACGACCGAAATGGCTGCTTATCCGGATCAGCCCTCGTTATTAATAACTTCCTAACTCTGTTTTACACCGGTCATCTTGCCTCTTCGAACGAAGTGTTCCAAACACAAAATATAGCTACAAGTGGAGACGGGATTATCTTTAAGAAATATATATACAATCCTATTATAAGACAGTCACCGAATGGAGTTGGAGACTTTAGAAATCCGAAGGTCTGGAGGTTTCGTAACCTCTGGTACATGGTCGTTGGAACTTCATCTAGGGAAAGATTTGGAGAATTGCTTTTATATTCATCGACGGATATCTTTAATTGGAAACTCAATGGGACATTTGTGAAATCATATGGAGATATGGGTCATATGTGGGAGAATCCCGATATATTTGAATTGGATGGTCAACATGTACTTATCATATCAGTTCAAGGAATAGAAGCTGATGGATTCAGGTTCAGGAATTTGTACCAGACAGGATACGTTGTGGGGACTTTTAACTATATGAAAGGAAAGTTTGAGGATCTGGAAGTATCTATAGCGACTTTTTATCAGTTAGATTTCGGCCATGACTTTTATGGAGCAAAGACTTTGCTTGCCAGTGACGGGCGGAGAATCTTGATCGCGTGGCTGGGAATGTGGGAGAGTGACTTCATTGAATCGACATCAGGATGGGCGAGCATGTTGACGATCATAAGAGAAGTCCGTTTAAACAAAAGGGGACGCATACTAATGACGCCTATCAGTGAAATGGAGGAATTGAGAGTGGAAATAATGGAAAACGCGTGGTATTATCCTGAAGAATCTTTTCAGGCCGGAGCGAAATCTTTTGAGTTGTTAGTTAATTCGTCTTCCATGTTATACGATACCGGACTAGTATTTGAGTGGAATTTTGGTACTTTTACGATAGGCTATTCAGCAGAACACGAGTACATAAGCATTGATCGCGGAGGACCAGACGGAGTGAGAAGAGCATATTGGTCACCAACTAATCATATTTTTTTGCGGATATTCGTAGATGTCAGTTCCATTGAGATATTTTGTGGCGAAGGCGAGGTAGTATTTTCGAGTCGCTTTTATCCGAAATCAATGCGTATAAAAGTAATTGGAAAATCTCAACTCCACATCACTCAGCACAGATTAAGACGTACTATCGGTTACGACAAAGAACTAGTGAATCGTTTGAAAGTCAGTTAG

Protein sequence:

>DPOGS200280-PA
MSWGHVVSKNLIDWTHYPSAVMPKDVYDRNGCLSGSALVINNFLTLFYTGHLASSNEVFQTQNIATSGDGIIFKKYIYNPIIRQSPNGVGDFRNPKVWRFRNLWYMVVGTSSRERFGELLLYSSTDIFNWKLNGTFVKSYGDMGHMWENPDIFELDGQHVLIISVQGIEADGFRFRNLYQTGYVVGTFNYMKGKFEDLEVSIATFYQLDFGHDFYGAKTLLASDGRRILIAWLGMWESDFIESTSGWASMLTIIREVRLNKRGRILMTPISEMEELRVEIMENAWYYPEESFQAGAKSFELLVNSSSMLYDTGLVFEWNFGTFTIGYSAEHEYISIDRGGPDGVRRAYWSPTNHIFLRIFVDVSSIEIFCGEGEVVFSSRFYPKSMRIKVIGKSQLHITQHRLRRTIGYDKELVNRLKVS-