Monarch geneset OGS2.0

DPOGS202673
TranscriptDPOGS202673-TA1188 bp
ProteinDPOGS202673-PA395 aa
Genomic positionDPSCF300039 + 612352-613539
RNAseq coverage938x (Rank: top 14%)
Annotation
HeliconiusHMEL0056115e-13055.75% 
BombyxBGIBMGA005555-TA4e-6938.02% 
Drosophila% 
EBI UniRef50UniRef50_D1KRK78e-11650.26%Beta-fructofuranosidase 1 n=1 Tax=Manduca sexta RepID=D1KRK7_MANSE
NCBI RefSeqNP_001119721.12e-6739.56%beta-fructofuranosidase [Bombyx mori]
NCBI nr blastpgi|2607654493e-11550.26%beta-fructofuranosidase 1 [Manduca sexta]
NCBI nr blastxgi|2607654494e-11850.38%beta-fructofuranosidase 1 [Manduca sexta]
Group
Gene OntologyGO:00045532e-48hydrolase activity, hydrolyzing O-glycosyl compounds
GO:00059752e-48carbohydrate metabolic process
KEGG pathwaybld:BLi041789e-74 
 K01193 (E3.2.1.26, sacA)maps-> Starch and sucrose metabolism
    Galactose metabolism
InterPro domain[1-252] IPR0232962.1e-67Glycosyl hydrolase family 43, five-bladed beta-propellor domain
[1-251] IPR0131486.7e-60Glycosyl hydrolases family 32, N-terminal
[1-356] IPR0013622e-48Glycoside hydrolase, family 32
[252-393] IPR0089853.4e-17Concanavalin A-like lectin/glucanase
Orthology groupMCL23648 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202673-TA
ATGTACCCAGACAAAAGCTACGATAAAACAGGCGTATTCTCTGGAAGTGCTTTAGTAGAGGACAACATAATGTTTCTTTTTTATACAGGTAATGTAAATCTTCCGGGAAGCACTCCTGACCATGAACAGCAGCAAGCTTTAGCAATATCCAAAGATGGAGTAAACGTTACTAAATATGCAAGAAATCCTATTTTGAAAGGATTGGAACATCAACCAAATATAAGAGATCCTAAGGTTTGGAAACACAAAAGTTCATATTATATGGTACTGGGAAATTCTTTTGTAAATGGATCTAACCAAACTCTTGGTCGTGCTTTACTATATAAATCAGACAATAAAATAATATGGGAACAAGTCTCAGTTCTTCATGAATCGAATGGAGATCTGGGATATATGTTTGAGTGTCCTGATTTCTTTGAACTAGATGGAAAATATGTATTGTTATTTTCACCACAAGGAGTAAAAGCGGATGGAGACGACTACAAAAACTTATTTCAAACTGGATATATTATAGGAGATTTTAATTACGAGACCTATGATTTTAAACCACTAACGCAATTTCGTGAACTAGATCACGGCCATGACTTTTATGCGACTCAAACAATCTTGGATAAAAGTAATAACAGAATAGTTATAGCCTGGAACGATATGTGGGAAACCAACTATCCTGAACAGAAAGAAGGTTTTACGGGGCAAATGACAATACCTAGAATACTTTTGCTATCACAAAATTTGACTTTAATTCAGAAGCCTGTAGGTGAAGTTGCGAAAATCCTGGATGGACTTGTTTTCACGGGTAGAGGCATAGGAGGTCAGAATTTCATATTGAAAGATAATATTGGCCTAATAAGTATAAGTGCTTCAGTAGAAAGAGACTTTGTATTGCATTTTGAGTCAGAGGATGACTCTCGAGCATTAATAATAAAGTATGATTCTCAAAATCATACTGTTTCCTTAGACCGTGGTGGTGAAGATGGACTTCGTCGTACACGATGGACACCAGTTAGTGCCATGAATATGAATATATACGTTGATAAAAGTTCGATTGAACTGTTTTGTGGAGAGGGAGAAATAACTTTCTCTAGTAGATACTTTCCCGTTGGTCAATCCATAATCCGCGTAGGAACTCAGAGTATTGCAGATATTCTCACGTTGAATAGTATAAAACCAACTGTGTACCTTAACTAA

Protein sequence:

>DPOGS202673-PA
MYPDKSYDKTGVFSGSALVEDNIMFLFYTGNVNLPGSTPDHEQQQALAISKDGVNVTKYARNPILKGLEHQPNIRDPKVWKHKSSYYMVLGNSFVNGSNQTLGRALLYKSDNKIIWEQVSVLHESNGDLGYMFECPDFFELDGKYVLLFSPQGVKADGDDYKNLFQTGYIIGDFNYETYDFKPLTQFRELDHGHDFYATQTILDKSNNRIVIAWNDMWETNYPEQKEGFTGQMTIPRILLLSQNLTLIQKPVGEVAKILDGLVFTGRGIGGQNFILKDNIGLISISASVERDFVLHFESEDDSRALIIKYDSQNHTVSLDRGGEDGLRRTRWTPVSAMNMNIYVDKSSIELFCGEGEITFSSRYFPVGQSIIRVGTQSIADILTLNSIKPTVYLN-