Monarch geneset OGS2.0

DPOGS201328
TranscriptDPOGS201328-TA1413 bp
ProteinDPOGS201328-PA470 aa
Genomic positionDPSCF300176 + 587779-591146
RNAseq coverage51x (Rank: top 70%)
Annotation
HeliconiusHMEL0123993e-16256.83% 
BombyxBGIBMGA010537-TA2e-15856.48% 
DrosophilaCG9701-PA5e-11144.74% 
EBI UniRef50UniRef50_G6DAN39e-13956.76%Glycoside hydrolase n=10 Tax=Obtectomera RepID=G6DAN3_DANPL
NCBI RefSeqXP_001237813.15e-12747.38%AGAP006424-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3640235858e-15255.09%seminal fluid protein CSSFP001 [Chilo suppressalis]
NCBI nr blastxgi|3640235858e-15356.74%seminal fluid protein CSSFP001 [Chilo suppressalis]
Group
Gene OntologyGO:00045538.2e-192hydrolase activity, hydrolyzing O-glycosyl compounds
GO:00059758.2e-192carbohydrate metabolic process
GO:00431692.9e-143cation binding
GO:00038242.9e-143catalytic activity
KEGG pathwaynvi:1001166645e-107 
 K05350 (bglB)maps-> Starch and sucrose metabolism
    Phenylpropanoid biosynthesis
    Cyanoamino acid metabolism
InterPro domain[29-460] IPR0013608.2e-192Glycoside hydrolase, family 1
[28-460] IPR0137812.9e-143Glycoside hydrolase, subgroup, catalytic core
[28-461] IPR0178531.7e-135Glycoside hydrolase, superfamily
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201328-TA
ATGAATGAATGCTGTGGAATAATTTTATTTGTCTTATCGTTGGCCATTGGAGTACAGTCTTCAAATTTACAATGTTACGAAACAGAATTTCCAGAAGGGTTCTTATTCAGCGCATCATCGTCTGCTTATCAGATAGAAGGTGCTTGGAACAAAGATGGTCGAACTGACAGCATTTGGGATGATTTAGTACACCAACGTCCCTACCTCGTCAGAGACAACGCGACCGGAGACATTGCTGATAATTCTTATTACATTTATAAAAGGGATATAGAAATTTTGAGAGAGATAGGACTACAAGTATATAGGTTCTCTATATCATGGAATAGAATTTTACCTACTGGTTTTCCCAACAAAATTAATTATGAAGGTGTTGCATATTACGATAATTTAATTAACGAATTATTGAAATACAACATTATCCCGGTGGTTACCATTTACCACTTTGATTTGCCTCAAAGACTCCAGGAATTGGGTGGCTGGGTTAATCCTTATGTCGTTGATTGGTTGGGAGACTACGCAAGGGTTGTTTTCAATTTATTCGGTGATAGAGTTAAATATTGGATAACAGTGAATGAACCACAGCAGATTTGCTACTACGGCTATGGTGATGTAATGAATGCACCAGCATTAAATTATAAAGGAATTGCTGAATACTATTGTGCGAAAAATGTACTATTAGCACATGCAAGGGCATACCACATTTACGACGAAGAGTTTCGAGACTTTCAGCAAGGCATTATATTTATAGCTATAAGTGCTGAATGGTACGAACCTGCTTCGTCGGACAAGAATGATATTTTGGCCGCTTACGACTCGAACATGTTTACATATGGACAATACGCTCATCCAATTTTCTCTGAGACTGGTGATTTTCCCCAAAAGATGAAGGATCGCATTGCAGAAAGAAGTGTCATGCAAGGTTTCGTTAGGTCCCGACTACCACAGCTTTCGGAACAGGAAATTGATTATATACGTGGCAGTTCTGACGTGTTCGGTTTAAATCACTATTCTACTTTCTATGCAAGCAGAAATCAATCTGTTTACACAAATTATGAATCCCCATCATTTTTTGACGATATGGCAGCATACACGTTTCAGCCGCCTGAATGGAGATTGAGCCCAGATGCTGGTGTTGCGACTGTTCCTTGGGGTTTCTACAAATTGCTGCAATTCATCAAGAGAGAGTACAATAATCCTCCCGTTTTCGTAACCGAGAACGGTTTTGGCGATAATGGCGGTTTAAAAGATAACGATCGTGTTACACATTTGAAGGGTTACTTATGTGCTCTTCTGAAAGCTATCAATCACGGCTCAGATATTATAGGATATTCTGTTTGGAGTCTCCTGGATTCGTTTGAATGGATGTGTGGATACAATAACAATATGAAGAAAAATTCCGGTGCGGATTGTTGA

Protein sequence:

>DPOGS201328-PA
MNECCGIILFVLSLAIGVQSSNLQCYETEFPEGFLFSASSSAYQIEGAWNKDGRTDSIWDDLVHQRPYLVRDNATGDIADNSYYIYKRDIEILREIGLQVYRFSISWNRILPTGFPNKINYEGVAYYDNLINELLKYNIIPVVTIYHFDLPQRLQELGGWVNPYVVDWLGDYARVVFNLFGDRVKYWITVNEPQQICYYGYGDVMNAPALNYKGIAEYYCAKNVLLAHARAYHIYDEEFRDFQQGIIFIAISAEWYEPASSDKNDILAAYDSNMFTYGQYAHPIFSETGDFPQKMKDRIAERSVMQGFVRSRLPQLSEQEIDYIRGSSDVFGLNHYSTFYASRNQSVYTNYESPSFFDDMAAYTFQPPEWRLSPDAGVATVPWGFYKLLQFIKREYNNPPVFVTENGFGDNGGLKDNDRVTHLKGYLCALLKAINHGSDIIGYSVWSLLDSFEWMCGYNNNMKKNSGADC-