Monarch geneset OGS2.0

DPOGS201329
TranscriptDPOGS201329-TA1080 bp
ProteinDPOGS201329-PA359 aa
Genomic positionDPSCF300176 + 595828-597934
RNAseq coverage0x (Rank: top 95%)
Annotation
HeliconiusHMEL0123995e-13258.99% 
BombyxBGIBMGA010811-TA2e-10348.53% 
DrosophilaCG9701-PA2e-7747.39% 
EBI UniRef50UniRef50_G6DAN32e-10953.54%Glycoside hydrolase n=10 Tax=Obtectomera RepID=G6DAN3_DANPL
NCBI RefSeqXP_001850317.13e-8052.00%glycoside hydrolase [Culex quinquefasciatus]
NCBI nr blastpgi|3640236131e-9667.78%seminal fluid protein CSSFP031 [Chilo suppressalis]
NCBI nr blastxgi|3640236133e-9867.78%seminal fluid protein CSSFP031 [Chilo suppressalis]
Group
Gene OntologyGO:00045534.3e-177hydrolase activity, hydrolyzing O-glycosyl compounds
GO:00059754.3e-177carbohydrate metabolic process
GO:00431696.7e-92cation binding
GO:00038246.7e-92catalytic activity
KEGG pathwaynvi:1001166645e-69 
 K05350 (bglB)maps-> Starch and sucrose metabolism
    Phenylpropanoid biosynthesis
    Cyanoamino acid metabolism
InterPro domain[1-356] IPR0013604.3e-177Glycoside hydrolase, family 1
[1-358] IPR0178536.7e-135Glycoside hydrolase, superfamily
[2-240] IPR0137816.7e-92Glycoside hydrolase, subgroup, catalytic core
Orthology groupMCL27793 Specific divergent
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201329-TA
ATGATTGGAACGGCGACAGCCTCTTATCAAATTGAAGGAGCCTGGGACGTGGATGGAAAATCGGAAAATATCTGGGACCACTTAACTCATACCGATCCTTGCAAAGTGCTGGACTGCTCTAATGGTGATGTCGCTGACAACTCTTATTTTCTCTATAAAAGAGATGTGGAAATGATGCGCGAGTTAGGACTCGACACTTACAGGTTTTCTATCTCTTGGACCAGAATCCTTCCTACTGGTTTTCCAGATTACATCAATAAAGCTGGAGTAGCATATTACAACAACTTAATTGATGAAATGCTAAAATATAACATTCAGCCGATAGTAACTTTATACCATTGGGACCTACCACAGAAAATACAAGAGATGGGAGGCTGGACGAATAGTGAAATTGTTAATTGGTTTGGGGACTACGCACGAGTTATATTTAATTTTTTTGGTGATAGAGTAAAATATTTTATCACTATTAATGAACCTCATCAAATTTGTCTGTTTGGTTATGGAGAAGATATATTGGCACCAGCATTAAACATACAAGGTATAGCTGACTATTTATGCATGAAGAATGTACTATTAGGTCACGCTAGAGCTTATCACATTTATGATAAGGAATTTCGGGTGAAACAAAATGGAAAAATATTCATTACAATAAACGCCGAATGGCATCAACCCAAAACAGTAAATGACGAGGAAGCAGCCCGGGATGCTAGACAATTTTATTACGTTCCATGGGGCTTTCGGTCATTATTTAACTACATCAGCCATCAATACGGAAATCCACCTATCTTGGTGACTGAGAACGGATTTGCGACAAATGGTGGTATTAACGACGAAGATCGAGTGACATATTTCAGAGGCTACTTGAACGCTGTCTTAGATGCCATCGACGATGGTGTTGATATAAGAGGTTATATTGCCTGGAGTCTCATGGATAATTTCGAGTGGTCAAAAGGATACACGGAACGCTTCGGTCTGTATGAAGTCGACTACAACGACCCAAACCGTACTCGCACGCCTCGCAAGTCCGCTTATGTACTGAAGGAGATTATAAGGACACGATCTATTGATCCCCAACTATGA

Protein sequence:

>DPOGS201329-PA
MIGTATASYQIEGAWDVDGKSENIWDHLTHTDPCKVLDCSNGDVADNSYFLYKRDVEMMRELGLDTYRFSISWTRILPTGFPDYINKAGVAYYNNLIDEMLKYNIQPIVTLYHWDLPQKIQEMGGWTNSEIVNWFGDYARVIFNFFGDRVKYFITINEPHQICLFGYGEDILAPALNIQGIADYLCMKNVLLGHARAYHIYDKEFRVKQNGKIFITINAEWHQPKTVNDEEAARDARQFYYVPWGFRSLFNYISHQYGNPPILVTENGFATNGGINDEDRVTYFRGYLNAVLDAIDDGVDIRGYIAWSLMDNFEWSKGYTERFGLYEVDYNDPNRTRTPRKSAYVLKEIIRTRSIDPQL-