Monarch geneset OGS2.0

DPOGS201331
TranscriptDPOGS201331-TA1122 bp
ProteinDPOGS201331-PA373 aa
Genomic positionDPSCF300176 + 605046-607195
RNAseq coverage6x (Rank: top 87%)
Annotation
HeliconiusHMEL0123998e-13558.54% 
BombyxBGIBMGA010811-TA1e-10546.82% 
DrosophilaCG9701-PA7e-7549.06% 
EBI UniRef50UniRef50_G6DAN32e-11554.80%Glycoside hydrolase n=10 Tax=Obtectomera RepID=G6DAN3_DANPL
NCBI RefSeqXP_001850317.12e-7750.55%glycoside hydrolase [Culex quinquefasciatus]
NCBI nr blastpgi|3640236133e-9467.36%seminal fluid protein CSSFP031 [Chilo suppressalis]
NCBI nr blastxgi|3640236131e-9567.36%seminal fluid protein CSSFP031 [Chilo suppressalis]
Group
Gene OntologyGO:00045535.3e-176hydrolase activity, hydrolyzing O-glycosyl compounds
GO:00059755.3e-176carbohydrate metabolic process
GO:00431692e-92cation binding
GO:00038242e-92catalytic activity
KEGG pathwaynvi:1001166645e-68 
 K05350 (bglB)maps-> Starch and sucrose metabolism
    Phenylpropanoid biosynthesis
    Cyanoamino acid metabolism
InterPro domain[1-363] IPR0013605.3e-176Glycoside hydrolase, family 1
[1-366] IPR0178531.6e-138Glycoside hydrolase, superfamily
[2-240] IPR0137812e-92Glycoside hydrolase, subgroup, catalytic core
Orthology groupMCL27793 Specific divergent
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201331-TA
ATGATTGGTACGGCGACAGCCTCTTATCAAATTGAAGGAGCCTGGGACGTGGATGGAAAATCGGAAAATATCTGGGACCACTTAACTCATACCGATCCTTGCAAAGTGCTGGACTGCTCTAATGGTGATGTCGCTGACAACTCTTATTATCTCTATAAAAGAGATGTGGAAATGATGCGCGAGTTAGGACTCGACACTTACAGGTTTTCTATCTCTTGGACCAGAATCCTTCCTACTGGTTTTCCAGATTACATCAATAAAGCTGGAGTAGCATATTACAACAACTTAATTGATGAAATGCTAAAATATAACATTCAGCCGATAGTAACTTTATACCATTGGGACTTACCACAGAAAATACAAGAGATGGGAGGGTGGACGAATAGTGAAATTGTGAATTGGTTTGGGGACTACGCACGAGTTATATTTAATTTTTTTGGTGACAGAGTAAAATATTTTATCACTTTTAATGAACCTTATCCAATTTGTCTGTTTGGTTATGGAGAAGGTATATTTGCACCAGCATTAACCATACGAGGTATAGCTGACTATTTATGCATGAAGAATGTACTATTAGGTCACGCTAGAGCTTATCACATTTATGATAAAGAATTTCGGGTGAATCAAAATGGAAAAATATTCATTACAATAAACGCCGAATTGTTTGAACCCAAAACGGCAAAAGACGAGGAAGCAGCCCGGGATGCTAGACAATTTTATTACGTTCCATGGGGCTTTCGGTCATTATTTAACTACATCAGCCATCAATACGGAAATCCACCCATCTTGGTTACTGAGAACGGATTTGCGACAAATGGTGGTATTAACGACGAAGATCGAGTGACATATTTCAGAGGCTACTTGAACGCTGTCTTAGATGCCATCGACGATGGTGTTGATATAAGAGGTTATATTGCCTGGAGTCTCATGGATAATTTCGAGTGGTCAAAAGGATACACGGAACGCTTCGGTCTGTATGAAGTCGACTACAACGACCCAAACCGTACTCGCACGCCTCGCAAGTCCGCTTATGTACTGAAGGAGATTATAAGGACACGATCTATTGATCCCAACTATGAACCTGACATGAGCCAACCCCTGACCATTGATGATAATAACTGA

Protein sequence:

>DPOGS201331-PA
MIGTATASYQIEGAWDVDGKSENIWDHLTHTDPCKVLDCSNGDVADNSYYLYKRDVEMMRELGLDTYRFSISWTRILPTGFPDYINKAGVAYYNNLIDEMLKYNIQPIVTLYHWDLPQKIQEMGGWTNSEIVNWFGDYARVIFNFFGDRVKYFITFNEPYPICLFGYGEGIFAPALTIRGIADYLCMKNVLLGHARAYHIYDKEFRVNQNGKIFITINAELFEPKTAKDEEAARDARQFYYVPWGFRSLFNYISHQYGNPPILVTENGFATNGGINDEDRVTYFRGYLNAVLDAIDDGVDIRGYIAWSLMDNFEWSKGYTERFGLYEVDYNDPNRTRTPRKSAYVLKEIIRTRSIDPNYEPDMSQPLTIDDNN-