Monarch geneset OGS2.0

DPOGS201334
TranscriptDPOGS201334-TA1533 bp
ProteinDPOGS201334-PA510 aa
Genomic positionDPSCF300176 + 630843-638131
RNAseq coverage2x (Rank: top 91%)
Annotation
HeliconiusHMEL0123990.067.98% 
BombyxBGIBMGA010536-TA0.059.60% 
DrosophilaCG9701-PA3e-14950.70% 
EBI UniRef50UniRef50_G6DAN30.084.11%Glycoside hydrolase n=10 Tax=Obtectomera RepID=G6DAN3_DANPL
NCBI RefSeqXP_001237813.12e-16056.66%AGAP006424-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3640235850.064.88%seminal fluid protein CSSFP001 [Chilo suppressalis]
NCBI nr blastxgi|3640236130.062.18%seminal fluid protein CSSFP031 [Chilo suppressalis]
Group
Gene OntologyGO:00045531.3e-244hydrolase activity, hydrolyzing O-glycosyl compounds
GO:00059751.3e-244carbohydrate metabolic process
GO:00431691.3e-176cation binding
GO:00038241.3e-176catalytic activity
KEGG pathwaytca:6645777e-141 
 K05350 (bglB)maps-> Starch and sucrose metabolism
    Phenylpropanoid biosynthesis
    Cyanoamino acid metabolism
InterPro domain[29-503] IPR0013601.3e-244Glycoside hydrolase, family 1
[30-491] IPR0137811.3e-176Glycoside hydrolase, subgroup, catalytic core
[29-504] IPR0178535.3e-166Glycoside hydrolase, superfamily
Orthology groupMCL10040 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201334-TA
ATGTTCCGTGCTGTAAGATTAATTTTCCTGGCTTTCGTTTTGACTGTGCTTGTCGGTAGCAATGAAATCTCTCGACATGAAGCCAGAAAAATACCTGACGACTTACTTTTTGGAGCTGCTACGGCATCCTACCAAATAGAAGGAGCTTGGAATGAAGATGGTAAATCTGAAAATATTTGGGATCGATTGACACACCTAAAACCTTGTTATATACACAACTGTGACACGGGAGATATCGCTGCTGATTCCTATCACCAATATAAGCGCGATGTGGAAATGATGCGGGAACTAGGTCTCGACTTTTATAGGTTCTCTCTCTCCTGGACGAGAATATTACCAACGAGTTTTCCAGATCAAATTAATGAAAAAGGAGTACAATATTACAATAATTTGATAAATGAGATGCTCAAATACAACATACAACCCATGGTGACTCTTTATCATTGGGATTTACCTCAGAAGCTGCAAGATCTGGGAGGATGGGCAAATCCCCATATAGTTGATTGGTTTACCGACTATGCCAAAGTAGTTTTCGAGTTATTTGGAGACAGGGTTAAGTACTGGATAACTGTCAATGAACCTAAACATGTTTGTCATCAAACAACCCCACAACTATCACTAGATCCATCTTATAGTGTTTCTTCACATTTTCATTACATGTGTGCCAAAAATCTGCTAGTAGCACATGCTAACGTCTACCATTTGTATAATAATAAATTTCGTGAAGTCCAAGGTGGTCAAGTCGGTATAACAATAAGTTCCGCGTGGGCTGAACCTGAGTCTGAAAATGACATGAAAGCTGCTGAAGATGCCATGCAATTTGAGATGGGTCTTTTTGCAAATCCAATATTTTCGGAGTCTGGAGATTATCCATCAGTCATGAAAGAAAGAATAGCAGCAAAGAGTAAGGAACAAGGATTTCCGAGATCACGATTACCACAATTCACTCCGGAGGAAGTAGATTTAATAAAAGGAAGCTCAGACTTCATTGGATTAAATCATTATACTACTAACATTGTTTATAGAAACGAATCTGTCTATGGAAGTTACAGTTCTCCATCACTTGAAGATGATGTGGAAGTTTTAAGTTATCAAGATAGTTCATGGGACTCAGGTGCTTCATCGTGGTTGAAGCGTGTACCCTGGGGATTTTATAAATTATTAACAAAAATACGAGAGGACTACAACAACCCACCAGTTTTCATCACTGAAAATGGATTCTCATCTCGGGGTGGTCTAATTGACGACGACCGCGTAAAGTATTACAGAACATACATTGATGCTATGCTCGATGCTATTGAAGATGGATCAGATATAAGAGTTTATACAGCGTGGAGTTTGATGGACAATTTCGAATGGATGGAGGGATACAGCGAACGTTTTGGCCTGTACGAGGTGGACTACGAGAGTCCTGAACGCACCCGCACTCCTCGCAAGTCTGCTTACGTGTACAAAGAGATGCTGCGCACACGCACACTGGACTATCATTATGAACCTGACATGAGCTTGGGAATGAATGTCGATGAAAACTAA

Protein sequence:

>DPOGS201334-PA
MFRAVRLIFLAFVLTVLVGSNEISRHEARKIPDDLLFGAATASYQIEGAWNEDGKSENIWDRLTHLKPCYIHNCDTGDIAADSYHQYKRDVEMMRELGLDFYRFSLSWTRILPTSFPDQINEKGVQYYNNLINEMLKYNIQPMVTLYHWDLPQKLQDLGGWANPHIVDWFTDYAKVVFELFGDRVKYWITVNEPKHVCHQTTPQLSLDPSYSVSSHFHYMCAKNLLVAHANVYHLYNNKFREVQGGQVGITISSAWAEPESENDMKAAEDAMQFEMGLFANPIFSESGDYPSVMKERIAAKSKEQGFPRSRLPQFTPEEVDLIKGSSDFIGLNHYTTNIVYRNESVYGSYSSPSLEDDVEVLSYQDSSWDSGASSWLKRVPWGFYKLLTKIREDYNNPPVFITENGFSSRGGLIDDDRVKYYRTYIDAMLDAIEDGSDIRVYTAWSLMDNFEWMEGYSERFGLYEVDYESPERTRTPRKSAYVYKEMLRTRTLDYHYEPDMSLGMNVDEN-