Monarch geneset OGS2.0

DPOGS215795
TranscriptDPOGS215795-TA1506 bp
ProteinDPOGS215795-PA501 aa
Genomic positionDPSCF300041 + 2063720-2068593
RNAseq coverage305x (Rank: top 37%)
Annotation
HeliconiusHMEL0059310.083.67% 
BombyxBGIBMGA003512-TA0.081.17% 
DrosophilaCG9701-PA2e-13346.87% 
EBI UniRef50UniRef50_G6D5V20.0100.00%Glucosidase n=2 Tax=Obtectomera RepID=G6D5V2_DANPL
NCBI RefSeqNP_001037073.10.081.57%glucosidase [Bombyx mori]
NCBI nr blastpgi|1129830360.081.57%glucosidase precursor [Bombyx mori]
NCBI nr blastxgi|1129830360.081.57%glucosidase precursor [Bombyx mori]
Group
Gene OntologyGO:00045532.1e-229hydrolase activity, hydrolyzing O-glycosyl compounds
GO:00059752.1e-229carbohydrate metabolic process
GO:00431692e-165cation binding
GO:00038242e-165catalytic activity
KEGG pathwaytca:6645777e-131 
 K05350 (bglB)maps-> Starch and sucrose metabolism
    Phenylpropanoid biosynthesis
    Cyanoamino acid metabolism
InterPro domain[22-501] IPR0013602.1e-229Glycoside hydrolase, family 1
[23-487] IPR0137812e-165Glycoside hydrolase, subgroup, catalytic core
[23-492] IPR0178532.8e-158Glycoside hydrolase, superfamily
Orthology groupMCL24915 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215795-TA
ATGGCGATCAAAGAGGCAACTTTTATCGCCCTCCTGGCGTTAGTACATTCGGGACATGCGCAGTACACTAAGTTTCCGGAAGGATTCACCTTTGGAGTAGCGACTGCTGCTCATCAAATAGAAGGGTCATGGAACGTCAGTGGAAAAACTGAAAACGTATGGGACCATCTGTCTCACAATCGGCCATGGATGATAGCTGACGGAACCAATGGCGATGTAGCCTGTGACTCCTACAACCGTTACCAAGAAGATGTAGATGAGCTGGCATACATGGGTGTAGATTTCTACAGACTGTCTCTGTCTTGGGCCAGAATTCTGCCAACTGGACGCATGGATGTCATAAATCCTGATGGTATTAGATACTACAACGCACTCTTTGATGCTTTAGCCGAAAAAAAAATTGAACCACTGGTTACTCTGTTCCATTGGGATTTACCACAATCACTCCAAGACCTAGGTGGATGGGCGAATCCGAAAATGATTGATTACTTCCGCGATTACGCAGACGTATGCTTCAGAGAGTTCGGTGATAAAGTCAAATCCTGGATTACACTTAATGAGCCCTATGAAATTTGTGAAGATGCTTATGGGGATGACAAGAAAGCTCCTGCCATCGATAGCCACGGTGTAGGAAACTACTTGTGCAGCGACACTCTGTTGAAAGCTCACGCCGAAGTTTACCATCTCTACAACGACACCTACAGACCTATACAAAACGGAAGAATAATGATTTCAATAAATTCAATTTGGTACGAACCAAGTGATCCCGAAAACGCGGAACAAGTTGCTCTGGCTGAAGTTGCTAACCAATTTAAATTCGGGTGGTTCGCAAATCCTATTTTCACCGAAGAAGGTGGCTATCCCGTCGTAATGGTAGAAAATATTGCTGAGCAAAGTAAAGCTGAAGGATTAAATAAACCTAGATTAGAACAATTCGATGAGTACTGGATTGAAAGAATTAAGGGTACATCAGACTTCCTTGGTATCAATCACTACACCACGCATTTGATAACCGGCCCGGGAGTGGACTCTCTCGCCAAACACCCGTCTTGGCTAAAAGATATTGGAGCGGTAGTAAGTTTGGACGTGGGTAGAGATTCAGCCTCAGAGTGGCTAAGAGTAGTGCCAACGGGTTTTGCAAACTTATTACGCTGGTGCAAGAGTACGTACAATGATGTTCCAATTTACATCACCGAGAACGGATTTTCTGATCGTGGCGCCATAGAAGATTACGACCGTATTAGATATTACAACGACTACCTCTCCGAAATTTTGAATGTCATTTATGACGATGATGTCAAAGTCCTTGGTTACACTGCATGGACCCTAATGGACAACTTCGAATGGCGAGCTGGATTTTCTGAACGCTTCGGTCTTTACCACGTGGACATAACGGATCCGAATCTCCCAAGAACACCGAAACTCTCTGCGGAATACTACAAGCAATTATGTGAAACGAAGGAAATACCTCAAGATGAACGGTTCAAGGATCCAGCTGTAAGTTGA

Protein sequence:

>DPOGS215795-PA
MAIKEATFIALLALVHSGHAQYTKFPEGFTFGVATAAHQIEGSWNVSGKTENVWDHLSHNRPWMIADGTNGDVACDSYNRYQEDVDELAYMGVDFYRLSLSWARILPTGRMDVINPDGIRYYNALFDALAEKKIEPLVTLFHWDLPQSLQDLGGWANPKMIDYFRDYADVCFREFGDKVKSWITLNEPYEICEDAYGDDKKAPAIDSHGVGNYLCSDTLLKAHAEVYHLYNDTYRPIQNGRIMISINSIWYEPSDPENAEQVALAEVANQFKFGWFANPIFTEEGGYPVVMVENIAEQSKAEGLNKPRLEQFDEYWIERIKGTSDFLGINHYTTHLITGPGVDSLAKHPSWLKDIGAVVSLDVGRDSASEWLRVVPTGFANLLRWCKSTYNDVPIYITENGFSDRGAIEDYDRIRYYNDYLSEILNVIYDDDVKVLGYTAWTLMDNFEWRAGFSERFGLYHVDITDPNLPRTPKLSAEYYKQLCETKEIPQDERFKDPAVS-