Monarch geneset OGS2.0

DPOGS201336
TranscriptDPOGS201336-TA1524 bp
ProteinDPOGS201336-PA507 aa
Genomic positionDPSCF300176 + 653181-655922
RNAseq coverage71x (Rank: top 66%)
Annotation
HeliconiusHMEL0123990.065.64% 
BombyxBGIBMGA010536-TA0.061.86% 
DrosophilaCG9701-PA4e-14049.07% 
EBI UniRef50UniRef50_G6DAN30.076.87%Glycoside hydrolase n=10 Tax=Obtectomera RepID=G6DAN3_DANPL
NCBI RefSeqXP_001850321.11e-14651.67%glycoside hydrolase [Culex quinquefasciatus]
NCBI nr blastpgi|3640235850.063.12%seminal fluid protein CSSFP001 [Chilo suppressalis]
NCBI nr blastxgi|3640235850.063.12%seminal fluid protein CSSFP001 [Chilo suppressalis]
Group
Gene OntologyGO:00045531.5e-237hydrolase activity, hydrolyzing O-glycosyl compounds
GO:00059751.5e-237carbohydrate metabolic process
GO:00431692.5e-175cation binding
GO:00038242.5e-175catalytic activity
KEGG pathwaytca:6645772e-133 
 K05350 (bglB)maps-> Starch and sucrose metabolism
    Phenylpropanoid biosynthesis
    Cyanoamino acid metabolism
InterPro domain[36-500] IPR0013601.5e-237Glycoside hydrolase, family 1
[30-488] IPR0137812.5e-175Glycoside hydrolase, subgroup, catalytic core
[28-501] IPR0178532.3e-164Glycoside hydrolase, superfamily
Orthology groupMCL10040 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201336-TA
ATGTTCGGCGCGGTGATATTAGCCTGTCTGGCATCCATTTTAGTAGGACGTATAGATAGCAGAGAATTTCGTAATGAATCAAGGATATTACCGCCTCACTTACTTTTCGGAGCTGCTACAGCGGCGTACCAGATAGAAGGGGCTTGGAACGAAGACGGCAAATCTGAAAGTATATGGGACCGCGTCACACACATGGTACCTTGTGTTATAGCTAATTGTGACACAGGCGACGTGGCCGATGATTCTTATCACCAATATAAGCGTGATGTGGAAATGATGCGGGAGCTAGGTCTCGACTTTTACAGGTTCTCTCTCTCCTGGACGAGAATATTACCAACGAGTTTTCCAGACCACATTAATGAAAAAGGAGTACAATATTATAATAATTTGATAAATGAGATGCTCAAATACAACATACAACCCATGGTGACTATTTATCATTGGGATTTACCTCAGAAGCTGCAAGATCTGGGAGGATGGACCAATCCCCATATCGTTGATTGGTTTACCGATTACTCCAGAGTAGTGTTCCAGTTATTTGGAGATAGGGTTAAGTATTGGGTAACTATTAATGAACCTCGAGAAGTTTGTGGTCAGGGATACGGAATGCAAACTATGGCTCCACTTCTAAATTATTCTGGATATGCTGATTACATATGTGCCAAAAACATACTTTTAGCTCATGCAAATGCCTATCATTTGTATAATGATGAATTTCGTGAAGCCCAGGGTGGTCAGATAGGTATAAGTCTCAGTGCTCATTGGTACGAACCTGAGTCAGAAAACGAAGTTGAATCCGCAGAAGTCTTTCGACAATTTGAGGTGGGTATTTATGCCAATCCAATATTTTCAAAATTGGGCGATTTCCCGTCCGTCGTTAAGGAAAAAGTAGCGGCAAGGAGTCAGATGCAAGGCTTTCCACGATCGCGACTACCAGAATTAACTCCTGAGGAAATTGAATTCGTTAAAGGAAGTTCCGACTTCTTTGGTTTAAACCATTATACTACATTTTTAACTTATGTATCAAAATTTCCAATACAATATCCTACATTTTATTATGCTGATATCGAAGTCTTACCTTATCAACCCGATGAATGGAATTCAAGTTATTCGAAATGGATGAAGGTAGTACCCTGGGGATTTTATAAGGTGCTAACTAAAATACGAGAGGAATATAATAATCCACCGGTTTTTATTACTGAAAATGGTTACGCATCTCCTCGTGGTCTCATAGACGACGACCGCATCGACTTTTATAGAAAATATATTAATGCTATGCTCGATGCTATAGAAGATGGAAGTGACGTTAGGGCTTACACTGCATGGAGTTTGATGGATAATTTGGAATGGATGAGTGGATACACTGAACGTTTCGGACTGTACGAGGTGGACTACGAGAGTCCTGAACGCACCCGCACTCCTCGTAAGTCTGCGTACGTCTATAAGGAGATGTTGCGCACACGAGTACTGGACTATCACTATGAACCTGATATGAGCTTGGGAATGCATATCGATGATAACTAA

Protein sequence:

>DPOGS201336-PA
MFGAVILACLASILVGRIDSREFRNESRILPPHLLFGAATAAYQIEGAWNEDGKSESIWDRVTHMVPCVIANCDTGDVADDSYHQYKRDVEMMRELGLDFYRFSLSWTRILPTSFPDHINEKGVQYYNNLINEMLKYNIQPMVTIYHWDLPQKLQDLGGWTNPHIVDWFTDYSRVVFQLFGDRVKYWVTINEPREVCGQGYGMQTMAPLLNYSGYADYICAKNILLAHANAYHLYNDEFREAQGGQIGISLSAHWYEPESENEVESAEVFRQFEVGIYANPIFSKLGDFPSVVKEKVAARSQMQGFPRSRLPELTPEEIEFVKGSSDFFGLNHYTTFLTYVSKFPIQYPTFYYADIEVLPYQPDEWNSSYSKWMKVVPWGFYKVLTKIREEYNNPPVFITENGYASPRGLIDDDRIDFYRKYINAMLDAIEDGSDVRAYTAWSLMDNLEWMSGYTERFGLYEVDYESPERTRTPRKSAYVYKEMLRTRVLDYHYEPDMSLGMHIDDN-