Monarch geneset OGS2.0

DPOGS201337
TranscriptDPOGS201337-TA1527 bp
ProteinDPOGS201337-PA508 aa
Genomic positionDPSCF300176 + 657434-660459
RNAseq coverage22x (Rank: top 78%)
Annotation
HeliconiusHMEL0123990.066.21% 
BombyxBGIBMGA010536-TA0.061.65% 
DrosophilaCG9701-PA1e-14048.97% 
EBI UniRef50UniRef50_G6DAN30.076.65%Glycoside hydrolase n=10 Tax=Obtectomera RepID=G6DAN3_DANPL
NCBI RefSeqXP_001850321.16e-14751.25%glycoside hydrolase [Culex quinquefasciatus]
NCBI nr blastpgi|3640235850.065.08%seminal fluid protein CSSFP001 [Chilo suppressalis]
NCBI nr blastxgi|3640235850.065.62%seminal fluid protein CSSFP001 [Chilo suppressalis]
Group
Gene OntologyGO:00045532.6e-238hydrolase activity, hydrolyzing O-glycosyl compounds
GO:00059752.6e-238carbohydrate metabolic process
GO:00431695.4e-175cation binding
GO:00038245.4e-175catalytic activity
KEGG pathwaytca:6645776e-134 
 K05350 (bglB)maps-> Starch and sucrose metabolism
    Phenylpropanoid biosynthesis
    Cyanoamino acid metabolism
InterPro domain[29-501] IPR0013602.6e-238Glycoside hydrolase, family 1
[30-487] IPR0137815.4e-175Glycoside hydrolase, subgroup, catalytic core
[29-502] IPR0178534.5e-164Glycoside hydrolase, superfamily
Orthology groupMCL10040 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201337-TA
ATGTTCGGAGCGGTGAGATTGGCTTTTCTGGCCTCAGTTTTAGCTGGACATGCTGATAGCACAAAAATCTTTCGACACGAATCAAGGAAATTCCCTGACCACTTACTGTTTGGTGCTGCTACCGCGGCGTACCAAATAGAAGGGGCTTGGAATGAAGACGGTAAATCTGAAAGTATATGGGACCGCGTCACACACATGGTACCTTGTGTTATAGCTAATTGTGACACAGGCGACGTGGCCGATGATTCTTATCACCAATATAAGCGTGATGTGGAAATGATGCGGGAGCTAGGTCTCGACTTTTACAGGTTCTCTCTCTCCTGGACGAGAATATTACCAACGAGTTTTCCAGACCACATTAATGAAAAAGGAGTACAATATTATAATAATTTGATAAATGAGATGCTCAAATACAACATACAACCCATGGTGACTATTTATCATTGGGATTTACCTCAGAAGCTGCAAGATCTGGGAGGATGGACCAATCCCCATATCGTTGATTGGTTTACCGATTACTCCAGAGTAGTGTTCCAGTTATTTGGAGATAGGGTTAAGTATTGGGTAACTATTAATGAACCTCGAGAAGTTTGTGGTCAGGGATACGGAATGCAAACTATGGCTCCACTTCTAAATTATTCTGGATATGCTGATTACATATGTGCCAAAAACATACTTTTAGCTCATGCAAATGCCTATCATTTGTATAATGATGAATTTCGTGAAGCCCAGGGTGGTCAGATAGGTATAAGTCTCAGTGCTCATTGGTACGAACCTGAGTCAGAAAACGAAGTTGAATCCGCAGAAGTCTTTCGACAATTTGAGGTGGGTATTTATGCCAATCCAATATTTTCAAAATTGGGCGATTTCCCGTCCGTCGTTAAGGAAAAAGTAGCGGCAAGGAGTCAGATGCAAGGCTTTCCACGATCGCGACTACCAGAATTAACTCCTGAGGAAATTGAATTCGTTAAAGGAAGTTCCGACTTCTTTGGTTTAAACCATTATACTACATTTTTAACTTATGTATCAAAATTTCCAATACAATATCCTACATTTTATTATGCTGATATCGAAGTCTTACCTTATCAACCCGATGAATGGAATTCAAGTTATTCGAAATGGATGAAGGTAGTACCCTGGGGATTTTATAAGGTGCTAACTAAAATACGAGAGGAATATAATAATCCACCGGTTTTTATTACTGAAAATGGTTACGCATCTCCTCGTGGTCTCATAGACGACGACCGCATCGACTTTTATAGAAAATATATTAATGCTATGCTCGATGCTATAGAAGATGGAAGTGACGTTAGGGCTTACACTGCATGGAGTTTGATGGATAATTTGGAATGGATGAGTGGATACACTGAACGTTTCGGACTGTACGAGGTGGACTACGAGAGTCCTGAACGCATCCGCACTCCTCGCAAGTCTGCTTACGTGTACAAGGAGATGCTGCGCATCCGAGTACTGGACTATCATTATGAACCTGATATGAGCTTGGGAATGAATGTCGATGATAATTAA

Protein sequence:

>DPOGS201337-PA
MFGAVRLAFLASVLAGHADSTKIFRHESRKFPDHLLFGAATAAYQIEGAWNEDGKSESIWDRVTHMVPCVIANCDTGDVADDSYHQYKRDVEMMRELGLDFYRFSLSWTRILPTSFPDHINEKGVQYYNNLINEMLKYNIQPMVTIYHWDLPQKLQDLGGWTNPHIVDWFTDYSRVVFQLFGDRVKYWVTINEPREVCGQGYGMQTMAPLLNYSGYADYICAKNILLAHANAYHLYNDEFREAQGGQIGISLSAHWYEPESENEVESAEVFRQFEVGIYANPIFSKLGDFPSVVKEKVAARSQMQGFPRSRLPELTPEEIEFVKGSSDFFGLNHYTTFLTYVSKFPIQYPTFYYADIEVLPYQPDEWNSSYSKWMKVVPWGFYKVLTKIREEYNNPPVFITENGYASPRGLIDDDRIDFYRKYINAMLDAIEDGSDVRAYTAWSLMDNLEWMSGYTERFGLYEVDYESPERIRTPRKSAYVYKEMLRIRVLDYHYEPDMSLGMNVDDN-