Monarch geneset OGS2.0

DPOGS201065
TranscriptDPOGS201065-TA2019 bp
ProteinDPOGS201065-PA672 aa
Genomic positionDPSCF300185 - 365159-369733
RNAseq coverage42x (Rank: top 72%)
Annotation
HeliconiusHMEL0080000.070.72% 
BombyxBGIBMGA007197-TA0.067.64% 
Drosophilatobi-PA4e-16050.19% 
EBI UniRef50UniRef50_Q19P003e-17855.51%Glycosyl hydrolase family 31 protein (Fragment) n=2 Tax=Obtectomera RepID=Q19P00_BOMMO
NCBI RefSeqXP_002053192.14e-16552.15%GJ23750 [Drosophila virilis]
NCBI nr blastpgi|1030581581e-17755.51%glycosyl hydrolase family 31 protein [Bombyx mori]
NCBI nr blastxgi|1030581582e-18055.41%glycosyl hydrolase family 31 protein [Bombyx mori]
Group
Gene OntologyGO:00045531.8e-184hydrolase activity, hydrolyzing O-glycosyl compounds
GO:00059751.8e-184carbohydrate metabolic process
GO:00081522.2e-21metabolic process
GO:00038242.2e-21catalytic activity
KEGG pathwaydme:Dmel_CG119093e-158 
 K01187 (E3.2.1.20, malZ)maps-> Starch and sucrose metabolism
    Galactose metabolism
InterPro domain[98-643] IPR0003221.8e-184Glycoside hydrolase, family 31
[233-579] IPR0178538.1e-75Glycoside hydrolase, superfamily
[359-428] IPR0137852.2e-21Aldolase-type TIM barrel
Orthology groupMCL10426 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201065-TA
ATGGCCGCCTTGAGGATAGTATTGGGAGTGCTGACGTTGGGTGCTCGTGTGTTCACTGCGGACCTCTCGCTCCGAGCTGATGATGTTGAGATCACTTTGGAACCTCAAATCACAGGAGGATTCTTATTAGTTCTCACACAAGATGAAGAGAGAGTAGTATACGGCCACATCGGACGCACTCTGTACTCAGCATTCACAACTGAGAACGTGGAGGGAGGAGTGTTAGTACAAATGGGCAACGTCGAGCTCACCATCCACACCGTCGCTGATGAACGAGCCAAAGCCAGGGGAATAAAAATCAGGTGGAACACTGAAGATACCTTACGGTTCGAGGACTGTTTTGATTTTGGTTCCAAGCACTGGTATGGAGGAAAAATGCAAATTAACCAGATTTACCCGCTTGAGGCTGGTCAGCAGAATTATAGTGCTTATATTTCACAAGAATACGACAATGGAGCGATAGTTGAGAGATACTGGCTGAACTCAGCGGGAGAATATGTTTATGTCCATCCCCAAGTTCCTCTGTTTGTTGATTATCACCAGATTTATCCTAATCATATCTGCTTCGGTGCTCAAATAGCTGATCCCTATTCCACGAAAAGGAACCACACTGAACTGACCTACGATATCTGGTTCCTACCCAATGTAAAAGAAGCCCACAAGCACGCAGTTGCAAATTACTTGGGTAAACCTTCTGGTGTACCAGATTTCAGAATGGTAGAGCATCCGATTTGGTCCACATGGGCTCAATACTCCCAAGATATTACGCAGGAGAAGGTTATGGATTTTGCTCAACAAATACTTGCCAATGGTTTTAATAACTCCCAAATAGAAATTGATGATCTATGGGAGACATGTTACGGATCCTTGGAAGTAGATCCTGTCAAGTTTCCCAACATGACAAATTTAGTGAAGAATCTAAAAGATTTAGGGTTCCGCGTCACTATCTGGATCCACCCCTTCATTAATAGTAATTGCAATCCTTGGTACTCTGAAGCTCTTGACAATGGCTACTTAGTACTCGATGAGGATGGCACTGCCAAAGCAAACTGGTGGAATACCAACGGTTCAGAGCCAGGTCTGGTAGACTTCACGAATCCCGCGGCAGCGGAGTGGTGGTACACACGCGTCAGAAACCTCCTCGACACTTACGACATAGACAGCATCAAATTTGACGCGGGAGAGAGCAGTTTTTCACCACAGATTCCCGTCCAACACGGCGACATTAATCTTCACCCTCACAACATCGTCGACGCCTACGTTCGGACCTGTGCGAGGTTCGGAGATATGATCGAAGTCCGATCTGCCTTCCGGACCCAAGACCTTCCTATATTTGTTCGCATGGTAGACAGAGACTCCATCTGGGGGATGAACAACGGTCTCCCAACCATCGTGACCGCGACTCTTCAGATGAATCTGAACGGATATCCGTTTGTTCTGCCGGATATGATCGGTGGTAACGGATACAACCTGAACCACGCGCAAGCCGACATACCCACCAAGGAGTTGTTCATTCGATGGGTGCAAGCGAATACTTTCTTGCCAGCCATGCAGTACTCGTATGTGCCCTGGAACTTTGATAATGAGACTTTAGATATAAGTCGGAAGTATACGGAGCTTCACGCTGCGCACGCGCACGATGTGTATGAGGCGATGCAGGCCGCCGTGGAGACCGGGGCTCCGGTCAACGCACCGCTCTGGTGGATCGACCCTGACAACGAGCAGACTTACACTATTTGGGATCAGTACCTATTAGGTGAGAATATCATAGTGGCTCCAATATTCAGTGAAGGGGCGACATCCCGCGACATCTATCTTCCTTCTGGTCGCTGGTTGGAGGAAGGAGACCCTGGCAAGGTGTTTGAAGGTCCGATATGGATCAGGAACTTCCCCGCACCCATCGACGTGCTGCCGTACTTCGTGAGAGAAGCCGAAGAGTCTAACGACTCGAGCATCCTACTCGCCTCCGTTTTCTTAATCCTTGCAGCCATTATCGCCAACCCCATCTTATCTTTTTAG

Protein sequence:

>DPOGS201065-PA
MAALRIVLGVLTLGARVFTADLSLRADDVEITLEPQITGGFLLVLTQDEERVVYGHIGRTLYSAFTTENVEGGVLVQMGNVELTIHTVADERAKARGIKIRWNTEDTLRFEDCFDFGSKHWYGGKMQINQIYPLEAGQQNYSAYISQEYDNGAIVERYWLNSAGEYVYVHPQVPLFVDYHQIYPNHICFGAQIADPYSTKRNHTELTYDIWFLPNVKEAHKHAVANYLGKPSGVPDFRMVEHPIWSTWAQYSQDITQEKVMDFAQQILANGFNNSQIEIDDLWETCYGSLEVDPVKFPNMTNLVKNLKDLGFRVTIWIHPFINSNCNPWYSEALDNGYLVLDEDGTAKANWWNTNGSEPGLVDFTNPAAAEWWYTRVRNLLDTYDIDSIKFDAGESSFSPQIPVQHGDINLHPHNIVDAYVRTCARFGDMIEVRSAFRTQDLPIFVRMVDRDSIWGMNNGLPTIVTATLQMNLNGYPFVLPDMIGGNGYNLNHAQADIPTKELFIRWVQANTFLPAMQYSYVPWNFDNETLDISRKYTELHAAHAHDVYEAMQAAVETGAPVNAPLWWIDPDNEQTYTIWDQYLLGENIIVAPIFSEGATSRDIYLPSGRWLEEGDPGKVFEGPIWIRNFPAPIDVLPYFVREAEESNDSSILLASVFLILAAIIANPILSF-