Monarch geneset OGS2.0

DPOGS205602
TranscriptDPOGS205602-TA2760 bp
ProteinDPOGS205602-PA919 aa
Genomic positionDPSCF300167 - 102767-115621
RNAseq coverage212x (Rank: top 46%)
Annotation
HeliconiusHMEL0050050.083.75% 
BombyxBGIBMGA007151-TA0.080.14% 
DrosophilaCG33080-PA0.047.88% 
EBI UniRef50UniRef50_E2AQK90.051.98%Uncharacterized family 31 glucosidase KIAA1161 n=7 Tax=Neoptera RepID=E2AQK9_CAMFO
NCBI RefSeqXP_001607147.10.052.53%PREDICTED: similar to ENSANGP00000011992 [Nasonia vitripennis]
NCBI nr blastpgi|3320292830.052.27%Uncharacterized family 31 glucosidase [Acromyrmex echinatior]
NCBI nr blastxgi|3838562840.052.24%PREDICTED: uncharacterized protein LOC100882776 [Megachile rotundata]
Group
Gene OntologyGO:00045538.5e-132hydrolase activity, hydrolyzing O-glycosyl compounds
GO:00059758.5e-132carbohydrate metabolic process
KEGG pathwaydme:Dmel_CG119094e-50 
 K01187 (E3.2.1.20, malZ)maps-> Starch and sucrose metabolism
    Galactose metabolism
InterPro domain[331-913] IPR0003228.5e-132Glycoside hydrolase, family 31
[488-866] IPR0178533.3e-39Glycoside hydrolase, superfamily
Orthology groupMCL15737 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205602-TA
ATGAGGATTCTGGAAACGTGCGGAACTCCTAACGATGTACTGACACCAGATCAAGCAGACAGAAATGATTTTCAAGTTAACCTTGACATCAATCGGTCAGTCACACCGGAGATAAAGATATTCGGTCCAGAGGATGATACGGCGGATCATAGGGATGAGGACTCCGCTCCGACTGAAAGAGAACATGATAGAGACGTTGAAGGAAAAAGTACAGTTGACATAAAAAGAGACAACACGAGTTCAGTCGCACACACCTTTAAATACCGGTCTCCTCTGTGGTTCGAGTCGGATTCTGACATCAGTAAGAGTGGCAGCAACTCAAGCGGGGACAGGGTCCAAGGTGGGGGGCTCGGCTGCGGAGACGAGAGCCTGGAATGCAGCTCAAACAGTTCAGACAGCGTCAACAACTTCCTAGATAAAAAGATTCCTGAACATCACCAGAGCTCAGTTTCGGTGTTCAGTGACGACAACAATGATGTTCATGATGATCAGGCTAAAGTACCGCTTAGATCGCCTCGCCGTAAGTCCACAGCCCCTCGACGGTTCAGATCTGAGTTCAGCATGGACGAAGATGAGTACTCCCCAAGCAATTCAGTCACGAGTGTGAACTCGCTCGCTAGTCTACTTAAGGAGAAATTACAAAGCATTCCTCAGAAAATAAGAAAAAAGCCTACAGATTACAAGCTGCGCGCGTTCGTCGGACTAATGTTTCTAGCCGTGGTGTTCTTTGTGGGGTTCGCCTACGTCCTCTACCACCGCCAGGCACTCACGACCGCCTACTTCGAAAGAGTGCAGTTCAATGAACCCAAACGACTTATCAGGGTTTACAATCAGGATGATGTGGAAATACTGAAAGCCAGATTGGGAGTGGATCTTCACGGGCAGCACAAATCCTTCCCGTGTCTACCGCAGCACCGCCGGCGAGGCTCTGAATGCCGAGAGTGGCTTCACGCGTTGCGACTTTACCTCACGAGTCTACCTCCAGAGCATGACAATACCACGTGCTACTCTGTCACCTGGCAAGCACTGTCAAATGACGTTACACCAAACGATTGCTTCGACTGGGGCGACACGAAAGTTAACTGGTTCGGAGCTGGGCAGTCTCTTAACCTAACCTGGCCACTCAACAGCGGTGCCATAGATTATACGCCTTTCATCACAGGCGACATGCAAAAATCTCAGTTCGGTAACGTTGTTACGAGATATTTGATTAACTCGAAAGGAGGAGCGATCACTGTGGATGAAGACACTCCGTTGCATATTTCTGTTAACAGAGGTAGAAAGGAAATATGTTTGAAAGCTAAGTACGACGACTTCGCATTCGCTAATAAAATTACAGAGTTCCCTGAACTGAAATACAATATATGCACTGCAAGGGATATAAAGTCCTTGCATTCTTCTATTCATAACCACAGAAGAGCCCCTCTGTGGGACGGCCTGAAGCCAGGTGACATTAAAACATTAGACTCTCTCATTTCTGAACCAGTTTGGCAAATTGCTCCTCGATTCAAACACGAATTACAAGACGAAACAATAGCCAAATATACAGAAGATGTTATAAGCCTAGGGTTTTTGAAACAAGGGCATGTATTGATTAACGAGTTTTGGCAGAATGAAATTGGTGACTTGACAGTCGATACGAGTCGCTTTGCAACATTAAATGTAACCGTAAACAAACTACACAGACGGGGCTTTAAGGTAGCTTTTACGATACAACCCTTTATAAGTACTGAAAGTAAAAACTTCGCTGAGACTGTACAAAAAAGATTGTTGATCAGCGAAAGAAACAGCGACAGAAGAATCCCAGCGTTGACAAGGTTTAAGTCTCTAGCGAGTGCGGGCGTGTTGGATATAACTAATAATAGATCTGTGCCTTGGATTATGGATAAACTACAGACTGTGATCTCCGCATATCATATAGACTCGTTTTATTTCGACCTCGGCACCGCTTACGACATGCCGCACTATTATCGGTGCGAACAAAAGTTGATAAATCCAGATCAATACAAGACAATATTTACCAAAACATTTGAGAAGGCTTTGAACATTATTGGCGTGTCCTCTGCCATACATCTCCCGCGTCCACCGATTTTCGTATCTTTGCCGCCGTTTGAATCTACTTGGGATGCTTTAAGACTGGTGATACCAACGATGTTGACGTACGGCATAAACGGTTTCCCGTTTACAATGCCGGGCGCAGTGGGGGGAGACATATACTGGCCTGGAAGCGAACAGTTCTTACCATCAGCTAAAGGAGCCGTAGAATCGATCGTGAACAGCACAACCCAGGAGAATGGTATCGAGTTACCGGAAAGAGAATTGTACATGAGGTGGCTGCAACTAGCAACCTTCTTACCTGTCATGAAGTTTACTCACCTGCCAAGCAAGTACAACGACGTCACTGTCCTAGAAATGGCTAAAAATCTAACTCTTCTACGACAAATGTATGTGACGCCTTTGTTATTAAAATACAAGCGTGAAGCTCTAGAGGAAGGCCTGCCGTTGGTGCGGCCTCTGTGGCTGGTGGCGGACGCTGACGTCACCCCCGCTCTGGACGAGTTCGTCATTGGAGACGAGATTGTAGTCGCGCCTGTCGTTCACCAAGGACACACCACGAGAGAAGTGTATTTGCCGGCTGGTCTGTGGCAGGACGGTATAGACGGTTCATTGAGGAAAGGCAATCGTTGGATGCACGACTACCGCGTGCCCGCTACCAAGGTCGCGTACTTCCTCAGGAAACCGGACGACTTAAGGTTTTAA

Protein sequence:

>DPOGS205602-PA
MRILETCGTPNDVLTPDQADRNDFQVNLDINRSVTPEIKIFGPEDDTADHRDEDSAPTEREHDRDVEGKSTVDIKRDNTSSVAHTFKYRSPLWFESDSDISKSGSNSSGDRVQGGGLGCGDESLECSSNSSDSVNNFLDKKIPEHHQSSVSVFSDDNNDVHDDQAKVPLRSPRRKSTAPRRFRSEFSMDEDEYSPSNSVTSVNSLASLLKEKLQSIPQKIRKKPTDYKLRAFVGLMFLAVVFFVGFAYVLYHRQALTTAYFERVQFNEPKRLIRVYNQDDVEILKARLGVDLHGQHKSFPCLPQHRRRGSECREWLHALRLYLTSLPPEHDNTTCYSVTWQALSNDVTPNDCFDWGDTKVNWFGAGQSLNLTWPLNSGAIDYTPFITGDMQKSQFGNVVTRYLINSKGGAITVDEDTPLHISVNRGRKEICLKAKYDDFAFANKITEFPELKYNICTARDIKSLHSSIHNHRRAPLWDGLKPGDIKTLDSLISEPVWQIAPRFKHELQDETIAKYTEDVISLGFLKQGHVLINEFWQNEIGDLTVDTSRFATLNVTVNKLHRRGFKVAFTIQPFISTESKNFAETVQKRLLISERNSDRRIPALTRFKSLASAGVLDITNNRSVPWIMDKLQTVISAYHIDSFYFDLGTAYDMPHYYRCEQKLINPDQYKTIFTKTFEKALNIIGVSSAIHLPRPPIFVSLPPFESTWDALRLVIPTMLTYGINGFPFTMPGAVGGDIYWPGSEQFLPSAKGAVESIVNSTTQENGIELPERELYMRWLQLATFLPVMKFTHLPSKYNDVTVLEMAKNLTLLRQMYVTPLLLKYKREALEEGLPLVRPLWLVADADVTPALDEFVIGDEIVVAPVVHQGHTTREVYLPAGLWQDGIDGSLRKGNRWMHDYRVPATKVAYFLRKPDDLRF-