Monarch geneset OGS2.0

DPOGS202913
TranscriptDPOGS202913-TA1611 bp
ProteinDPOGS202913-PA536 aa
Genomic positionDPSCF300126 + 371149-376097
RNAseq coverage760x (Rank: top 17%)
Annotation
HeliconiusHMEL0145938e-17165.80% 
BombyxBGIBMGA004194-TA2e-17553.46% 
DrosophilaHexo1-PB2e-5830.09% 
EBI UniRef50UniRef50_A4LAF91e-17752.79%Beta-hexosaminidase n=8 Tax=Obtectomera RepID=A4LAF9_OSTFU
NCBI RefSeqNP_001093291.16e-17453.79%beta-N-acetylglucosaminidase 2 [Bombyx mori]
NCBI nr blastpgi|1342525724e-17752.79%beta-hexosaminidase [Ostrinia furnacalis]
NCBI nr blastxgi|1342525722e-17552.79%beta-hexosaminidase [Ostrinia furnacalis]
Group
Gene OntologyGO:00431693.2e-107cation binding
GO:00059753.2e-107carbohydrate metabolic process
GO:00038243.2e-107catalytic activity
GO:00045536.1e-86hydrolase activity, hydrolyzing O-glycosyl compounds
GO:00045638.8e-61beta-N-acetylhexosaminidase activity
KEGG pathwayaga:AgaP_AGAP0100565e-129 
 K12373 (HEX)maps-> Lysosome
    Glycosaminoglycan degradation
    Amino sugar and nucleotide sugar metabolism
    Glycosphingolipid biosynthesis - globo series
    Other glycan degradation
    Glycosphingolipid biosynthesis - ganglio series
InterPro domain[180-535] IPR0137813.2e-107Glycoside hydrolase, subgroup, catalytic core
[182-535] IPR0178534.6e-96Glycoside hydrolase, superfamily
[182-495] IPR0158836.1e-86Glycoside hydrolase, family 20, catalytic core
[139-159] IPR0015408.8e-61Glycoside hydrolase, family 20
[63-180] IPR0158822.2e-18Acetylhexosaminidase, subunit a/b
Orthology groupMCL10643 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202913-TA
ATGCTGCGTTATGCCCTCGTCTTCGTATGTATTCGTTCGATATGGGGTTTCGGGTATCATGTGAATATTGTAAAACCTGGACCCCTGTATCCTCCGACAAAGGGTGAAGTTTGGCCAAAGCCACAGAATGAGAGAAAGGAGCCTATTTATTATTCATTTGATCCTGGACATTTTAAAGTCAAGGTTCAACAGGAAACTTGCGATATATTAACAAATGCTGTGGAACGATATATATATATTATTAAAAATAAGAGCGGTCTACACGCGCGAGACAGGAAGCTTCGTGCTCACAGACGCACCGACGATGTTTACAAGGGGAAGATAAACCAGCTCATGATAACTCTGACGTCTCCCTGCGAGGAGTATCCGCATTTTGATATGATTGAAAGCTACAATCTAAGTGTTGCTGATACATCGCAGCTCACGAGTACCTCAATATGGGGAGTGTTGCGAGGCCTTGAGACATTTTCACAACTATTTTATCTCTCCAATGATCGAAATGAGCTGTATATCAATAAAACGGATATAATTGATTTCCCTCGTTACAAACACCGGGGTATCCTGTTGGATACTTCCCGACATTACGCTACCACATCAACAATTCTTAAACTGTTAGAATCTATATCCATTAATAAAATGAATGTTTTTCACTGGCATATTGTTGATGATCAAAGTTTTCCTTATCAGAGTGAGAAGTTTCCAGAAATAAGTGAACGTGGTGCTTATGATTCTTCTATGGTGTACACGAAAGAAGACATCCTTATGATCATAGATTTTGCAAGAAATCGTGGAATACGGGTCATCCCGGAATTTGATGTTCCCGGACACACTGCGTCCTGGGGCCTCGCATACCCCGGTGTTCTAACGGAGTGTTACAACCAGCAACAGATGGTGGGCCTCGGCCCCATGGATCCCACCAAAAATATAACTTACAAACTCCTCGCTGATCTGTTCGCTGAAGTACAGGATCTGTTCCCCGAGAGATACTTCCACGTTGGCGGTGATGAAGTCGAGTTAAACTGCTGGAGTTCAAATCCACATTTGAGAGACTATATGAATAAGAACAAATTAAAAGTATCGGATCTTCATTCTCTGTTCATGAGGAACGTCATTCCTCTGCTCTCCAACAGCTCCAAAGTGATTGTATGGCAGGAAGTTTTTGACGAAAAAGTTCCTCTATCAATGGACACACTGGTGCAAGTGTGGAAGAACGGTTGGGTGACTGAGATGATCTCGGTGTTGAAATCCGGACACAGCGTGTTGTTCTCGGCGGCGTGGTATCTGGATTCGTTGAACCAAAAGTGGACGGATCTCTACAAACAGGATCCTCGAGGGATGGTGCTGGATGCCACCGACAACAGCTCCCTGGCCGAGGGAGTGGTGGGCGGGGAGGCCTGTATGTGGGGCGAGATGATTAATGTCAGGAGTGTCATGGCTAGGGTATGGCCACGAGCCTGCGCCGTGGCTGAGCGTCTGTGGAGTTCTGTGGAAGGATCGTACTACATAGTGCCAGCGGAGGCCTACCACCGCATCGAAGAACACACCTGCCGCATGATAAGACGAGGCATCGACTCCGGCCCACCATCCGGACCAGGGTTCTGTGTCGTATAG

Protein sequence:

>DPOGS202913-PA
MLRYALVFVCIRSIWGFGYHVNIVKPGPLYPPTKGEVWPKPQNERKEPIYYSFDPGHFKVKVQQETCDILTNAVERYIYIIKNKSGLHARDRKLRAHRRTDDVYKGKINQLMITLTSPCEEYPHFDMIESYNLSVADTSQLTSTSIWGVLRGLETFSQLFYLSNDRNELYINKTDIIDFPRYKHRGILLDTSRHYATTSTILKLLESISINKMNVFHWHIVDDQSFPYQSEKFPEISERGAYDSSMVYTKEDILMIIDFARNRGIRVIPEFDVPGHTASWGLAYPGVLTECYNQQQMVGLGPMDPTKNITYKLLADLFAEVQDLFPERYFHVGGDEVELNCWSSNPHLRDYMNKNKLKVSDLHSLFMRNVIPLLSNSSKVIVWQEVFDEKVPLSMDTLVQVWKNGWVTEMISVLKSGHSVLFSAAWYLDSLNQKWTDLYKQDPRGMVLDATDNSSLAEGVVGGEACMWGEMINVRSVMARVWPRACAVAERLWSSVEGSYYIVPAEAYHRIEEHTCRMIRRGIDSGPPSGPGFCVV-