Monarch geneset OGS2.0

DPOGS213576
TranscriptDPOGS213576-TA1824 bp
ProteinDPOGS213576-PA607 aa
Genomic positionDPSCF300033 + 142270-150471
RNAseq coverage382x (Rank: top 31%)
Annotation
HeliconiusHMEL0045843e-13450.59% 
BombyxBGIBMGA011646-TA0.063.71% 
DrosophilaHexo2-PA4e-15044.40% 
EBI UniRef50UniRef50_A4PHN60.066.83%Beta-N-acetylglucosaminidase 1 n=3 Tax=Obtectomera RepID=A4PHN6_BOMMO
NCBI RefSeqNP_001078833.10.066.83%beta-N-acetylglucosaminidase 1 [Bombyx mori]
NCBI nr blastpgi|1456518160.066.83%beta-N-acetylglucosaminidase 1 precursor [Bombyx mori]
NCBI nr blastxgi|1578045740.066.28%hexosaminidase [Ostrinia furnacalis]
Group
Gene OntologyGO:00431694.8e-106cation binding
GO:00059754.8e-106carbohydrate metabolic process
GO:00038244.8e-106catalytic activity
GO:00045534.1e-82hydrolase activity, hydrolyzing O-glycosyl compounds
GO:00045635e-40beta-N-acetylhexosaminidase activity
KEGG pathwayapi:1001693533e-151 
 K12373 (HEX)maps-> Lysosome
    Glycosaminoglycan degradation
    Amino sugar and nucleotide sugar metabolism
    Glycosphingolipid biosynthesis - globo series
    Other glycan degradation
    Glycosphingolipid biosynthesis - ganglio series
InterPro domain[223-606] IPR0137814.8e-106Glycoside hydrolase, subgroup, catalytic core
[225-577] IPR0178531e-97Glycoside hydrolase, superfamily
[225-565] IPR0158834.1e-82Glycoside hydrolase, family 20, catalytic core
[175-195] IPR0015405e-40Glycoside hydrolase, family 20
[136-202] IPR0158822.5e-09Acetylhexosaminidase, subunit a/b
Orthology groupMCL16732 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213576-TA
ATGTGTCGTCTGCTGGTGCTGGCGATAGTATTGGTGCATTTAGTCAACTGTCAGGACAAAGAAAATGACATAGAGGAATCAACACCGCTTGTCTATGAACCATCCTGGACATATGAATGTGTGCCGGATGAGGGCTGTCAACGCTCCGATTTTCCGCAGCCGACCCTGAACAGGAGTATGATCTTCGATAGCATCGACCTATGCAGGACCGTCTGCGGCCGCTTCGGAGGAATCTGGCCCAAACCGGTCACGGCGGCCTTGAGCATGCAGACTATTAAGATACATCCGAATTACTTGAGATTCGATCTTAGCAACGCGCCGGCAGAAACCAGAAAGATATTGGCCGAGATGTCTCAAGTGGCCACTCAGAATATCATAAGTGAATGCGAGGGTAATGTCACGGAGGTCGTTGAGATGCCTGTCATTGTACACATCACGGTCAAAACAGACAACATGAACCTAACCTGGCAGACTGACGAACAATACCGCTTGGATGTCCAGAGCAAAGATACCAGCGTCGTGGTCCAAGTTATCGCGGAAACGGTTTTCGGAGCACGCCATGGTTTAGAGACTTTGACGCACCTGATTTCGGCTGATAAGCCAGATTTATCGGAACAATCTAAATGTGGGCTCCGTATGGTAGCGGGTGCCAAAATATGGGATAAACCTGTGTACCCACATAGAGGATTTCTTCTGGACACGTCAAGAAACTTCATCCCTATGGACGATATCAAAAGAATGATCGATGGTTTGGCTACACTCAAGATGAACGTCTTCCACTGGCACGTAACAGATTCACACAGCTTCCCGTTGGAGTCAAGACGAGTGCCGCAGTTTACTAAGTACGGTGCTTATTCCGCCTCAGAGATCTACAGTTCGGAGGAAGTACGTGGTCTGGTGGAATACGCTCTCGTTCGCGGAGTCCGTATCTTAATTGAAATAGACTCACCAGCGCACGCCGGCAATGGCTGGCAGTGGGGGAATGAGTACGGTTTGGGTGATCTGGCGGTTTGTGTGAACGAGAAACCCTGGCGGCAGCTTTGTATTCAGCCACCCTGCGGGCAACTGAACCCCGCCAACCCGGCCGTATATAGAGTATTGAGAGACTTGTACAGAGACATCGCTGAAACTCTCACCAAACCTCCTCTATTTCACATCGGCGGTGATGAGGTATTCTTTGAATGCTGGAACTCGTCAAATACGATCCTAGAATACATGCAGACTAAAGGCTACAGCAGGAATGTTGAAGGTTTCATCAATTTATGGTCGGAATTTCATGAGAAGGCTCTCAATATTTGGGATGAAGAACTAGCAGCGATCGGAGAAACGGAGAAGCAGCCAGTCTTAATCTGGTCTTCCGAACTCACACAAGCCCACAGAATACAAAAACATTTGGACAAAAAAAGATACACAATTGAAGTTTGGGAACCTCTGTCCAGTCCTTTGTTAATTCAACTGATACGTCTGGGCTACAACGTTATATCAGTCCCCAAAGACGTGTGGTATCTCGATCATGGGTTTTGGGGTCAGACGAAATACTCTAACTGGAGAAGAATGTACGCACACACACTACCAAGAGATCCAAATGTTTTGGGGGGTGAAGTCGCTATGTGGACTGAATATGTGGATAAAGAGGCTTTGGATCCGAGAGTGTTCCCTCGAGTGGCTAGCGTGGCGGAACGTCTTTGGTCGGATCCCACGACGGGAGCGAGTGGCGCTCAGCCTCGCCTGCAGCGTGTGAGGACGAGACTCGTCCAACGAGGACTGAGAGCGGACGTGCTAGCTCCAGGCTGGTGCGCTCAACACGACACGCGCTGTTTATAG

Protein sequence:

>DPOGS213576-PA
MCRLLVLAIVLVHLVNCQDKENDIEESTPLVYEPSWTYECVPDEGCQRSDFPQPTLNRSMIFDSIDLCRTVCGRFGGIWPKPVTAALSMQTIKIHPNYLRFDLSNAPAETRKILAEMSQVATQNIISECEGNVTEVVEMPVIVHITVKTDNMNLTWQTDEQYRLDVQSKDTSVVVQVIAETVFGARHGLETLTHLISADKPDLSEQSKCGLRMVAGAKIWDKPVYPHRGFLLDTSRNFIPMDDIKRMIDGLATLKMNVFHWHVTDSHSFPLESRRVPQFTKYGAYSASEIYSSEEVRGLVEYALVRGVRILIEIDSPAHAGNGWQWGNEYGLGDLAVCVNEKPWRQLCIQPPCGQLNPANPAVYRVLRDLYRDIAETLTKPPLFHIGGDEVFFECWNSSNTILEYMQTKGYSRNVEGFINLWSEFHEKALNIWDEELAAIGETEKQPVLIWSSELTQAHRIQKHLDKKRYTIEVWEPLSSPLLIQLIRLGYNVISVPKDVWYLDHGFWGQTKYSNWRRMYAHTLPRDPNVLGGEVAMWTEYVDKEALDPRVFPRVASVAERLWSDPTTGASGAQPRLQRVRTRLVQRGLRADVLAPGWCAQHDTRCL-