Monarch geneset OGS2.0

DPOGS212532
TranscriptDPOGS212532-TA1317 bp
ProteinDPOGS212532-PA438 aa
Genomic positionDPSCF300315 - 206919-208826
RNAseq coverage9x (Rank: top 85%)
Annotation
HeliconiusHMEL0145352e-12354.05% 
BombyxBGIBMGA008188-TA6e-14764.77% 
Drosophilafdl-PB2e-6431.40% 
EBI UniRef50UniRef50_B1P8686e-8938.20%Beta-N-acetylglucosaminidase n=1 Tax=Spodoptera frugiperda RepID=B1P868_SPOFR
NCBI RefSeqNP_001165928.11e-8737.77%fused lobes [Bombyx mori]
NCBI nr blastpgi|2953115684e-9037.98%hexosaminidase [Ostrinia furnacalis]
NCBI nr blastxgi|2953115682e-8738.36%hexosaminidase [Ostrinia furnacalis]
Group
Gene OntologyGO:00431693.6e-54cation binding
GO:00059753.6e-54carbohydrate metabolic process
GO:00038243.6e-54catalytic activity
GO:00045631.6e-36beta-N-acetylhexosaminidase activity
GO:00045531.1e-35hydrolase activity, hydrolyzing O-glycosyl compounds
KEGG pathwaytca:6560273e-73 
 K12373 (HEX)maps-> Lysosome
    Glycosaminoglycan degradation
    Amino sugar and nucleotide sugar metabolism
    Glycosphingolipid biosynthesis - globo series
    Other glycan degradation
    Glycosphingolipid biosynthesis - ganglio series
InterPro domain[78-427] IPR0137813.6e-54Glycoside hydrolase, subgroup, catalytic core
[80-426] IPR0178531.8e-47Glycoside hydrolase, superfamily
[36-56] IPR0015401.6e-36Glycoside hydrolase, family 20
[80-233] IPR0158831.1e-35Glycoside hydrolase, family 20, catalytic core
[5-78] IPR0158826.3e-11Acetylhexosaminidase, subunit a/b
Orthology groupMCL25443 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212532-TA
ATGGTAATACTTATAGATGTAGAAAGTGATTCTGACCCACGGCTTCGTATAAATACCGACGAGGGATATATGTTGAAGGTAGAAACTAAAAATAATCAAGTTATTATAAAAGTAAGTGGACTATCATTTTGTGGAGCTCGTCACGGGTTTGAAACCCTAAGTCAGTTGATCTTGTTAGATCAAAGCACGGGTTACCTTATCATGCTATCCAGTGCGATAATAAAGGATGCCCCAACTTACAAATACAGAGGATTGATGGTTGACACAGGAAGAAATTATATACCTGTTGTGGACTTATTAAGAACTGTTGACGCAATGTCTACCTGTAAATTGAACACATTTCACTGGAGAATATCTGACGCTACAAGCTTTCCAATGAGCTTGTCAAAAATACCAGAATTAGAGGAATATGGACCCTATGATAGATCAATGGTGTATACAAAAAAAGATATCAGGATGATTGTGAATAGAGCCGGTATTCGTGGGATAAGAGTTTTAATAGAAATAGCTGCACCAGGTCCAGTTGGCAGACCCTTTTCCTGGTTGTCCTCCACGACTTGTTCCCGAAAAAATAACAGCCTTACTTGCGACAATGATCTTTGTAGGCGTCTGACAATGCACGACTCTACATTTGATGTGCTTCAAAAAATATATTCTGAAATCCTTGAAATGACGAACGTCGATGACGTCTTCCATTTGAGTGATAGCGTTTTCTCGATGACCAATTGTTATTATTTATTCGATGATCGCGAGGGATTTTTAGATAAAGCTCTATTCCGTCTGAAGATGGCTAATAAAGGATTTCTGCCACAACTGCCTATTATTTGGTACACGTCACATTTAATGAAACATTTTGAAGCTAAAACTTGGGAGAGATTAGGAGTGCAAATCGATGAATGGGATGCAAACCCTTATGAGTCATATTTAAATAAATTTAGGGTTATACATTCTACCAAGTGGGATTTGTCTTGCGAAATGAGAAAGCAGAGATGCATAAGATACAGAACCTGGCAACAAATGTATTTATGGAAATCTTGGAGAAATGTTAATGTGTTTACTACCGAAGGAGGAGAATCTATTTTATGGACAGACTTGGTTGATTCAAGTAACCTTGATTACCATCTATGGCCTCGTGCTGCAGTTGTTGCAGAACGTTTGTGGTCAGATGTGGTGGCCAATGGAAGTGCCAACAAATACGTTTACATGAGGCTAGATACCCATAGATGGAGAATGATGCAACGTGGCATCCAGGTGCAACCGATTTGGCCACCTTGGTGCAGTTTCAGCCCTAGCTCATGCCTTGAGAGAGTGCATTAA

Protein sequence:

>DPOGS212532-PA
MVILIDVESDSDPRLRINTDEGYMLKVETKNNQVIIKVSGLSFCGARHGFETLSQLILLDQSTGYLIMLSSAIIKDAPTYKYRGLMVDTGRNYIPVVDLLRTVDAMSTCKLNTFHWRISDATSFPMSLSKIPELEEYGPYDRSMVYTKKDIRMIVNRAGIRGIRVLIEIAAPGPVGRPFSWLSSTTCSRKNNSLTCDNDLCRRLTMHDSTFDVLQKIYSEILEMTNVDDVFHLSDSVFSMTNCYYLFDDREGFLDKALFRLKMANKGFLPQLPIIWYTSHLMKHFEAKTWERLGVQIDEWDANPYESYLNKFRVIHSTKWDLSCEMRKQRCIRYRTWQQMYLWKSWRNVNVFTTEGGESILWTDLVDSSNLDYHLWPRAAVVAERLWSDVVANGSANKYVYMRLDTHRWRMMQRGIQVQPIWPPWCSFSPSSCLERVH-