Monarch geneset OGS2.0

DPOGS214324
TranscriptDPOGS214324-TA2082 bp
ProteinDPOGS214324-PA693 aa
Genomic positionDPSCF300020 - 647769-651943
RNAseq coverage9x (Rank: top 85%)
Annotation
HeliconiusHMEL0045400.055.17% 
BombyxBGIBMGA003996-TA0.059.39% 
Drosophilafdl-PB6e-10835.47% 
EBI UniRef50UniRef50_B1P8680.055.57%Beta-N-acetylglucosaminidase n=1 Tax=Spodoptera frugiperda RepID=B1P868_SPOFR
NCBI RefSeqNP_001165928.12e-17953.37%fused lobes [Bombyx mori]
NCBI nr blastpgi|2953115680.056.52%hexosaminidase [Ostrinia furnacalis]
NCBI nr blastxgi|2953115680.056.17%hexosaminidase [Ostrinia furnacalis]
Group
Gene OntologyGO:00431691.8e-81cation binding
GO:00059751.8e-81carbohydrate metabolic process
GO:00038241.8e-81catalytic activity
GO:00045533.5e-66hydrolase activity, hydrolyzing O-glycosyl compounds
GO:00045631.2e-47beta-N-acetylhexosaminidase activity
KEGG pathwaytca:6560274e-127 
 K12373 (HEX)maps-> Lysosome
    Glycosaminoglycan degradation
    Amino sugar and nucleotide sugar metabolism
    Glycosphingolipid biosynthesis - globo series
    Other glycan degradation
    Glycosphingolipid biosynthesis - ganglio series
InterPro domain[321-690] IPR0137811.8e-81Glycoside hydrolase, subgroup, catalytic core
[323-664] IPR0178537.9e-71Glycoside hydrolase, superfamily
[323-651] IPR0158833.5e-66Glycoside hydrolase, family 20, catalytic core
[279-299] IPR0015401.2e-47Glycoside hydrolase, family 20
[25-143] IPR0130577.5e-14Amino acid transporter, transmembrane
[229-321] IPR0158827.9e-08Acetylhexosaminidase, subunit a/b
Orthology groupMCL14551 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214324-TA
ATGTGCCCGAATACTGCGTTTGAACTTAGAATAAAACAGTCCAACTATGTCCCTAAAGCTTTAGATATAAAAGTCCTGTCACTTGATAAAGCTTTAAAAGACCCGACCGTTTTGACGAAACCTTTCGGGGTCATTAGATTCGAGATGGTCATCGTAACCATAATATTAACGATTTTTGGAGCGTTGGGTTACTGGGCCTTCGGTACTATGGAGGAGAACGTGCTGAGGTCCCTCCCGTTTGACGATGATACAGCTATGGTCGCGATAGGAATATATTTAGTGTCGATAGCATTCGCTTATCCCATACAATGCTATCCAGCGATACAAATCGTAATTGAGATAATAAAGAACAGGGATGTTCCCGACCCTCCATCGAACACGACATTGAAGAAAATTGAGACGAAATTACCATTATGGAGTTGGGACTGTATCGATGATAAATGCGTTCCAAGCAAGGCAACCATGAACGGAAAACTGCAGTCATTAATGACCTGTAACATGCTATGTTCGTCCATGCAGCTCTGGCCGCAGCCAACTGGTGCTGTAAGTTTAGCAACCACAGCTGTACCAGTCAGAGCTGACCTCTTCAAACTTAAAATTATGTCGTTCACATCAAAACCCGTAAGAGATTACTTACGAAAGGCTTTTACACTTTTCCGTAGAGAGTTACGTACGAATGAAAGGAACATTCGCGCATTTGAAGATTGGCGATCAGTTATTGTCCGTATCGTCATTAACGAAAATGGAAGTACTGATCCTCGCATGCTTATCAATACTGACGAGAGCTATCAGTTGAGGCTTTACCCAAAATTAGGTTCAGCTGAGATTTTTCTTGTTGATATATTTTCTCACTCATTTTGCGGAGCTCGACATGCCATGGAAACATTATCTCAGTTAATATGGCTCGATCCCTACGCAGGTTCTTTGTTAATGATTGAAGCAGCAACCGTTGACGATGCTCCACGATTTCGCTATCGAGGTCTTTTATTGGATACTGCCCGTAACTACTTTCCTGTCAATGACATAATTAAGACAATCGATGCTATGGCCGCTTCGAAGCTCAATACATTTCATTGGCATGCAACAGATTCTCAAGCGTTTTCGTTACTATTCGATAGTGTGCCTCAACTGGCTAAGTATGGTGCTTATGGTCACAGTACAATATATTCTTCTGCAGATGTGAGAGCTGTTGTAAACCGCGCAAGATTACGCGGTATTCGTGTGCTTATAGAAGTAGATTTACCTGCGCATGTTGGTAGTGCTTGGGATTGGGGCCAACAAATGGACGTTAAAGAGTTAGCTTATTGCATTACGTCCGAACCCTGGGTCGCCTATTGTCAGGAACCTCCTTGTGGACAGATAAATCCACGCAATGATCATGTATACGATCTCATAGAACGAATTTATACTGAAATTATTAATCTTACAGGTGTTGATGATATGTTTCATATCGGAGGTGATGACATTTCTGAACGATGTTGGCTCGACAATTTTGATGATACGGATCCTGTGGTCTTATGGTCTCATTTCACTCAAAACATATTAAAACGCCTAGAGGCCGTTAATGGACAGTTACCAAATTTAACAATATTGTGGTCGTCACAATTTTCAGAACGTATGAAAACAGATCTAAAATCTTTCGTTCATAAGCTAGGTCTCCAGGTGCGCAGCGTCGCTTGGTCGCCAAGATACGTTTCTGGAATTCGGACTATCGTTTCTCATGAGGATGTGTGGGACTTGAACAATGGTTACGGTACCTGGCATGGAGATACCGAAGGCCCACCCTATAACTCGTGGCAGAGAATATATGAACATCGGCCCTGGGCTCGAAAGCCTATTAGTTGTATGGAAGGAGGTGAAGCAACAGTTTGGTCTTCGACATTAAGCACAGGTTGTTTGGATGCACAAATATGGCCTAGAGCTGCTGCATTAGCAGAAAGATTATGGTCAGACCGCGCTGAAGCTGCCACAAGGTTAGTTCATGCTAGGCTTGATGTTCATCGTTCACGTTTGGTCGAACGAGGTATACGTGCAGCTCCCATGTGGTCTATGTGGTGTACTCATAACACAAATACATGTTGA

Protein sequence:

>DPOGS214324-PA
MCPNTAFELRIKQSNYVPKALDIKVLSLDKALKDPTVLTKPFGVIRFEMVIVTIILTIFGALGYWAFGTMEENVLRSLPFDDDTAMVAIGIYLVSIAFAYPIQCYPAIQIVIEIIKNRDVPDPPSNTTLKKIETKLPLWSWDCIDDKCVPSKATMNGKLQSLMTCNMLCSSMQLWPQPTGAVSLATTAVPVRADLFKLKIMSFTSKPVRDYLRKAFTLFRRELRTNERNIRAFEDWRSVIVRIVINENGSTDPRMLINTDESYQLRLYPKLGSAEIFLVDIFSHSFCGARHAMETLSQLIWLDPYAGSLLMIEAATVDDAPRFRYRGLLLDTARNYFPVNDIIKTIDAMAASKLNTFHWHATDSQAFSLLFDSVPQLAKYGAYGHSTIYSSADVRAVVNRARLRGIRVLIEVDLPAHVGSAWDWGQQMDVKELAYCITSEPWVAYCQEPPCGQINPRNDHVYDLIERIYTEIINLTGVDDMFHIGGDDISERCWLDNFDDTDPVVLWSHFTQNILKRLEAVNGQLPNLTILWSSQFSERMKTDLKSFVHKLGLQVRSVAWSPRYVSGIRTIVSHEDVWDLNNGYGTWHGDTEGPPYNSWQRIYEHRPWARKPISCMEGGEATVWSSTLSTGCLDAQIWPRAAALAERLWSDRAEAATRLVHARLDVHRSRLVERGIRAAPMWSMWCTHNTNTC-