Monarch geneset OGS2.0

DPOGS213790
TranscriptDPOGS213790-TA1869 bp
ProteinDPOGS213790-PA622 aa
Genomic positionDPSCF300212 + 774663-776531
RNAseq coverage228x (Rank: top 44%)
Annotation
HeliconiusHMEL0139110.069.13% 
BombyxBGIBMGA009235-TA0.065.97% 
DrosophilaCG13397-PA2e-13842.54% 
EBI UniRef50UniRef50_E2C9307e-15245.06%Alpha-N-acetylglucosaminidase n=10 Tax=Endopterygota RepID=E2C930_HARSA
NCBI RefSeqXP_001606979.14e-14845.42%PREDICTED: similar to alpha-n-acetylglucosaminidase [Nasonia vitripennis]
NCBI nr blastpgi|3071922542e-15145.06%Alpha-N-acetylglucosaminidase [Harpegnathos saltator]
NCBI nr blastxgi|3071683127e-15047.38%Alpha-N-acetylglucosaminidase [Camponotus floridanus]
Group
KEGG pathwaynvi:1001233511e-147 
 K01205 (NAGLU)maps-> Lysosome
    Glycosaminoglycan degradation
InterPro domain[1-596] IPR0077812.4e-199Alpha-N-acetylglucosaminidase
Orthology groupMCL13146 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213790-TA
ATGGCATTAAATGGCATTAATATGGCATTGGCTCCGGTAGCACAGGAGGCCGCCTGGACGAGGGTTTACAAACAGCTAGGAATGACCGATGACGAGATTAAGGAACACTTCACGGGGCCTGGTTTCCTCGCATGGCTTCGGATGGGAAATGTTCATGGTTGGGGAGGGCCACTTCCACAATCTTGGCATGACAGGCAGAAACAAATCCAAGAAGTTGTCACCGATTTGATGTTCAAGTTAGGTATGATACCGGTATTTCCAGCTTTCAATGGACACGTCCCGAAAGCATTTGAAAAAATATTCCCAAACACAACCTTCCATCCGGTGGAGACGTGGAACAAATTTGACGAAGACTACTGTTGCAATCTATTCGTAGACCCCAGGGAGCCGGATTTTAAGATGATATCTAAAATGTTCATGAGAGAAATAACCGCAGGATTGGGCAGCAGTCACATATACACGGCGGATCCGTTCAACGAGATAAAAATACAACCCTGGTCCACATCATTGGTGGTAGAAACAGCTAAAGCTATATTTTCAAGTATCTCCGAGTATGACAAGGATGCCGTGTGGCTTGTACAGAACTGGATGTTCGTTCACAATCCCTTACTTTGGCCATTGAAGAGAGTTAACAGCTTCTTAACATCTGTACCAAACGGTAGAATGCTTGTATTGGACCTTCAATCGGAACAGTGGCCACAATATGACTTATATCAAATGTATTACGGACAGCCGTTTATTTGGAGTATGCTACATAATTTCGGGGGTACTTTAGGTATGTTCGGTAATACTAAAACCATAAACAAGGACGTGTATGAAGTAAGGAAAAGGGAGAACAGCACTATGGTAGGGATAGGACTGACCCCGGAAGGTATAAACCAGAATTATGTTATCTACGATTTAATGTTAGAATCAGCTTGGCGCAAGGGACCCGTACCGGATCTCGAAGAATGGGTATCAGACTACGCAGAAAGGAGATACGGCTGCAATGCAACTTCCATAGGATGGAAATATCTGCTTAGGAGCGTCTATAATTTTACAGGTCTCAACAGAATTAGAGGTAAATATGTAATGACTAGACGTCCCAGCTTTAACATCAGACCATGGGCGTGGTACAAGGGGCATGATTTATTCGAAGCTTTAAAGAACTTCGTCTATGTACAAAACCCAGCCTGCTCTACATCAGGTTTCTTACACGACTTGGTTGATGTCACCCGTCAAGCGTTGCAATACAAAATTGAACAGATCTATATGAACTTACAAAACGACAGATATTCAAACTACATGGTGTTCAACTACACCATATCTAGCTTCATAGATGCCATGACTGATATGCAAAATATATTAGCAACGAGCAGTGATTTCAAAATTACATCGTGGTTATCCAGCGCAAGGGCAATCTCAAATCTACCTTTGGAATCATCACTGTATGATTTCAACGCGCGCAATCAAATAACCCTATGGGGTCCCAATGGGGAAATCAGTGATTATGCATGTAAACAATGGGCCGAACTTTTTAAGTACTACTACATACCAAGATGGTCGATATTTTTATCCATGGCATTAGATGCCAAGACAAGAAACGAACCTTTTGATGAAAAAGGAGCTCAGAGAGTAGTGAGGTCTTCGGTGGAAGAGAAATTCGCGAGCATCAATATAGACTACATACCGTCTGATAATCCACAACAACTCGCCCTAAATCTGTACCAAAAATGGTTCAGTGTATCAGGACACGCGGATTTACCTATGAGGATAATTAAACAGGATCCAAAGAAAAAGACAACATTGCCTGATACGGACACAGACGGCGAAGATTACAATGAAAATACCCCAACAGTTATTTTCTTGCACTCTACGACACCTAATTAG

Protein sequence:

>DPOGS213790-PA
MALNGINMALAPVAQEAAWTRVYKQLGMTDDEIKEHFTGPGFLAWLRMGNVHGWGGPLPQSWHDRQKQIQEVVTDLMFKLGMIPVFPAFNGHVPKAFEKIFPNTTFHPVETWNKFDEDYCCNLFVDPREPDFKMISKMFMREITAGLGSSHIYTADPFNEIKIQPWSTSLVVETAKAIFSSISEYDKDAVWLVQNWMFVHNPLLWPLKRVNSFLTSVPNGRMLVLDLQSEQWPQYDLYQMYYGQPFIWSMLHNFGGTLGMFGNTKTINKDVYEVRKRENSTMVGIGLTPEGINQNYVIYDLMLESAWRKGPVPDLEEWVSDYAERRYGCNATSIGWKYLLRSVYNFTGLNRIRGKYVMTRRPSFNIRPWAWYKGHDLFEALKNFVYVQNPACSTSGFLHDLVDVTRQALQYKIEQIYMNLQNDRYSNYMVFNYTISSFIDAMTDMQNILATSSDFKITSWLSSARAISNLPLESSLYDFNARNQITLWGPNGEISDYACKQWAELFKYYYIPRWSIFLSMALDAKTRNEPFDEKGAQRVVRSSVEEKFASINIDYIPSDNPQQLALNLYQKWFSVSGHADLPMRIIKQDPKKKTTLPDTDTDGEDYNENTPTVIFLHSTTPN-