Monarch geneset OGS2.0

DPOGS209365
TranscriptDPOGS209365-TA2004 bp
ProteinDPOGS209365-PA667 aa
Genomic positionDPSCF300118 - 242364-246836
RNAseq coverage337x (Rank: top 34%)
Annotation
HeliconiusHMEL0131150.054.71% 
BombyxBGIBMGA005691-TA3e-10043.15% 
DrosophilaCG5613-PA1e-5535.93% 
EBI UniRef50UniRef50_UPI0001791C181e-7342.64%UPI0001791C18 related cluster n=1 Tax=unknown RepID=UPI0001791C18
NCBI RefSeqXP_001949945.12e-7442.64%PREDICTED: similar to endo-beta-N-acetylglucosaminidase [Acyrthosiphon pisum]
NCBI nr blastpgi|1936108294e-7342.64%PREDICTED: cytosolic endo-beta-N-acetylglucosaminidase-like isoform 1 [Acyrthosiphon pisum]
NCBI nr blastxgi|1936108292e-7342.64%PREDICTED: cytosolic endo-beta-N-acetylglucosaminidase-like isoform 1 [Acyrthosiphon pisum]
Group
Gene OntologyGO:00339254.8e-85mannosyl-glycoprotein endo-beta-N-acetylglucosaminidase activity
GO:00057374.8e-85cytoplasm
KEGG pathwayecb:1000572806e-73 
 K01227 (E3.2.1.96)maps-> Other glycan degradation
InterPro domain[1-269] IPR0052014.8e-85Glycoside hydrolase, family 85
[28-135] IPR0178539.5e-06Glycoside hydrolase, superfamily
Orthology groupMCL11461 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209365-TA
ATGGCTAATGGCTACCACGACGACAGTTTCATCGATGGCACCGGTAACTACACCGCCTATACGTTCTACAACTGGGGCGGCATCGACATATTCTGTTATTTCAGCCACCACTTCATCACCATCCCTCCCCTGGGCTGGATCAACGTCGCCCACGCGCACGGGGTTCAAATTATCGGTACCGTTATAACGGAGTGGGCGGACGGGGTGGCGATGTGGGAGAAGCTGCTGTCGTCGGAGTCCGAGTGGCAGGACTTCGCCAGCATGCTGGTCGCCATCGCTAAGACCTTGAAGTTTGACGGATGGCTGCTAAATATAGAGAACAAGGTGTCGGACCCGCGCTCGTTGGTCCAGTTCGTCCATCACCTGCACACCGCCCTCCACCAGGAGTTGCCGCACGCGGTGCTTCTGTGGTACGACAGTGTCACTGTCGACGGTCATCTCTACTGGCAGAACGCTCTCAACGAGAAGAACAGGGCGTTCTTCGATGTGACGGACGGCCTGTTCACGAACTACTCGTGGAGCGCGTCCGACGTGTCGTCCAGCGTGCTGGAGGCCGGCGACCGCCTCACCGACCTCTACATAGGGATCGACGTCTGGGGACGGAACTTCTACGGCGGCGGGCAGTTCAACACGCAGCAGGCGATCCAGGTGGCGTTCCATCAAGGTTGTTCGTTAGCGATATTCGCCCCGGCCTGGACGTACGAGGCCCTCACCAACGACAAGGACAACCTCAACTGGGTGACGGACGGCGAGGAGCTGGACGGCTACGACAGTTTCCTGCTCCGAGACCGCGCGCTCTGGACCAGCTTGTGGCCCTTCCTGAACACGAAGCTGCCGTGCCGGTTGCCATTCCAGACGTCCTGGTGTCGCGGCCAGGGGACGAGGAGAAGAATCTATGGAGAAGTCATATGTCCAGTTCCTTGGTACAACCTGCGACACATGCACTATCAGCCCAACTCGACCCTGGGACCCCACGGGTACTTACTGTCGACACAGGACAACATCAATCGCCTCTCGAACTTGGGGTTGCTGAAGAACAGGGAGGGCATCCTCAGATACAGGAAGTCCTTGGAACAGAGCAAGATGGAGCTGGAGACGAACCCCGGCGACGTCGCCATGAACATAAAGGAAGACAGCTTGACGCATCTGGACTCGGGTTCACACAGGGAGGTGGTGGTCGACAAGGATAGTGACACGAGCACTGTGAGGACTACGGTCAGGACGAAGGTGAAGAACGCACTGAGGAACCTGTTCAAGATCAGAACGAGCAGAGTCGACAAAAACGGAAGCGACCACGTGAGCCAGGCGAGCGACAGGACGGACGAGGCGGCGCGGGAGGCGGAGGAATACCAGTCCTCGGGGAAGTCCATGGTGAGGATGTCGCTGAACCTGAGTCTGGGCCGGACCACCAAGACCAGGTACTGTCTGGGCTACGTGTCCATGGAGCGCGAGTGTTTCGAGACCTACTACGAGGACAGCTTCATAGGCGGCTCGTGTCTGATGGTGCACCCGGCTGACGACGAGTACGAGGCACAGCGCACGTCCCGGCTCTTACACTGCGACTTCCGGTGTGACGACACGCTGGTCGTGTGCGTGGTCACCAAGACCCTGGACGAGCACGACGACCAGTTCCTCAACATCAGACTGAGCGTGTCGGACTGCGGAGGCTGCGAGAAGGTGGTGTTGGTGGGGAGGAGCCTCCCCAGCGGAGGGGAGGAGCCATCCGGGACCGAGCTGGAGCAGGTGTTCCCCGTCAACGACGAAGACGACTTCCCCGAGCTACAGAAGTATCTGGTGCTGAACGAGCCCGGGTTCTACGTACCCGTCGTCAACCCGTACGGTTGGCAGGTCAGATATTATCGCGTCCGTGTCCCGGGATGCCGCGTGCTGGCCGTCAGCTGTCGGACCGGCCTGCCCCTGGGGCCGGTGCTGCTGGGACACCTGGGACTCTGTAGCATCAGAGATACACACGCCGACGATGCACAAACCAACGTCGCCAGCTAG

Protein sequence:

>DPOGS209365-PA
MANGYHDDSFIDGTGNYTAYTFYNWGGIDIFCYFSHHFITIPPLGWINVAHAHGVQIIGTVITEWADGVAMWEKLLSSESEWQDFASMLVAIAKTLKFDGWLLNIENKVSDPRSLVQFVHHLHTALHQELPHAVLLWYDSVTVDGHLYWQNALNEKNRAFFDVTDGLFTNYSWSASDVSSSVLEAGDRLTDLYIGIDVWGRNFYGGGQFNTQQAIQVAFHQGCSLAIFAPAWTYEALTNDKDNLNWVTDGEELDGYDSFLLRDRALWTSLWPFLNTKLPCRLPFQTSWCRGQGTRRRIYGEVICPVPWYNLRHMHYQPNSTLGPHGYLLSTQDNINRLSNLGLLKNREGILRYRKSLEQSKMELETNPGDVAMNIKEDSLTHLDSGSHREVVVDKDSDTSTVRTTVRTKVKNALRNLFKIRTSRVDKNGSDHVSQASDRTDEAAREAEEYQSSGKSMVRMSLNLSLGRTTKTRYCLGYVSMERECFETYYEDSFIGGSCLMVHPADDEYEAQRTSRLLHCDFRCDDTLVVCVVTKTLDEHDDQFLNIRLSVSDCGGCEKVVLVGRSLPSGGEEPSGTELEQVFPVNDEDDFPELQKYLVLNEPGFYVPVVNPYGWQVRYYRVRVPGCRVLAVSCRTGLPLGPVLLGHLGLCSIRDTHADDAQTNVAS-