DPGLEAN12799 in OGS1.0

New model in OGS2.0DPOGS209365 
Genomic Positionscaffold151:+ 64249-69003
See gene structure
CDS Length2286
Paired RNAseq reads  1148
Single RNAseq reads  2681
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA005690 (8e-108)
Best Drosophila hit  CG5613, isoform A (8e-48)
Best Human hitcytosolic endo-beta-N-acetylglucosaminidase (5e-66)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC030739 [Tribolium castaneum] (3e-81)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC030739 [Tribolium castaneum] (7e-77)
GeneOntology terms

  
GO:0005622 intracellular
GO:0005737 cytoplasm
GO:0033925 mannosyl-glycoprotein endo-beta-N-acetylglucosaminidase activity
InterPro families

  
IPR017853 Glycoside hydrolase, superfamily
IPR013781 Glycoside hydrolase, subgroup, catalytic core
IPR005201 Glycoside hydrolase, family 85
Orthology groupMCL12225

Nucleotide sequence:

ATGGCGCACAAGAAAATGGGAACCCGAAACGAATTAACCTGTAGACCGCTCGATTCATTG
GAAGAAATTCAGTCATTTTTACAAAATCCACCTGCGTGGAGGACTTTATGCACAGATTTA
GTTCCTCACAGTGAGAGTGTAGTGAGAAACATAAAAATAAATCCGTTTTTCGGTGAAAAT
ATACTCGAAACTCCACAAACGTTCTGTCATTATGATCCGGAAGCGCAGGAAGTCGAGCGC
GAGGAACGTCGACGACTACCAAAGACACTGGTTTGCCATGATATGGCTAATGGCTACCAC
GACGACAGTTTCATCGATGGCACCGGTAACTACACCGCCTATACGTTCTACAACTGGGGC
GGCATCGACATATTCTGTTATTTCAGCCACCACTTCATCACCATCCCTCCCCTGGGCTGG
ATCAACGTCGCCCACGCGCACGGGGTTCAAATTATCGGTACCGTTATAACGGAGTGGGCG
GACGGGGTGGCGATGTGGGAGAAGCTGCTGTCGTCGGAGTCCGAGTGGCAGGACTTCGCC
AGCATGCTGGTCGCCATCGCTAAGACCTTGAAGTTTGACGGATGGCTGCTAAATATAGAG
AACAAGGTGTCGGACCCGCGCTCGTTGGTCCAGTTCGTCCATCACCTGCACACCGCCCTC
CACCAGGAGTTGCCGCACGCGGTGCTTCTGTGGTACGACAGTGTCACTGTCGACGGTCAT
CTCTACTGGCAGAACGCTCTCAACGAGAAGAACAGGGCGTTCTTCGATGTGACGGACGGC
CTGTTCACGAACTACTCGTGGAGCGCGTCCGACGTGTCGTCCAGCGTGCTGGAGGCCGGC
GACCGCCTCACCGACCTCTACATAGGGATCGACGTCTGGGGACGGAACTTCTACGGCGGC
GGGCAGTTCAACACGCAGCAGGCGATCCAGGTGGCGTTCCATCAAGGTTGTTCGTTAGCG
ATATTCGCCCCGGCCTGGACGTACGAGGCCCTCACCAACGACAAGGACAACCTCAACTGG
GTGACGGACGGCGAGGAGCTGGACGGCTACGACAGTTTCCTGCTCCGAGACCGCGCGCTC
TGGACCAGCTTGTGGCCCTTCCTGAACACGAAGCTGCCGTGCCGGTTGCCATTCCAGACG
TCCTGGTGTCGCGGCCAGGGGACGAGGAGAAGAATCTATGGAGAAGTCATATGTCCAGTT
CCTTGGTACAACCTGCGACACATGCACTATCAGCCCAACTCGACCCTGGGACCCCACGGG
TACTTACTGTCGACACAGGACAACATCAATCGCCTCTCGAACTTGGGGTTGCTGAAGAAC
AGGGAGGGCATCCTCAGATACAGGAAGTCCTTGGAACAGAGCAAGATGGAGCTGGAGACG
AACCCCGGCGACGTCGCCATGAACATAAAGGAAGACAGCTTGACGCATCTGGACTCGGGT
TCACACAGGGAGGTGGTGGTCGACAAGGATAGTGACACGAGCACTGTGAGGACTACGGTC
AGGACGAAGGTGAAGAACGCACTGAGGAACCTGTTCAAGATCAGAACGAGCAGAGTCGAC
AAAAACGGAAGCGACCACGTGAGCCAGGCGAGCGACAGGACGGACGAGGCGGCGCGGGAG
GCGGAGGAATACCAGTCCTCGGGGAAGTCCATGGTGAGGATGTCGCTGAACCTGAGTCTG
GGCCGGACCACCAAGACCAGGTACTGTCTGGGCTACGTGTCCATGGAGCGCGAGTGTTTC
GAGACCTACTACGAGGACAGCTTCATAGGCGGCTCGTGTCTGATGGTGCACCCGGCTGAC
GACGAGTACGAGGCACAGCGCACGTCCCGGCTCTTACACTGCGACTTCCGGTGTGACGAC
ACGCTGGTCGTGTGCGTGGTCACCAAGACCCTGGACGAGCACGACGACCAGTTCCTCAAC
ATCAGACTGAGCGTGTCGGACTGCGGAGGCTGCGAGAAGGTGGTGTTGGTGGGGAGGAGC
CTCCCCAGCGGAGGGGAGGAGCCATCCGGGACCGAGCTGGAGCAGGTGTTCCCCGTCAAC
GACGAAGACGACTTCCCCGAGCTACAGAAGTATCTGGTGCTGAACGAGCCCGGGTTCTAC
GTACCCGTCGTCAACCCGTACGGTTGGCAGGTCAGATATTATCGCGTCCGTGTCCCGGGA
TGCCGCGTGCTGGCCGTCAGCTGTCGGACCGGCCTGCCCCTGGGGCCGGTGCTGCTGGGA
CACCTGGGACTCTGTAGCATCAGAGATACACACGCCGACGATGCACAAACCAACGTCGCC
AGCTAG

Protein sequence:

MAHKKMGTRNELTCRPLDSLEEIQSFLQNPPAWRTLCTDLVPHSESVVRNIKINPFFGEN
ILETPQTFCHYDPEAQEVEREERRRLPKTLVCHDMANGYHDDSFIDGTGNYTAYTFYNWG
GIDIFCYFSHHFITIPPLGWINVAHAHGVQIIGTVITEWADGVAMWEKLLSSESEWQDFA
SMLVAIAKTLKFDGWLLNIENKVSDPRSLVQFVHHLHTALHQELPHAVLLWYDSVTVDGH
LYWQNALNEKNRAFFDVTDGLFTNYSWSASDVSSSVLEAGDRLTDLYIGIDVWGRNFYGG
GQFNTQQAIQVAFHQGCSLAIFAPAWTYEALTNDKDNLNWVTDGEELDGYDSFLLRDRAL
WTSLWPFLNTKLPCRLPFQTSWCRGQGTRRRIYGEVICPVPWYNLRHMHYQPNSTLGPHG
YLLSTQDNINRLSNLGLLKNREGILRYRKSLEQSKMELETNPGDVAMNIKEDSLTHLDSG
SHREVVVDKDSDTSTVRTTVRTKVKNALRNLFKIRTSRVDKNGSDHVSQASDRTDEAARE
AEEYQSSGKSMVRMSLNLSLGRTTKTRYCLGYVSMERECFETYYEDSFIGGSCLMVHPAD
DEYEAQRTSRLLHCDFRCDDTLVVCVVTKTLDEHDDQFLNIRLSVSDCGGCEKVVLVGRS
LPSGGEEPSGTELEQVFPVNDEDDFPELQKYLVLNEPGFYVPVVNPYGWQVRYYRVRVPG
CRVLAVSCRTGLPLGPVLLGHLGLCSIRDTHADDAQTNVAS