DPGLEAN06610 in OGS1.0

New model in OGS2.0DPOGS213232 
Genomic Positionscaffold636:+ 50886-63203
See gene structure
CDS Length2940
Paired RNAseq reads  1146
Single RNAseq reads  2830
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA002235 (0.0)
Best Drosophila hit  CG5871 (0.0)
Best Human hitbifunctional protein NCOAT isoform a (1e-109)
Best NR hit (blastp)  PREDICTED: similar to CG5871 CG5871-PA [Tribolium castaneum] (0.0)
Best NR hit (blastx)  PREDICTED: similar to CG5871 CG5871-PA [Tribolium castaneum] (0.0)
GeneOntology terms  GO:0004415 hyalurononglucosaminidase activity
InterPro families

  
IPR017853 Glycoside hydrolase, superfamily
IPR016181 Acyl-CoA N-acyltransferase
IPR011496 Beta-N-acetylglucosaminidase
Orthology groupMCL14222

Nucleotide sequence:

ATGCGCAAGGATTTCATTTGCGGCGTCGTCGAAGGCTTTTACGGCCGGCCTTGGACGACG
GAGCAGAGGAAAGATTTATTTCAAAAATTGAAAAAATGGGGACTAGACATGTACGTTTAC
GCTCCGAAAGATGACTACAAGCACAGAGCCTACTGGAGGGAGCTGTACACAGTGGAAGAA
GCAGAACATCTGACGTCACTAATATCGGAAGCTAAGTCTCACGGTATTACTTTCTGTTAT
GCCCTGTCTCCCGGACTGGACATCACATACAGCAGCCAAAAGGAAATAACAACATTGAAA
CGGAAACTAGAGCAGGTATCTCAATTTGGGTGTACATGTTTTGCTCTGCTGTTTGACGAC
ATCGAGCCGGAAATGAGTGAAGCTGACAAGCAAATATTCCAGAGTTTTGCACATGCTCAG
GTATCTGTTACTAATGAGATCCACCAGCATCTCGGCAGCCCCAAGTTCCTCCTATGCCCG
ACTCAGTACTGCTCCACGAGGGCTGTACCAACTGTGCACACATCGGAGTATCTCAATACA
TTGGGCACTAAACTCTCCCAGGAAATTGACATCATGTGGACGGGACCCAAAGTTATAAGT
AAGACTCTAACCACAGAGTGCATTGGAGAGATAACGCAAGTGCTACGGAGACCTCCTGTT
ATTTGGGACAATTTACACGCCAATGACTACGACCAGAAGAGGATATTCTTAGGTCCATAC
TGCGGCCGCTCCCCCGAGCTGATCCCTCTCCTCCGGGGAGTGTTATCTAACCCCAACTGC
GAGTACAACGCCAACATGATACCGATATGGACCCTCGCACACTGGGCCAGGTGCAGCCTG
GATGCACCAGCACATATGGAAGCTGTGTCGTGGGACATTAAGCTGGAACGTGAGAGCGAG
CAAGGTATATGTGAAGATGAAGTGCCTCTCACTCTCGGTAAACACGTATACCACCACAGA
CAAGCGCTGAGACAAGCCATAAACGAGTGGCTCCCAGAGTTTTCTATACCCAAAACAGCC
CAAGGTCCAGTCATTAAACCTCAACCGCAGGTCGCCGCTCCTCCGGTACCTATCCTGCCG
ATCCTGCCGTCGGTGAACACGTGTATGTCGCTGACGGCCACCACCACGACCAGCTCGCGC
GCCCCGGACCTGCCCATACCCACCGTCACCACCAGTCAGCTGCAGGCGCTGGCTGACCGG
CCAGCACTCGCCACCGCTGTGACGTCAATCGAGCCATTCAATCCTGTCCCCAACCCTGTG
ATGAATTCCTTAGTATCACCAACTAAGGTGATCCTTAACGAGTCGATCCCAAACCCCATC
ATACCCATGGCCAGTTCCATCGCTCTGCCGCCCGAGCTGCCGGTGTCCACGCTGCCGGTA
CCCATAATGGGCATCAAGGCGATCGATGGTGACAAGATCGATTCGGAAATGGATAAAATC
GATATCAACGAATCGAATGATAGTCTACTGACGCAGAGCTTCATAGACGATATGAAGAAG
GACAAAGAGGACGACGACACTATCATAGTTGATGATCTGGAGCAGTCTGAACAGCAGCGG
AACGGTGACATGAGTGTTGGAGATACGCCCCAGACGTTGAGTCCCAGCCGCGTCCCCGAG
GGTGTGGAACCCCTGGACGTGGACCCTCCATCAACCGCAGCTGATTCTGATGTTGTCATG
AACGATCAGCTCAGCGAGAACGGTTCTATGCAAGTTGAGCCTAGCAGCAGTCCGTTGAGC
GGGGACATGATAGTTGAACAAGCCGAGGCCATCGATGATGACTCTCGGCTGTCTCAAGAT
GACCTGCTGTTGCTCTGCGAGCTGTTCTACCTACCGTTCAGCCACGGTGGTAGAGGTCTC
AGACTGTTGCACGACTACCACTGGCTCACCACCCACGCCACCAGCTGCCTAGCAAGAGGG
AATAAACCGGAACCCAGCGAGTGGCGCCGTCGTCTGCGTCGCTTCTCGTGGTGGTCGTGT
CGCGCGCGCCGCTTGTCCCGCCGCCTGTCCTTGTGCGCCAACCGAGAGCTACACGCGGAG
CTGCACCCCTACCTATGGGACCTGTGCGCTGTGCTCGCTCTACTACAGGCCTTCCTGCGG
TGGCTGGGGTTCTCCAAAGGCTGGAGGGAAGCCTTCGAAAGCGGCACCCAAGAGCCGTGG
GTGTTCCGAGGAGGTCTGACCGCGGACCTGCGGAGACTACTGCCCGTGGAGTGCAGCGGG
GACGCTCTCAGACCCCAGTGTATACCCAACAGCCTGCCGCTCACCGTGAGACCCTACACA
CTCGCAGACGAAGACGCGGTGTGTAATTTGTGCCAGAAAACCTGCCGGGACGGTTTGGAC
TGCAGTCACTTGTTCCCTGGGGAATTAATGTCGCTCCCCGTGGACAGACTGATCGCTCCA
TACTTGACGCTGTCCCCCGAGCTGTGTATGGTGATAGAGGATGATGGTGACATCATCAAT
GATGATGATGATGACAAGCCGGGCATCAATAACAACGACGCTAAACCGGAAATAGTCGGT
TACGTGTGCGCGGCCGTCAACAGCGTGGACTTCTATAGGAAACAGGAAATAGCCTGGATA
CCGGAAATGTGTCTCAAATATCCCAAGGAGTTACTCGACAAAGACGACTTGAGTGATGCG
GCCAAGGACTGCATCCGCTACTTCCACTCGTACTCCGCGGAGTCCATCATCACTTCCTCT
TCCGGCGTCTACTCCTCTCACCCGTCCTTGATATCGATGGCTGCTGTGCCACGGTCCGAC
CCGCTGGCAACCTCGCGCCTGCTCACGTGCCTACTGGCTGCTCTCAGGGCTTACGGTGTT
AACGGCGTCCACACCTGTGTGGCCATCAACGACCAGCACCTGCTGCAGTTCTACAGCAAG
TTCGGCTTCACGGAGCACTCGCGTAACGAGGTCCACGTGTTCATGGCTAAACTGTTCTAG

Protein sequence:

MRKDFICGVVEGFYGRPWTTEQRKDLFQKLKKWGLDMYVYAPKDDYKHRAYWRELYTVEE
AEHLTSLISEAKSHGITFCYALSPGLDITYSSQKEITTLKRKLEQVSQFGCTCFALLFDD
IEPEMSEADKQIFQSFAHAQVSVTNEIHQHLGSPKFLLCPTQYCSTRAVPTVHTSEYLNT
LGTKLSQEIDIMWTGPKVISKTLTTECIGEITQVLRRPPVIWDNLHANDYDQKRIFLGPY
CGRSPELIPLLRGVLSNPNCEYNANMIPIWTLAHWARCSLDAPAHMEAVSWDIKLERESE
QGICEDEVPLTLGKHVYHHRQALRQAINEWLPEFSIPKTAQGPVIKPQPQVAAPPVPILP
ILPSVNTCMSLTATTTTSSRAPDLPIPTVTTSQLQALADRPALATAVTSIEPFNPVPNPV
MNSLVSPTKVILNESIPNPIIPMASSIALPPELPVSTLPVPIMGIKAIDGDKIDSEMDKI
DINESNDSLLTQSFIDDMKKDKEDDDTIIVDDLEQSEQQRNGDMSVGDTPQTLSPSRVPE
GVEPLDVDPPSTAADSDVVMNDQLSENGSMQVEPSSSPLSGDMIVEQAEAIDDDSRLSQD
DLLLLCELFYLPFSHGGRGLRLLHDYHWLTTHATSCLARGNKPEPSEWRRRLRRFSWWSC
RARRLSRRLSLCANRELHAELHPYLWDLCAVLALLQAFLRWLGFSKGWREAFESGTQEPW
VFRGGLTADLRRLLPVECSGDALRPQCIPNSLPLTVRPYTLADEDAVCNLCQKTCRDGLD
CSHLFPGELMSLPVDRLIAPYLTLSPELCMVIEDDGDIINDDDDDKPGINNNDAKPEIVG
YVCAAVNSVDFYRKQEIAWIPEMCLKYPKELLDKDDLSDAAKDCIRYFHSYSAESIITSS
SGVYSSHPSLISMAAVPRSDPLATSRLLTCLLAALRAYGVNGVHTCVAINDQHLLQFYSK
FGFTEHSRNEVHVFMAKLF