New model in OGS2.0 | DPOGS213232  |
---|---|
Genomic Position | scaffold636:+ 50886-63203 |
See gene structure | |
CDS Length | 2940 |
Paired RNAseq reads   | 1146 |
Single RNAseq reads   | 2830 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA002235 (0.0) |
Best Drosophila hit   | CG5871 (0.0) |
Best Human hit | bifunctional protein NCOAT isoform a (1e-109) |
Best NR hit (blastp)   | PREDICTED: similar to CG5871 CG5871-PA [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to CG5871 CG5871-PA [Tribolium castaneum] (0.0) |
GeneOntology terms   | GO:0004415 hyalurononglucosaminidase activity |
InterPro families    | IPR017853 Glycoside hydrolase, superfamily IPR016181 Acyl-CoA N-acyltransferase IPR011496 Beta-N-acetylglucosaminidase |
Orthology group | MCL14222 |
Nucleotide sequence:
ATGCGCAAGGATTTCATTTGCGGCGTCGTCGAAGGCTTTTACGGCCGGCCTTGGACGACG
GAGCAGAGGAAAGATTTATTTCAAAAATTGAAAAAATGGGGACTAGACATGTACGTTTAC
GCTCCGAAAGATGACTACAAGCACAGAGCCTACTGGAGGGAGCTGTACACAGTGGAAGAA
GCAGAACATCTGACGTCACTAATATCGGAAGCTAAGTCTCACGGTATTACTTTCTGTTAT
GCCCTGTCTCCCGGACTGGACATCACATACAGCAGCCAAAAGGAAATAACAACATTGAAA
CGGAAACTAGAGCAGGTATCTCAATTTGGGTGTACATGTTTTGCTCTGCTGTTTGACGAC
ATCGAGCCGGAAATGAGTGAAGCTGACAAGCAAATATTCCAGAGTTTTGCACATGCTCAG
GTATCTGTTACTAATGAGATCCACCAGCATCTCGGCAGCCCCAAGTTCCTCCTATGCCCG
ACTCAGTACTGCTCCACGAGGGCTGTACCAACTGTGCACACATCGGAGTATCTCAATACA
TTGGGCACTAAACTCTCCCAGGAAATTGACATCATGTGGACGGGACCCAAAGTTATAAGT
AAGACTCTAACCACAGAGTGCATTGGAGAGATAACGCAAGTGCTACGGAGACCTCCTGTT
ATTTGGGACAATTTACACGCCAATGACTACGACCAGAAGAGGATATTCTTAGGTCCATAC
TGCGGCCGCTCCCCCGAGCTGATCCCTCTCCTCCGGGGAGTGTTATCTAACCCCAACTGC
GAGTACAACGCCAACATGATACCGATATGGACCCTCGCACACTGGGCCAGGTGCAGCCTG
GATGCACCAGCACATATGGAAGCTGTGTCGTGGGACATTAAGCTGGAACGTGAGAGCGAG
CAAGGTATATGTGAAGATGAAGTGCCTCTCACTCTCGGTAAACACGTATACCACCACAGA
CAAGCGCTGAGACAAGCCATAAACGAGTGGCTCCCAGAGTTTTCTATACCCAAAACAGCC
CAAGGTCCAGTCATTAAACCTCAACCGCAGGTCGCCGCTCCTCCGGTACCTATCCTGCCG
ATCCTGCCGTCGGTGAACACGTGTATGTCGCTGACGGCCACCACCACGACCAGCTCGCGC
GCCCCGGACCTGCCCATACCCACCGTCACCACCAGTCAGCTGCAGGCGCTGGCTGACCGG
CCAGCACTCGCCACCGCTGTGACGTCAATCGAGCCATTCAATCCTGTCCCCAACCCTGTG
ATGAATTCCTTAGTATCACCAACTAAGGTGATCCTTAACGAGTCGATCCCAAACCCCATC
ATACCCATGGCCAGTTCCATCGCTCTGCCGCCCGAGCTGCCGGTGTCCACGCTGCCGGTA
CCCATAATGGGCATCAAGGCGATCGATGGTGACAAGATCGATTCGGAAATGGATAAAATC
GATATCAACGAATCGAATGATAGTCTACTGACGCAGAGCTTCATAGACGATATGAAGAAG
GACAAAGAGGACGACGACACTATCATAGTTGATGATCTGGAGCAGTCTGAACAGCAGCGG
AACGGTGACATGAGTGTTGGAGATACGCCCCAGACGTTGAGTCCCAGCCGCGTCCCCGAG
GGTGTGGAACCCCTGGACGTGGACCCTCCATCAACCGCAGCTGATTCTGATGTTGTCATG
AACGATCAGCTCAGCGAGAACGGTTCTATGCAAGTTGAGCCTAGCAGCAGTCCGTTGAGC
GGGGACATGATAGTTGAACAAGCCGAGGCCATCGATGATGACTCTCGGCTGTCTCAAGAT
GACCTGCTGTTGCTCTGCGAGCTGTTCTACCTACCGTTCAGCCACGGTGGTAGAGGTCTC
AGACTGTTGCACGACTACCACTGGCTCACCACCCACGCCACCAGCTGCCTAGCAAGAGGG
AATAAACCGGAACCCAGCGAGTGGCGCCGTCGTCTGCGTCGCTTCTCGTGGTGGTCGTGT
CGCGCGCGCCGCTTGTCCCGCCGCCTGTCCTTGTGCGCCAACCGAGAGCTACACGCGGAG
CTGCACCCCTACCTATGGGACCTGTGCGCTGTGCTCGCTCTACTACAGGCCTTCCTGCGG
TGGCTGGGGTTCTCCAAAGGCTGGAGGGAAGCCTTCGAAAGCGGCACCCAAGAGCCGTGG
GTGTTCCGAGGAGGTCTGACCGCGGACCTGCGGAGACTACTGCCCGTGGAGTGCAGCGGG
GACGCTCTCAGACCCCAGTGTATACCCAACAGCCTGCCGCTCACCGTGAGACCCTACACA
CTCGCAGACGAAGACGCGGTGTGTAATTTGTGCCAGAAAACCTGCCGGGACGGTTTGGAC
TGCAGTCACTTGTTCCCTGGGGAATTAATGTCGCTCCCCGTGGACAGACTGATCGCTCCA
TACTTGACGCTGTCCCCCGAGCTGTGTATGGTGATAGAGGATGATGGTGACATCATCAAT
GATGATGATGATGACAAGCCGGGCATCAATAACAACGACGCTAAACCGGAAATAGTCGGT
TACGTGTGCGCGGCCGTCAACAGCGTGGACTTCTATAGGAAACAGGAAATAGCCTGGATA
CCGGAAATGTGTCTCAAATATCCCAAGGAGTTACTCGACAAAGACGACTTGAGTGATGCG
GCCAAGGACTGCATCCGCTACTTCCACTCGTACTCCGCGGAGTCCATCATCACTTCCTCT
TCCGGCGTCTACTCCTCTCACCCGTCCTTGATATCGATGGCTGCTGTGCCACGGTCCGAC
CCGCTGGCAACCTCGCGCCTGCTCACGTGCCTACTGGCTGCTCTCAGGGCTTACGGTGTT
AACGGCGTCCACACCTGTGTGGCCATCAACGACCAGCACCTGCTGCAGTTCTACAGCAAG
TTCGGCTTCACGGAGCACTCGCGTAACGAGGTCCACGTGTTCATGGCTAAACTGTTCTAG
Protein sequence:
MRKDFICGVVEGFYGRPWTTEQRKDLFQKLKKWGLDMYVYAPKDDYKHRAYWRELYTVEE
AEHLTSLISEAKSHGITFCYALSPGLDITYSSQKEITTLKRKLEQVSQFGCTCFALLFDD
IEPEMSEADKQIFQSFAHAQVSVTNEIHQHLGSPKFLLCPTQYCSTRAVPTVHTSEYLNT
LGTKLSQEIDIMWTGPKVISKTLTTECIGEITQVLRRPPVIWDNLHANDYDQKRIFLGPY
CGRSPELIPLLRGVLSNPNCEYNANMIPIWTLAHWARCSLDAPAHMEAVSWDIKLERESE
QGICEDEVPLTLGKHVYHHRQALRQAINEWLPEFSIPKTAQGPVIKPQPQVAAPPVPILP
ILPSVNTCMSLTATTTTSSRAPDLPIPTVTTSQLQALADRPALATAVTSIEPFNPVPNPV
MNSLVSPTKVILNESIPNPIIPMASSIALPPELPVSTLPVPIMGIKAIDGDKIDSEMDKI
DINESNDSLLTQSFIDDMKKDKEDDDTIIVDDLEQSEQQRNGDMSVGDTPQTLSPSRVPE
GVEPLDVDPPSTAADSDVVMNDQLSENGSMQVEPSSSPLSGDMIVEQAEAIDDDSRLSQD
DLLLLCELFYLPFSHGGRGLRLLHDYHWLTTHATSCLARGNKPEPSEWRRRLRRFSWWSC
RARRLSRRLSLCANRELHAELHPYLWDLCAVLALLQAFLRWLGFSKGWREAFESGTQEPW
VFRGGLTADLRRLLPVECSGDALRPQCIPNSLPLTVRPYTLADEDAVCNLCQKTCRDGLD
CSHLFPGELMSLPVDRLIAPYLTLSPELCMVIEDDGDIINDDDDDKPGINNNDAKPEIVG
YVCAAVNSVDFYRKQEIAWIPEMCLKYPKELLDKDDLSDAAKDCIRYFHSYSAESIITSS
SGVYSSHPSLISMAAVPRSDPLATSRLLTCLLAALRAYGVNGVHTCVAINDQHLLQFYSK
FGFTEHSRNEVHVFMAKLF