New model in OGS2.0 | DPOGS201447  |
---|---|
Genomic Position | scaffold171:+ 3908-14874 |
See gene structure | |
CDS Length | 5265 |
Paired RNAseq reads   | 3488 |
Single RNAseq reads   | 11086 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA002696 (0.0) |
Best Drosophila hit   | chitin deacetylase-like 5, isoform B (0.0) |
Best Human hit | ND |
Best NR hit (blastp)   | GJ16215 [Drosophila virilis] (0.0) |
Best NR hit (blastx)   | GE16505 [Drosophila yakuba] (0.0) |
GeneOntology terms    | GO:0005576 extracellular region GO:0016810 hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds GO:0006030 chitin metabolic process GO:0008061 chitin binding |
InterPro families    | IPR011330 Glycoside hydrolase/deacetylase, beta/alpha-barrel IPR002557 Chitin binding domain IPR002509 Polysaccharide deacetylase |
Orthology group | MCL16546 |
Nucleotide sequence:
ATGATTTCGAGTTTACTTGTTTATGCAATAAATTTGTTGAAGCCTTTTAGATTTTTATTG
CCTTCCACTTTTAATATAATTTTCTACTTTACTCATTCTTTGATTATAGCAGAATGTCTT
GGAGCGGAGCGCTCGCGGCAGCGCTCTCCTATCAGGAGCGTTCCCGTGGCGGCCAGCGTG
AAGAGAGAAGTTGACTTCGACTGCCCCGAAGAATTTGGGTATTATCCACATCCCACCGAC
TGCACGTTGTACTATGTTTGCGTTTTCGGCGGTGCTTTACTAGAATCTTGCACTGGTGGC
CTCATGTACAGTCACGAGCTCCAAACATGTGATTGGCCGCGTAACGTAGGCTGCGACGCC
ACCGGTGCCGTCATAGCTGACGATTACGAACGGCTAAACGAAAGACAGCCTCCACCTCCT
ACATCTCGAAGAAATCCACCTCCACCTCCTCCGTCCAGAGCACAGCCCCATCCAGTTATT
ACTTCAAGAGGGCAACCAAAATTCAACCGACAAGAATATGAAAAACAACAACAGTTATAT
GCAGAAGTAGATGACTTACCTCCTGTGGAAGAAATTGAGAATGATAGGCAGCAAAGGGTA
TACAGAGGACAACCATCAACTATTGGACAAGTTCAAAAAGATAGGGACGGTTACCATGGA
TCGCAGGGTGTTAGTGCTGGGAGAACACTTAACTCTAATATTATTCCTTCCTCAATTGCG
CAAAATAGTAAAATTGGATCGTTTTCTTTTGGGACGCAACTAGAAGACAGAAGAACAGCC
ACTGCCACCCAAACTCCTCAGTCTTATAGAGAAGATATATATGACGTTTTGACTGACTCT
TCTGACTTAACTAAAGACTTTACGACTGGTGAAAGGACTATTAAAGATATAAGTGATAAC
ACTTTATCGAGGAAAAGAAGAGATGTTGAATCCCATTTAAATGCATCTTCTGTACCTGAT
AAAATTTCTAGTAGGAACGATAATGACCAAGAAATGGAATATATAGAGTTTGACCCAGAA
TCTGATGAAGATGAATACCAAGATGATGAATTGGATGATAGTGAAAGAGGGAAGAGACAA
ATTAGGTTTTATATAAAAGAGGGACATAAGGTTCCTCTTAAGTTTACCAGTTCTAAACCT
GTGAAATTTGTAAACTTGCGGCATAATAATCAAAATCCCAACATTCATAATCCACGCTAC
CATACAAATCATAATAGTGGATTTAATGATAATACTTATACTTCATTTGTTGTGACTAAT
AATAACAACCATTATCCAGTAAGTGGCCACGTCAATTATCAAACTGTTGATCATTATCAA
AATCAAAGGCCATTTAAAGCAAGTTTACCTGATTTATCAAAACCTAAATATTTACCCAAT
AATGCAGGAACTCAAATTATTACAAAAAGTCCTCCTATTTCTTCTTTAAATAACAATCAA
AATCCATTTGCTTCTTTAGCTGGTGGGTTTTATAACAATGCGTTGAAAAATGATCATAAT
ACAATATCGCAAGGACAAATATCCTCAGTGAAACCAAATTTATCTTCACCACATGATTTA
ACGCATTTCCCCAGTACAATTGTTACTGGAAGACCTTTATCATCTTCTACTCAGAGTTTA
AATTATAATGTAAAAAATGATGAAAATAAAAAACTAAATAATGTTTACAAGAACAATAAT
AACAAATTTACAAAAGACGAAGACTATAATGAAGATGAAGACTATTCTGATGAAGAAGCA
GAATCTTCAGAAGAAGATGAGGAAGATCACAAACCAAATTTCTCACCACCTATTACTGTC
CCTTATAGTTTTAATCATCCCAGAAATAAATATGCTAACATTGATAATCCATTTGCCCGA
CCGAATTTTAATTTTGATGAATTTTTGGCTAAATTAAGAGATGATCAATATTCGGTTATC
GGACTATCAACTCAAAAACCAAAAGCTTTACAAAATAATGATGTACAAACAGATTCACCT
ATAAATACAATTCCATCTATTAATAGTCACAAAATATCATCTTTTAAAGGTATAAGTACT
CCAAAGCCATTTACAATGTCTGACGTACCGCAAAATTCAACATATGCTATAAATGAAAAC
ATAAAAAATTTCGCTCCCCAACATGGCTCTGATTACGTTTTGAGGTCACAAATAACTAAT
AACCCATACTTTCAACACAATCCAAAAAATGTTAATAGACTTCCACAAAAGGATGCTGGT
ATACCTTTAGAGACTTTAAAACCGAAATTAAAGCTACCTAACTTCCAGGATAATAGACCA
CTTTCTATAAATTACAATTTCAATACTCCAGCAGAAGGTAGTAATCAACCGTCAAATACA
ATAAGGCCAATTGTCACTCCGACATCTTATTATAGCACTCCAAACAACAACAATAAATTA
CCACTACAATCTCATCATATAAATAATGCAAAACCGTTTTTGGTATCAACATCGCCACCT
TTCAATAGATATGTGTTAAGCATTTCGCAGTCTACTTCGAGACCTGCTACATTTGTAAAT
CAGCAAGTTTCTCCTGTACAAAATTACTGGAAAAAACCATCTATTGCTTTCACACCTACA
ACACCATCTTCGATTAGTCCAAATACAGTCACAGAAATTGCAAAATGGACAAAATTGTAT
TCACAAGCAACACAATCATCTACAATAATACCATTGAGTGGTAAAAATATAGTTGCTGAT
GTAAGTACTAAAGCTCCGCCAAAGCGTAAACCTATACCGAAACCTTCGCCGGAAATGAAT
GATTACTATTATGATGACGAAGATGAACAGTATTATTATGAACCAATCGTTAAGCCTAAA
TATATGCCAAGCTCCGAAGTTATGCCTCAAAGACCGCCTATGGCACAAAACTATGAAGAA
TACGACGATTCCAATGAACAACTTGAAATTCATACAGATTCAAAGATTCAAGAACATCAA
AAAATACCTAGTAGTCAAAATAATTTTAAAGTAGAAAGTGCAACAAAAAACCACAATGAT
GTATCCGTTGTCACCAAGTCACCATATAAACAATCAAACAAAATTATTAACGGCAAAATT
CCCGTACCAGTAATGGTTGATTATGACGATTCAACAAATTCCATGTCTCATAATAGTCGC
AATCGAACGTATTACTTAAGGAAGCCAAATAAGCCTGAGAATAATCCAAATACATTAAAA
CCTCCTAAGTATTTGAATCAGACAACCTTGCGGCCTTATACTGTCAGACATAGATTGGCA
ATGCCAACGACTGAAAAAAATCAGGTTAATCAAGATGTAGAAAATAAACAAATGAGAGGA
AGAATACGACATCACAATATAGTTGCGGAAATGAAATTGACTACTCCTCATGACAGCTTT
AAACAAGAGACTCGAATTACTAAGACTGGTCATGACGATAAAACGAACAGCCTGGAACCC
ACAGAGAGCGTTACACCTTCCTCGTATTCTCCAAGTCCGCGACCAAAAATGCTTTATAAC
GGTTCTCAGACTTACAGTCCCGATCAGTATGATCCTTATTACGCTGTATATGATGAAGAC
GGTGAACTGTACAAGGATACAGACTATGTGCAGCAATATAACTCAGCTTCACTCCGACCA
GCAGTTCAGCAAACGTACAGAGGCACTCCGCCCTCACGTCGGCCAGTAGAGACCTATTCA
GCAAGACCTGTCGCAGATGATTACGACGATGCTCTTATTCAAGGACCTATTATAAATCAA
AACCAATACCAGACATCTGTTCGTCAACCGGCAAGGGGTGAAGGTAACGAATTGGGTTAT
GATCCTATACCAAGCAGCGTGAGGACGACTATTTATGAAGCCACTTTCCCAAGCACGAAT
CCAACAACAACTAGCACCAGCACTACTACCACTACCACTACCACTACCACAACCACAAGA
CGACCAACAACCGCACCTTACACGGAAGCAATGACCCCGTCTCGTTATTCCCCAAGGCCC
ACAAGCACAAGAGGTCGAGGTTCGGCACATTTTTCAACTTCTGGTGGATCTGAGGCTCCA
CAGCAGACTCCTAACAGAGGAACACCTCCAACTCGCAGTCGTCCTACGTTAAAACCCTCA
ACAGCCATAGTTACAAAGACTGTGGATATCAATATTTACGCTCATCCACCATCGCGCCCC
GCTCCTGTTTACCCACAACCGACACCTGACAAGACAGCTGCCAAATGTAGAAAAGATGTA
TGTCTTCTACCAGATTGTTTCTGCGGCGGAAAAGACATTCCTGGCGAATTGCCGGTGGAT
AAGGTGCCTCAAATTGTTTTGCTGACTTTCGATGATTCCGTAAATGATTTGAACAAGGGC
TTGTACACGGATCTATTTGAAAAAGGACGGGTTAACCCAAATGGTTGCCCTATAACAGCT
ACCTTTTATGTATCTCACGAATGGACGGATTACAGTCAAGTTCAAAACTTATACTCGGCT
GGACATGAAATGGCATCTCACACAGTATCTCATAGTTTTGGAGAGCAATTCTCTCAGAAA
AAATGGAACAGAGAAGTCGGAGGTCAAAGAGAGATTTTGGCAGCGTACGGTGGTGTTAAA
CTCGATGATGTTAGAGGAATGCGTGCACCTTTCTTATCTGTAGGAGGAAATAAAATGTTC
AAAATGTTGTACGACTCCAACTTTACATACGATTCATCATTGCCAGTATATGAAAACAGA
CCACCGAGTTGGCCTTATACTTTGGACTATAAACTTTTCCACGATTGCATGATACCACCT
TGTCCCACCAAATCTTATCCAGGAGTTTGGGAAGTTCCTATGGTCATGTGGCAAGATTTG
AATGGTGGCCGTTGTTCTATGGGCGATGCTTGTGCCAATCCGCCGGATGCAGAAGGTGTT
TACAAAATGATTTTGAAAAATTTCGACAGACATTATACCAGTAACAGGGCTCCTTTTGGT
CTCTTCTATCATGCAGCTTGGTTCACTCAACCTCACCACAAAGAAGGTTTCATCATGTTC
CTAGACTTCATTAATAAAATGAATGATGTTTGGATTATCACAAACTGGCAAGCCTTGCAG
TGGGTGCGAGACCCCACCCCAATATCCAGATTAAACAATTTCCAACCGTTCCAGTGCAAT
TATGCGGATCGGCCGAAAAAATGCAACAATCCTAAGGTTTGCAACTTGTGGCATAAATCC
GGAGTAAGGTATATGAGGACATGTCAACCCTGTCCTCCAATTTATCCTTGGACTGGAAAA
ACTGGCATCTCATCATCGCGCATTGACAACGAAATTGAAGAATAG
Protein sequence:
MISSLLVYAINLLKPFRFLLPSTFNIIFYFTHSLIIAECLGAERSRQRSPIRSVPVAASV
KREVDFDCPEEFGYYPHPTDCTLYYVCVFGGALLESCTGGLMYSHELQTCDWPRNVGCDA
TGAVIADDYERLNERQPPPPTSRRNPPPPPPSRAQPHPVITSRGQPKFNRQEYEKQQQLY
AEVDDLPPVEEIENDRQQRVYRGQPSTIGQVQKDRDGYHGSQGVSAGRTLNSNIIPSSIA
QNSKIGSFSFGTQLEDRRTATATQTPQSYREDIYDVLTDSSDLTKDFTTGERTIKDISDN
TLSRKRRDVESHLNASSVPDKISSRNDNDQEMEYIEFDPESDEDEYQDDELDDSERGKRQ
IRFYIKEGHKVPLKFTSSKPVKFVNLRHNNQNPNIHNPRYHTNHNSGFNDNTYTSFVVTN
NNNHYPVSGHVNYQTVDHYQNQRPFKASLPDLSKPKYLPNNAGTQIITKSPPISSLNNNQ
NPFASLAGGFYNNALKNDHNTISQGQISSVKPNLSSPHDLTHFPSTIVTGRPLSSSTQSL
NYNVKNDENKKLNNVYKNNNNKFTKDEDYNEDEDYSDEEAESSEEDEEDHKPNFSPPITV
PYSFNHPRNKYANIDNPFARPNFNFDEFLAKLRDDQYSVIGLSTQKPKALQNNDVQTDSP
INTIPSINSHKISSFKGISTPKPFTMSDVPQNSTYAINENIKNFAPQHGSDYVLRSQITN
NPYFQHNPKNVNRLPQKDAGIPLETLKPKLKLPNFQDNRPLSINYNFNTPAEGSNQPSNT
IRPIVTPTSYYSTPNNNNKLPLQSHHINNAKPFLVSTSPPFNRYVLSISQSTSRPATFVN
QQVSPVQNYWKKPSIAFTPTTPSSISPNTVTEIAKWTKLYSQATQSSTIIPLSGKNIVAD
VSTKAPPKRKPIPKPSPEMNDYYYDDEDEQYYYEPIVKPKYMPSSEVMPQRPPMAQNYEE
YDDSNEQLEIHTDSKIQEHQKIPSSQNNFKVESATKNHNDVSVVTKSPYKQSNKIINGKI
PVPVMVDYDDSTNSMSHNSRNRTYYLRKPNKPENNPNTLKPPKYLNQTTLRPYTVRHRLA
MPTTEKNQVNQDVENKQMRGRIRHHNIVAEMKLTTPHDSFKQETRITKTGHDDKTNSLEP
TESVTPSSYSPSPRPKMLYNGSQTYSPDQYDPYYAVYDEDGELYKDTDYVQQYNSASLRP
AVQQTYRGTPPSRRPVETYSARPVADDYDDALIQGPIINQNQYQTSVRQPARGEGNELGY
DPIPSSVRTTIYEATFPSTNPTTTSTSTTTTTTTTTTTTRRPTTAPYTEAMTPSRYSPRP
TSTRGRGSAHFSTSGGSEAPQQTPNRGTPPTRSRPTLKPSTAIVTKTVDINIYAHPPSRP
APVYPQPTPDKTAAKCRKDVCLLPDCFCGGKDIPGELPVDKVPQIVLLTFDDSVNDLNKG
LYTDLFEKGRVNPNGCPITATFYVSHEWTDYSQVQNLYSAGHEMASHTVSHSFGEQFSQK
KWNREVGGQREILAAYGGVKLDDVRGMRAPFLSVGGNKMFKMLYDSNFTYDSSLPVYENR
PPSWPYTLDYKLFHDCMIPPCPTKSYPGVWEVPMVMWQDLNGGRCSMGDACANPPDAEGV
YKMILKNFDRHYTSNRAPFGLFYHAAWFTQPHHKEGFIMFLDFINKMNDVWIITNWQALQ
WVRDPTPISRLNNFQPFQCNYADRPKKCNNPKVCNLWHKSGVRYMRTCQPCPPIYPWTGK
TGISSSRIDNEIEE