DPGLEAN08277 in OGS1.0

New model in OGS2.0DPOGS200411 
Genomic Positionscaffold451:+ 22821-26438
See gene structure
CDS Length1422
Paired RNAseq reads  47
Single RNAseq reads  123
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008988 (0.0)
Best Drosophila hit  ChLD3 (3e-178)
Best Human hitND
Best NR hit (blastp)  GA14716 [Drosophila pseudoobscura pseudoobscura] (0.0)
Best NR hit (blastx)  hypothetical protein AaeL_AAEL005685 [Aedes aegypti] (0.0)
GeneOntology terms


  
GO:0005576 extracellular region
GO:0016810 hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds
GO:0008061 chitin binding
GO:0006030 chitin metabolic process
InterPro families



  
IPR002557 Chitin binding domain
IPR002172 Low-density lipoprotein (LDL) receptor class A repeat
IPR002509 Polysaccharide deacetylase
IPR011330 Glycoside hydrolase/deacetylase, beta/alpha-barrel
IPR023415 Low-density lipoprotein (LDL) receptor class A, conserved site
Orthology groupMCL15872

Nucleotide sequence:

ATGTGGACTAGCAACGAATGTTCTAGATACTTACTCTGCTTGGAGGGGGAGGTATTTGAA
TTCAAGTGCTCTAAAGGTCTATTGTTTGATGTCAATCGGCAGTTATGTGATATGCCGCAA
AATGTTCACAACTGTGACGTAACAACAGAGACGCTTATACCAAAACCGCAGTTAGAAAAT
GCGAAATGCGCGAACGAAACCCATCTGGGATGCGCTAACGACATGTGCATGCCCGCAGAA
TATTTCTGCGACGGTGCCTTCGACTGCGAAGATAATTCTGATGAGGGTTGGTGTGACGTA
ACCTACGACCCCAACGCTGCTCTCCCGTGCGATCCCGGATTATGTCTTTTACCGGAATGT
TTTTGCACAAAACACGGCAACGAAACGCCGAACCACATAGTTCCGAGTCAGACTCCCCAA
ATGATAACATTGACTTTCAACGGTGCGGTAAACCATGAAAACTGGGATATATACACTAGA
CAGCTGTTCACTTTGGATAGAACTAATCCCAACGGATGTCCTATAAAGGCAACGTTCTTC
GTATCACATCCGTACACCAATTATAGGCACGTGCAGAAACTGTGGAACGACGGTCACGAA
ATCGCTGTTCATTCAATCACCCATCGTGGCCCAGAGGAGTGGTGGTCCAAAAACGCTACA
GTCGAAGAATGGTTTGATGAAATGGTTGGACAAGCAAATATTATAAACAGATTTAGCAAA
GTTTGGATGGAAGACTTCAGGGGTCTAAGGGTTCCGTATCTGTCTGTGGGTTGGAATAGG
CAGTTTCTAATGATGCAAGAATTCGGGTTTGTTTACGACGCTACAGTTGTAGCACCAGCG
GTAGACCCACCTTACTGGCCGTATACTCTGGACTACAAAATGCCTCACTCTTGTACTGGA
AATAATCAGTACTGTCCAACAAGAAGCTATGCAGGCCTTTGGGAGATGGTCATTAACCCG
CTAATTTACGGAAAACATGTTTGTGCCACATTAGAATACTGTCCAACCAACCTCAACGGG
GACGACATATATCAGATCCTGATGAATAACTTCAAAAGACATTATTTAAAAAATAGAGCT
CCGTTTGGAATACATCTGAACGCGACTTGGCTTAAAAATAATGAATATCTGGCAGCTTTC
AGGAAATTCACAGATGAGTTGCTAAAACTTAATGACGTTTACTTTGTGACATATCGCGAA
GTCATTGATTGGATAAGGAGACCAACGCCAGTGTTGCAACTAAAGAAATTTCAACCATGG
CAGTGTAATAATAAACAATTTCAGGAATCTGATATTGCTTGCGGCAAACCCAAGACTTGC
AAACTACCCTCGAAAGTTCTAGAACATGATAAATATATGATAACTTGCATGGATTGTCCA
AAGAGTTATCCATGGATAAGAAATGAGTTTGGCTTAGAATAG

Protein sequence:

MWTSNECSRYLLCLEGEVFEFKCSKGLLFDVNRQLCDMPQNVHNCDVTTETLIPKPQLEN
AKCANETHLGCANDMCMPAEYFCDGAFDCEDNSDEGWCDVTYDPNAALPCDPGLCLLPEC
FCTKHGNETPNHIVPSQTPQMITLTFNGAVNHENWDIYTRQLFTLDRTNPNGCPIKATFF
VSHPYTNYRHVQKLWNDGHEIAVHSITHRGPEEWWSKNATVEEWFDEMVGQANIINRFSK
VWMEDFRGLRVPYLSVGWNRQFLMMQEFGFVYDATVVAPAVDPPYWPYTLDYKMPHSCTG
NNQYCPTRSYAGLWEMVINPLIYGKHVCATLEYCPTNLNGDDIYQILMNNFKRHYLKNRA
PFGIHLNATWLKNNEYLAAFRKFTDELLKLNDVYFVTYREVIDWIRRPTPVLQLKKFQPW
QCNNKQFQESDIACGKPKTCKLPSKVLEHDKYMITCMDCPKSYPWIRNEFGLE