DPGLEAN19361 in OGS1.0

New model in OGS2.0DPOGS210955 
Genomic Positionscaffold711:- 39123-41392
See gene structure
CDS Length2022
Paired RNAseq reads  389
Single RNAseq reads  1025
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006382 (9e-154)
Best Drosophila hit  CG14304, isoform B (6e-63)
Best Human hitND
Best NR hit (blastp)  AGAP009479-PA [Anopheles gambiae str. PEST] (1e-69)
Best NR hit (blastx)  hypothetical protein AaeL_AAEL011586 [Aedes aegypti] (2e-76)
GeneOntology terms

  
GO:0008061 chitin binding
GO:0005576 extracellular region
GO:0006030 chitin metabolic process
InterPro families  IPR002557 Chitin binding domain
Orthology groupMCL21742

Nucleotide sequence:

ATGAATGGAAGAATGGAAACAGCAGCCTCTAATTCTGTTAATACTTTAAATACTGAAATC
GGTATAACGCAAAAACTTGGTACAAATAAGGGTCTTAAATTAAATTCAACTCAGTTGTCC
ATTAATGAAAGGTACAGAAGATTAATACCCTATATGACATTTTACTATGCTAATGATTTA
TTGCCACCCACAACAGAATCTTACACTAATAATGTAGAAGTTGAAAAGGCTGAAATTATT
GAAGCTGGAACTTCGAGGCCTTTAGGGCGAGAACCAAAAATTATTTATTCACAAAGACAA
AAAGATATTCCCAGGTACCAAGGAAACCGGTTAACCCCATTTAATATAGCCTCATCAAGT
CCGAGTAGATTGTTCTATAAAGACGTTTTCCCTGTTGTGTACCAACATATACCAAAAGCA
AATCACAACTACAGTCCTGCCTCGGAGAATAGGGATCATTTATATGACAGCTACTTCCCG
AAACCCATTAAAGCACCGAGTGTTCCATTTACGAAGCCACAAACCAGAAAGCCATACTAT
AACTATAACCATGAAAATATACCCAACATTCAGTATATTCAATCCAATCAAGGAGAATCT
CCAAAATATAAACTTGTGCCTTATGACCAGGCTCCACCAGTAAACGTTGAAAAAAATGGC
AATTACAATAAACAATACAATGTTCCTGTTCTAATACCTGAGGAACCTGTTTATATCAAA
CCCAGACCGCATGTTTATCAGCCCGCGCAACATTTCTATGAAAATAGCTACCAACCAAAG
CCAAGAAAACCTCCAACGACAATTTCAGAAGTCTATTACGAACGAAGGCCATATGAACCA
GTATTATCGGAACCAGTGATAGAAAGTGGTTTTAAGCCTATTATTAAATCTCAAATAACA
TCTACTGAAAATCCTGTGTATACATCGACTGCGCAAGATATACCATATGACGATTACTAT
CAGGAAAAGCAAGAACAAGGCCAGTTGATTCAACCAGCTCAAATAGAGTCTGAAGTAACA
AAATATAGACCTCAATATGTTGTAGAACAGCCGCCTACCCAGAACGAACACTACAGTTCT
TCAAAATCGGTGGCTTTAGCGGATTTACTTAATTCATTACAGATAAATAAATCTATACCG
AAACCAATAACTAGAGAAAACGTCGGAGCTTCAATTAAGACATTATTACAAGTTTTGAAT
GCGTTAAGAGCAATACCTCAGGAGAATGACGTAGAAACATCCGTATTAAGCACACCTAAG
CCGTTTGAAGCGATTGAAACACCTGTCCGATCGACTCCGCATACCGTTGTTGCTACAACC
GCAAGACCACAAAATTCTGATATCCATGAACCTTTGCTTGCTACCATTCACACGCCCTCG
CAACATATTGATGAATATCCAACTGGCGGCAGTAGCTCTCAGCGTTTTCCTCTTCCAGTT
ACATCTGAGGAGGAGGGTGGGACTCCCGGTAAACCAGATGTCGACTATCCAATTTTAACC
GTTATACCTGAAACCAGTTTTAATTGTAAAACGCAACGTTATAAAGGATTTTTTGCTGAT
CCCGAAACAAGATGTCAGGTATGGCATTATTGTGATTTGAATGGTGGTCAAGCGTCATTC
CTGTGTCCTAACGGGACGATATTTTCTCAAGCGGCACTAACGTGTGATTGGTGGTTTAAT
GTACGCTGTTCACAAACCGCTCAACTGTACGTGCTAAATGAAAGTCTATACAAATATATT
TTGCCACATTCACCTAAGTTCCCCGAAGACTACAGCGGACCCTTAGTAGATAAGTACCTG
TCGTTAAAGTTTAAAGAAATGGAAGAACAGTTCAGGAAGAATAAAAATAAAAAAGCCGAA
AAAATGCAAGATGACGATTCAAATGACTCAAAAGAAACTGATGATTCTGTGATTGAAAGT
CGAAGACAAGAAAACAGTCAGAATGACTCTGTAAACCAACCTCACGTAATTGTCGAATCG
CCTGGCAGTAGTGGCAACGTTCAGAGATTACAAGATGAATAA

Protein sequence:

MNGRMETAASNSVNTLNTEIGITQKLGTNKGLKLNSTQLSINERYRRLIPYMTFYYANDL
LPPTTESYTNNVEVEKAEIIEAGTSRPLGREPKIIYSQRQKDIPRYQGNRLTPFNIASSS
PSRLFYKDVFPVVYQHIPKANHNYSPASENRDHLYDSYFPKPIKAPSVPFTKPQTRKPYY
NYNHENIPNIQYIQSNQGESPKYKLVPYDQAPPVNVEKNGNYNKQYNVPVLIPEEPVYIK
PRPHVYQPAQHFYENSYQPKPRKPPTTISEVYYERRPYEPVLSEPVIESGFKPIIKSQIT
STENPVYTSTAQDIPYDDYYQEKQEQGQLIQPAQIESEVTKYRPQYVVEQPPTQNEHYSS
SKSVALADLLNSLQINKSIPKPITRENVGASIKTLLQVLNALRAIPQENDVETSVLSTPK
PFEAIETPVRSTPHTVVATTARPQNSDIHEPLLATIHTPSQHIDEYPTGGSSSQRFPLPV
TSEEEGGTPGKPDVDYPILTVIPETSFNCKTQRYKGFFADPETRCQVWHYCDLNGGQASF
LCPNGTIFSQAALTCDWWFNVRCSQTAQLYVLNESLYKYILPHSPKFPEDYSGPLVDKYL
SLKFKEMEEQFRKNKNKKAEKMQDDDSNDSKETDDSVIESRRQENSQNDSVNQPHVIVES
PGSSGNVQRLQDE