DPGLEAN08455 in OGS1.0

New model in OGS2.0DPOGS214168 
Genomic Positionscaffold49:+ 105153-108906
See gene structure
CDS Length1698
Paired RNAseq reads  20974
Single RNAseq reads  72636
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006214 (0.0)
Best Drosophila hit  vermiform, isoform B (0.0)
Best Human hitlow-density lipoprotein receptor-related protein 6 precursor (6e-06)
Best NR hit (blastp)  AGAP011937-PA [Anopheles gambiae str. PEST] (0.0)
Best NR hit (blastx)  AGAP011937-PA [Anopheles gambiae str. PEST] (0.0)
GeneOntology terms








  
GO:0005041 low-density lipoprotein receptor activity
GO:0006030 chitin metabolic process
GO:0005576 extracellular region
GO:0008061 chitin binding
GO:0004099 chitin deacetylase activity
GO:0007424 open tracheal system development
GO:0035159 regulation of tube length, open tracheal system
GO:0035001 dorsal trunk growth, open tracheal system
GO:0060439 trachea morphogenesis
GO:0007632 visual behavior
InterPro families



  
IPR023415 Low-density lipoprotein (LDL) receptor class A, conserved site
IPR002557 Chitin binding domain
IPR002172 Low-density lipoprotein (LDL) receptor class A repeat
IPR002509 Polysaccharide deacetylase
IPR011330 Glycoside hydrolase/deacetylase, beta/alpha-barrel
Orthology groupMCL10363

Nucleotide sequence:

ATGCAACAATTTGTAATGAAGTATAATAAAAAATACATACTTAAGGCACAAGACGATGAC
GGTGATGATGAACAGAATCCTGAACAGCTTTGTGACGGACGTCCTGGAGATGAATACTTT
AGATTGTCTACCGAAGGCGATTGCCGAGAAGTAGTTCGGTGTGACAAAGGAGGTGAAAAT
GGCGTCACAAGGCTCGCTTCAGTGCGCTGCCCTGGTGGACTAGCATTCGATATCGACCGC
CAGACATGTGATTGGAAAACACATGTTAAGAATTGTGATAAACTAGAAAGTCTTAAACAG
ATCACATGCCCGTCTGGACTGTCTTTCGATCTTGATAAACAAACCTGCGACTGGAAGGGT
AAAGTGACCAACTGTGACAAAATTGAGAAACCAAGAAAAATTCTGCCCATTCTGAAGACT
GATGAACCAATTTGCTCCGAAGGCAAGCTTGCTTGCGGAAGTGGTGACTGCATCGAGAAA
GAATTATTCTGTAACGGAAAACCAGACTGCAAAGATGAATCTGATGAAAATGCTTGCACC
GTCGATTTGGACCCTAATAGAGCACCAGACTGCGATACCAGCCAATGCAAACTTCCTGAT
TGCTTCTGCTCAGCTGATGGTACTCGTATCCCCGGAGGCTTGGAGCCTAGTCAAGTCCCT
CAGATGATCACAATCACCTTCAACGGTGCTGTAAACGTTGACAACATTGACTTGTACGAC
CAGATCTTCAATGGAAACCACCAAAATCCTAATGGTTGTCAGATCCGTGGTACATTCTTT
GTCTCCCACAAATATAGTAACTACGCTGCTATTCAGGAATTACACCGCAGGGGACACGAA
ATCGCAGTTTTCTCAATCACACATAAAGATGATCCTAACTATTGGACCAGTGGAAGCTAT
GACGATTGGTTAGCCGAAATGGCTGGAGCGCGTCTTATAATTGAACGTTTTGCGAACATT
AGCGATGCTTCCATTATTGGAGTAAGAGCCCCATACCTGAGAGTTGGAGGAAATAAACAA
TTTGAAATGATGACTGACCAATACTTTGTATATGATGCTTCTATAACCGCACCTCTAGGT
CGTGTCCCTATCTGGCCTTACACATTATTCTTCCGCATGCCACATAAGTGTAATGGAAAC
GCCCATAACTGTCCCTCAAGGAGTCACCCAGTCTGGGAAATGGTTATGAATGAACTTGAC
AGAAGAGATGACCCAACCTTTGATGAATCTCTTCCTGGTTGTCACGTGGTGGACTCTTGT
TCAAACATTCAAACTGGAGAACAATTCGCACGTCTTCTTCGTCACAACTTCAACCGTCAC
TACACGACCAACCGTGCCCCTCTTGGTTTCCATTTCCATGCTTCTTGGCTCAAGTCAAAG
AAAGAATTCAGAGATGAACTTATCAAATTTATCCAAGAAATGAATGAAAAGAACGATGTC
TACTTCACTTCTCTCATTCAGGTGATACAATGGATGCAGAACCCCACAGAACTGTCCCAA
CTCAGAGATTTTGCGGAATGGAAACAAGACAAATGTGACGTAAAAGGTCAACCATTCTGC
TCTCTACCAAATGCGTGTCCCTTAACGACCCGGGAACTGCCAGGCGAGACACTGCGTCTT
TTCACCTGTATGGAATGCCCTAATAACTACCCCTGGATTTTAGATCCCACGGGAGAGGGC
TTCAGCGTTAGGAAGTGA

Protein sequence:

MQQFVMKYNKKYILKAQDDDGDDEQNPEQLCDGRPGDEYFRLSTEGDCREVVRCDKGGEN
GVTRLASVRCPGGLAFDIDRQTCDWKTHVKNCDKLESLKQITCPSGLSFDLDKQTCDWKG
KVTNCDKIEKPRKILPILKTDEPICSEGKLACGSGDCIEKELFCNGKPDCKDESDENACT
VDLDPNRAPDCDTSQCKLPDCFCSADGTRIPGGLEPSQVPQMITITFNGAVNVDNIDLYD
QIFNGNHQNPNGCQIRGTFFVSHKYSNYAAIQELHRRGHEIAVFSITHKDDPNYWTSGSY
DDWLAEMAGARLIIERFANISDASIIGVRAPYLRVGGNKQFEMMTDQYFVYDASITAPLG
RVPIWPYTLFFRMPHKCNGNAHNCPSRSHPVWEMVMNELDRRDDPTFDESLPGCHVVDSC
SNIQTGEQFARLLRHNFNRHYTTNRAPLGFHFHASWLKSKKEFRDELIKFIQEMNEKNDV
YFTSLIQVIQWMQNPTELSQLRDFAEWKQDKCDVKGQPFCSLPNACPLTTRELPGETLRL
FTCMECPNNYPWILDPTGEGFSVRK