DPGLEAN18660 in OGS1.0

New model in OGS2.0DPOGS203993 
Genomic Positionscaffold2:+ 1210525-1230550
See gene structure
CDS Length3360
Paired RNAseq reads  1174
Single RNAseq reads  2905
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA002138 (0.0)
Best Drosophila hit  furrowed, isoform A (0.0)
Best Human hitCUB and sushi domain-containing protein 3 isoform 3 (5e-77)
Best NR hit (blastp)  furrowed, putative [Pediculus humanus corporis] (0.0)
Best NR hit (blastx)  PREDICTED: similar to C-type lectin, selectin-like (AGAP000929-PA) [Tribolium castaneum] (0.0)
GeneOntology terms


  
GO:0007423 sensory organ development
GO:0007476 imaginal disc-derived wing morphogenesis
GO:0007155 cell adhesion
GO:0005488 binding
InterPro families







  
IPR000436 Sushi/SCR/CCP
IPR006585 Fucolectin tachylectin-4 pentraxin-1
IPR001304 C-type lectin
IPR008979 Galactose-binding domain-like
IPR016187 C-type lectin fold
IPR016060 Complement control module
IPR000421 Coagulation factor 5/8 type, C-terminal
IPR016186 C-type lectin-like
IPR018378 C-type lectin, conserved site
Orthology groupMCL10713

Nucleotide sequence:

ATGTATTATTATCTATATCTGTACATTATATGTATTGAGTTTGTTTTGTTATGTTTCATT
GGGTTTTTTGTTTTAGGCGACAGACGGTGTGGCCATCCAGCTGTGCCGCCGAATGCAAAG
GTTTCTTTGGCTTCAGACACTGACATAGTACCGGGAACAGTGGCCACCTATGAATGCGAT
GACGGCTACGAGCTTTTCGGTGCACATCAAAGAGAATGTACATTAAGGGGTGACTGGACT
TCCGAACCGCCATTTTGTGGAACCAACGTTGCCTTCAGAAAACCAGCAAATCAATCAACC
ACTGTACGTGGTGGAAGCGCTAGCAACGGCAATGATGGTGAAAAGACCACCGAACATGAC
GGCAAAAGATGCACAGAAACACAAAGAGAGGCTTCACCCTGGTGGCAGGTCGACCTGCTA
CGTCACTACGCCGTCAAAGTGGTTAGAGTCACTACTAGGGGCTGTTGCGGTCACCAACCG
CTTCAAGATCTGGAGATCAGAGTAGGCAACAGCAGCAGTGATTTACAAAGAAACCCACTA
TGTGCTTGGTTTCCTGGCACCATTGACGAAGGTATAACGAAGACGTTCACTTGCGCCCGC
CCCCTCATAGGTCAGCACGTTTTCCTCCAGCTGGTTGGAGTAGAGAGCTCCCTTTCTTTA
TGTGAAGTAGAAGTATTCACCACAGAAGAGTTCTCAAACGATCGATGCGCCCCGATTGGA
GCCTCAGCAGATATTGAGTTGGCCGCTTTTTCTCGAAACTGTTACGAATTTAATGGAGCT
AAGGGAGCATCTTTTGAGGAAGCGAGAAAACAATGTCAGGAACACGGAGGAGACTTAATA
CATGGGTTTCAGGGCGCTACGTCAAGTTACCTTATTCAGGAGTTAGAAAGGCGCAGGCCA
AATTTAAAGACTCTGTTGGTCTGGATCGGAGCTCAAAAGGAACCTGGTCTTACTTCCAGA
ACTTGGAAATGGGTTGACGGAGAAACTGTTACAAAGCCAACGTGGGGAAAGGACCAGCCG
AATAATTATAACGGCGAACAAAACTGCGTGGTGCTTGATGGTGGGCGCTCCTGGCTGTGG
AACGACGTCGGCTGCAACCTGGACTACCTACACTGGATCTGCCAGTACCTGCCCCCTACT
TGTGGAAGTCCAGATAAGCTGTTGAACACGACTATTGAAGATAATGACTACCATGTAGGG
TCTTCGATAAGGTATAAATGTCCACAAGGTCACATGCTGATTGGAGATAAAACTAGAGAA
TGCAAAAAAGATGGATTCTGGTCTGGAGCAGCACCAAGTTGTAAATACATTAACTGCGGT
GGATTGACTCCAATTCAAGATGGTAGCGTCGATTTAGTGGATGGGCACACCACTTACGGA
GCGAAAGCTATTTATTCGTGTAAAGAGAATTACACGCTAGTGGGTAACGCTGAACGAATG
TGTAAGGATCAAGGAATTTGGGACGGAGAGGCACCCAAGTGTCTGTTTGACTGGTGTCCG
GAGCCACCACCAGTCTCAGGAGCTACGGTCACCACTAGTGGTCACAAAGCAGGATCGTTG
GCCACCTACACTTGCCAGAATGGATTTATACTTTTTGGTTCACCAAGTATCACTTGTAAT
CTCGGTGGAACATGGGGCGGCACCCCACCTTCATGTAAATATGTAGACTGCGGCACTCCA
GCACAAGTCCATAAAGGGTCCTTCAGACTATTGAACGGCACGACTACGTACGGCTCGATA
GCCCAATTCACCTGCGAACCCGATTACTGGTTGGCGGGAGCTGAAGTACTCACATGCTAT
CGTGATGGCAAATGGTCACATGATATACCTTCCTGTGAATTGATAAGTTGTTCGGACCCG
GAGGTGCCGACTGGTGGCTATATGGAAGCATACGATTACAACGTCCACTCAACCATTGAC
TTTCATTGTGAGAAGGGACATAAACTCATTGGTGAACCAAGTCTCACGTGTCAGCCTGAC
GGAGAGTGGTCAGGAGAATCGCCTAAGTGTGAATATGTGGACTGCGGTAAATTGCCACCT
CTGCCTTACGGTTCGGCAGAACTTTTAAATGGTACTACGCATTTAGGAAGTATCATCCAA
TACTCATGTACTACCAACTACAGACTGGTCGGACCAGTAAGAAGGATATGTACTGAAGAT
TTCCAATGGAGTGATTCATCACCAAGATGTGAAGAAATAAGATGTCCAGAGCCAATCGTA
GCGGAAAACAGCATCGTATCCGTAACTGGTAACGATCGCATGCACGGACGCACGCTCATT
CGTACACGATCAAGCACTCAGGGCAATACGTACAGAATCGGTGCCTTGGTAAAATACCGC
TGTGAGCGTGGGTACAAGGTGGTGGGCGAGAGTCTATCAACTTGTGAAGATAATGGACAA
TGGAGTGGTGTCAGACCTAAATGTCAATACGTTGACTGTGGAAATCCGGGTCGCATACAA
AATGGCAAAGTCACATTGGCTACAAACGCGACGTACTATGGGGCAGCAGCCTTGTACGAA
TGCGACGAACATTGGCAACTAGATGGTGTCTCAAGGCGATTGTGTCAAGATAACGAAACC
TGGAGTTCTGAAGCACCTGTATGTAAAGAAATAACCTGTGTGGATCCTTCAATCCAAATA
AAGGGCAGTATTGGTTTATTGGTTGTGACGTCAACTCTCAGCATTGGAGGCGAAGCACAC
TACCGCTGTGAACGGGGATACAGCCTAAAAGGAAATGAAACTAGAACTTGTCTACCGAAA
GGACAGTGGGCCGGAGCACCTCCTGTTTGCATACCGATAGACTGCAAGTCACCCGGCACT
GTAGACAACGGCAGAGTGATTATTTCAAATAGTTCGACAATCTTCGGCAGCTCTATAGAG
TATCATTGCTTACCGCAATATCAGCGAGTTGGACCATTCCTTCGCAAATGTTTAGACGAT
GGCAAATGGTCAGGAGAAGAACCCAAGTGCGAATTGATCACGAACGAAGCTGCTGAAAAT
GGCGCTCTACCACTCAGTGTTGGAGTTGGTTGCGGTATCGTTCTATTTTTGCTTATGTTG
CTCGGAGTCATCTATTTAAGACTACGTAAAGCAACGCCAGTCAAGAACACTGAAAATATA
GAAGGAGCTGAACGGAAAGAAGACCAAAACGCAGCCGTAATGAGCTACGCAACCCTCCAC
GATACTAACGGACGGCATATTTACGACCACGTAACGGACAATCTGTACGATTCACCGTAC
GGCGAGAGTTTGGCCGAGAACTCCGCGTACGGTAGACGCAGTGACACCGAATCCGCATAC
GAACCAGAACCCACCGGCCCCAACGCTGTAGTCACCATCAATGGAGTGGCCGTTCGTTGA

Protein sequence:

MYYYLYLYIICIEFVLLCFIGFFVLGDRRCGHPAVPPNAKVSLASDTDIVPGTVATYECD
DGYELFGAHQRECTLRGDWTSEPPFCGTNVAFRKPANQSTTVRGGSASNGNDGEKTTEHD
GKRCTETQREASPWWQVDLLRHYAVKVVRVTTRGCCGHQPLQDLEIRVGNSSSDLQRNPL
CAWFPGTIDEGITKTFTCARPLIGQHVFLQLVGVESSLSLCEVEVFTTEEFSNDRCAPIG
ASADIELAAFSRNCYEFNGAKGASFEEARKQCQEHGGDLIHGFQGATSSYLIQELERRRP
NLKTLLVWIGAQKEPGLTSRTWKWVDGETVTKPTWGKDQPNNYNGEQNCVVLDGGRSWLW
NDVGCNLDYLHWICQYLPPTCGSPDKLLNTTIEDNDYHVGSSIRYKCPQGHMLIGDKTRE
CKKDGFWSGAAPSCKYINCGGLTPIQDGSVDLVDGHTTYGAKAIYSCKENYTLVGNAERM
CKDQGIWDGEAPKCLFDWCPEPPPVSGATVTTSGHKAGSLATYTCQNGFILFGSPSITCN
LGGTWGGTPPSCKYVDCGTPAQVHKGSFRLLNGTTTYGSIAQFTCEPDYWLAGAEVLTCY
RDGKWSHDIPSCELISCSDPEVPTGGYMEAYDYNVHSTIDFHCEKGHKLIGEPSLTCQPD
GEWSGESPKCEYVDCGKLPPLPYGSAELLNGTTHLGSIIQYSCTTNYRLVGPVRRICTED
FQWSDSSPRCEEIRCPEPIVAENSIVSVTGNDRMHGRTLIRTRSSTQGNTYRIGALVKYR
CERGYKVVGESLSTCEDNGQWSGVRPKCQYVDCGNPGRIQNGKVTLATNATYYGAAALYE
CDEHWQLDGVSRRLCQDNETWSSEAPVCKEITCVDPSIQIKGSIGLLVVTSTLSIGGEAH
YRCERGYSLKGNETRTCLPKGQWAGAPPVCIPIDCKSPGTVDNGRVIISNSSTIFGSSIE
YHCLPQYQRVGPFLRKCLDDGKWSGEEPKCELITNEAAENGALPLSVGVGCGIVLFLLML
LGVIYLRLRKATPVKNTENIEGAERKEDQNAAVMSYATLHDTNGRHIYDHVTDNLYDSPY
GESLAENSAYGRRSDTESAYEPEPTGPNAVVTINGVAVR