DPGLEAN02177 in OGS1.0

New model in OGS2.0DPOGS201045 
Genomic Positionscaffold1470:+ 17897-28466
See gene structure
CDS Length1800
Paired RNAseq reads  1486
Single RNAseq reads  3450
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012488 (0.0)
Best Drosophila hit  SP1070, isoform C (0.0)
Best Human hitcubilin precursor (3e-27)
Best NR hit (blastp)  PREDICTED: similar to GA21569-PA [Nasonia vitripennis] (0.0)
Best NR hit (blastx)  PREDICTED: similar to GA21569-PA [Nasonia vitripennis] (0.0)
GeneOntology terms




  
GO:0005112 Notch binding
GO:0007155 cell adhesion
GO:0005509 calcium ion binding
GO:0004872 receptor activity
GO:0016324 apical plasma membrane
GO:0035152 regulation of tube architecture, open tracheal system
InterPro families






  
IPR001304 C-type lectin
IPR002172 Low-density lipoprotein (LDL) receptor class A repeat
IPR000859 CUB
IPR000436 Sushi/SCR/CCP
IPR016186 C-type lectin-like
IPR016060 Complement control module
IPR023415 Low-density lipoprotein (LDL) receptor class A, conserved site
IPR016187 C-type lectin fold
Orthology groupMCL14175

Nucleotide sequence:

ATGTTGATAAGGTGTCGTGCGGCGCTGAGTGCCGTTGTGTTGCTCCTAGCTGTGTCTCTT
ACGAAATCCGAGGGAAACCTCTTCACGTGTCCTAATGGCTGGGAACTCAAGGGGCTTAAT
TGTTACAAATTTTTCAATATCAGACATTCTTGGGAAAAAGCGGCTGAATTATGTCGAAGG
TATGGAAGCGAACTAATGGTGATAGATAGTTATACAGAAAATAACATGACTGCAAATATG
GTGCCCCCCAGTCCGAGCAATAGTAACTATTGGCTGGGACTTGCGAGTGTTGACGATTTG
AGAACGAATACTTTAGAGTCCGCAGCTGGAGCCTTAGTCTCTCAGTACGCTGGTTATTGG
GACCTTAAACAGCCTAATCCCAAAAACGGCGAATGCGTCGATGTCCATGTTACAAGCGAA
ACACAATCCTGGGAACTAACAACATGCGAAACACTTCTGCCTTTTATGTGCAAGGCTAAC
GCATGCCCGGCAGGAACCTTCCATTGTTCAAACGGAAGATGTATCAATGCAGCCTTCAAA
TGTGACAAGCAAGATGATTGTGGCGACGGATCGGATGAAATGGATTGTGCTGGCGACTGT
CATTTTTATATGGCCAGCAGTGGTGATGTCGTAGAAAGTCCCAACTATCCTCATAAATAT
CTTCCCTTCAGCGACTGTAAATGGACACTTGAAGGACCTCAAGGACAAAATATAGTTTTG
CAGTTCCAAGATTTTGAAACTGAAAAGTCTTTCGATACTGTTCAAATCTTAGTTGGTGGT
CGAACCGAAGATAAATCTGTGAACCTAGCAACTTTATCAGGAAAACAAGATTTATCTAGC
AAACTTTATGTATCTGCTTCTAATTTTATGATAATTAAGTTTACTTCTGACGGATCCGTA
GAGCGTAAAGGTTTCCGTGCATCATGGAAAACTGAATCTTCAAACTGTGGAGGAATTTTA
AGAGCAACACCTCAAGGCCAAGTACTCAATTCTCCTGGTTATCCAAATAGTTATCCTGGG
GGATTAGAATGCATGTATATTATTGAAGCGCAACCGGGCAGAATTGTTTCCTTAGAGATA
GAAGATTTAGATTTGGAAATGAACAGAGATTATATAGTAATAAAAGATGGAAACACACCA
TCAAGTCCTGTTTTAGCGAGATTAACTGGTGCTGGTGAAGATAACGAAAAGGTGGTCATA
TCCACTACTAATCACTTATACATGTACTTCCGAACAAGTCTAGGTGATTCTAAGAAAGGA
TTCAACATGAGATATTCGCAAGGATGTAGAGCAACCATAATAGCAGCTAACGGAACATTT
AACTCTCCAGCATTTGGTCTAAGCAATTATCCTAATAATCAGGAATGTTTATATCGTCTT
AAAAATCCGAACGGTGGTCCTCTTTCTTTGAAGTTTGATGAATTCAGCATACATCCCTCC
GATATAGTACAAGTATTTGACGGGGCTAGTACAGGTGGACTACGTTTACATTCAGATAAT
GGATTCACAGTAAAGCCAAGAATAACTTTGACTGCCTCAAGCGGTGAAATGTTAATAAGA
TTTATATCCGACGCCGTCCATAATGGTGTTGGTTGGAAAGCAACATTTTCTGCAGACTGC
CCACCTCTTAAACCTGGAATTGGTGCATTGGCATCTAATAGAGATACAGCTTTTGGAACA
ACTATAACATTTTCGTGTCCTATTGGCCAGGAATTTGCAACTGGTAAAGCAAGAATTACA
ACCAAATGTTTAGATGGTGGTAAATGGTCCACAACGTATATTCCCAGTTGTCAAGGTTAG

Protein sequence:

MLIRCRAALSAVVLLLAVSLTKSEGNLFTCPNGWELKGLNCYKFFNIRHSWEKAAELCRR
YGSELMVIDSYTENNMTANMVPPSPSNSNYWLGLASVDDLRTNTLESAAGALVSQYAGYW
DLKQPNPKNGECVDVHVTSETQSWELTTCETLLPFMCKANACPAGTFHCSNGRCINAAFK
CDKQDDCGDGSDEMDCAGDCHFYMASSGDVVESPNYPHKYLPFSDCKWTLEGPQGQNIVL
QFQDFETEKSFDTVQILVGGRTEDKSVNLATLSGKQDLSSKLYVSASNFMIIKFTSDGSV
ERKGFRASWKTESSNCGGILRATPQGQVLNSPGYPNSYPGGLECMYIIEAQPGRIVSLEI
EDLDLEMNRDYIVIKDGNTPSSPVLARLTGAGEDNEKVVISTTNHLYMYFRTSLGDSKKG
FNMRYSQGCRATIIAANGTFNSPAFGLSNYPNNQECLYRLKNPNGGPLSLKFDEFSIHPS
DIVQVFDGASTGGLRLHSDNGFTVKPRITLTASSGEMLIRFISDAVHNGVGWKATFSADC
PPLKPGIGALASNRDTAFGTTITFSCPIGQEFATGKARITTKCLDGGKWSTTYIPSCQG