DPGLEAN18634 in OGS1.0

New model in OGS2.0DPOGS203969 
Genomic Positionscaffold2:+ 531282-546358
See gene structure
CDS Length1650
Paired RNAseq reads  628
Single RNAseq reads  1648
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA000730 (6e-13)
Best Drosophila hit  CG2264, isoform F (3e-71)
Best Human hitSPARC-related modular calcium-binding protein 1 isoform 1 (6e-23)
Best NR hit (blastp)  AGAP007489-PB [Anopheles gambiae str. PEST] (9e-123)
Best NR hit (blastx)  AGAP007489-PB [Anopheles gambiae str. PEST] (1e-108)
GeneOntology terms



  
GO:0005509 calcium ion binding
GO:0005604 basement membrane
GO:0010811 positive regulation of cell-substrate adhesion
GO:0030198 extracellular matrix organization
GO:0050840 extracellular matrix binding
InterPro families





  
IPR002350 Proteinase inhibitor I1, Kazal
IPR000716 Thyroglobulin type-1
IPR018249 EF-HAND 2
IPR019577 SPARC/Testican, calcium-binding domain
IPR011497 Protease inhibitor, Kazal-type
IPR011992 EF-hand-like domain
IPR018247 EF-Hand 1, calcium-binding site
Orthology groupMCL11341

Nucleotide sequence:

ATGAATGACTTAGTGTTCCTCATTTTTTGTTTAAATTATATTTGTTACGTTAGTGGTGCT
GATTCTGGCGAGAAGCCCAATGCGCAAAGCGAGACCTGTTACCATCGCGTGGCGGCGTGT
GAAGCAAACACGGGTGCCGTGAATCGTCCAGTCTGCGGTTCCGACGGACATAACTACCCT
TCAAAATGTCACTTAATGAAGGCACAGTGCTCAGGAGAACCTATTGTAATGGCCCACAGA
GGGCCCTGTACAGACAGTCAAACTTCATGTATGGCGGTGCTGCGTTATGCATTGAAGCAA
GGCGGTCGTCGTGCCACATTTGTGCCAAGGTGCCGCGCGGACGGCACTTATGCCGCCGTG
CAATGTGCCGCTGCAGGTGCCGCAGCTGGCTGTTGGTGTGTCACCGCCGACGGGAAACCT
CTGCCCGATACAGCTGTGAGGAATGGAAGGCCAGATTGTACGAGAACTGGTAAATCTCAA
ACAAAGCGGCGCTCTTCCGTTCGAGGTCAACGTAATAAGAAAAGTTGTACCAGAGTAGAC
AGAGCACAGTTCAATGGAAATCTTATCAAAATATTCAGTGGAGAATACGACCGAGCCCGA
GCTGATGATGGAGGGGCCTCGGATCCTCGAGGAGTCGCTGATTGGAAATTCAGGGAACTG
GATCGTGATAGAAGTGGGACGCTGCAGAAGTCTGAGTATCGCGGCTTGCGGCGGCTCATC
AAAAAGGTGGTGAAACCAAAACGATGCGCTCGCGCATGGGCCCGCGGTTGTGACGGCGAC
GGGGACGGGGAGATCGCGCGCTCGGAGTGGGCCGCATGTCTCTTGGCCAGCCCGGACCCA
CCCGCTCCGGACTACGACGATAGTGTTCCAGAGCCCGAACCGGACTACGAAGAGGAACCA
CCTCCAGACCCCAGTTCAGTATTGCCTGGCATAATGCGGAATTCCTTCGCTCCAGACGGT
TCTGTCGTTAGAGAAGATGAAACAAACGACTGTCTCACAGACCGACAGGCCGTGCTAGAT
GAACAGAAAGCTGGCAGTGCTGTTTTATACGTGCCAGAGTGTACTGGTGACGGTCGGTAT
GCGCGCGCGCAGTGTTACCGCTCCACCGGCTACTGCTGGTGCGTCCATCAAGACACTGGC
AAACCGATACCGGGATCGTCGGTCAAAGACGCTAAGCCGGACTGCGACGCCGCTCCACAA
CACGCCAGCCCAATGAGAGGTTGCCCAGAACCAATGAAGAGTCATTTTCTCCATGACCTG
ATAAGTTTCTTCATATCAAAGATGACTACTTCTATCAACGGCACGGGTCCAGGAGATGTG
GTGAAATGGGGGGCGTCGAAGGAGGAGCAGGCAGCTACTTGGACCTATGTTATGTTAGAT
AAAGACAAAAACAAAGCCTTGGAAAGACGGGAGTGGAAAGCTTTCCACCAGCTGATATCA
AACATGGAGCCATTGAGAAGATGTGGAAGAAAACTCCCTCGTTACTGTGACGTAAACCAT
GATTCCAAGATTAGTATTACAGAATGGATGGCCTGCTTGGAGGTCACACAGGCAGCGCAC
GGGCATACCACTGAAACAACAAAAGTTCCATCTAATCCAAGAAGAAAAGGACCCAATCCT
CTCGAATCGATTCTAAAGGCCGACGACTAG

Protein sequence:

MNDLVFLIFCLNYICYVSGADSGEKPNAQSETCYHRVAACEANTGAVNRPVCGSDGHNYP
SKCHLMKAQCSGEPIVMAHRGPCTDSQTSCMAVLRYALKQGGRRATFVPRCRADGTYAAV
QCAAAGAAAGCWCVTADGKPLPDTAVRNGRPDCTRTGKSQTKRRSSVRGQRNKKSCTRVD
RAQFNGNLIKIFSGEYDRARADDGGASDPRGVADWKFRELDRDRSGTLQKSEYRGLRRLI
KKVVKPKRCARAWARGCDGDGDGEIARSEWAACLLASPDPPAPDYDDSVPEPEPDYEEEP
PPDPSSVLPGIMRNSFAPDGSVVREDETNDCLTDRQAVLDEQKAGSAVLYVPECTGDGRY
ARAQCYRSTGYCWCVHQDTGKPIPGSSVKDAKPDCDAAPQHASPMRGCPEPMKSHFLHDL
ISFFISKMTTSINGTGPGDVVKWGASKEEQAATWTYVMLDKDKNKALERREWKAFHQLIS
NMEPLRRCGRKLPRYCDVNHDSKISITEWMACLEVTQAAHGHTTETTKVPSNPRRKGPNP
LESILKADD