New model in OGS2.0 | DPOGS203969  |
---|---|
Genomic Position | scaffold2:+ 531282-546358 |
See gene structure | |
CDS Length | 1650 |
Paired RNAseq reads   | 628 |
Single RNAseq reads   | 1648 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA000730 (6e-13) |
Best Drosophila hit   | CG2264, isoform F (3e-71) |
Best Human hit | SPARC-related modular calcium-binding protein 1 isoform 1 (6e-23) |
Best NR hit (blastp)   | AGAP007489-PB [Anopheles gambiae str. PEST] (9e-123) |
Best NR hit (blastx)   | AGAP007489-PB [Anopheles gambiae str. PEST] (1e-108) |
GeneOntology terms    | GO:0005509 calcium ion binding GO:0005604 basement membrane GO:0010811 positive regulation of cell-substrate adhesion GO:0030198 extracellular matrix organization GO:0050840 extracellular matrix binding |
InterPro families    | IPR002350 Proteinase inhibitor I1, Kazal IPR000716 Thyroglobulin type-1 IPR018249 EF-HAND 2 IPR019577 SPARC/Testican, calcium-binding domain IPR011497 Protease inhibitor, Kazal-type IPR011992 EF-hand-like domain IPR018247 EF-Hand 1, calcium-binding site |
Orthology group | MCL11341 |
Nucleotide sequence:
ATGAATGACTTAGTGTTCCTCATTTTTTGTTTAAATTATATTTGTTACGTTAGTGGTGCT
GATTCTGGCGAGAAGCCCAATGCGCAAAGCGAGACCTGTTACCATCGCGTGGCGGCGTGT
GAAGCAAACACGGGTGCCGTGAATCGTCCAGTCTGCGGTTCCGACGGACATAACTACCCT
TCAAAATGTCACTTAATGAAGGCACAGTGCTCAGGAGAACCTATTGTAATGGCCCACAGA
GGGCCCTGTACAGACAGTCAAACTTCATGTATGGCGGTGCTGCGTTATGCATTGAAGCAA
GGCGGTCGTCGTGCCACATTTGTGCCAAGGTGCCGCGCGGACGGCACTTATGCCGCCGTG
CAATGTGCCGCTGCAGGTGCCGCAGCTGGCTGTTGGTGTGTCACCGCCGACGGGAAACCT
CTGCCCGATACAGCTGTGAGGAATGGAAGGCCAGATTGTACGAGAACTGGTAAATCTCAA
ACAAAGCGGCGCTCTTCCGTTCGAGGTCAACGTAATAAGAAAAGTTGTACCAGAGTAGAC
AGAGCACAGTTCAATGGAAATCTTATCAAAATATTCAGTGGAGAATACGACCGAGCCCGA
GCTGATGATGGAGGGGCCTCGGATCCTCGAGGAGTCGCTGATTGGAAATTCAGGGAACTG
GATCGTGATAGAAGTGGGACGCTGCAGAAGTCTGAGTATCGCGGCTTGCGGCGGCTCATC
AAAAAGGTGGTGAAACCAAAACGATGCGCTCGCGCATGGGCCCGCGGTTGTGACGGCGAC
GGGGACGGGGAGATCGCGCGCTCGGAGTGGGCCGCATGTCTCTTGGCCAGCCCGGACCCA
CCCGCTCCGGACTACGACGATAGTGTTCCAGAGCCCGAACCGGACTACGAAGAGGAACCA
CCTCCAGACCCCAGTTCAGTATTGCCTGGCATAATGCGGAATTCCTTCGCTCCAGACGGT
TCTGTCGTTAGAGAAGATGAAACAAACGACTGTCTCACAGACCGACAGGCCGTGCTAGAT
GAACAGAAAGCTGGCAGTGCTGTTTTATACGTGCCAGAGTGTACTGGTGACGGTCGGTAT
GCGCGCGCGCAGTGTTACCGCTCCACCGGCTACTGCTGGTGCGTCCATCAAGACACTGGC
AAACCGATACCGGGATCGTCGGTCAAAGACGCTAAGCCGGACTGCGACGCCGCTCCACAA
CACGCCAGCCCAATGAGAGGTTGCCCAGAACCAATGAAGAGTCATTTTCTCCATGACCTG
ATAAGTTTCTTCATATCAAAGATGACTACTTCTATCAACGGCACGGGTCCAGGAGATGTG
GTGAAATGGGGGGCGTCGAAGGAGGAGCAGGCAGCTACTTGGACCTATGTTATGTTAGAT
AAAGACAAAAACAAAGCCTTGGAAAGACGGGAGTGGAAAGCTTTCCACCAGCTGATATCA
AACATGGAGCCATTGAGAAGATGTGGAAGAAAACTCCCTCGTTACTGTGACGTAAACCAT
GATTCCAAGATTAGTATTACAGAATGGATGGCCTGCTTGGAGGTCACACAGGCAGCGCAC
GGGCATACCACTGAAACAACAAAAGTTCCATCTAATCCAAGAAGAAAAGGACCCAATCCT
CTCGAATCGATTCTAAAGGCCGACGACTAG
Protein sequence:
MNDLVFLIFCLNYICYVSGADSGEKPNAQSETCYHRVAACEANTGAVNRPVCGSDGHNYP
SKCHLMKAQCSGEPIVMAHRGPCTDSQTSCMAVLRYALKQGGRRATFVPRCRADGTYAAV
QCAAAGAAAGCWCVTADGKPLPDTAVRNGRPDCTRTGKSQTKRRSSVRGQRNKKSCTRVD
RAQFNGNLIKIFSGEYDRARADDGGASDPRGVADWKFRELDRDRSGTLQKSEYRGLRRLI
KKVVKPKRCARAWARGCDGDGDGEIARSEWAACLLASPDPPAPDYDDSVPEPEPDYEEEP
PPDPSSVLPGIMRNSFAPDGSVVREDETNDCLTDRQAVLDEQKAGSAVLYVPECTGDGRY
ARAQCYRSTGYCWCVHQDTGKPIPGSSVKDAKPDCDAAPQHASPMRGCPEPMKSHFLHDL
ISFFISKMTTSINGTGPGDVVKWGASKEEQAATWTYVMLDKDKNKALERREWKAFHQLIS
NMEPLRRCGRKLPRYCDVNHDSKISITEWMACLEVTQAAHGHTTETTKVPSNPRRKGPNP
LESILKADD