Genomic Position | scaffold4275:- 19404-21747 |
---|---|
See gene structure | |
CDS Length | 1326 |
Paired RNAseq reads   | 179 |
Single RNAseq reads   | 1036 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA013969 (3e-06) |
Best Drosophila hit   | ND |
Best Human hit | A disintegrin and metalloproteinase with thrombospondin motifs 3 preproprotein (1e-07) |
Best NR hit (blastp)   | A disintegrin and metalloproteinase with thrombospondin motifs 1 [Bombyx mori] (3e-41) |
Best NR hit (blastx)   | A disintegrin and metalloproteinase with thrombospondin motifs 1 [Bombyx mori] (2e-38) |
GeneOntology terms    | GO:0004222 metalloendopeptidase activity GO:0008233 peptidase activity GO:0051605 protein maturation by peptide bond cleavage GO:0005578 proteinaceous extracellular matrix GO:0030574 collagen catabolic process GO:0046872 metal ion binding GO:0005576 extracellular region GO:0008270 zinc ion binding GO:0008201 heparin binding GO:0030199 collagen fibril organization GO:0006508 proteolysis |
InterPro families   | IPR000884 Thrombospondin, type 1 repeat |
Orthology group | MCL10002 |
Nucleotide sequence:
ATGCAGAACGAGTGGTTAGACATAGGAGATTTTTGCATCCCATTAGCACTAAAATGGCGC
ACCTTAATCCATGATTGGTCGCCAGCATTGCTAAAATTCTATCTCAATGCGTTCCAGATG
ACTCTCCCAGATCAGAGTAATTTAGTAAGATGGGGTAAAGGTACCGAAAAGACTTGCTAT
ATCTGTGGGAAGGCAGTTGGAACTGCTAGGCACTTGTTAGTGGGATGTAAGGTACTCCTC
GATAGCGGTCAATACTCGCGTCGTCACGATAGGGTTCTGGAAATCATACGTGAAGCGGTT
AGTCTTTCGGTAGCCAGAGCGCAAAAAGGAATAACCACAAACGAGCGATCAGTAGGTTTT
GTGAGAGAGGGCATTAGGACTATAAAAACAAATGTCAAGCCTTACTCCATCCTTAAAGCG
GCTACGGATTGGACTATAATGATGGATACGTGTGAAAAACAATACAAAATCCCCGAGGAT
ATTTGTGCGTCGGCCTCCAGACCAGACATATTTATGTATTCGCGAATCTTAAAGCGCGTT
GTGATGATAGAGCTTACGGTTCCTTGGGAAACCAACATCCCCAAAGACCATACCATCAAG
GTCAACAAATATTACGAGCTCACAAACGAACTCACTCGAAATAGGTTCGTCGTGGATTTA
TATGTGGTAGAAGTGGGAGCGAGAGGTATAACGGCTAAATCCCTCTACAACCTACTAAAA
GACTTGGGCCTGTCCAGAACTAACATCAATTCATTCTTGGAAAGTACTTCGAAGGCAGCT
CTAGTAGATTCAATTCAAATTTGGTTAGGTAGAGAGAGGAGCTTAGAGGGTGGAGGTCGT
AATTCCCAGGATCTGTTAAAGCTCCTCTCCCTGCAACGCGATCATCCCGGCACACGCACG
ATGAATCCACGGAATGCTGAAAATGTAACAACAAAACATGAACTCATCAAACACAGTGTA
TCTCCCGCTCTCTGCCCAGCGTCAAAACCAGCCAAGACACAACCCTGCAATAGGATACCC
TGTCCTGTTTATTGGCAAGAAATGCCTTGGACACCGTGTTCCACAACTTGTGGCCGCGGA
GTTTCCCATCGGCCTCTTTCCTGCCCCGCTTCAGACCCTGCTCTCTGTGGACCGAAGCCC
CGGGAGCGACGTCGACGCTGTCGTCTCAGAAAGTGCCCCAAACCGCCCGCCCCCTGCCCA
GAAACTGACGCAACCCAATACTGCGAGCTTTTCACCAGCGATCAACTGGAACGAAACTGT
GTAGTACCGCCCTTTAGAAAATACTGTTGCAACGCCTGTCAGTACATCAGGAAAAGGGGA
GAGTAG
Protein sequence:
MQNEWLDIGDFCIPLALKWRTLIHDWSPALLKFYLNAFQMTLPDQSNLVRWGKGTEKTCY
ICGKAVGTARHLLVGCKVLLDSGQYSRRHDRVLEIIREAVSLSVARAQKGITTNERSVGF
VREGIRTIKTNVKPYSILKAATDWTIMMDTCEKQYKIPEDICASASRPDIFMYSRILKRV
VMIELTVPWETNIPKDHTIKVNKYYELTNELTRNRFVVDLYVVEVGARGITAKSLYNLLK
DLGLSRTNINSFLESTSKAALVDSIQIWLGRERSLEGGGRNSQDLLKLLSLQRDHPGTRT
MNPRNAENVTTKHELIKHSVSPALCPASKPAKTQPCNRIPCPVYWQEMPWTPCSTTCGRG
VSHRPLSCPASDPALCGPKPRERRRRCRLRKCPKPPAPCPETDATQYCELFTSDQLERNC
VVPPFRKYCCNACQYIRKRGE