DPGLEAN05984 in OGS1.0

Genomic Positionscaffold4275:- 19404-21747
See gene structure
CDS Length1326
Paired RNAseq reads  179
Single RNAseq reads  1036
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA013969 (3e-06)
Best Drosophila hit  ND
Best Human hitA disintegrin and metalloproteinase with thrombospondin motifs 3 preproprotein (1e-07)
Best NR hit (blastp)  A disintegrin and metalloproteinase with thrombospondin motifs 1 [Bombyx mori] (3e-41)
Best NR hit (blastx)  A disintegrin and metalloproteinase with thrombospondin motifs 1 [Bombyx mori] (2e-38)
GeneOntology terms









  
GO:0004222 metalloendopeptidase activity
GO:0008233 peptidase activity
GO:0051605 protein maturation by peptide bond cleavage
GO:0005578 proteinaceous extracellular matrix
GO:0030574 collagen catabolic process
GO:0046872 metal ion binding
GO:0005576 extracellular region
GO:0008270 zinc ion binding
GO:0008201 heparin binding
GO:0030199 collagen fibril organization
GO:0006508 proteolysis
InterPro families  IPR000884 Thrombospondin, type 1 repeat
Orthology groupMCL10002

Nucleotide sequence:

ATGCAGAACGAGTGGTTAGACATAGGAGATTTTTGCATCCCATTAGCACTAAAATGGCGC
ACCTTAATCCATGATTGGTCGCCAGCATTGCTAAAATTCTATCTCAATGCGTTCCAGATG
ACTCTCCCAGATCAGAGTAATTTAGTAAGATGGGGTAAAGGTACCGAAAAGACTTGCTAT
ATCTGTGGGAAGGCAGTTGGAACTGCTAGGCACTTGTTAGTGGGATGTAAGGTACTCCTC
GATAGCGGTCAATACTCGCGTCGTCACGATAGGGTTCTGGAAATCATACGTGAAGCGGTT
AGTCTTTCGGTAGCCAGAGCGCAAAAAGGAATAACCACAAACGAGCGATCAGTAGGTTTT
GTGAGAGAGGGCATTAGGACTATAAAAACAAATGTCAAGCCTTACTCCATCCTTAAAGCG
GCTACGGATTGGACTATAATGATGGATACGTGTGAAAAACAATACAAAATCCCCGAGGAT
ATTTGTGCGTCGGCCTCCAGACCAGACATATTTATGTATTCGCGAATCTTAAAGCGCGTT
GTGATGATAGAGCTTACGGTTCCTTGGGAAACCAACATCCCCAAAGACCATACCATCAAG
GTCAACAAATATTACGAGCTCACAAACGAACTCACTCGAAATAGGTTCGTCGTGGATTTA
TATGTGGTAGAAGTGGGAGCGAGAGGTATAACGGCTAAATCCCTCTACAACCTACTAAAA
GACTTGGGCCTGTCCAGAACTAACATCAATTCATTCTTGGAAAGTACTTCGAAGGCAGCT
CTAGTAGATTCAATTCAAATTTGGTTAGGTAGAGAGAGGAGCTTAGAGGGTGGAGGTCGT
AATTCCCAGGATCTGTTAAAGCTCCTCTCCCTGCAACGCGATCATCCCGGCACACGCACG
ATGAATCCACGGAATGCTGAAAATGTAACAACAAAACATGAACTCATCAAACACAGTGTA
TCTCCCGCTCTCTGCCCAGCGTCAAAACCAGCCAAGACACAACCCTGCAATAGGATACCC
TGTCCTGTTTATTGGCAAGAAATGCCTTGGACACCGTGTTCCACAACTTGTGGCCGCGGA
GTTTCCCATCGGCCTCTTTCCTGCCCCGCTTCAGACCCTGCTCTCTGTGGACCGAAGCCC
CGGGAGCGACGTCGACGCTGTCGTCTCAGAAAGTGCCCCAAACCGCCCGCCCCCTGCCCA
GAAACTGACGCAACCCAATACTGCGAGCTTTTCACCAGCGATCAACTGGAACGAAACTGT
GTAGTACCGCCCTTTAGAAAATACTGTTGCAACGCCTGTCAGTACATCAGGAAAAGGGGA
GAGTAG

Protein sequence:

MQNEWLDIGDFCIPLALKWRTLIHDWSPALLKFYLNAFQMTLPDQSNLVRWGKGTEKTCY
ICGKAVGTARHLLVGCKVLLDSGQYSRRHDRVLEIIREAVSLSVARAQKGITTNERSVGF
VREGIRTIKTNVKPYSILKAATDWTIMMDTCEKQYKIPEDICASASRPDIFMYSRILKRV
VMIELTVPWETNIPKDHTIKVNKYYELTNELTRNRFVVDLYVVEVGARGITAKSLYNLLK
DLGLSRTNINSFLESTSKAALVDSIQIWLGRERSLEGGGRNSQDLLKLLSLQRDHPGTRT
MNPRNAENVTTKHELIKHSVSPALCPASKPAKTQPCNRIPCPVYWQEMPWTPCSTTCGRG
VSHRPLSCPASDPALCGPKPRERRRRCRLRKCPKPPAPCPETDATQYCELFTSDQLERNC
VVPPFRKYCCNACQYIRKRGE