New model in OGS2.0 | DPOGS207163  |
---|---|
Genomic Position | scaffold7:- 993909-995177 |
See gene structure | |
CDS Length | 1269 |
Paired RNAseq reads   | 252 |
Single RNAseq reads   | 667 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA000620 (1e-170) |
Best Drosophila hit   | ND |
Best Human hit | heparanase isoform 1 precursor (6e-56) |
Best NR hit (blastp)   | heparanase-like protein [Bombyx mori] (0.0) |
Best NR hit (blastx)   | heparanase-like protein [Bombyx mori] (4e-180) |
GeneOntology terms    | GO:0005576 extracellular region GO:0016020 membrane GO:0005765 lysosomal membrane GO:0016798 hydrolase activity, acting on glycosyl bonds |
InterPro families    | IPR005199 Glycoside hydrolase, family 79 IPR017853 Glycoside hydrolase, superfamily IPR013781 Glycoside hydrolase, subgroup, catalytic core |
Orthology group | MCL17325 |
Nucleotide sequence:
ATGTCTGATAGATTAATTTTCAGTGCTGAAGACTTACCTTCAGTCTCGTGTGATCACTGT
CTGGCATCAAGTCACAATGAAACAGCCTGCGTTGCTCTTAAAAAATTATGTAAGAATAAG
TTTTTGCCATTCTTTCTAATGACCGGCCGTAAATGGACAGAAATAAATGAATTTTGTCAA
GCAACAAACACAAAGTTATTATTTACATTAAATTTACTGCTTCGCGATAGTCATGAATGG
AATTCTCAAAATGCTGTTGAATTGATAAAATATTCCAAACAGAAGAAGTTTGATATTGAC
TGGCAACTTGGAAATGAGCCTAACTCTTTCAGACATGTATTCAATTTGACTGTTACCCCT
CAAGAATTAGCTCATGACTTCAAAAAGCTTCGGAATCTTCTAAATCATCATGGATATAAA
AAATCATTATTAGTAGGGCCTGACACCACTAGGCCCCAAGAACATCAACCAAACTGTCTG
AAATATATGGTGGAATTCCTAGGCAATGGTTCACATTTTGTAAATGCTAGATCATGGCAT
CAGTACTACCTGAATAGTAGAACTGCTAAGTTACAAGATTTTTGGAATCCTGAAACACTT
GACTTGCTTAAAGAACAAATCGAAACTATGCAAAATCACACCAAGAAATATCACAATATA
CCCATGTGGCTCAGTGAAACTAGTACTTCTTATGGCGGTGGGGCCCCTGGTTTGTCCAAC
ACATATGCTGGTACTCCTCTATGGGTAGATAAGCTGGGCCTGTCTGCTAAATATAACATT
TCCACTGTCATAAGGCAAAGCTTTTATGGAGGAAACTACAGCCTTGTAAATGAAGAACTC
GAACCTCTTCCTGATTGGTGGGTGAGTGTGTTATATAAGAAGTTAGTGGGCAATAAAGTT
CTTCATTTGATGTGTAAGTGTTCTCCACATCAGAGAGTTTATGTTCATTGTGCTAATAAG
AATTATACAAATGATTCAAGTGCAATAACAGTTTACGCCATTAATTTAGAAATGGAAAAA
GTCCAATTTCTTCTCAATGGCACTGCCTTACATGGTGATAATATAATAATTGATGAATTC
ATAATAAGTGCTCCTTCAAATAACAGGCGAACAAAAACCATACTTTTAAATGGCTGGCCA
CTGCATTATGAGTCAGCTAGTCTTGACCTGCAACCCAATCACAAGAAATATAATAACCGT
ATATCTATGCCGCCATATTCCATAGGATTTTGGGTTATTAAAAATACGTCAATTAAAATA
TGTAAATGA
Protein sequence:
MSDRLIFSAEDLPSVSCDHCLASSHNETACVALKKLCKNKFLPFFLMTGRKWTEINEFCQ
ATNTKLLFTLNLLLRDSHEWNSQNAVELIKYSKQKKFDIDWQLGNEPNSFRHVFNLTVTP
QELAHDFKKLRNLLNHHGYKKSLLVGPDTTRPQEHQPNCLKYMVEFLGNGSHFVNARSWH
QYYLNSRTAKLQDFWNPETLDLLKEQIETMQNHTKKYHNIPMWLSETSTSYGGGAPGLSN
TYAGTPLWVDKLGLSAKYNISTVIRQSFYGGNYSLVNEELEPLPDWWVSVLYKKLVGNKV
LHLMCKCSPHQRVYVHCANKNYTNDSSAITVYAINLEMEKVQFLLNGTALHGDNIIIDEF
IISAPSNNRRTKTILLNGWPLHYESASLDLQPNHKKYNNRISMPPYSIGFWVIKNTSIKI
CK