New model in OGS2.0 | DPOGS215393  |
---|---|
Genomic Position | scaffold670:+ 50443-85562 |
See gene structure | |
CDS Length | 3903 |
Paired RNAseq reads   | 7422 |
Single RNAseq reads   | 16364 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA002365 (0.0) |
Best Drosophila hit   | furin 2, isoform I (0.0) |
Best Human hit | proprotein convertase subtilisin/kexin type 5 isoform 1 preproprotein (0.0) |
Best NR hit (blastp)   | Endoprotease FURIN [Spodoptera frugiperda] (0.0) |
Best NR hit (blastx)   | Endoprotease FURIN [Spodoptera frugiperda] (0.0) |
GeneOntology terms    | GO:0004252 serine-type endopeptidase activity GO:0005886 plasma membrane GO:0006508 proteolysis GO:0006468 protein amino acid phosphorylation GO:0004714 transmembrane receptor protein tyrosine kinase activity GO:0007169 transmembrane receptor protein tyrosine kinase signaling pathway GO:0005524 ATP binding |
InterPro families    | IPR015500 Peptidase S8, subtilisin-related IPR000209 Peptidase S8/S53, subtilisin/kexin/sedolisin IPR002884 Proprotein convertase, P IPR006211 Furin-like cysteine-rich domain IPR006212 Furin-like repeat IPR008979 Galactose-binding domain-like IPR009030 Growth factor, receptor IPR009020 Proteinase inhibitor, propeptide IPR022398 Peptidase S8/S53, subtilisin, active site |
Orthology group | MCL10159 |
Nucleotide sequence:
ATGGTGCTGTCATGGCGTGCGCTGGCGCTGCTAGCCGCGCTTCAAATGTGCTCATGCTTA
CCGAAAGCGCTATATCACAATCACTTCGCTATCCATGTACCAGCTGGCGAAAAACACGCG
GATGACATCGCGACAAGACATGGTTTCATCAACCATGGCCAGATTGGAGCCCTCAAGGGA
TACTACTTGCTGTCACACCACGGGGTTCACAAACGGTCTACAGAACCAAGTCACGAGCAT
CATCACAAGTTAAATAATGAACCCGAGGTTAAATGGTTCGAGCAACAACGCGAGAGGCGT
AGAATGAAACGTGATTACAGCCCTTATGAGAGCACATTATGGTCGCAGTTGTCTCGAAGG
CTGCCCTCCCATAGAACCCGTCACCGCGCCATTACACCCTCGCCTTTCTTCTCCGATCCG
TTGTTTAAAGAGCAATGGTATTTGAATGGCGGTGCGAAAGATGGACTTGACATGAATGTA
ATGCCGGCGTGGCAAAGAGGTTACACTGGAAAGGGTGTAGTTGTGTCAATCCTTGATGAC
GGTATACAAACCAACCATCCCGACCTCGCGCAGAACTATGATCCTCTCGCTTCCACTGAC
ATAAATGGGAACGACGATGATCCAATGCCTCAAGACAACGGCGATAACAAACATGGAACA
CGTTGTGCTGGGGAGGTTGCCGCTGTAGCATATAATCAGTACTGTGGCGTCGGTATAGCA
TACAATGCTAGTATAGGAGGAGTCCGTATGTTGGACGGTGTAGTGAATGACGCGGTGGAA
GCCAGAGCTCTTGGTCTTAACCCCGATCACATTGACATATACAGTGCCTCGTGGGGTCCT
GAAGATGATGGGAAGACGGTAGACGGGCCGGGCCCGCTTGCTAGAAGAGCTTTTATTTAT
GGAGTTACAAGTGGTAGGCGCGGTAAAGGAAGTATATTCGTGTGGGCTTCGGGAAACGGT
GGTCGCCATACAGACTCCTGTAATTGTGATGGATATACAAATAGTATATTTACTTTATCA
ATATCGAGTGCGACACAAGGGGGATTTAAACCTTGGTATCTAGAAGAATGTTCATCGACT
CTAGCCTCCACATACAGCTCGGGTACTCCGGGTCATGATAAGAGTGTTGCTACTGTTGAT
ATGGACGGCAGATTAAGATCAGATCATATTTGTACAGTGGAACATACAGGAACGTCCGCA
TCTGCACCTTTAGCAGCCGGTATTTGTGCCCTTGCGCTGGAAGCTAATCCAAATTTGACC
TGGAGAGATATGCAGTATTTAGTAGTGTTAACATCACGTCCACAACCCCTCGAAAAAGAA
ACTGGGTGGATTGTGAACGGTGTGAAGAGAAAAGTTAGTCACAAGTTTGGCTATGGTTTA
ATGGATGCATCGGAAATGGTGAATTTGGCGGAACAATGGGTATCAGTACCACCGCAACAT
ATATGTAAATCGCAGGAAATTAATGAGGACAAAGCTATTGAATCCTCATTTGGTTATACA
CTAAAAGTACATATGGATGTTAATGGTTGCAGTGGAACAGTTAATGAAGTGAGATATCTA
GAACATGTCCAGTGCAAAATATCGTTGAGGTTTTTCCCTAGAGGTAATCTCCGCATACTT
CTTACTTCACCGATGGGAACAACGTCCTCTTTATTATTTGAAAGACCTAGAGATGTTATC
AGTTCCAACTTTGATGATTGGCCCTTCTTAAGTGTTCATTTCTGGGGTGAGAGAGCCGAA
GGTAGATGGACTTTGCAGATCGTCAATGCTGGTAACAGGCATGTTAACCAACCAGGCATT
CTTAAAAAATGGCAGTTGATATTTTATGGCACATCAACAGACCCTATACGGCTAAGGTCG
AAAAGACCTGCACAAGCAGCGCCAGCCTTTGCTTTTCCAACTGCCGCTGATGGTTACGAA
GCTGCCGGGGATTCTTTTTACAATACTGACGCGTTTACAAATTACCAGAACTTTCCTTCA
TTATTCGCCGCTGGGTCAAACCCCGAAAAGGCGATAGCACGTCTCGACGGACACAATGTC
CCTTCACCGCATGGGGAAAATGTCCTCGCTGATAGTAATGATAAGCGCGTCATGCACGAT
TGTGATCCCGAATGCGATTCTCAAGGTTGCTATGAAAAAGGACCCACACAATGTATAGCC
TGTAAGCATTACAGACTAGATGATGCCTGTGTATCTCGATGCCCTCCGAGAAGTTTTGCC
AATCAAGGTGGTGTTTGTTGGCCCTGTCATGAAACATGTGAAACATGCGTGGGCCCAGGA
CAAGATTCATGTTTGACATGTTCGCCAGCACATTTATTAGTGGCCGATTTGGGTTTGTGT
ATACAACAATGTCCTGATGGATATTGGGAAAATAGCGAAGCGTCAGCTTGTCGGCCGTGT
GCTGCACACTGTTCCACCTGCTCAGAGAGAGCTGATGCATGTACGTCATGTGAACATCAT
TTAGTACTATACAACGGAACTTGTGCCACATCCTGCCCACCTTCAACGTATGAAACGGAA
GACTATAGCTGTGCTAAATGTCATGAAAGTTGCAACACTTGTCACGGACCTGGAGAGCAA
CATTGTGTCACATGTCCTGCTTCTAGTTATGTGCTTGATGGCCGTTGTCTGAGCACGTGT
CCAAGTGGTTATTACGCAGATAAGAAAAGGAAAGAATGCATGAAATGTCCCATTGGTTGT
GCAACTTGTTTGGCTTCTTTGTGCCAATCTTGTAACTCAAATTGGGAATTGAACAGAAAA
GGGAAATGTGTGGCTGCTGGAAGTGACAGGTGTAATGCTGGTGAATTTTCGGATGGTAGC
CAATGTCAGCTGTGTCACAATGACTGCGATTCGTGTTACGGTGAAACTGAGGGCAACTGT
CTAACGTGTCCATCGCCCAACCTTTTACAAAATCACAAATGTGTACCAGAATGTAGTCGT
GGGTACTACTCTGAAGCCGGTCGCTGCACTCGTTGTATCCACGGTTGCAGCGAGTGCGCA
TCGAGACTAAACTGCACTTTCTGCACTGGGTCTCTCAGACTTCAGTCTGGTACTTGTAGA
ACAGCCTGTGCAGAAGGTTACTACGCTGATCGTGGTACATGTTCCAAGTGCTACTTATCG
TGTGCTACTTGCATTGGTCCACGTCGTGATCAGTGCGCCTCGTGTCCCCGTGGCTGGAGG
CTGGCAGCTGGTGAATGTCACCCTGAATGTCCACAGGGTTTCTATAAGACCGCCGACGGT
TGCCGCCACTGTCACCACTACTGCCGCGAGTGTGACGGCTCCGGGCCGTTACACTGCACG
TCGTGTCCTCAACGCTTCATGTTAGACGGCGGGCTGTGTATGGAGTGTTTGAGCTCTCAA
TACTATGAAAGCAGCAGTGGATTATGTCGATCGTGTCACGAATCGTGTAGGATTTGCTCT
GGACCCGGACAGTACAGCTGTACGGCGTGTTCGAGACCATTGCGGTTGGATAGGTTGAAC
AACCAATGTGTTCAGTGTTGTTCGGAGCGAGCTAACAACGCTACCTCAGACTGTTGTCAC
TGTGATTCTGACACAGGTGAGTGTATTAACTCGTCGGGCGCTGTTCGTCGTATCGCGGAG
TGGGGCGCGCTACACACCGACGAGAACCACCCAGAACTGGCGACCACTGTGATCGTGTTG
TGTGCGGCGGCCGGGCTCGTGTTGGTAGCTGTGGCAGTCGTGTTGCATAAGCGGTCACAG
AAGCCGCAGGCACGATCTAAAGGACTAACTTACGCGGCCTTATCCTCCGAGGACGCGGAT
GTGCTGGTGGTCGGGCGTAATCGGTTGGTCGAGCACGTGCTCGAAGACGAGCACGCGCGG
CCCGAGCACGTGCTCGGCTCCGACGATCTAGAGCACGCGCCGCTAATGAAACATTCCACA
TAG
Protein sequence:
MVLSWRALALLAALQMCSCLPKALYHNHFAIHVPAGEKHADDIATRHGFINHGQIGALKG
YYLLSHHGVHKRSTEPSHEHHHKLNNEPEVKWFEQQRERRRMKRDYSPYESTLWSQLSRR
LPSHRTRHRAITPSPFFSDPLFKEQWYLNGGAKDGLDMNVMPAWQRGYTGKGVVVSILDD
GIQTNHPDLAQNYDPLASTDINGNDDDPMPQDNGDNKHGTRCAGEVAAVAYNQYCGVGIA
YNASIGGVRMLDGVVNDAVEARALGLNPDHIDIYSASWGPEDDGKTVDGPGPLARRAFIY
GVTSGRRGKGSIFVWASGNGGRHTDSCNCDGYTNSIFTLSISSATQGGFKPWYLEECSST
LASTYSSGTPGHDKSVATVDMDGRLRSDHICTVEHTGTSASAPLAAGICALALEANPNLT
WRDMQYLVVLTSRPQPLEKETGWIVNGVKRKVSHKFGYGLMDASEMVNLAEQWVSVPPQH
ICKSQEINEDKAIESSFGYTLKVHMDVNGCSGTVNEVRYLEHVQCKISLRFFPRGNLRIL
LTSPMGTTSSLLFERPRDVISSNFDDWPFLSVHFWGERAEGRWTLQIVNAGNRHVNQPGI
LKKWQLIFYGTSTDPIRLRSKRPAQAAPAFAFPTAADGYEAAGDSFYNTDAFTNYQNFPS
LFAAGSNPEKAIARLDGHNVPSPHGENVLADSNDKRVMHDCDPECDSQGCYEKGPTQCIA
CKHYRLDDACVSRCPPRSFANQGGVCWPCHETCETCVGPGQDSCLTCSPAHLLVADLGLC
IQQCPDGYWENSEASACRPCAAHCSTCSERADACTSCEHHLVLYNGTCATSCPPSTYETE
DYSCAKCHESCNTCHGPGEQHCVTCPASSYVLDGRCLSTCPSGYYADKKRKECMKCPIGC
ATCLASLCQSCNSNWELNRKGKCVAAGSDRCNAGEFSDGSQCQLCHNDCDSCYGETEGNC
LTCPSPNLLQNHKCVPECSRGYYSEAGRCTRCIHGCSECASRLNCTFCTGSLRLQSGTCR
TACAEGYYADRGTCSKCYLSCATCIGPRRDQCASCPRGWRLAAGECHPECPQGFYKTADG
CRHCHHYCRECDGSGPLHCTSCPQRFMLDGGLCMECLSSQYYESSSGLCRSCHESCRICS
GPGQYSCTACSRPLRLDRLNNQCVQCCSERANNATSDCCHCDSDTGECINSSGAVRRIAE
WGALHTDENHPELATTVIVLCAAAGLVLVAVAVVLHKRSQKPQARSKGLTYAALSSEDAD
VLVVGRNRLVEHVLEDEHARPEHVLGSDDLEHAPLMKHST