DPGLEAN04197 in OGS1.0

New model in OGS2.0DPOGS209563 
Genomic Positionscaffold1027:- 21462-48386
See gene structure
CDS Length3810
Paired RNAseq reads  213
Single RNAseq reads  505
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA002365 (0.0)
Best Drosophila hit  furin 2, isoform I (0.0)
Best Human hitproprotein convertase subtilisin/kexin type 5 isoform 1 preproprotein (0.0)
Best NR hit (blastp)  Endoprotease FURIN [Spodoptera frugiperda] (0.0)
Best NR hit (blastx)  Endoprotease FURIN [Spodoptera frugiperda] (0.0)
GeneOntology terms





  
GO:0004252 serine-type endopeptidase activity
GO:0005886 plasma membrane
GO:0006508 proteolysis
GO:0006468 protein amino acid phosphorylation
GO:0004714 transmembrane receptor protein tyrosine kinase activity
GO:0007169 transmembrane receptor protein tyrosine kinase signaling pathway
GO:0005524 ATP binding
InterPro families







  
IPR000209 Peptidase S8/S53, subtilisin/kexin/sedolisin
IPR008979 Galactose-binding domain-like
IPR009030 Growth factor, receptor
IPR009020 Proteinase inhibitor, propeptide
IPR002884 Proprotein convertase, P
IPR006211 Furin-like cysteine-rich domain
IPR006212 Furin-like repeat
IPR015500 Peptidase S8, subtilisin-related
IPR022398 Peptidase S8/S53, subtilisin, active site
Orthology groupMCL10159

Nucleotide sequence:

ATGCCACCTATTCACCTTGTTGTGGTATTGACAGTGTTGGACTTGTGTGTGCCACTGTAC
CACGATCAGTTCGCGCTGCTCGTGCCAGATGGCCGTGCAGATGAACTGGCTCACAGACAT
GGGTTCATAAACCATGGCCAGATCGGCAGCCTGAACCATTATTACTTATTTTCACATCCT
CACATAAGCAAGAGATCAGTGAACGCAAGCAAAGAATATGAAATTCGTCTCAAGAATGAC
CCACAGGTGAGATGGGTGATGCAGCAGCGCGAGCTGAAGAGAAGCAAGCGGGATCTGCAT
CCACGTCACGTCATGAGACAGTCCATGCCGGAGTTCCCAGATCCATTGTTCAAAGAACAG
TGGTATCTTAATGGTGGTGCGGCTGAAGGTTTAGATATGAACGTCGGGGTCGCATGGAGA
AAAGGCTACACAGGAAAGGGGGTCGTAATAACAATACTCGATGACGGAATCCAGCCTAAC
CATCCAGACCTGTTGCAGAATTATGATCCCGCGGCATCGACTGACATAAACGGAAATGAC
ACCGATCCAACGCCGCAGGATAATGGCGACAATAAACATGGTACCCGGTGTGCGGGGGAA
GTCGCCGCAGTGGCGTATAATAAATATTGTGGCGTAGGAATAGCATACAATGCTAGCATT
GGTGGCGTGAGAATGCTAGACGGTCTAGTTAATGACGCAGTGGAAGCTAAAGCACTCGGC
TTCAACACTCACCACATAGACATTTATAGCGCTTCCTGGGGACCGGAGGACGACGGAAAA
ACGGTCGACGGACCTGGCCCCTTAGCTAGAAGAGCTTTTATCAACGGCGTAACAAACGGC
AGAGGGGGAAAAGGTTCTATTTTTATATGGGCATCAGGAAATGGAGGGAGGCATACTGAT
TCCTGTAATTGTGATGGCTACGCAAATAGTATATTCACAATTTCAATTTCTAGTGCTACA
CAGGGAGGTTACAAGCCGTGGTATTTAGAAGAATGTTCCTCTACTTTAGCATCCACATAT
AGTTCAGGGACACCAGGTAGGGACAAAAGTGTTGCGACCGTTGATATGGACGTCCAATTG
CGGCCTGATCATATTTGTACCGTAGATCATACGGGTACCTCGGCTTCTGCACCCTTAGCA
GCGGGAATTTGTGCACTAGCATTGGAAGCAAATTCATTATTAACATGGCGAGATATGCAA
CATCTAATCGTTATGACCTCCAGGTCACAACCTTTAGATAAAGAAGAAGGATGGATCGTA
AATGGTGTTAAAAGAAAAGTAAGTCATAAATTTGGTTACGGTCTTATGGATGCCGGACAA
ATGGTATCTTTAGCTGAACAATGGATAAATGTTCCACCGCAACACATATGTAAGTCACAA
GAAATAAACGAGGACAGGGCAATTGAAACTTCTTTCGGATATACTATATCCGTTCATATG
GACGTAAATGGTTGTAGTGGTACAATGAATGAAGTTAGGTTTTTGGAACACGTTCAATGT
AAGATTTCGTTAAGCTTTTTTCCAAGAGGTAATTTACGGATATTGCTGACGTCGCCAATG
GGCACTACGTCAACTTTATTATTTGAAAGAACTCACGATGCTGCTAGTTCTAATTTTGAC
GATTGGCCTTTCTTAAGTGTTCATTTTTGGGGTGAAAACGCCGAAGGACGATGGACACTT
CAAATAATAAACGCCGGCAATAACCATGTTACTCAACCGGGAGTATTAAAAAAGTGGCAG
CTTATTTTCTACGGTACGGCGGCAAATCCGATGCGTTTACGAAATAAAAGTTACTTCAAT
TCTGATAATATTCGACAAGATGAGAAGACCTATCATATTAACGACGTTTATGATGCTAAT
GAGTATTCGCAATTTCTCAATGAAATAGAACTTGGGATTTCAGATAGACGTAATTATCCT
AAGAATATTCCTTCAGCTCAAAGAAAAAACGTTTTGGCAGATGCTAACGATAAGCAAGTC
CAAAGACTGTGCGATCCCGAATGTGATTCACAAGGTTGTTATGGTAAGGGTCCCACCCAA
TGTGTTGCTTGCAAACATTACCGCCTCGATAACTCCTGTGTGTCCAGGTGTCCACCGAGA
AGCTTTGTTAACCAAGGAGGTGTTTGTTGGCCTTGCCATGAGTCTTGCGAAACTTGTGCT
GGAGCTGGACAGGATTCTTGTCTTACTTGTGCACCAGCACATTTACTTGTTGTCGATTTA
GCTGTATGTCTGCAACAGTGTCCAGATGGTTATTATGAAGATCCTGACGCAAACGCTTGT
TTTCCGTGTGCAGAACACTGTGACACCTGTTCGGATAAAGCTGATTTGTGTTCTTCATGT
GCTCATAATTACGAATTGTATAATGGGTCTTGTTTAGCCACTTGCCCTCCTGGAACATAC
AAAAAAGAGGATTTTGGTTGTATGCGGTGTCATGAAACGTGTGAGTCCTGTAGCGGCCCG
AATGAATCTGAATGTGTTACTTGTAAAATTGGAGAGTACGCGCTAGAAGGTCGCTGTGTA
TCTAACTGTCTCATCGGGAATTATGCAGATGTTCAAAAAAAAGAATGCATATCGTGTCCC
ATTGGATGTTCAATTTGCACATATGCAGTTTGTTCCGCTTGTCAAGAGAAATGGGTTCTA
ACGAAAAAAGGAACATGTCAGCCTGAAGGAAACGACAAGTGTGATACTAATGAGTATTAC
GAAGGAGGGCGTTGCAAAAATTGCCATTCCACTTGTGAGAAATGTAGTGGTCCTAATGAA
TGGGACTGTTTATCTTGTTCAAGTCCTCTGTTATTGCAGGGATCAAGGTGCGTTGCGGAA
TGTGGACAAGGCTTTTACCAGACAGCTGGGAGATGTTCGTTATGCCCGCACACATGCAAA
ACATGCGTGTCGAGGTTAAATTGTACAACTTGTGCTAATAGTCTTAGATTACAATCCGGT
ACTTGTCGTTCTACATGCGCAGCTGGTTACTATCCTGATGAAGGAACATGTTCCAAGTGC
TACTTATCTTGTGAGACCTGTACTGGCCCGAGAAGAGATCAATGCGCATCATGTCCTCCA
GATTGGAGGCTAGCAGCGGGAGAATGTAGACCGGAATGTCCTCAAAACTTCTTTACATGG
GGAGACAGTTGTCGTAGATGTCATCATTATTGCCAGGATTGCCATGGAGCTGGTCCCCAA
AGGTGTACATCCTGTCCTCAGCATTTTTCCTTAGAAAATGGTTTATGTGTTGAGTGCCTT
AGTTCCCAATATTATGAAATTAGGACGAGGACTTGCCGTCCATGCCATGACTCTTGCAGG
TCATGTTCTGGACCTGGGCCTACTAGTTGTGTGACGTGTGCACATCCTCTTCGTTTAGAT
AGGGTGAATCACAAATGTTTGCCATGCTGTACAGAGAATTTAGTTTCGTTTTATTTAAGC
ACTAATCAATCAACAGATTGCTGCCACTGTGATAAAGATATGGGCGGTTGTCTGAACGGT
TCATCGGCGGGTAAGAGACGCATTGCGGAGAACATTGGGGCGCATATGACGCCATCATTT
TTTGTCGACGACGCAAAAGAGCAGAATATCTTAGACCGTGATTTATTGCTCCTTTTGAGT
GCTGGAGTGGCGGTCCTTGTAATATCCATTGCAATGATTGTACTGAGGTTCAAGTCTAAA
AAGTGCAAACATCTTTCACCGTTTCCAAGGACAGGATATTCACAATTGACTTCCATAGAC
GAGGATTTCACAGCAGTGAGCTTATCGCATACTACATTGAAAGTCATTCAAAGTGACATA
AACACAAACCATCTGGAGGAACCAACATAA

Protein sequence:

MPPIHLVVVLTVLDLCVPLYHDQFALLVPDGRADELAHRHGFINHGQIGSLNHYYLFSHP
HISKRSVNASKEYEIRLKNDPQVRWVMQQRELKRSKRDLHPRHVMRQSMPEFPDPLFKEQ
WYLNGGAAEGLDMNVGVAWRKGYTGKGVVITILDDGIQPNHPDLLQNYDPAASTDINGND
TDPTPQDNGDNKHGTRCAGEVAAVAYNKYCGVGIAYNASIGGVRMLDGLVNDAVEAKALG
FNTHHIDIYSASWGPEDDGKTVDGPGPLARRAFINGVTNGRGGKGSIFIWASGNGGRHTD
SCNCDGYANSIFTISISSATQGGYKPWYLEECSSTLASTYSSGTPGRDKSVATVDMDVQL
RPDHICTVDHTGTSASAPLAAGICALALEANSLLTWRDMQHLIVMTSRSQPLDKEEGWIV
NGVKRKVSHKFGYGLMDAGQMVSLAEQWINVPPQHICKSQEINEDRAIETSFGYTISVHM
DVNGCSGTMNEVRFLEHVQCKISLSFFPRGNLRILLTSPMGTTSTLLFERTHDAASSNFD
DWPFLSVHFWGENAEGRWTLQIINAGNNHVTQPGVLKKWQLIFYGTAANPMRLRNKSYFN
SDNIRQDEKTYHINDVYDANEYSQFLNEIELGISDRRNYPKNIPSAQRKNVLADANDKQV
QRLCDPECDSQGCYGKGPTQCVACKHYRLDNSCVSRCPPRSFVNQGGVCWPCHESCETCA
GAGQDSCLTCAPAHLLVVDLAVCLQQCPDGYYEDPDANACFPCAEHCDTCSDKADLCSSC
AHNYELYNGSCLATCPPGTYKKEDFGCMRCHETCESCSGPNESECVTCKIGEYALEGRCV
SNCLIGNYADVQKKECISCPIGCSICTYAVCSACQEKWVLTKKGTCQPEGNDKCDTNEYY
EGGRCKNCHSTCEKCSGPNEWDCLSCSSPLLLQGSRCVAECGQGFYQTAGRCSLCPHTCK
TCVSRLNCTTCANSLRLQSGTCRSTCAAGYYPDEGTCSKCYLSCETCTGPRRDQCASCPP
DWRLAAGECRPECPQNFFTWGDSCRRCHHYCQDCHGAGPQRCTSCPQHFSLENGLCVECL
SSQYYEIRTRTCRPCHDSCRSCSGPGPTSCVTCAHPLRLDRVNHKCLPCCTENLVSFYLS
TNQSTDCCHCDKDMGGCLNGSSAGKRRIAENIGAHMTPSFFVDDAKEQNILDRDLLLLLS
AGVAVLVISIAMIVLRFKSKKCKHLSPFPRTGYSQLTSIDEDFTAVSLSHTTLKVIQSDI
NTNHLEEPT