DPGLEAN14881 in OGS1.0

New model in OGS2.0DPOGS215392 
Genomic Positionscaffold2795:- 12697-25275
See gene structure
CDS Length1992
Paired RNAseq reads  24
Single RNAseq reads  81
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012439 (2e-130)
Best Drosophila hit  furin 2, isoform I (3e-117)
Best Human hitneuroendocrine convertase 1 isoform 1 preproprotein (3e-145)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC004402 [Tribolium castaneum] (0.0)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC004402 [Tribolium castaneum] (2e-173)
GeneOntology terms



  
GO:0006508 proteolysis
GO:0004252 serine-type endopeptidase activity
GO:0008233 peptidase activity
GO:0008236 serine-type peptidase activity
GO:0016787 hydrolase activity
InterPro families




  
IPR000209 Peptidase S8/S53, subtilisin/kexin/sedolisin
IPR015500 Peptidase S8, subtilisin-related
IPR008979 Galactose-binding domain-like
IPR009020 Proteinase inhibitor, propeptide
IPR022398 Peptidase S8/S53, subtilisin, active site
IPR002884 Proprotein convertase, P
Orthology groupMCL17641

Nucleotide sequence:

ATGCACGATGGAAGAGAGACCGCGAACAGTTATGATGGTGAGTGGATCGTTGAAGTGGTA
GGTGGTGAGGAGGTGGCGCAGTTGGTGGCGCTGGAACACGGATATAAATACGAAGGACCG
GTGCTGGGTTTGGCAAACATGTACGCGTTCCACGCACACGAGCGCAAGGAGCGTCGCACC
CCGAGCAAGCACACATCCACACTGCGCAAGGACAGGAGGATTCGATGGGCGGAACAACTC
TTTGCAAAAAGTCGCGTGAAGCGATATCCGTACCCTGACCTCGACGGCACATTAAAACGA
GTAAAAAGAATAGATGAATACACCAGGGATGCAGACTTTACGAGGAGTTCAACCGTGGAA
CACGGACGGAGGGAGGTCTTCAATGACGAGCTCTGGGCCTACGAATGGTATTTGCAAGAC
ACTCGTGACAATCCAAACGTACCTCGCCTGGACCTCAATGTGTTATCGGTGTATAATATG
GGCTACAACGGACGTGGTGTTCGCGTGTCTATACTCGACGACGGAGTCGAACACAATCAC
ACGGACTTACAGAACAACTACGATCCGGAAATCAGTTGGGATTGCAATGATGGAGACTCG
GATCCATATCCGAGGCATGACGATAAAAACCGGAATTCTCACGGCACGAGATGTGCCGGT
GAGATAGCGATGACGGCTAACAATAAGAAGTGCGGAGTGGGCGTGGCCTGGGGCGCCAAA
GTGGGTGGAGTCAGAATGCTCGATGGACGAATCACTGATCATGTTGAAGGCGAAGCAATA
GGATTCGCGTGGGACAAAGTGGACATATACAGCGCTTCATGGGGCCCCAACGATGACGGA
GAGACCGTGGAGGGTCCAGGGCGACTCGCCATGGAGGCCTTCAAGAGAGGAGTGCAAATG
GGCCGGAACGGTAAAGGGAATATATTCGTGTGGGCCAACGGCAATGGTGGAACACACGAC
GATAACTGTAACTGCGACGGCTACTCTTCCAGTATGTACACGATATCTATTGCTAGCGCT
TCCCAACAAGGCCTGTTTCCTTGGTACGGAGAGATCTGCTCCTCGACTCTAGCAACCGCA
TACTCCTCTGGTGCTTACAGTGATCAGAAAATTGCCACTACAGACGTAAACGACTCGTGT
ACACTTGGGCACACGGGCACCTCTGCAGCGGCGCCATTGGCGGCCGGTATTATTGCTTTA
ATGCTAGATGCCAACCCAAATTTAACTTGGAGAGATGTCCAACATCTGATTGTATGGACT
TCGGAATATACACCGCTATCTGATAACCCCGGTTGGCAAGTCAACGGCGCGGGTCTTTAT
TTCGACGTACGTTTCGGCTTTGGTCTTTTGAACGCCGGATCTCTTGTCAACGCCGCACTC
AACTGGACTACAGTACCAAGTGCACTATCGTGTAGAATCGATGCTTCTCCGATCAAAGGC
AAAGTCGCCATTTCAGCAATGGAAACTGTAGATATAACAGTAAAAGTATCGGACTGTGAA
GTAAATTACTTAGAACACGTCGAACTGTATGTTAATATCGAGTATACGCGAAGAGGTGCT
TTGGAAATACACCTAATTTCTCCTCAAGGTACGATGGTTCAACTACTCAGTCCTCGTCCG
AGAGATACGTCCAAGGTCGGCTTTGTTAACTGGCCTTTAACCTCAGTAGCGACGTGGGGA
GAGAGAGCTAATGGACTTTGGAGGGTCATCGTACAAGACAAGGGGAATAAATGGAACACG
GGTTATGTCGGTGAACTGGTTCTCATAGTCCACGGTACAAAGGAAATGCCCGCTCACATG
AGGAGTGGTCCGAGGAGATACGACGACACCTTCAGTCGGTACGAGATCGAGTCGTATGAG
GATGAGCCGGCGGTACCAGGAGACCATGAGCACGGAGGAGTCGCCAGCGCGCTACTGGAC
CAGGCGGACACCGAGCTACAGAGGAACTACCACAGCAGGGGGCAGCAGGCTGGCGAGCGA
CACCGCGATTGA

Protein sequence:

MHDGRETANSYDGEWIVEVVGGEEVAQLVALEHGYKYEGPVLGLANMYAFHAHERKERRT
PSKHTSTLRKDRRIRWAEQLFAKSRVKRYPYPDLDGTLKRVKRIDEYTRDADFTRSSTVE
HGRREVFNDELWAYEWYLQDTRDNPNVPRLDLNVLSVYNMGYNGRGVRVSILDDGVEHNH
TDLQNNYDPEISWDCNDGDSDPYPRHDDKNRNSHGTRCAGEIAMTANNKKCGVGVAWGAK
VGGVRMLDGRITDHVEGEAIGFAWDKVDIYSASWGPNDDGETVEGPGRLAMEAFKRGVQM
GRNGKGNIFVWANGNGGTHDDNCNCDGYSSSMYTISIASASQQGLFPWYGEICSSTLATA
YSSGAYSDQKIATTDVNDSCTLGHTGTSAAAPLAAGIIALMLDANPNLTWRDVQHLIVWT
SEYTPLSDNPGWQVNGAGLYFDVRFGFGLLNAGSLVNAALNWTTVPSALSCRIDASPIKG
KVAISAMETVDITVKVSDCEVNYLEHVELYVNIEYTRRGALEIHLISPQGTMVQLLSPRP
RDTSKVGFVNWPLTSVATWGERANGLWRVIVQDKGNKWNTGYVGELVLIVHGTKEMPAHM
RSGPRRYDDTFSRYEIESYEDEPAVPGDHEHGGVASALLDQADTELQRNYHSRGQQAGER
HRD