New model in OGS2.0 | DPOGS215392  |
---|---|
Genomic Position | scaffold2795:- 12697-25275 |
See gene structure | |
CDS Length | 1992 |
Paired RNAseq reads   | 24 |
Single RNAseq reads   | 81 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA012439 (2e-130) |
Best Drosophila hit   | furin 2, isoform I (3e-117) |
Best Human hit | neuroendocrine convertase 1 isoform 1 preproprotein (3e-145) |
Best NR hit (blastp)   | hypothetical protein TcasGA2_TC004402 [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | hypothetical protein TcasGA2_TC004402 [Tribolium castaneum] (2e-173) |
GeneOntology terms    | GO:0006508 proteolysis GO:0004252 serine-type endopeptidase activity GO:0008233 peptidase activity GO:0008236 serine-type peptidase activity GO:0016787 hydrolase activity |
InterPro families    | IPR000209 Peptidase S8/S53, subtilisin/kexin/sedolisin IPR015500 Peptidase S8, subtilisin-related IPR008979 Galactose-binding domain-like IPR009020 Proteinase inhibitor, propeptide IPR022398 Peptidase S8/S53, subtilisin, active site IPR002884 Proprotein convertase, P |
Orthology group | MCL17641 |
Nucleotide sequence:
ATGCACGATGGAAGAGAGACCGCGAACAGTTATGATGGTGAGTGGATCGTTGAAGTGGTA
GGTGGTGAGGAGGTGGCGCAGTTGGTGGCGCTGGAACACGGATATAAATACGAAGGACCG
GTGCTGGGTTTGGCAAACATGTACGCGTTCCACGCACACGAGCGCAAGGAGCGTCGCACC
CCGAGCAAGCACACATCCACACTGCGCAAGGACAGGAGGATTCGATGGGCGGAACAACTC
TTTGCAAAAAGTCGCGTGAAGCGATATCCGTACCCTGACCTCGACGGCACATTAAAACGA
GTAAAAAGAATAGATGAATACACCAGGGATGCAGACTTTACGAGGAGTTCAACCGTGGAA
CACGGACGGAGGGAGGTCTTCAATGACGAGCTCTGGGCCTACGAATGGTATTTGCAAGAC
ACTCGTGACAATCCAAACGTACCTCGCCTGGACCTCAATGTGTTATCGGTGTATAATATG
GGCTACAACGGACGTGGTGTTCGCGTGTCTATACTCGACGACGGAGTCGAACACAATCAC
ACGGACTTACAGAACAACTACGATCCGGAAATCAGTTGGGATTGCAATGATGGAGACTCG
GATCCATATCCGAGGCATGACGATAAAAACCGGAATTCTCACGGCACGAGATGTGCCGGT
GAGATAGCGATGACGGCTAACAATAAGAAGTGCGGAGTGGGCGTGGCCTGGGGCGCCAAA
GTGGGTGGAGTCAGAATGCTCGATGGACGAATCACTGATCATGTTGAAGGCGAAGCAATA
GGATTCGCGTGGGACAAAGTGGACATATACAGCGCTTCATGGGGCCCCAACGATGACGGA
GAGACCGTGGAGGGTCCAGGGCGACTCGCCATGGAGGCCTTCAAGAGAGGAGTGCAAATG
GGCCGGAACGGTAAAGGGAATATATTCGTGTGGGCCAACGGCAATGGTGGAACACACGAC
GATAACTGTAACTGCGACGGCTACTCTTCCAGTATGTACACGATATCTATTGCTAGCGCT
TCCCAACAAGGCCTGTTTCCTTGGTACGGAGAGATCTGCTCCTCGACTCTAGCAACCGCA
TACTCCTCTGGTGCTTACAGTGATCAGAAAATTGCCACTACAGACGTAAACGACTCGTGT
ACACTTGGGCACACGGGCACCTCTGCAGCGGCGCCATTGGCGGCCGGTATTATTGCTTTA
ATGCTAGATGCCAACCCAAATTTAACTTGGAGAGATGTCCAACATCTGATTGTATGGACT
TCGGAATATACACCGCTATCTGATAACCCCGGTTGGCAAGTCAACGGCGCGGGTCTTTAT
TTCGACGTACGTTTCGGCTTTGGTCTTTTGAACGCCGGATCTCTTGTCAACGCCGCACTC
AACTGGACTACAGTACCAAGTGCACTATCGTGTAGAATCGATGCTTCTCCGATCAAAGGC
AAAGTCGCCATTTCAGCAATGGAAACTGTAGATATAACAGTAAAAGTATCGGACTGTGAA
GTAAATTACTTAGAACACGTCGAACTGTATGTTAATATCGAGTATACGCGAAGAGGTGCT
TTGGAAATACACCTAATTTCTCCTCAAGGTACGATGGTTCAACTACTCAGTCCTCGTCCG
AGAGATACGTCCAAGGTCGGCTTTGTTAACTGGCCTTTAACCTCAGTAGCGACGTGGGGA
GAGAGAGCTAATGGACTTTGGAGGGTCATCGTACAAGACAAGGGGAATAAATGGAACACG
GGTTATGTCGGTGAACTGGTTCTCATAGTCCACGGTACAAAGGAAATGCCCGCTCACATG
AGGAGTGGTCCGAGGAGATACGACGACACCTTCAGTCGGTACGAGATCGAGTCGTATGAG
GATGAGCCGGCGGTACCAGGAGACCATGAGCACGGAGGAGTCGCCAGCGCGCTACTGGAC
CAGGCGGACACCGAGCTACAGAGGAACTACCACAGCAGGGGGCAGCAGGCTGGCGAGCGA
CACCGCGATTGA
Protein sequence:
MHDGRETANSYDGEWIVEVVGGEEVAQLVALEHGYKYEGPVLGLANMYAFHAHERKERRT
PSKHTSTLRKDRRIRWAEQLFAKSRVKRYPYPDLDGTLKRVKRIDEYTRDADFTRSSTVE
HGRREVFNDELWAYEWYLQDTRDNPNVPRLDLNVLSVYNMGYNGRGVRVSILDDGVEHNH
TDLQNNYDPEISWDCNDGDSDPYPRHDDKNRNSHGTRCAGEIAMTANNKKCGVGVAWGAK
VGGVRMLDGRITDHVEGEAIGFAWDKVDIYSASWGPNDDGETVEGPGRLAMEAFKRGVQM
GRNGKGNIFVWANGNGGTHDDNCNCDGYSSSMYTISIASASQQGLFPWYGEICSSTLATA
YSSGAYSDQKIATTDVNDSCTLGHTGTSAAAPLAAGIIALMLDANPNLTWRDVQHLIVWT
SEYTPLSDNPGWQVNGAGLYFDVRFGFGLLNAGSLVNAALNWTTVPSALSCRIDASPIKG
KVAISAMETVDITVKVSDCEVNYLEHVELYVNIEYTRRGALEIHLISPQGTMVQLLSPRP
RDTSKVGFVNWPLTSVATWGERANGLWRVIVQDKGNKWNTGYVGELVLIVHGTKEMPAHM
RSGPRRYDDTFSRYEIESYEDEPAVPGDHEHGGVASALLDQADTELQRNYHSRGQQAGER
HRD