DPGLEAN11046 in OGS1.0

New model in OGS2.0DPOGS203664 
Genomic Positionscaffold184:+ 131792-135302
See gene structure
CDS Length1419
Paired RNAseq reads  1448
Single RNAseq reads  3436
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA003461 (5e-34)
Best Drosophila hit  CG31728 (6e-96)
Best Human hittransmembrane protease serine 6 (3e-39)
Best NR hit (blastp)  PREDICTED: similar to CG31728-PA [Apis mellifera] (8e-136)
Best NR hit (blastx)  CLIP-domain serine protease subfamily D (AGAP008183-PA) [Anopheles gambiae str. PEST] (6e-137)
GeneOntology terms
  
GO:0004252 serine-type endopeptidase activity
GO:0006508 proteolysis
InterPro families


  
IPR009003 Peptidase cysteine/serine, trypsin-like
IPR001254 Peptidase S1/S6, chymotrypsin/Hap
IPR001314 Peptidase S1A, chymotrypsin-type
IPR018114 Peptidase S1/S6, chymotrypsin/Hap, active site
Orthology groupMCL16236

Nucleotide sequence:

ATGAAACGTGAATCAAACAGAACAAGGGATGACAAACAACTATTATTTTTGCAAGCTAGA
CAAGCAAATGATGAAACCTTTGGAAGAGATTGTGAAACAACTATAGGGAAAAAGGGAATT
TGCAAGAGCTTTCGCGACTGCTACCCATTATTTAAGATTGTGGATTTATCCGGTTACGAT
GGTTGGGTCATGGGTCACTATGACACATGTAGTTTTGTAAACAGGGAAAATTCAGAGCTA
TTCGGAGTGTGTTGTACTGAGCCCGTTGGCACACCGCCGCAGCAGGAACCAGACGTGCAA
CGACTTGGTGTTTTTAGACCTCCTTATCCGATTTCAATGAACAATTACCAACATCCATCA
CCTCTCTTACCTAAATGGATGAACATGAACGAGCCTCTTCATCGCCAGTTCTTTTCTCAA
TGGCCGCCAACTATACCACCACTACCCACACATCCACCGGACCACACAGCTCCAACACAT
CCGCCATCTATCGTTGCCGGTATTCCGACGACAACTAAACCGTCAAACGGCTTACCATCT
ACAACTTGGGGTACGAAACCTCCAGCAACGACAAAACAGACCTGGTCTCCTGCATATCCA
ACACAGCCAACAAAGCCAACCGGCCAACCGGGCGTGGATTCGTCGTGCGGAATTAAGAAC
GGACCACAGACCTACGGAAGTACGTATGAATCTCTTGACGAGGAGCGTATAGTGGGGGGT
CATAACGCGGAGCTAAACGAGTGGCCATGGATAGTAGCGCTGTTCAATAATGGAAGACAA
TTCTGCGGAGGATCCCTCATAGACGATAGACATGTTTTAACAGCAGCTCATTGTGTAGCT
CATATGACATCGTTGGATGTCGCTCGACTCACGGCGAGACTGGGAGACTACAACATACGG
ACGAACACAGAGACACAACACGTTGAGAGAAGAATCAAGAGAGTTGTCAGACATCGCGGT
TTCGACATGAGGACATTATACAACGACGTAGCTGTTCTAACTTTAGACCAACCTGTGACT
TTCACAAAAAACATTCGACCGGTTTGTCTTCCCGGAGGAGCCAGAGCTTATTCAGGACTA
ATAGCGACGGTAATAGGATGGGGAAGCTTGAGAGAAAGTGGTCCTCAACCGTCTATTCTA
CAAGAAGTGTCAATACCAATTTGGACTAACAACGAGTGTCGTCTCAAGTACGGCTCCGCG
GCCCCTGGTGGGATCGTTGACCACATGCTGTGCGCTGGTAAAGCCAGTATGGATTCATGC
AGTGGCGACAGCGGTGGACCTTTGATGGTGAATGAAGGCGGTCGTTGGACTCAAGTCGGC
GTCGTGTCATGGGGTATCGGATGTGGTAAGGGTCAGTACCCTGGGGTCTACACACGAATC
ACCTCTTTCCTCCCCTGGATACAAAAGAACGCTAAGTGA

Protein sequence:

MKRESNRTRDDKQLLFLQARQANDETFGRDCETTIGKKGICKSFRDCYPLFKIVDLSGYD
GWVMGHYDTCSFVNRENSELFGVCCTEPVGTPPQQEPDVQRLGVFRPPYPISMNNYQHPS
PLLPKWMNMNEPLHRQFFSQWPPTIPPLPTHPPDHTAPTHPPSIVAGIPTTTKPSNGLPS
TTWGTKPPATTKQTWSPAYPTQPTKPTGQPGVDSSCGIKNGPQTYGSTYESLDEERIVGG
HNAELNEWPWIVALFNNGRQFCGGSLIDDRHVLTAAHCVAHMTSLDVARLTARLGDYNIR
TNTETQHVERRIKRVVRHRGFDMRTLYNDVAVLTLDQPVTFTKNIRPVCLPGGARAYSGL
IATVIGWGSLRESGPQPSILQEVSIPIWTNNECRLKYGSAAPGGIVDHMLCAGKASMDSC
SGDSGGPLMVNEGGRWTQVGVVSWGIGCGKGQYPGVYTRITSFLPWIQKNAK