New model in OGS2.0 | DPOGS203664  |
---|---|
Genomic Position | scaffold184:+ 131792-135302 |
See gene structure | |
CDS Length | 1419 |
Paired RNAseq reads   | 1448 |
Single RNAseq reads   | 3436 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA003461 (5e-34) |
Best Drosophila hit   | CG31728 (6e-96) |
Best Human hit | transmembrane protease serine 6 (3e-39) |
Best NR hit (blastp)   | PREDICTED: similar to CG31728-PA [Apis mellifera] (8e-136) |
Best NR hit (blastx)   | CLIP-domain serine protease subfamily D (AGAP008183-PA) [Anopheles gambiae str. PEST] (6e-137) |
GeneOntology terms    | GO:0004252 serine-type endopeptidase activity GO:0006508 proteolysis |
InterPro families    | IPR009003 Peptidase cysteine/serine, trypsin-like IPR001254 Peptidase S1/S6, chymotrypsin/Hap IPR001314 Peptidase S1A, chymotrypsin-type IPR018114 Peptidase S1/S6, chymotrypsin/Hap, active site |
Orthology group | MCL16236 |
Nucleotide sequence:
ATGAAACGTGAATCAAACAGAACAAGGGATGACAAACAACTATTATTTTTGCAAGCTAGA
CAAGCAAATGATGAAACCTTTGGAAGAGATTGTGAAACAACTATAGGGAAAAAGGGAATT
TGCAAGAGCTTTCGCGACTGCTACCCATTATTTAAGATTGTGGATTTATCCGGTTACGAT
GGTTGGGTCATGGGTCACTATGACACATGTAGTTTTGTAAACAGGGAAAATTCAGAGCTA
TTCGGAGTGTGTTGTACTGAGCCCGTTGGCACACCGCCGCAGCAGGAACCAGACGTGCAA
CGACTTGGTGTTTTTAGACCTCCTTATCCGATTTCAATGAACAATTACCAACATCCATCA
CCTCTCTTACCTAAATGGATGAACATGAACGAGCCTCTTCATCGCCAGTTCTTTTCTCAA
TGGCCGCCAACTATACCACCACTACCCACACATCCACCGGACCACACAGCTCCAACACAT
CCGCCATCTATCGTTGCCGGTATTCCGACGACAACTAAACCGTCAAACGGCTTACCATCT
ACAACTTGGGGTACGAAACCTCCAGCAACGACAAAACAGACCTGGTCTCCTGCATATCCA
ACACAGCCAACAAAGCCAACCGGCCAACCGGGCGTGGATTCGTCGTGCGGAATTAAGAAC
GGACCACAGACCTACGGAAGTACGTATGAATCTCTTGACGAGGAGCGTATAGTGGGGGGT
CATAACGCGGAGCTAAACGAGTGGCCATGGATAGTAGCGCTGTTCAATAATGGAAGACAA
TTCTGCGGAGGATCCCTCATAGACGATAGACATGTTTTAACAGCAGCTCATTGTGTAGCT
CATATGACATCGTTGGATGTCGCTCGACTCACGGCGAGACTGGGAGACTACAACATACGG
ACGAACACAGAGACACAACACGTTGAGAGAAGAATCAAGAGAGTTGTCAGACATCGCGGT
TTCGACATGAGGACATTATACAACGACGTAGCTGTTCTAACTTTAGACCAACCTGTGACT
TTCACAAAAAACATTCGACCGGTTTGTCTTCCCGGAGGAGCCAGAGCTTATTCAGGACTA
ATAGCGACGGTAATAGGATGGGGAAGCTTGAGAGAAAGTGGTCCTCAACCGTCTATTCTA
CAAGAAGTGTCAATACCAATTTGGACTAACAACGAGTGTCGTCTCAAGTACGGCTCCGCG
GCCCCTGGTGGGATCGTTGACCACATGCTGTGCGCTGGTAAAGCCAGTATGGATTCATGC
AGTGGCGACAGCGGTGGACCTTTGATGGTGAATGAAGGCGGTCGTTGGACTCAAGTCGGC
GTCGTGTCATGGGGTATCGGATGTGGTAAGGGTCAGTACCCTGGGGTCTACACACGAATC
ACCTCTTTCCTCCCCTGGATACAAAAGAACGCTAAGTGA
Protein sequence:
MKRESNRTRDDKQLLFLQARQANDETFGRDCETTIGKKGICKSFRDCYPLFKIVDLSGYD
GWVMGHYDTCSFVNRENSELFGVCCTEPVGTPPQQEPDVQRLGVFRPPYPISMNNYQHPS
PLLPKWMNMNEPLHRQFFSQWPPTIPPLPTHPPDHTAPTHPPSIVAGIPTTTKPSNGLPS
TTWGTKPPATTKQTWSPAYPTQPTKPTGQPGVDSSCGIKNGPQTYGSTYESLDEERIVGG
HNAELNEWPWIVALFNNGRQFCGGSLIDDRHVLTAAHCVAHMTSLDVARLTARLGDYNIR
TNTETQHVERRIKRVVRHRGFDMRTLYNDVAVLTLDQPVTFTKNIRPVCLPGGARAYSGL
IATVIGWGSLRESGPQPSILQEVSIPIWTNNECRLKYGSAAPGGIVDHMLCAGKASMDSC
SGDSGGPLMVNEGGRWTQVGVVSWGIGCGKGQYPGVYTRITSFLPWIQKNAK