New model in OGS2.0 | DPOGS200623  |
---|---|
Genomic Position | scaffold173:- 1157-6782 |
See gene structure | |
CDS Length | 1569 |
Paired RNAseq reads   | 1166 |
Single RNAseq reads   | 2935 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA008910 (8e-173) |
Best Drosophila hit   | CG3108 (3e-98) |
Best Human hit | carboxypeptidase B2 isoform a preproprotein (2e-57) |
Best NR hit (blastp)   | molting fluid carboxypeptidase A [Bombyx mori] (0.0) |
Best NR hit (blastx)   | molting fluid carboxypeptidase A [Bombyx mori] (5e-180) |
GeneOntology terms    | GO:0004181 metallocarboxypeptidase activity GO:0008270 zinc ion binding GO:0006508 proteolysis |
InterPro families    | IPR000834 Peptidase M14, carboxypeptidase A IPR003146 Proteinase inhibitor, carboxypeptidase propeptide IPR009020 Proteinase inhibitor, propeptide |
Orthology group | MCL12092 |
Nucleotide sequence:
ATGGCAAAGCTCTGGTGGACGGTCGTGTGCCTCCTCGCATCCTTGGAGCTCTGCACTCCG
CTTATAAACGAATTGCAACCGGGTAAAGAATGGCCCAAACGTCAATCAGTAAGACAACCT
CAGGACGAGCTAGACAACCCTGATGTCACAACGGTTATAGCTGATACAACGGTAGGAGGT
TCAGTAGATATACCCGAAGATGTACAAGAAGATGTTGAAGAGAACATTCAAACAAAAGCT
ATAGATGTAGAAGACTCCAAAGTAGATTACTCCGGAGCACAACTGTGGAAAGTGGCGACT
GATAAGAACGGAGTAAGAGTACTTTTAGGTCGATTGCGTCGTAGAAATCTCATTTCGACG
TGGTCGGGGAACCAAACGTACATCGATGTCCTTGTGAAACCCGACGCCGTACAGAACGTT
ACACGGATATTCAAGAGAGAGAACATTACTTTTGACGTCATTATCGAGGACCTACAAAGG
AGAATCAATGAAGAAAACCCTCCGCTCGATGAGAACGAAATTGAGCTACAGGACAGACGA
GCAAGATCTTACAATTTTTGGCGTTACAGGGCCACCCGTTTAGGTGTGATAAAAGCCTTC
ATGGGCTACAGCTCTCTTAAACTTCCAATGGAGTTTTGGCCATATGCCAACTGGGGTCAC
CGTATGACATGGAAACAATATCACAGATTGGAAGACATTCACGGCTTTATGGATTATTTA
GCCAAAACGTATCCCAAGATCGTGAGTGTGAACTCAATAGGAAAATCCTATGAAGGAAGA
GACCTTAAAGTTCTCCGTATATCAGATGGCAAGCCTTCAAATAAGGCGGTTTTTATCGAC
GGTGGTATACACGCTAGGGAATGGATCAGCCCGGCTACGGTTACATACTTCATCAACCAA
ATAGCTGAAAACTTCGACGAAGAATCCGATGACATAAGGGATATTGATTGGTATTTCTTG
CCTGTTGTCAATCCTGATGGATACGAATACACGCATATCAAAGATCGTTTGTGGAGAAAA
AATAGAAAGCCGGCAGTTTACGGTGTGAGACAGTGTGTCGGGACTGATTTGAACAGAAAT
TTCGGTTATCGTTGGGGTGGTAAAGGTTCCTCGAGTAATCCCTGCAGTGAAATATATAGA
GGAAATAGAGCTTTTTCTGAACCAGAATCCAGAGCAGTATCGGAATTCATCAAAACAAGT
GCAGCTAATTTCTCAGCATACCTGACATACCACAGTTATGGTCAATATTTATTATACCCT
TGGGGATATGACAACGCAGTCCCACCAGATCATAAAGAATTAGATCTTGTTGGCAAAAAT
ATAGCAGCGGCTATTCAAGCGACTGGAGGCTCTAAATATTCTGTTGGGTCGTCTAGTGGC
CTCCTTTATCCCGCTTCAGGCGGTTCAGATGACTGGGCCAAAGGCCAGGGCATTAAATAT
GCATACACAATTGAACTTAGCGATACTGGCCGCCATGGATTTGTTTTGCCGACAACCTTC
ATTGAGCCAGTAGCAAGGGAATCATTGTCAGGCTTAAGAGTGCTTGCAGCCCAATTAAGA
AAGAACTAA
Protein sequence:
MAKLWWTVVCLLASLELCTPLINELQPGKEWPKRQSVRQPQDELDNPDVTTVIADTTVGG
SVDIPEDVQEDVEENIQTKAIDVEDSKVDYSGAQLWKVATDKNGVRVLLGRLRRRNLIST
WSGNQTYIDVLVKPDAVQNVTRIFKRENITFDVIIEDLQRRINEENPPLDENEIELQDRR
ARSYNFWRYRATRLGVIKAFMGYSSLKLPMEFWPYANWGHRMTWKQYHRLEDIHGFMDYL
AKTYPKIVSVNSIGKSYEGRDLKVLRISDGKPSNKAVFIDGGIHAREWISPATVTYFINQ
IAENFDEESDDIRDIDWYFLPVVNPDGYEYTHIKDRLWRKNRKPAVYGVRQCVGTDLNRN
FGYRWGGKGSSSNPCSEIYRGNRAFSEPESRAVSEFIKTSAANFSAYLTYHSYGQYLLYP
WGYDNAVPPDHKELDLVGKNIAAAIQATGGSKYSVGSSSGLLYPASGGSDDWAKGQGIKY
AYTIELSDTGRHGFVLPTTFIEPVARESLSGLRVLAAQLRKN