New model in OGS2.0 | DPOGS204784  |
---|---|
Genomic Position | scaffold1728:+ 11322-14994 |
See gene structure | |
CDS Length | 1263 |
Paired RNAseq reads   | 4 |
Single RNAseq reads   | 12 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA009476 (5e-114) |
Best Drosophila hit   | CG8560 (9e-38) |
Best Human hit | carboxypeptidase A1 precursor (3e-31) |
Best NR hit (blastp)   | midgut carboxypeptidase 2 [Trichoplusia ni] (2e-50) |
Best NR hit (blastx)   | midgut carboxypeptidase 2 [Trichoplusia ni] (3e-49) |
GeneOntology terms    | GO:0004180 carboxypeptidase activity GO:0006508 proteolysis GO:0008270 zinc ion binding GO:0004181 metallocarboxypeptidase activity |
InterPro families    | IPR000834 Peptidase M14, carboxypeptidase A IPR003146 Proteinase inhibitor, carboxypeptidase propeptide IPR009020 Proteinase inhibitor, propeptide |
Orthology group | MCL40427 |
Nucleotide sequence:
ATGCTATCTGAATATTTGGAGACAGAACCAGATGGGTCCAATTCTAGGAAAGAACTCGTG
AGGTTTAATTCTCAAAAGCTAAGACAGAAGAACCATCGAGTTTATAAAATTAAACTTGAA
ACAACCGAGCAATCGAATAAGTTTTATTTACTCAAAGATGACGTCATAGATTTCTGGCAG
AAACCTTCTTTGAAGCATAAAGTGGAAGGTAAAGCTATGGTTCCTCCTTCACATTTTGGT
TGGTTTGAAAAACAATTGAATAATATGAATATTGAGAAGAATATTTATATAGATGATGTT
TATGAATATTTAAAGGAGCATGACACCCGTCAAGCACGAAGAAACGATGAGGACGATGTA
TTCAGTTTCGACGGCTATCATAGATTTGATCAAGTTTTGAATTATATGAAAGGGTTAAAT
GGGACTTCTTTACCAGGTGTTGATGTAGAATTCGTTGAAGCTGGATATACGGATGAGAAC
AGAACTTTGGCATATTTGAGAATAAATTCCAAAACAAGTGAAGAGGGAACACAAAGGCCC
ATAGTAATTATTGAGGCTGGGGTTAATCCTAGAGAGTGGATAACAATACCGACTGCTCTT
AATATTGCAAATAAACTTATTGAAGGAAATCAAACGAAATTAGCACAAAATTTGGAATGG
ATTATTTTACCAGTTCTCAATCCGGATGGTTATGAATTTACACATAACTCGAATCGACTT
TGGACGAAATCAAGAAGCACCAGAAGTAACTTAGGTTTTATATGCCCAGGAGTGAATATT
AACAGAAACTTTGACATTGACTGGATGTTTTCTGACTCCAGCACTAGTCCGTGTAGTCAT
CTGTATGGTGGGATTGAAAGCTTTTCAGAACCGGAGTCCCAAATAATAAGAAAACTCATC
GAAGAACATGGTAATCGAATCAAACTATACATTTCCCTACAAAACAATGGAGGATTTGTA
TCTTATCCCTGGCAATACGAGAGAGCTGCAAGTGGAATGTTCAGACAACACCATTTATTG
GGATTAGAAATGATTTCAGCCATAGCAGATAATTACAAGTTAGACATAGGCTCCTTAGCT
TTAGGAGATAGGGCTTCTGGAACTAGCAGTGATTATGTTATGAGTAGAAATGTTTTATAC
ACATTCAATATTGATATAAAACAATGCGAGGGTGATGTTCTTGTACCTGAGGCTGAAATA
AGACCAATCGCTGAACGGGTATGGAGAGCAGTCGCTGTAGCCGCTGGAAATATGATAAGT
TGA
Protein sequence:
MLSEYLETEPDGSNSRKELVRFNSQKLRQKNHRVYKIKLETTEQSNKFYLLKDDVIDFWQ
KPSLKHKVEGKAMVPPSHFGWFEKQLNNMNIEKNIYIDDVYEYLKEHDTRQARRNDEDDV
FSFDGYHRFDQVLNYMKGLNGTSLPGVDVEFVEAGYTDENRTLAYLRINSKTSEEGTQRP
IVIIEAGVNPREWITIPTALNIANKLIEGNQTKLAQNLEWIILPVLNPDGYEFTHNSNRL
WTKSRSTRSNLGFICPGVNINRNFDIDWMFSDSSTSPCSHLYGGIESFSEPESQIIRKLI
EEHGNRIKLYISLQNNGGFVSYPWQYERAASGMFRQHHLLGLEMISAIADNYKLDIGSLA
LGDRASGTSSDYVMSRNVLYTFNIDIKQCEGDVLVPEAEIRPIAERVWRAVAVAAGNMIS