New model in OGS2.0 | DPOGS201286  |
---|---|
Genomic Position | scaffold1560:- 12566-17455 |
See gene structure | |
CDS Length | 1659 |
Paired RNAseq reads   | 25 |
Single RNAseq reads   | 82 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA003037 (2e-162) |
Best Drosophila hit   | CG31019 (3e-111) |
Best Human hit | cytosolic carboxypeptidase 6 (2e-96) |
Best NR hit (blastp)   | GI22112 [Drosophila mojavensis] (6e-135) |
Best NR hit (blastx)   | PREDICTED: similar to CG31019-PA [Apis mellifera] (2e-120) |
GeneOntology terms    | GO:0004181 metallocarboxypeptidase activity GO:0006508 proteolysis GO:0008270 zinc ion binding |
InterPro families   | IPR000834 Peptidase M14, carboxypeptidase A |
Orthology group | MCL13411 |
Nucleotide sequence:
ATGGAAGCCTTGGTGGAAGATCTTGAGCATGCCAGTATATATCGCCGACGTCCATGTGAC
AGCGAAGAAAGTGACGGGGAAGGCGGCTTGGGCAACCTTAACCGACTAATCGTCCGGCCG
CCGGGGCACAGTGGGAAAGCAAAACGCGGTCATCTCTGTTTCGACGCGGCCTTCGAATGC
GGTAACCTCGGCCGAGCTGATCACATTACGGAGCTGGAGTATGACCTGTTCGTCCGACCC
GACACTTGCAGGCCTCGATCGAGGTTCTGGTTCAATTTTACCGTCGAAAACGTTAAACAG
GAACAACGAGTACTTTTCAATATAGTCAATATGGGTAAAGAATATACACTATATAATGAA
GAAATGACACCTATAGTACGATCAACATCTCGTCCGAAATGGCAACGCATACCACGGCGG
CTTCTATTCTATCATAAGTCTGTGATACATCGTGGTCGTAAAATACTAAGCCTAGCATTT
GCTTTTGATAAGGAGGAAGACGTCTATCAATTTGCAGCTGCCGTTCCATATTCTTATTCA
AAATTGCAAAAGTATTTAGCTATATGGGAAAAAAGAGCCCAAGGTTTTGCCACCAGACGA
TCTATTGCGCAGACTACGCAAAAACGGAGGATCGATCATCTCACTATTGGAGATACTGTT
ATTGGAGCAGAGACTAAAGACGATCTGAGTACAAAAGGTAAAGGTCAAGAAATAAAGAAG
AGGGTAGTTTTAATTTTAGCACGAACGCACGGCGGCGAACCACCATCTTCTATTATATGC
CAAGGTTTTCTAGACTACATTATAGGTTCATCGGAAAAAGCATTGTCGTTACGGAATGGC
ATCCGCATTGAGGTGGTTCCAATGCTCAACCCTGATGGTGTATTTTTGGGGAACCAGAGG
TCGGATCTGTTAGGTTCAGATCTTAATCGGTGTTGGAACCGAGCTACAACTTTCGCACAT
CCAGCTCTTGTTGCTGTAAATGATCTCATTCAAAAGATAAATGCTGAAAAGACCCTCCAA
TTAGACTTCATAATCGATATTCATGCTGATCTTAGTCATGAGGGAGTATTTGTCCGTGGC
AATTCGTATGATGATGTTTACAGATTTGAACGTCACGCAGTCCTACCTAAGTTCTTGGCT
ATGCGAGTCGAAGCCTGGAAGCCGGAGTCATGTCTTTACAACACTGACTCTATCAGCGCG
GGCAGTGCGCGTCGTACTTTACCGACGGGTACTGTGGATGCATACAGCCTCCTAGCATCA
CTTGGTGGAAGACGATTAACACCCAGGGGTCCATACCTACACTACACTGAAGACGCATAC
TCTAAAATAGGCAGATCCTTAGCGAAGGCTCTGTGTGATTACTACAGGCATATCGGGGTT
ATTCCACCACAGGAAGGAACACCAAATAAAAAGAAAGATCCCAGAAAGCGAAGAAGAAAA
CGAAGAGCAGCATATGAGAGAGAAAGATCTCCCAGTTCATCTCCTGATCGTCGTGTGTTG
GTATCCCGTGTCCTGAGCCCCCCTCTGCCTGTGACATCACCCCCTGCAATTCGAGCACTT
CCCCCAAGAACGGCTCGTAGCGTTCGCCAAAGGACGTGTACCAGAGTACATATTGGAGAT
AATCTGTTGCCCGCTATAACTGGTTTGACGTTCCAATGA
Protein sequence:
MEALVEDLEHASIYRRRPCDSEESDGEGGLGNLNRLIVRPPGHSGKAKRGHLCFDAAFEC
GNLGRADHITELEYDLFVRPDTCRPRSRFWFNFTVENVKQEQRVLFNIVNMGKEYTLYNE
EMTPIVRSTSRPKWQRIPRRLLFYHKSVIHRGRKILSLAFAFDKEEDVYQFAAAVPYSYS
KLQKYLAIWEKRAQGFATRRSIAQTTQKRRIDHLTIGDTVIGAETKDDLSTKGKGQEIKK
RVVLILARTHGGEPPSSIICQGFLDYIIGSSEKALSLRNGIRIEVVPMLNPDGVFLGNQR
SDLLGSDLNRCWNRATTFAHPALVAVNDLIQKINAEKTLQLDFIIDIHADLSHEGVFVRG
NSYDDVYRFERHAVLPKFLAMRVEAWKPESCLYNTDSISAGSARRTLPTGTVDAYSLLAS
LGGRRLTPRGPYLHYTEDAYSKIGRSLAKALCDYYRHIGVIPPQEGTPNKKKDPRKRRRK
RRAAYERERSPSSSPDRRVLVSRVLSPPLPVTSPPAIRALPPRTARSVRQRTCTRVHIGD
NLLPAITGLTFQ