New model in OGS2.0 | DPOGS208933  |
---|---|
Genomic Position | scaffold31:+ 6234-15247 |
See gene structure | |
CDS Length | 3141 |
Paired RNAseq reads   | 1596 |
Single RNAseq reads   | 3965 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA002404 (7e-103) |
Best Drosophila hit   | CG32627, isoform A (2e-28) |
Best Human hit | cytosolic carboxypeptidase-like protein 5 isoform 1 (3e-99) |
Best NR hit (blastp)   | PREDICTED: similar to ATP/GTP binding protein-like 5 [Tribolium castaneum] (1e-129) |
Best NR hit (blastx)   | PREDICTED: similar to ATP/GTP binding protein-like 5 [Tribolium castaneum] (3e-125) |
GeneOntology terms    | GO:0006508 proteolysis GO:0004181 metallocarboxypeptidase activity GO:0005634 nucleus GO:0008270 zinc ion binding GO:0005737 cytoplasm |
InterPro families   | IPR000834 Peptidase M14, carboxypeptidase A |
Orthology group | MCL16556 |
Nucleotide sequence:
ATGGATCGCAATAATATAATAGAGTGCGGCGGATTTTACTTTATACACAATTTTGACTCA
GGGAATTTAGGGCATGTAGAACGAGTGCCCACAGAATTTATTGCTCCAACGTTAAATCCG
AAAACAAATGTTTCGGAGACTCCCGATTATGAGTTTAATTTATGGACGCGACCTGATTGC
GCTGGCACAGAATTCGAGAATGGCAATCGAACTTGGTTTTATTTTGGCATACAAGCCAGT
GAGCCTAATGTACAGGTGCGACTTAACTTGATCAACCTTAACAAACAAGGCAAGATGTAT
AACCAGGGTATGGCTCCAGTGACACGGACCCTTCCAGGGAAGCCACAGTGGGAAAGGATA
AGGGATCGTCCAGTGCATTCAACAGATGACAACACATTTACACTGTCTTTCCGATATAGA
ACATCAGATAATCCGAAAGCTACAACCTTCTTTGCATTCACATACCCATTCTCATTTGCC
GAGCTACAAATAGCTCTGAACTCTATTGATCTTAAAATGTTGCCAGTCCCGCCACCTCAA
TCACCTGATGATATATATTATTGCAGAGAATGTTTAATATATTCATTAGAAGGAAGGCGT
GTAGACTTATTGACAATTTCATCCCACCATGGTATAACAATGGAGCGAGAGGACAGATTA
AAGAATTTGTTCCCAGAAAATCAGGAGAGGCCTTTCAAATTCCAAAATAAGAAGGTCATA
TTTATATCTGCTCGGGTGCATCCAGGAGAAACTCCATCGAGCTTTGTGTTCAACGGATTC
CTAAACTTACTACTGACAAGAAACGATCCAATTGCAATCCAATTGAGGAAACTCTATGTG
TTCAAAATGATTCCGTTTTTAAACCCAGATGGTGTTGCCAGGGGTCATTACAGAACCGAT
ACTAGGGGGGTTAATTTAAACAGGGTCTATTTGAATCCATCACTTCTCTATCATCCCACT
GTGTATGCATCAAGGTCTCTTATAAGATATCACCATTTTGGATTTGAGAAAGATGAAGAT
AATTGTGAGGATATTAAGAGCTTTGCATCCCGCAGCATACAGAACATTAGTGAAAGTGTT
GAGCTGGTTGAAACGAAGAAAAAGAAATCCCCCGGTTCCCCAAACATCAAGGGTGACTTC
AAGCGTGACAAGGCAAAGACACAACCAGCAAAGCCGTCAGCTCTGTACTCAGAAGACCAC
AGCAGGGACGAGTTCAGTGGAGACGCCGCCAACTTAGCCGATCAGGTCCTCGACATGAAG
CTCCAAGAGATGCCATCACAAACAGACAATATATCCTCGAATCCGTCTAACGCTAACCTG
GAGGAGAGCTCGTGTCTCCTCAACGATAACGTGCTGCGGTCGTGTCTGGGGTCTAACGTT
CACCTCAGCACGAGCGAGGAACTCACTATAAACGGCCTCAACCCCCTGAAGCCGCTGAGG
GATACTTTGAAGAACAGCATCAGTCTACTCATGGAGTCCAGCTCGTCCGTCGCCGGTGAG
AGTATTTCGCAGGAACTTCCTGTAGTGAAGATGGGCTATTGTAAAGTGTGCCGAAGGGAC
AGGGAGTCGATGCTGTCCGACCTGCCATCGTATAGGAATATCGAGGAGTACGGAGAACAT
CAGAGACAAAAAATAGACACTAAGGAGCAGAAGGTCGACGACCTTGAAGTTATCGAGGGT
TCGGTGAACGTGTTCTTCTGTACGAACTGCTTCAAGCGATACATTGTGACGGAGGGCAAC
GAGGAAATTGCGACCGCTACGTCTTCAGGTGATTGTGTGGAGGGCCCCCCTCTATCTCCA
AGGCCTCCGCCTCCGGAACGCCCTCAGACGCGTTCCCCCGCCGGCAGCACTGGGGACTCG
TTGCCGCCAGCGACGGTCCGAAAGGTCGACAAACCGAAGTCAGCTCCTAAGTCGTCTAAG
AAGAGGTCCCCGGCCGTGACCGCAACCACGGCCCCCTCCCCCGCCGCAGCACCCACCGTC
TTGAGACCGCACAAGGACGTCGAGTCCGGCCTGTACCTTTATATAGATCTACACGGACAC
GCCTCTAAGAAAGGCATCTTCATGTACGGCAACCACTTTGAGGACCTGGAGAGTTCGGTG
GAGTGCATGTTGCTGCCTCGCATCATGTCGCTCAACAACCTGCACTTCCACTTCTCGTCC
TGCAACTTCACCGAGAGGAACATGTATCTGAAGGACCGTCGCGACGGCATGTCCCGCGAG
GGCTCGGGTCGCGTGGCCGTGCTGAAGGCCACCGGTCTGGTCCGCTCCTACACCCTGGAG
TGCAACTACAACACGGGCCGCCTGGTGAACGTGCTGCCGCCGCCCTGCCGCGAGCCCGCC
GCCACCGCCCAGCCCGCGCCCCCGCCACCCAAGTACACGCCGCACATCTTTGAGGAGGTC
GGGCGATCTCTCGGAGCGTCCATACTTGATCTGACGGGGCAGCATCCTAACTCGCGAATC
CCGTGCTCCGAACATCGTAATCTGGCTGCCGTGCGCGACTGGCTCAGGACGCACTCGAGG
ACCGCGCGCCCTCAGTTGACTATGTCGAGACTGCGGCCGAAGACTTCCTCCCCGACGAGG
ATGCCGTTGTTCGCGCGCTCCAAGGCCAAGGTGACGGACGAGAGGAAAGAGAACGCGTAC
ATAGCGGCAAAGAGCGACACGGAAAGGCGCCGCAGCCCGCCCATACTGGCACCGCGCTCA
GGGCTCGACCTCACAAACCTCAACACCAAGTTCGGCAAGAAAAACGAACCAGCAAAATCG
TCATCACGAACACGCTACCTGGCAGACAGCGAGCCGAAACCTAAGACGCTATCCACCAAG
AGGCGCAACGTCCTCGCTATCAGGAAACCAAATACAAGCAAGACGCAGATGAGCGGCATT
GTGAAGGCGAAGGCGAACCGAAGAGCCGCGGACGATTCAGACGACCGAGCGACATCCGCC
AAGCTCGGCAAGCGAGGGTATGTCCGTCCAGGGAGAGCGAGGCGCCAACCGACATCCACT
TCTTCATCAGAGGCCGCCGGAGGCTCCAGCTCTTGGGAGGCGGGCGGTTCCCACGAGACA
GCCTTGGCCGCTAAGAGGCGGCAGTTCCCGAACCCCGCGCCCTCACACCTCAAGAAGATA
CGCCTCAAGAACGGCTTGTAG
Protein sequence:
MDRNNIIECGGFYFIHNFDSGNLGHVERVPTEFIAPTLNPKTNVSETPDYEFNLWTRPDC
AGTEFENGNRTWFYFGIQASEPNVQVRLNLINLNKQGKMYNQGMAPVTRTLPGKPQWERI
RDRPVHSTDDNTFTLSFRYRTSDNPKATTFFAFTYPFSFAELQIALNSIDLKMLPVPPPQ
SPDDIYYCRECLIYSLEGRRVDLLTISSHHGITMEREDRLKNLFPENQERPFKFQNKKVI
FISARVHPGETPSSFVFNGFLNLLLTRNDPIAIQLRKLYVFKMIPFLNPDGVARGHYRTD
TRGVNLNRVYLNPSLLYHPTVYASRSLIRYHHFGFEKDEDNCEDIKSFASRSIQNISESV
ELVETKKKKSPGSPNIKGDFKRDKAKTQPAKPSALYSEDHSRDEFSGDAANLADQVLDMK
LQEMPSQTDNISSNPSNANLEESSCLLNDNVLRSCLGSNVHLSTSEELTINGLNPLKPLR
DTLKNSISLLMESSSSVAGESISQELPVVKMGYCKVCRRDRESMLSDLPSYRNIEEYGEH
QRQKIDTKEQKVDDLEVIEGSVNVFFCTNCFKRYIVTEGNEEIATATSSGDCVEGPPLSP
RPPPPERPQTRSPAGSTGDSLPPATVRKVDKPKSAPKSSKKRSPAVTATTAPSPAAAPTV
LRPHKDVESGLYLYIDLHGHASKKGIFMYGNHFEDLESSVECMLLPRIMSLNNLHFHFSS
CNFTERNMYLKDRRDGMSREGSGRVAVLKATGLVRSYTLECNYNTGRLVNVLPPPCREPA
ATAQPAPPPPKYTPHIFEEVGRSLGASILDLTGQHPNSRIPCSEHRNLAAVRDWLRTHSR
TARPQLTMSRLRPKTSSPTRMPLFARSKAKVTDERKENAYIAAKSDTERRRSPPILAPRS
GLDLTNLNTKFGKKNEPAKSSSRTRYLADSEPKPKTLSTKRRNVLAIRKPNTSKTQMSGI
VKAKANRRAADDSDDRATSAKLGKRGYVRPGRARRQPTSTSSSEAAGGSSSWEAGGSHET
ALAAKRRQFPNPAPSHLKKIRLKNGL