New model in OGS2.0 | DPOGS210662  |
---|---|
Genomic Position | scaffold635:+ 71528-73054 |
See gene structure | |
CDS Length | 1527 |
Paired RNAseq reads   | 317 |
Single RNAseq reads   | 799 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA001803 (0.0) |
Best Drosophila hit   | dipeptidase B, isoform B (3e-99) |
Best Human hit | probable aminopeptidase NPEPL1 (5e-21) |
Best NR hit (blastp)   | PREDICTED: similar to Sb:cb283 protein [Tribolium castaneum] (4e-113) |
Best NR hit (blastx)   | PREDICTED: similar to Sb:cb283 protein [Tribolium castaneum] (5e-109) |
GeneOntology terms    | GO:0008240 tripeptidyl-peptidase activity GO:0006508 proteolysis GO:0008239 dipeptidyl-peptidase activity GO:0004177 aminopeptidase activity GO:0005622 intracellular |
InterPro families    | IPR000819 Peptidase M17, leucyl aminopeptidase, C-terminal IPR011356 Peptidase M17 |
Orthology group | MCL16297 |
Nucleotide sequence:
ATGTCGCCGTTTAAATTGTACGAGAATATTTTTATTGAGACGAATCTTTTATCCTCGGAC
TATGACGGCGTTATCCTCATACTGTATCCCAGGGACATGAATGTGGCGTTGCCCAGGCAT
GTGTCGAGCTTCATAGACAAAATCTTTATCCTGGATAAGAGTATTTACAAGACGCCCAGC
GTTTGGAACTGTGATTACGTTTCTGGAGGGCGGTTGGTGCTGTCGCCGGTAGGTAATGTA
ACTCCATACCATGACGTTACCGTGGTGAGAGAAGCCGCGAAGAGGGGAATGCTGCGAGCA
ATGGAAGCCGGTATGACCAAACCGTTGCTGATCGTTGAAAACGTAGTCCATTACCCCGAC
GGGCAATTAGTCTGCATTCTGGGGGCTCTGGAATCCTTATATGTTCCGATACAGATAAGG
GAGATGAAACCCCAGAAACAGGTATACAGAATCGGTCTGCATGCTGAGGAAAAAGCAACT
GAGTCATTTGAAAAGATAGTTAGAAACGCTATCGCCTTGGAGCGAGCTAGGATCGTAGCT
AGAGACATCGCTGGCGGGGATCCCGAGAGAATGGCTCCCGGGAGGATAGCTGATTATGTA
GTCAAAGTGTTCGCCGAAGATCCTTGTGTATCCATCAAAATTATTGACAACGATGATATT
ATAGCGCAGAAATATCCACTGCTGGCAGCTGTATCGCGGGCAGCGAATAACGTGGAGAGA
CACAAGGCTAGAGTTGTTTTACTGGAGTACAATTCATCTAACCCGGTCAGGGTGACAGAA
ACCATAATGTTGGTTGGCAAAGGGGTGACGTACGACACTGGCGGCGCTGATATAAAGATA
TCTGGCAAGATGGCCGGCATGTCCAGGGATAAATGCGGGGCAGCGGCTGTAGCTGGGTTT
TTGAAGGCCTGCTCCATACTGAAACCTCCACATCTGAAGGTCATTGGGGTTATGTGTTTG
TGTCGCAATTCTATCGGCTCAGATTCCTATGTGTCTGATGAATTGCTAACATCCAGTAGC
GGAAAACTGGTCAGGGTTACCAACACGGATGCGGAGGGTAGGCTAGCTATGGCAGATTCT
CTTTACATGCTGGCCAATATGGCGGAAAAAGAGCTCAACCCACATCTCTACACCATAGCG
ACTTTGACCGGACACGCCAGAGCCTGCTACGGTAATTATACAGCAGCTATGGACAATCAC
AGCGCCAAGGGCACCAACCACTCGAGCAAATTGCAGTTCAGCGGGTCAAGACTCGGAGAA
GGATTCGAGATATCTACCGTGAGGGCCGAGGATTTGGCTGTAAATGATGGGAAATGTAGC
GGAGATGATCTCGTTCAATATGACACTGACGCGAAATGCCGCAACCACCAGCTAGCTGCA
GGGTTTCTGATCAGGGTTGCCGGTTTGGAAGACAAGAATATAAAATACACGCATCTCGAT
ATAGCTGGAGCGGCGGGATGTCCTCCGGAAAAGCCCACAGCGACGCCCGTCTTATCTTTG
TGTCACTTACACAAAGTCTTATTGTAA
Protein sequence:
MSPFKLYENIFIETNLLSSDYDGVILILYPRDMNVALPRHVSSFIDKIFILDKSIYKTPS
VWNCDYVSGGRLVLSPVGNVTPYHDVTVVREAAKRGMLRAMEAGMTKPLLIVENVVHYPD
GQLVCILGALESLYVPIQIREMKPQKQVYRIGLHAEEKATESFEKIVRNAIALERARIVA
RDIAGGDPERMAPGRIADYVVKVFAEDPCVSIKIIDNDDIIAQKYPLLAAVSRAANNVER
HKARVVLLEYNSSNPVRVTETIMLVGKGVTYDTGGADIKISGKMAGMSRDKCGAAAVAGF
LKACSILKPPHLKVIGVMCLCRNSIGSDSYVSDELLTSSSGKLVRVTNTDAEGRLAMADS
LYMLANMAEKELNPHLYTIATLTGHARACYGNYTAAMDNHSAKGTNHSSKLQFSGSRLGE
GFEISTVRAEDLAVNDGKCSGDDLVQYDTDAKCRNHQLAAGFLIRVAGLEDKNIKYTHLD
IAGAAGCPPEKPTATPVLSLCHLHKVLL