New model in OGS2.0 | DPOGS207580  |
---|---|
Genomic Position | scaffold4787:- 618-4901 |
See gene structure | |
CDS Length | 1857 |
Paired RNAseq reads   | 3114 |
Single RNAseq reads   | 7407 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA004688 (0.0) |
Best Drosophila hit   | dipeptidyl aminopeptidase III, isoform C (0.0) |
Best Human hit | dipeptidyl peptidase 3 (1e-175) |
Best NR hit (blastp)   | PREDICTED: similar to dipeptidyl peptidase iii [Nasonia vitripennis] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to dipeptidyl peptidase iii [Nasonia vitripennis] (0.0) |
GeneOntology terms    | GO:0008239 dipeptidyl-peptidase activity GO:0006508 proteolysis GO:0016020 membrane GO:0005829 cytosol |
InterPro families    | IPR016526 Peptidase M49, dipeptidyl-peptidase 3, eukaryotic IPR005317 Peptidase M49, dipeptidyl-peptidase III |
Orthology group | MCL12893 |
Nucleotide sequence:
ATGGAGGATAAGTCGATCTTTCTTTTGCCAAACAGTCAGAAATTTGTTGAACTAGATAGT
TCACAGGCATTTACAAAATTAACTAAACAAGAAAAGTTGTATGCTCACTATTTGAGTCAG
GCTGCTTGGAATGGTGGTTTAATTGTTCTTCTACAAACAAGTCCAGAATCACCAAGGATT
TTTTCACTTTTGCACAGAATTTTTAAATCAGAAGGATTAGCTGATTTAAAAAAAGTTTCC
CTTGGAGCTGGTGTATCCGAGGATGATTTTCAGGCCTTCTTAGTTTATGCGGGTGGATTA
TTTGCTAACAGCGGTAATTACAAAGGCTTTGGTGATACAAAATTCATTCCTAACTTGCCA
AAAGAATGTTTTGAAGTTATCGTTAAATCATCAAAAGCATTTAAAAATGATGAAGCACAT
ATAAGTAAACTGTGGGAGAACACTAAAAATGCTATGTACAGTACTGCACCCAGATTAGCC
AGCCTTGGTTTAGCCGATAAGGGTATAACAACATATTTCTCAAGTAACTGTACAGAGGCG
GACTCGACTCTTGTAAATGACTGGATGAAAACAAAACGCATTGAAGCATACATTTGTAGA
ACTTTCAAGACAACCGCTGACGATGGATTACCTTTGTATACGATACACCTGGCCAGTGTC
GAGAAAAGCTCAAAGCCGCCCCTTACTATGGATAAAGAAAAATACAAAAATGCGTACTTC
CAAGTGACTCGGGGAGATTATTCGCCATTATTGAGTTTGGTCAACGAAAATCTTGCAAAA
GCTATGGAGTATGCGGCAAATGAGAATGAAAAGAATATGATTAAACATTACATTAACAGT
TTTAAAGAGGGAGATTTAAGTGAACATAAAGAAGGCAGCAGGTTCTGGGTGAAGGACAAA
GGACCGATTATAGAGACATATCAAGGCTTCATAGAGACATACCGCGATCCCAGCGGACAA
AGAGGTGAATTCGAGGGTTTTGTGGCTATGGTCAATAAAGATATGTCAAAAAAGTTTGGG
GAACTCGTCCATGGTGCTGAAAACTTCATAAAGCTGTTACCGTGGGGGGAGGGGCTTGAG
AAGGATTCCTTCCTCCGACCGGACTTCACTAGTCTAGACGTACTGACGTTCTCAGGGAGC
GGTATACCAGCCGGAATTAACATACCTAACTATGATGAGATCCGACAAAATGAAGGCTTT
AAGAACGTGTCCCTGGGTAACGTGTTCCCCGCCGCTTATAAGGAGTCCGTTATACCATTC
CTCTCTGATAGTGATAAAGTTCTTTTAGAAAAATACAGGGTTGCTGCATTTGAGGTTCAA
GTAGGACTTCATGAACTGCTGGGTCATGGCAGCGGGAAGCTTCTCAGACAAAACGCAGAC
GGGACATTCAACTTCGACAAGGAGAAAGTTAAAAATCCTCTAACTGGCAAGGAGATCGAG
TCGTGGTATTCAGAAGGCGAGAATTACGACAGCAAGTTCACCACTTTGGGATCCGCCTTC
GAGGAATGCCGGGCGGAGGCTGTTGGATTGTATCTGTCGTTACGACCTGAGATACTCAAA
ATCTTCGGTTACGAGGGTCAGGAAGCAGAGGACGTGATGTACGTCAACTGGCTCAGTCTA
CTGTGGAACGGAGCCGCCAAGGCCACGGAAATGTACCAGCCGGCTACGAAAACGTGGCTA
CAGGCCCACGCGAGAGCTCGTTTTGTTTTAATGAGACTGTTGGAATTGGAAGGTAACGGA
ATACTAACAGTCACCGAGGTTGATCCCGGCAAGAACCTGTTGCTTACTTTAGACAGGAAA
CGTTTGGCTACTGACGGAAAACGAATTGTCGGTAGGAATGACTTGTTATTAATTTAA
Protein sequence:
MEDKSIFLLPNSQKFVELDSSQAFTKLTKQEKLYAHYLSQAAWNGGLIVLLQTSPESPRI
FSLLHRIFKSEGLADLKKVSLGAGVSEDDFQAFLVYAGGLFANSGNYKGFGDTKFIPNLP
KECFEVIVKSSKAFKNDEAHISKLWENTKNAMYSTAPRLASLGLADKGITTYFSSNCTEA
DSTLVNDWMKTKRIEAYICRTFKTTADDGLPLYTIHLASVEKSSKPPLTMDKEKYKNAYF
QVTRGDYSPLLSLVNENLAKAMEYAANENEKNMIKHYINSFKEGDLSEHKEGSRFWVKDK
GPIIETYQGFIETYRDPSGQRGEFEGFVAMVNKDMSKKFGELVHGAENFIKLLPWGEGLE
KDSFLRPDFTSLDVLTFSGSGIPAGINIPNYDEIRQNEGFKNVSLGNVFPAAYKESVIPF
LSDSDKVLLEKYRVAAFEVQVGLHELLGHGSGKLLRQNADGTFNFDKEKVKNPLTGKEIE
SWYSEGENYDSKFTTLGSAFEECRAEAVGLYLSLRPEILKIFGYEGQEAEDVMYVNWLSL
LWNGAAKATEMYQPATKTWLQAHARARFVLMRLLELEGNGILTVTEVDPGKNLLLTLDRK
RLATDGKRIVGRNDLLLI