New model in OGS2.0 | DPOGS213433  |
---|---|
Genomic Position | scaffold3039:+ 5613-22532 |
See gene structure | |
CDS Length | 2259 |
Paired RNAseq reads   | 107 |
Single RNAseq reads   | 290 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA004470 (9e-122) |
Best Drosophila hit   | CG11319 (4e-90) |
Best Human hit | dipeptidyl aminopeptidase-like protein 6 isoform 1 (3e-62) |
Best NR hit (blastp)   | hypothetical protein TcasGA2_TC010074 [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | hypothetical protein TcasGA2_TC010074 [Tribolium castaneum] (0.0) |
GeneOntology terms    | GO:0008239 dipeptidyl-peptidase activity GO:0008236 serine-type peptidase activity GO:0016020 membrane GO:0006508 proteolysis |
InterPro families    | IPR002469 Peptidase S9B, dipeptidylpeptidase IV N-terminal IPR001375 Peptidase S9, prolyl oligopeptidase, catalytic domain |
Orthology group | MCL11245 |
Nucleotide sequence:
ATGGAATTTCCACAGGGCTCAGGGCTAGGTGACGAGTTCGTGTATAGAGATGCGTGGGGT
GGAATAAGTCTTTTATTTGCTGCCAACCAGACCTCAAAGACTTTAATGCCTAACAGTACA
TTTCGTGTTCTACAACCGGCGTCTTATTCACTGTCAGCAGACCGACGATTCCTGTTGTTG
GCGCACGCACCGCGCAAGTTACATCAGTATTCATATCTAGCGCGATACTCGGTCTATGAT
ATTCTCACAACCGAGTCTTATCCACTGACCCCCCTCCCAGATGACATAGGAGGGGGGGTG
ATCAGCGAAGGACCTCTCCTCTTACTTGCGATGTGGACTCCGAAAGGGCATGGGCTCATC
ACAGTCAAGGATTATGATATTTATTATCGCCCGGCACCACGCTCATCTACCGGGTATAGA
GTAACTGACACCGGTATACCGGGAAGAATAAATAATGGTGTTCCGGATTGGTTGTATGAA
GTTGAAATCCTCAAATCCCGGTCCGCGTTGTGGATGTCAGCGGATGGCCACATGGTTTTA
TATGCCACCTTTAATGATAGCTTGGTTCACGAACAAAAATTTCCGTGGTATGGAGCTGCT
TTGGATACTGATGATCCTGCCAAGACTTATCCGGAAATAAGAAGCGTCAGATACCCCAAG
CCGGGGACTAACAATCCGGTTGTAAAGTTGACGGTGGCCGATATTGCTGACCCTAAACAC
ATACGTTCCCACCATCTTACACCCCCTAAGGTTTATAGAATATTCTCAGGCAAAGAAGCG
TGGGTAGAAGCTGCCCCTGCGCCGTTATGGTCAGCGGGCGGCGCCGCGTTAGTGACACTA
GCCCCTGTCCGAGACGGCCCCGCGGGTCTGTTCCGTCACATAATACGAGCTGAACATAAT
TCTCATGGACCCAGAGCATTGCCACTCACACACGGAAGCTTCGATGTAGTTAAATTGCTA
GCATGGGATCACGCAAACCAACACATTTATTATTTGGGTATACCAGAGGGCAAACCAGGA
CAGCAGCACTTATACAGAGTGTCATCTGAAGCGCCACGACCTGGCACCCCCCAGAAGCTC
CCTTATTGCGTCACTTGTAACTCCCAGCCATCACCATCAATAAACCTGGAGTTCTATGGT
AATTTGGCTTCTTCTGGAGACAGCTCCTGGGACAGTGATTGGGAAGAGATCCCTACATCA
CCATCACCTCCTAAGAAGAAAAAAAAGAAGCAACCAGAACAAGTCCAACAAAACCAGCCG
TGCCTTTATCACGATGCTCATTTTAGTCCATCATCTACATATTTTGTACTGGAGTGCCTG
GGACCAGGAGTCCCCACGTCGAGCTTACACAAGATAGCTTTACCTGAGCCAAGACTTTTG
ATGCATCTAGAAAACAATACAGCTGTTAAGGAAAAGTTAGCAGCAATAGCTCTACCCACG
CCGCGGACATTTTCAGTACAACTGTCAAGTGGACACGCGGCAAGAGTGAGACTTTTACTT
CCTCCAGGACTAAGGGAAGATGAAGTTACTAAATACCCTTTGGTCATGAAAGTACACGGG
GCTCCGGGAACACAGCTGGTTACGGAGCAATGGTCGTTAGACTGGGGTTCTCTGGCAGCT
GGGGCCGGAGCTATACTAGCATCCGTGGATGCAAGAGGAGCTGGAGGAAGAGGATTGGCT
GCCCATCACACTCTGCATAAGAGACTGGGGACTGTTGAACTGAAAGATCAACTGGAGGTG
GCTGAATATCTGCGGGACTCACTTCACTTCATAGATGCACGTCGAGTGGCGGTGTGGGGA
CGAGCGCATGGAGGTTTTCTGGCAGCGTTGGCGCTTGCCTCCCCCTTGAGAGTGTTCCAT
TGCGGGATCACAATTACACCTATTGTGCGCTGGAGATATTACGCCTCGGCCTACGCAGAG
CGGTACATGGGTCTGCCAAACGCGACAGGTAATTACCGAGGCTACGCGGATGCTGATGTC
ACGAAGCAAGCTTCAGCATTACAAGATAAGATGATCCTGGTGGTTCACGGAACTGCTGAT
GATGATGTACATATTCAGCAGACCATGTCGCTAGCCAGGGCGTTGTCAGACCACGGGAGT
ACATTTAGGCAACAGATATATCCGGACGAAGGCCATAGTTTGGAGGGTGTAAAACATCAT
TTGTATCGAACGATGTCTTCGTTCCTTGATGATTGTTTTAAAAAGCAAGTACCACCGGAG
ACGAAGGCGGGGCTTCGGAACGGTGGAAACCTTGACTGA
Protein sequence:
MEFPQGSGLGDEFVYRDAWGGISLLFAANQTSKTLMPNSTFRVLQPASYSLSADRRFLLL
AHAPRKLHQYSYLARYSVYDILTTESYPLTPLPDDIGGGVISEGPLLLLAMWTPKGHGLI
TVKDYDIYYRPAPRSSTGYRVTDTGIPGRINNGVPDWLYEVEILKSRSALWMSADGHMVL
YATFNDSLVHEQKFPWYGAALDTDDPAKTYPEIRSVRYPKPGTNNPVVKLTVADIADPKH
IRSHHLTPPKVYRIFSGKEAWVEAAPAPLWSAGGAALVTLAPVRDGPAGLFRHIIRAEHN
SHGPRALPLTHGSFDVVKLLAWDHANQHIYYLGIPEGKPGQQHLYRVSSEAPRPGTPQKL
PYCVTCNSQPSPSINLEFYGNLASSGDSSWDSDWEEIPTSPSPPKKKKKKQPEQVQQNQP
CLYHDAHFSPSSTYFVLECLGPGVPTSSLHKIALPEPRLLMHLENNTAVKEKLAAIALPT
PRTFSVQLSSGHAARVRLLLPPGLREDEVTKYPLVMKVHGAPGTQLVTEQWSLDWGSLAA
GAGAILASVDARGAGGRGLAAHHTLHKRLGTVELKDQLEVAEYLRDSLHFIDARRVAVWG
RAHGGFLAALALASPLRVFHCGITITPIVRWRYYASAYAERYMGLPNATGNYRGYADADV
TKQASALQDKMILVVHGTADDDVHIQQTMSLARALSDHGSTFRQQIYPDEGHSLEGVKHH
LYRTMSSFLDDCFKKQVPPETKAGLRNGGNLD