New model in OGS2.0 | DPOGS206546  |
---|---|
Genomic Position | scaffold74:+ 70099-81695 |
See gene structure | |
CDS Length | 2649 |
Paired RNAseq reads   | 12 |
Single RNAseq reads   | 30 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA014035 (3e-133) |
Best Drosophila hit   | CG42400 (2e-139) |
Best Human hit | dipeptidase 1 precursor (1e-85) |
Best NR hit (blastp)   | hypothetical protein TcasGA2_TC014432 [Tribolium castaneum] (2e-166) |
Best NR hit (blastx)   | hypothetical protein TcasGA2_TC014432 [Tribolium castaneum] (9e-153) |
GeneOntology terms    | GO:0016805 dipeptidase activity GO:0006508 proteolysis GO:0008235 metalloexopeptidase activity GO:0008239 dipeptidyl-peptidase activity |
InterPro families    | IPR000180 Peptidase M19, renal dipeptidase, active site IPR008257 Peptidase M19, renal dipeptidase |
Orthology group | MCL16798 |
Nucleotide sequence:
ATGATTCAAGTGGATGGTTTCAAACGCTCGTCTCCGTCACCATCGCGATGCTCAGTTCGG
AGTGACAGTCGGGTCTTCAGACAGCCTCGCGTCATTCAAGTTAACCGAGACGCAGCTACA
ATGATGGACGATGAAGCCAGTGTCAGCGGCATGGGGAATGTCACTGAACCTCCTTGCTCC
TCAGATTTCGAATCTAACGTTAATCAGTCAACACATTTTCGATCACGGACATTTAGTGAC
GGCTTCATTAGAAACACAAGCCTCGAATTGTTACTTGGTGTTGATTGCGAGGCCTGTCTG
GCGCTCGCTGCACAACAACATTGCATCAATGAACCTATAACGGAACAGGATTTTGATTTG
GTATGCAATTGTAATGTTCGCCAACATAATTACTTGAACTATTACCGTGACTCCGTCAAA
CAGAAGGGGCATACTCTGAATATTTATGATATAGAAAGAATTAAGAGAGACACGGGAAGA
AGTAGTCTAGCTGTCGGTTCACATCGTAGTCTTGATAGAAAACATTACAAAAGTAAAACC
ATAGGCGTTTCAAGCTACATATCCAAGCAGTTAATATCATATGACCCGACTAATACACTT
GTACAAAAACTCAAAAGAAAATTGTCTAATCTGAAAAAAGTGAGGGGTGAAAATATTGTG
AAAAAACCACATAATGGCCTTCAACTTGCTGAAATGGCATCCACTTCTACTAGCTCCCTA
AAAAGGCTATCAATGTTATCACTTGTTGATAAATCGCAAAAAAAAATGGACAATCAAAAT
TGCATTTTTTCTTCAAACGGTGTGCAATCGAAAGATGATTTCAAAAACAATGTTACCACA
TATGAAAATTTTTCTTCCAATATTGACCAATGCTATGACCGGCTTCGTACATCCAACATC
GATACATCACAAAAGAACGCCAAAGGAAGAAATTTTGATATCAACGACGAACTAATTCCG
AGGAACCCGCCTTTGGTGAAATATTCAACACTTGATAATCGAAGCAGGTGTCGAAGTATA
AAGAGACAGCTCTCGTGTAACGATTTTCCCAGCGACAGGTGTAGGGAACGTGTGCCCGAG
ACATGGTCGCAAGACGAGAACTCTTTTGGTTTGAGAAATTCGAGGAATGAAGTAGATGAC
GAACAGACATTCATAGAAGCTCACAAAGTATTAAGAGGTGCTCCAAGAAGTGCTCATGCA
CATTGTACCTGTAACTCCATTCAAGACAATAGAATACCGTCTATATTTTATGACACCCAT
GGTAAAAAGCGTTCAGCCCCCGCCCCTGACGTTACGGATCGCTCCCAGGGAAGATCTTGC
AGCGACTGCAAACCTGCACAGTGGTACCCCCCGAGCACATCCGCGACCAGCCACAGTTCT
ACGGTTACTGCTAATTCATCACGTCTGACAGCATGGCACCGTCGATGGTGTTGCGTGGCT
ATACTTGTGTTGGTAGCGGGCACAGCCTGCGTTGCCGGACCGCTGGCTCTTAGGGCTCCA
CCTGGTGCACCCCTACACGAAAGACTCCGCCTCGCAGAAAGACTACTGCACGACACACCA
CTTATTGATGGACATAATGATTTACCTTGGAATATACGAAAATTTTTACACAATAAAATC
AAAGACTTTAGATTTGATGAAGACCTTCGAACTATATCTCCCTGGGCTACGAGCTCGTGG
AGTCATACAGATTTACTTCGTCTTAAGCATGGAAGAGTAGCCGCTCAGTTTTGGGCCGCA
TACGTGCCTTGCGACGCGCAACATCGGGATGCAGTGCAATTGACCTTCGAACAAATAGAC
CTAATCCAGAGACTCACAGACAAGTACCATCCACAACTAACATTCTGTACCTCTGCCGAC
GATATATTATCGGCTCACGTAAACCACCGGCTGTGCTCACTGGTGGGTGTGGAAGGTGGG
CATGCAATTGGAGGTTCCTTAGGTGTACTAAGGACGTTGTATCAAGTTGGAGTTCGGTAT
CTAACTCTAACTTCGACTTGCGATACGCCTTGGGCTGAATGTGCTTCCACCGATCGACCT
GAATCCGCACAAAGGGGAGGATTAACGCCTTTTGGTAAAGTGGTGGTTAAAGAAATGAAT
AGATTGGGCATGCTGGTTGATCTATCACATGTTTCTGAGCGAACCATGCGGGATGCCCTT
TCGGTTTCACGAGCGCCAGTGCTTTTCTCACATTCCTCGGCCCGAGCGCTTTGTAACGTA
ACTCGAAATGTACCAGACAGCGTGCTTCGACTCTTAGCAGCTAATAAAGGACTGATAATG
GTCAACTTCTACACTTCTTTTCTCACTTGTAGAGATACGGCTACCGTTCAGGATGCTATA
GAACACATAAACCATATCCGCGACATCGCTGGTGTCGACAGCGTTGGCTTAGGAGCAGGA
TACGATGGAATAAATTACACACCTCATGGGTTAGAAGATGTCTCGTCATATCCATTATTA
TTTGCTGAACTGATGGAAGACGGATGGAGCATAGAAGATTTGAAAAAATTGGCTGGCCTG
AATTTATTACGTGTAATGAACGCAGCAGAACGTGTATCTAGAGAATTATCATCAGCCCAT
GTCACTCCTTACGAAGAAGTTGGACCCAGAGTGTTAGACTCGCACAATTGTTCCAGTCAG
GACGTTTAA
Protein sequence:
MIQVDGFKRSSPSPSRCSVRSDSRVFRQPRVIQVNRDAATMMDDEASVSGMGNVTEPPCS
SDFESNVNQSTHFRSRTFSDGFIRNTSLELLLGVDCEACLALAAQQHCINEPITEQDFDL
VCNCNVRQHNYLNYYRDSVKQKGHTLNIYDIERIKRDTGRSSLAVGSHRSLDRKHYKSKT
IGVSSYISKQLISYDPTNTLVQKLKRKLSNLKKVRGENIVKKPHNGLQLAEMASTSTSSL
KRLSMLSLVDKSQKKMDNQNCIFSSNGVQSKDDFKNNVTTYENFSSNIDQCYDRLRTSNI
DTSQKNAKGRNFDINDELIPRNPPLVKYSTLDNRSRCRSIKRQLSCNDFPSDRCRERVPE
TWSQDENSFGLRNSRNEVDDEQTFIEAHKVLRGAPRSAHAHCTCNSIQDNRIPSIFYDTH
GKKRSAPAPDVTDRSQGRSCSDCKPAQWYPPSTSATSHSSTVTANSSRLTAWHRRWCCVA
ILVLVAGTACVAGPLALRAPPGAPLHERLRLAERLLHDTPLIDGHNDLPWNIRKFLHNKI
KDFRFDEDLRTISPWATSSWSHTDLLRLKHGRVAAQFWAAYVPCDAQHRDAVQLTFEQID
LIQRLTDKYHPQLTFCTSADDILSAHVNHRLCSLVGVEGGHAIGGSLGVLRTLYQVGVRY
LTLTSTCDTPWAECASTDRPESAQRGGLTPFGKVVVKEMNRLGMLVDLSHVSERTMRDAL
SVSRAPVLFSHSSARALCNVTRNVPDSVLRLLAANKGLIMVNFYTSFLTCRDTATVQDAI
EHINHIRDIAGVDSVGLGAGYDGINYTPHGLEDVSSYPLLFAELMEDGWSIEDLKKLAGL
NLLRVMNAAERVSRELSSAHVTPYEEVGPRVLDSHNCSSQDV