New model in OGS2.0 | DPOGS208930  |
---|---|
Genomic Position | scaffold3263:- 5206-10339 |
See gene structure | |
CDS Length | 1869 |
Paired RNAseq reads   | 2345 |
Single RNAseq reads   | 5667 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA002507 (0.0) |
Best Drosophila hit   | CG15817, isoform C (1e-35) |
Best Human hit | ubiquitin carboxyl-terminal hydrolase 1 (1e-16) |
Best NR hit (blastp)   | PREDICTED: similar to ubiquitin specific protease [Tribolium castaneum] (6e-105) |
Best NR hit (blastx)   | PREDICTED: similar to ubiquitin specific protease [Tribolium castaneum] (2e-91) |
GeneOntology terms    | GO:0006511 ubiquitin-dependent protein catabolic process GO:0004221 ubiquitin thiolesterase activity |
InterPro families    | IPR018200 Peptidase C19, ubiquitin carboxyl-terminal hydrolase 2, conserved site IPR001394 Peptidase C19, ubiquitin carboxyl-terminal hydrolase 2 |
Orthology group | MCL14536 |
Nucleotide sequence:
ATGCCAGTTAACTCTTTATCAGAAACGACAAAAGGTGTTGCGAGAACTAAATTATCCCTC
TCTCTTAGAAGAAATAATGCAACAAATCCGATCAACGCAAATCAGAAACCAGGAGATTTA
AAGGACAGCTCTTACAAGGGATCAAATGACAATAAAGAGAATAAACCAGTGAAGAGACCT
ATAACAGGCACATATCTTAATACATTGAATGCTGCTAAAAAGCTTAAGCCTTTGACTGAA
ACGTCGAAACCGAAAGAAACAGTTAGTGAGCCACAGCTAGTTGTATTTGAACCATCAATA
AGTAACATTGAAATGTTGAATGGCCACCATCCTTCGGACAACCAGTATGGAGGGAGTACA
CAGTGGAAAGCCCCCATTGCCACTCTGTCTAATCTTGGAAATACTTGTTTCCTCAATAGT
GTACTATATACCCTACGATATGCTCCACAATTTGTGCACAACCTCCATCATCTAGTGTCC
GATCTCACTAGAGTGGAACAGAAATTGGGCAGCATCAGGTTAAAAAGCTCATCTCTGGGA
CGAAGTGCAGCTGGGCTTGCATCTTCTGGTACTAGATCATGGAGCAGTAAAGATTTGTTA
TCTCTGGGACAATCGGATAACACCACAGGAAAAAGTAAAATACAGATAGCCACAGAGAAG
CTTCATGAAACATATCTTAGCCTACGGGCCGCGGAAAGCAAGTGTATAAACAGTGGTGCT
GCTGATGCCAGCCCGGAACCATATGCTGCTGATGCATTTCTAGCGGCATTGAGAGAAGTC
AACTCTACATTTGAAGGTAATCGGCAACAAGATGCACATGAGCTTCTTGTTTGTATCTTG
GACAGCATTAGAGAAACATGCAGAGCTCTAAGTGCAAGAGCATCCCGTCTTCAATTACAT
GAAAATGGCGACAGCAATGGCCTCGGTCGCCAACCCAGCCTTGACGGTGATGGCAGTAAG
ACGTTAGGACATCTCCGTAAGTCGTGGAAAAAACGCAAGGAAACAAAAACCACTGACAAA
AGAAGCTCACCCTCAGAAGAACGTCCGCCTTCACCGGATCCTGAGAAGGATGAACGCTCG
AGGCCTGGCTGGGACTTTGTTGCTGATGATTTTGAAGGTACCATGGTTGTTCGAACTATG
TGCCTGGAGTGTGAGGCGGTGACGGAGAAAGCTCAAGCTGTGTGTGAGCTCTGTGTGCCA
GTAGGTGATGATGATACAAATGAAGAACCATTTAGAGCTGCATGTCTCTCTAGTGAATAT
TTGAGGGATCAGAATAAGTATTGGTGCGAGCGCTGCCTGCGCTACAACGAGGCTAGACGC
AGTGTCGCGTATTCGCGACTGCCACGGCTGTTAGTGTTGCAGCTCAAGCGCTTCAGCGGC
GGCATGGAAAAGATCACAAGACACGCGCCCACGCCACTTCTCATGCCTTGCTTCTGTGAG
CCATGTGCCAAACGGCCACCTGATCATCCACCCACACACAGATACATCCTATGGGCGGTG
ATAATGCACCTTGGTCAGGCGTTGACCGGTGGCCACTATGTAGCGTACGCGAGAGATCGT
TCCAACGGTGACGGCGAGGTGGCCAGCAAATGTGAGAGAACTGGCGGTGGTGACGCAGCA
TCTAACAACAGCGGCTCAAGCTTCATGCGAACTCTATTTAATCGCCCGAGAGCACAACCA
TCTGGCTGCGCTGCCAATGATTGCTGTGTACCGCGCCCTCGGCTAGACACCTGCTGGCTG
GCCTGCGACGACGACCTCGTCAAACCCATATCAAATGAAGAGTTCGAGGATCTATTATCC
GCCGAGCCGAAAATGCGCTCCGCAGCAACACCATACTTACTGTTCTATGTGAAGAGCGAA
GTCGGTTAA
Protein sequence:
MPVNSLSETTKGVARTKLSLSLRRNNATNPINANQKPGDLKDSSYKGSNDNKENKPVKRP
ITGTYLNTLNAAKKLKPLTETSKPKETVSEPQLVVFEPSISNIEMLNGHHPSDNQYGGST
QWKAPIATLSNLGNTCFLNSVLYTLRYAPQFVHNLHHLVSDLTRVEQKLGSIRLKSSSLG
RSAAGLASSGTRSWSSKDLLSLGQSDNTTGKSKIQIATEKLHETYLSLRAAESKCINSGA
ADASPEPYAADAFLAALREVNSTFEGNRQQDAHELLVCILDSIRETCRALSARASRLQLH
ENGDSNGLGRQPSLDGDGSKTLGHLRKSWKKRKETKTTDKRSSPSEERPPSPDPEKDERS
RPGWDFVADDFEGTMVVRTMCLECEAVTEKAQAVCELCVPVGDDDTNEEPFRAACLSSEY
LRDQNKYWCERCLRYNEARRSVAYSRLPRLLVLQLKRFSGGMEKITRHAPTPLLMPCFCE
PCAKRPPDHPPTHRYILWAVIMHLGQALTGGHYVAYARDRSNGDGEVASKCERTGGGDAA
SNNSGSSFMRTLFNRPRAQPSGCAANDCCVPRPRLDTCWLACDDDLVKPISNEEFEDLLS
AEPKMRSAATPYLLFYVKSEVG