New model in OGS2.0 | DPOGS209212  |
---|---|
Genomic Position | scaffold331:+ 21670-25632 |
See gene structure | |
CDS Length | 1833 |
Paired RNAseq reads   | 1281 |
Single RNAseq reads   | 3231 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA001982 (2e-131) |
Best Drosophila hit   | PNGase, isoform A (9e-86) |
Best Human hit | peptide-N(4)-(N-acetyl-beta-glucosaminyl)asparagine amidase isoform 1 (7e-84) |
Best NR hit (blastp)   | PREDICTED: similar to peptide-N(4)-(N-acetyl-beta-glucosaminyl)asparagine amidase [Tribolium castaneum] (3e-111) |
Best NR hit (blastx)   | PREDICTED: similar to peptide-N(4)-(N-acetyl-beta-glucosaminyl)asparagine amidase [Tribolium castaneum] (2e-107) |
GeneOntology terms    | GO:0000224 peptide-N4-(N-acetyl-beta-glucosaminyl)asparagine amidase activity GO:0005737 cytoplasm GO:0006516 glycoprotein catabolic process GO:0005515 protein binding GO:0016787 hydrolase activity GO:0046872 metal ion binding |
InterPro families    | IPR002931 Transglutaminase-like IPR018997 PUB domain IPR008979 Galactose-binding domain-like IPR006588 Peptide N glycanase, PAW domain |
Orthology group | MCL15579 |
Nucleotide sequence:
ATGGAGGATATGGCACGTTTAGCGCTGGTTGAGCAGAGCGTCAAAGATGAAAATCAATAT
AGAAAAATACTCTTCGATCTTTTGGACCACATAAGCAATATTTTAGATAATCCCCACGAT
TACGATTTGAGGACTATTAAGAGTGATATCCTAGAAAAAGTGTTAGACTGCGAGGCCTTC
GCCGACTACCTGAAATACATTGGATTTCAAATGGGACAAAAAGAAATTATGTTTCCCAGA
GAACAAACTTTAAGTAAATTGAGAATAGCACAAGCTGCTCTGGAAAGAAAAATAGGTTTT
TGTTGTGGCAATCTTAACAAGACTGTTACTGTAACAGATAACAAGAAGACCAGTAAACCA
AAACTTACAGAAGCCAATATATTGGTAACCAATAATTCATTTTTATTAAGAATCCAGTCA
GTTTTCAATGACATGATAAGGTATGAGGATGAAGAGCTCCAACAGAGAGCCAGGGAACAC
ATCCCACTTGTGACGCTGCAGCTCATGGCGCTGGACAGGATGAGGGAGCATCAGAAGAAA
ATTAAAATGGGTGAGATCAAAGATCAAGACATGTCTTTCGACATGGCCTTGTTAATGGAA
CTGATTGTGTGGTTCAAAAACAAGTTCTTCACATGGGTCGATCAACCAGCCTGTGACAAG
TGTGGCGGAGCGACATCATTAGTCAGTGTTAGCTCCGTCAAGACTGACTTAGAAACTTGT
AGAGCTGAGCTATATAAATGCTCGTGCGGCCGCGACGTGCTCTTCCCGAGGTACAACAAT
CCCATCACCTTACTGAGAACGAGACGCGGCCGCTGCGGGGAGTGGGCTAACTGCTTCACG
TTGATGTGTCGTGCGCTCGGATACGACACAAGATACGTGTACGACACGACCGATCACGTG
TGGTGCGAGGTGTTCGACCAGGACTCCCAGCGCTGGTTGCACGTGGACCCCTGCGAGGGG
TGTCTGAACGCTCCCCTGATGTACGAGCACGGTTGGGGGAAGTCCCTCACATACATCATA
GCGGTCTCACGAGATGACCTCCAGGACGTAACGTGGAGATATTCCAGCCACCATAAAGCG
CTGCTGCAGCGTCGCGATGAAGTGTCCGAAGCGGACCTGGTGCTAGCTATCCTGGCTCTC
CGTGATCATCGTCACGACCAGGTGAGTCCGGCCAGAAGGAGGTACCTCGTCATCAGGACA
TTGAAGGAACTTGTGGAATTGATGGTGGAGAGGAAACCCGGCGAAATGGAATCCCACGGC
CGGATATCCGGTTCGAAGGCGTGGCGGATGGAGCGCGGTGAGACGGGAGCGAGGAAACAC
GCGTTCATACTAACAGAACCCGGCGACCATTGTGTACAATACCGTACCAGCTCAGATACA
TATAGAGTACTACTGAATAACGTACAACGAGACGAGATAAAGAGCTGGAGGGGGGGAGTG
TTCCACAGCGAGAACATGTTCAGGAAGTTGGAGACGGACTGGCAGCAGACCTACCTCGCC
AGGGAGGAGGGAGAAAACACCGGCAGTATATCGTGGAAGCTGATGGTGGAAGGAGATTTG
GTGATATCGAGCGTAGCGATGGACGTCACCACCGCTCAGTACGAGGATGGCAGGATAGAG
TGGACCTACGAGGTGGACCACCAGCCCCCGAGGACCTTCAGCTTGAACGCGGGTCGGTGG
CAGGTAGACGGTAGGTGGTCATCGTGTGAGGTGAGAGCTCGTCTGGTCGGGGGGAAGGGG
GTGGTAGCCTGGCAACACGCGCAGATCGCCCGACAGCACACCTCCGACGACAAACCAGCG
CTCAGCCTGCTAGCCACCGTCGTGCCACGGTGA
Protein sequence:
MEDMARLALVEQSVKDENQYRKILFDLLDHISNILDNPHDYDLRTIKSDILEKVLDCEAF
ADYLKYIGFQMGQKEIMFPREQTLSKLRIAQAALERKIGFCCGNLNKTVTVTDNKKTSKP
KLTEANILVTNNSFLLRIQSVFNDMIRYEDEELQQRAREHIPLVTLQLMALDRMREHQKK
IKMGEIKDQDMSFDMALLMELIVWFKNKFFTWVDQPACDKCGGATSLVSVSSVKTDLETC
RAELYKCSCGRDVLFPRYNNPITLLRTRRGRCGEWANCFTLMCRALGYDTRYVYDTTDHV
WCEVFDQDSQRWLHVDPCEGCLNAPLMYEHGWGKSLTYIIAVSRDDLQDVTWRYSSHHKA
LLQRRDEVSEADLVLAILALRDHRHDQVSPARRRYLVIRTLKELVELMVERKPGEMESHG
RISGSKAWRMERGETGARKHAFILTEPGDHCVQYRTSSDTYRVLLNNVQRDEIKSWRGGV
FHSENMFRKLETDWQQTYLAREEGENTGSISWKLMVEGDLVISSVAMDVTTAQYEDGRIE
WTYEVDHQPPRTFSLNAGRWQVDGRWSSCEVRARLVGGKGVVAWQHAQIARQHTSDDKPA
LSLLATVVPR