DPGLEAN01501 in OGS1.0

New model in OGS2.0DPOGS209212 
Genomic Positionscaffold331:+ 21670-25632
See gene structure
CDS Length1833
Paired RNAseq reads  1281
Single RNAseq reads  3231
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001982 (2e-131)
Best Drosophila hit  PNGase, isoform A (9e-86)
Best Human hitpeptide-N(4)-(N-acetyl-beta-glucosaminyl)asparagine amidase isoform 1 (7e-84)
Best NR hit (blastp)  PREDICTED: similar to peptide-N(4)-(N-acetyl-beta-glucosaminyl)asparagine amidase [Tribolium castaneum] (3e-111)
Best NR hit (blastx)  PREDICTED: similar to peptide-N(4)-(N-acetyl-beta-glucosaminyl)asparagine amidase [Tribolium castaneum] (2e-107)
GeneOntology terms




  
GO:0000224 peptide-N4-(N-acetyl-beta-glucosaminyl)asparagine amidase activity
GO:0005737 cytoplasm
GO:0006516 glycoprotein catabolic process
GO:0005515 protein binding
GO:0016787 hydrolase activity
GO:0046872 metal ion binding
InterPro families


  
IPR002931 Transglutaminase-like
IPR018997 PUB domain
IPR008979 Galactose-binding domain-like
IPR006588 Peptide N glycanase, PAW domain
Orthology groupMCL15579

Nucleotide sequence:

ATGGAGGATATGGCACGTTTAGCGCTGGTTGAGCAGAGCGTCAAAGATGAAAATCAATAT
AGAAAAATACTCTTCGATCTTTTGGACCACATAAGCAATATTTTAGATAATCCCCACGAT
TACGATTTGAGGACTATTAAGAGTGATATCCTAGAAAAAGTGTTAGACTGCGAGGCCTTC
GCCGACTACCTGAAATACATTGGATTTCAAATGGGACAAAAAGAAATTATGTTTCCCAGA
GAACAAACTTTAAGTAAATTGAGAATAGCACAAGCTGCTCTGGAAAGAAAAATAGGTTTT
TGTTGTGGCAATCTTAACAAGACTGTTACTGTAACAGATAACAAGAAGACCAGTAAACCA
AAACTTACAGAAGCCAATATATTGGTAACCAATAATTCATTTTTATTAAGAATCCAGTCA
GTTTTCAATGACATGATAAGGTATGAGGATGAAGAGCTCCAACAGAGAGCCAGGGAACAC
ATCCCACTTGTGACGCTGCAGCTCATGGCGCTGGACAGGATGAGGGAGCATCAGAAGAAA
ATTAAAATGGGTGAGATCAAAGATCAAGACATGTCTTTCGACATGGCCTTGTTAATGGAA
CTGATTGTGTGGTTCAAAAACAAGTTCTTCACATGGGTCGATCAACCAGCCTGTGACAAG
TGTGGCGGAGCGACATCATTAGTCAGTGTTAGCTCCGTCAAGACTGACTTAGAAACTTGT
AGAGCTGAGCTATATAAATGCTCGTGCGGCCGCGACGTGCTCTTCCCGAGGTACAACAAT
CCCATCACCTTACTGAGAACGAGACGCGGCCGCTGCGGGGAGTGGGCTAACTGCTTCACG
TTGATGTGTCGTGCGCTCGGATACGACACAAGATACGTGTACGACACGACCGATCACGTG
TGGTGCGAGGTGTTCGACCAGGACTCCCAGCGCTGGTTGCACGTGGACCCCTGCGAGGGG
TGTCTGAACGCTCCCCTGATGTACGAGCACGGTTGGGGGAAGTCCCTCACATACATCATA
GCGGTCTCACGAGATGACCTCCAGGACGTAACGTGGAGATATTCCAGCCACCATAAAGCG
CTGCTGCAGCGTCGCGATGAAGTGTCCGAAGCGGACCTGGTGCTAGCTATCCTGGCTCTC
CGTGATCATCGTCACGACCAGGTGAGTCCGGCCAGAAGGAGGTACCTCGTCATCAGGACA
TTGAAGGAACTTGTGGAATTGATGGTGGAGAGGAAACCCGGCGAAATGGAATCCCACGGC
CGGATATCCGGTTCGAAGGCGTGGCGGATGGAGCGCGGTGAGACGGGAGCGAGGAAACAC
GCGTTCATACTAACAGAACCCGGCGACCATTGTGTACAATACCGTACCAGCTCAGATACA
TATAGAGTACTACTGAATAACGTACAACGAGACGAGATAAAGAGCTGGAGGGGGGGAGTG
TTCCACAGCGAGAACATGTTCAGGAAGTTGGAGACGGACTGGCAGCAGACCTACCTCGCC
AGGGAGGAGGGAGAAAACACCGGCAGTATATCGTGGAAGCTGATGGTGGAAGGAGATTTG
GTGATATCGAGCGTAGCGATGGACGTCACCACCGCTCAGTACGAGGATGGCAGGATAGAG
TGGACCTACGAGGTGGACCACCAGCCCCCGAGGACCTTCAGCTTGAACGCGGGTCGGTGG
CAGGTAGACGGTAGGTGGTCATCGTGTGAGGTGAGAGCTCGTCTGGTCGGGGGGAAGGGG
GTGGTAGCCTGGCAACACGCGCAGATCGCCCGACAGCACACCTCCGACGACAAACCAGCG
CTCAGCCTGCTAGCCACCGTCGTGCCACGGTGA

Protein sequence:

MEDMARLALVEQSVKDENQYRKILFDLLDHISNILDNPHDYDLRTIKSDILEKVLDCEAF
ADYLKYIGFQMGQKEIMFPREQTLSKLRIAQAALERKIGFCCGNLNKTVTVTDNKKTSKP
KLTEANILVTNNSFLLRIQSVFNDMIRYEDEELQQRAREHIPLVTLQLMALDRMREHQKK
IKMGEIKDQDMSFDMALLMELIVWFKNKFFTWVDQPACDKCGGATSLVSVSSVKTDLETC
RAELYKCSCGRDVLFPRYNNPITLLRTRRGRCGEWANCFTLMCRALGYDTRYVYDTTDHV
WCEVFDQDSQRWLHVDPCEGCLNAPLMYEHGWGKSLTYIIAVSRDDLQDVTWRYSSHHKA
LLQRRDEVSEADLVLAILALRDHRHDQVSPARRRYLVIRTLKELVELMVERKPGEMESHG
RISGSKAWRMERGETGARKHAFILTEPGDHCVQYRTSSDTYRVLLNNVQRDEIKSWRGGV
FHSENMFRKLETDWQQTYLAREEGENTGSISWKLMVEGDLVISSVAMDVTTAQYEDGRIE
WTYEVDHQPPRTFSLNAGRWQVDGRWSSCEVRARLVGGKGVVAWQHAQIARQHTSDDKPA
LSLLATVVPR