DPGLEAN07253 in OGS1.0

New model in OGS2.0DPOGS215494 
Genomic Positionscaffold841:+ 96799-102771
See gene structure
CDS Length2037
Paired RNAseq reads  5754
Single RNAseq reads  13998
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA007451 (0.0)
Best Drosophila hit  CG33138 (0.0)
Best Human hit1,4-alpha-glucan-branching enzyme (0.0)
Best NR hit (blastp)  1,4-alpha-glucan branching enzyme, putative [Pediculus humanus corporis] (0.0)
Best NR hit (blastx)  GF11944 [Drosophila ananassae] (0.0)
GeneOntology terms


  
GO:0003844 1,4-alpha-glucan branching enzyme activity
GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds
GO:0043169 cation binding
GO:0005975 carbohydrate metabolic process
InterPro families








  
IPR017853 Glycoside hydrolase, superfamily
IPR014756 Immunoglobulin E-set
IPR006589 Glycosyl hydrolase, family 13, subfamily, catalytic domain
IPR013783 Immunoglobulin-like fold
IPR013781 Glycoside hydrolase, subgroup, catalytic core
IPR013780 Glycosyl hydrolase, family 13, all-beta
IPR006048 Alpha-amylase, C-terminal all beta
IPR006047 Glycosyl hydrolase, family 13, catalytic domain
IPR004193 Glycoside hydrolase, family 13, N-terminal
IPR015902 Alpha amylase
Orthology groupMCL12168

Nucleotide sequence:

ATGGACCCGATGGACGTACCAGTCCCCGATTTAAAACTTTTATTCCAAAGAGACGGCTAT
TTAAGACCATATGAACGTGAAATTCGACGACGTTTCGCCTGCTTCCAAGATCTGTGGGAT
AAGATAGAGTCATGGGAGGGTGGCGTGGAAGGTTTCACTACCGGTTACCGTTATTATGGA
CCACAGTTCTGCGTCGACGGATCAGTGGTGTGGAGAGAATGGGCTCCTGGAGCACACTCG
CTTCATCTTCAGGGCGACTTCAATGGTTGGAACCCAAAGAGTCATCCGTTCAGGAAGCTG
GAATATGGAAAGTGGGAGCTTTATATACCTGGAAATGAGGATGAATCGTGTCCTATCAAG
CATTTGAGTCGAGTCCAGCTTATTGTTAACGAACACCTGTACCGAGTGTCTCCCTGGGCG
AGTTACGTTAAGCCATACGAAGGATTCACTTACCAACAATTCATTTACAAGCCGGAGCAG
CCGTACCAGTTCAAGCACAGAAAAGTTAAGAGGCCAGCGTCGTTACGCATCTATGAGTGC
CACGTGGGGATCGCTACCAACGAGGGAAGAGTTGGCACTTACCTGGAGTTCAAGGACAAT
GTGCTACCAAGGATTAAAGATTTAGGCTACAACGCTATACAGTTGATGGCTATAATGGAA
CATGCCTACTACGCCTCTTTCGGTTACCAGGTCACAAGCTTTTTTGCAGCCAGCAGTCGA
TATGGAACCCCCTGTGAGTTAAAGCAGTTGATCGACCGTGCCCATGAGCTCGGTATCTAC
GTGCTATTAGACGTCGTCCACTCCCACGCCTCCAAGAACACGTTGGATGGTCTCAACGAG
TTCGACGGCACCAACTCCTGCTACTTCCACGACGGCGCCAGAGGAACCCACTCGCTCTGG
GACAGCAGATTGTTCAACTATTCCGAGACGGAGGTGCTACGTTTCCTACTTTCTAACCTG
AGATGGTATCAAGAGGAATATCAGTTTGACGGATTCAGGTTTGATGGCGTGACGTCGATG
TTGTACCACAGTCGTGGCATTGGGGAAGGCTTCTCTGGAAATTATGACGAGTACTATGGA
TTGAACGTGGACACGGAGGCGCTCGTCTACCTGATGGTGGCCAACGAGCTCGTGCACTCC
ATAGACAGCCAGGCCATAACTATAGCCGAGGATGTATCAGGAATGCCGGCCTCCGGGCGA
CCTGTTCGTGAAGGCGGTACGGGCTTTGACTACCGCCTGGGTATGGCCATCCCGGACATG
TGGATCAAATTGCTGAAGGAGGAACGCGACGAGGACTGGAAGATGGGGCACATCGTCCAC
ACCCTCACCAACAGACGGTGGATGGAGGGGACTGTCGCTTACGCCGAAAGCCATGACCAG
GCTCTGGTCGGGGATAAGACGATCGCGTTCTGGTTGATGGACGCCGCCATGTACACCCAC
ATGAGTACCCTAAGCGAGCCCAACCCGGTCATCGAGCGAGGACTCGCTCTACACTGCATG
ATACGACTCATCACCAACGCGCTCGGAGGAGAGGCCTACCTCAATTTCATTGGTAACGAA
TTCGGACACCCCGAATGGTTGGATTTCCCTCGAGCTGGCAACAATTCCTCGTACCACTAC
GCCAGGAGACAGTGGCATCTCGTTGACGACCAGCTTCTCAAGTACAAATATCTGAACGAA
TTCGATAAAGACATGCACGCCTTGGAGAACAAATACGGATGGCTCGCATCAAATCCGGCG
TACGTGTCGTGTAAGCATGAAGGCGACAAAGTGATAGCGTTCGAGCGCGCGGGCCTGCTG
TTCGTCTTCAACTTCCATCCCAACCAAAGCTTCACGGACTACCGCGTTGGCGTCGACGTC
GCTGGAAAATATCAGGCTGTATTGTGTTCGGATAGTAAGAAATACGGCGGTTTCGGTCGT
GTGGAGCCGGATGGGGAATACCATCTCACTCAGAACATGCCCTGGGGTGACAGAAAGGAT
TCCGTTCAGCTCTACATCCCGTGCCGTACAGCTCTAGTTTACGCTCGATGTGAATGA

Protein sequence:

MDPMDVPVPDLKLLFQRDGYLRPYEREIRRRFACFQDLWDKIESWEGGVEGFTTGYRYYG
PQFCVDGSVVWREWAPGAHSLHLQGDFNGWNPKSHPFRKLEYGKWELYIPGNEDESCPIK
HLSRVQLIVNEHLYRVSPWASYVKPYEGFTYQQFIYKPEQPYQFKHRKVKRPASLRIYEC
HVGIATNEGRVGTYLEFKDNVLPRIKDLGYNAIQLMAIMEHAYYASFGYQVTSFFAASSR
YGTPCELKQLIDRAHELGIYVLLDVVHSHASKNTLDGLNEFDGTNSCYFHDGARGTHSLW
DSRLFNYSETEVLRFLLSNLRWYQEEYQFDGFRFDGVTSMLYHSRGIGEGFSGNYDEYYG
LNVDTEALVYLMVANELVHSIDSQAITIAEDVSGMPASGRPVREGGTGFDYRLGMAIPDM
WIKLLKEERDEDWKMGHIVHTLTNRRWMEGTVAYAESHDQALVGDKTIAFWLMDAAMYTH
MSTLSEPNPVIERGLALHCMIRLITNALGGEAYLNFIGNEFGHPEWLDFPRAGNNSSYHY
ARRQWHLVDDQLLKYKYLNEFDKDMHALENKYGWLASNPAYVSCKHEGDKVIAFERAGLL
FVFNFHPNQSFTDYRVGVDVAGKYQAVLCSDSKKYGGFGRVEPDGEYHLTQNMPWGDRKD
SVQLYIPCRTALVYARCE