DPGLEAN20294 in OGS1.0

New model in OGS2.0DPOGS210044 
Genomic Positionscaffold2351:- 51195-54641
See gene structure
CDS Length1737
Paired RNAseq reads  10124
Single RNAseq reads  28295
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA003057 (1e-158)
Best Drosophila hit  CG14935, isoform B (1e-144)
Best Human hitneutral and basic amino acid transport protein rBAT (3e-85)
Best NR hit (blastp)  alpha amylase [Bombyx mori] (2e-170)
Best NR hit (blastx)  alpha amylase [Bombyx mori] (8e-171)
GeneOntology terms

  
GO:0004558 alpha-glucosidase activity
GO:0043169 cation binding
GO:0005975 carbohydrate metabolic process
InterPro families



  
IPR006047 Glycosyl hydrolase, family 13, catalytic domain
IPR013781 Glycoside hydrolase, subgroup, catalytic core
IPR006589 Glycosyl hydrolase, family 13, subfamily, catalytic domain
IPR017853 Glycoside hydrolase, superfamily
IPR015902 Alpha amylase
Orthology groupMCL10053

Nucleotide sequence:

ATGCGTTTCCTGATCCTCTCTCTCGCTGTCTTCACGTCAGCGGTTGCCGCTTCTGATACG
GAGTGGTGGAAGACCGCCTTGATCTACCAAATCTATCCGCGATCCTTCAAGGACAGTAAT
GGCGACGGCATCGGCGATCTTAATGGTATCACGGAGAAGCTGGTTTATCTGAATCAGACG
GGAGTTGACGCGATCTGGCTCTCACCGATCTACCTCTCGCCGATGTATGACTTTGGGTAC
GACATTACGGACTACAGGAAAATAGCCCCCGAATACGGTACTATGGACGATTTCAAGACG
CTCATGACAGAAGCACGGAGACTTGGTATCCGTGTAATAATGGACTTGGTCCCCAACCAC
ACGGGCAATGAGAGCGAATGGTTTCAGAAGTCCATCCGACGCGAGCCAGGATACGAGGAT
TACTATATATGGGCGGACGGCATCAAGACCGAAGGATCCAACGACACTAAGCCACCGAGC
AATTGGGTAAGCACTTTCCGGAAGAGTGCGTGGGAATACAATTCTGTGCGCGGTCAATAC
TACCTCCACAAATTTGTAATCGGACAACCAGATCTTAATTATCGCAGTACAAGAGTTCAA
CAGGAAATGAAGGATGTCCAGAAATTTTGGCTCGATTTGGGAGTATCCGGTTTCCGTGTG
GACGCAATCAATCATCTGTACGAATCTAATCCCGCTAATTTCGGTGGTCGCTACCCAGAC
GAGCCTTTATCAGGAAACCCCAACACCAATCCCGACGACTACGAGTACCTGAACCACATT
CATACCGAAAACCTGAACGAAACCTATGAAGTGGTTTACGACTGGAGAGATCTTCTCGAC
GAGTACATAGAACTGCAGGGGGAATACAAGATCATGATGACGGAGGCTTACGCGGACTTG
GACAGCATGATGCGGTACTACGGCACCAGCACCAGGAACGGATCTATTCCCTTCAACTTC
AGCTTTTTGGGAGACATCACCAAGGATTCCGACGCGAGACATATTAAGACTGTCATCGAT
AAATGGATGACGTACATGCCGAGTGGAAGAACTGCCAACTGGGTGAACGGTAACCACGAT
CAAAGCAGGATGGCTAATCGTCAGGGGGTCGACAGAGTTGATGCTATGAACATGATAGCA
CTGTTGTTACCTGGTGTTGCCATCACATACCAGGGTGAGGAAATAGGAATGACAGATGGA
GAGGTCAGCTGGGAAGAGACGAAGGACCCGCAGGCTTGTAACACTGACGACCCCGTGAAC
TACTGGAAGAAGTCGAGAGACCCCAACCGTACGCCCTTCCACTGGGATAACAGCACTAAT
GCTGGATTCTCTACCGGAAAGACTTGGCTACCGGTTGCTAGTAACTACCACAAAGTAAAC
TTGGCTGAACAAATCAACAACACCAAAAGTCACTACCAGTTCTACAAGGATCTCGCAGCA
ATAAGAAAGATGGCAGCTGTGAAATATGGAGATGTAGACACAAGAGCTCTGTCAGAAACG
GTATTAGTCGTCACAAGGTTACTACCGGGCGAGCAGGGAGTATTGGGCATTGTGAACTTA
TCAGATGAGGACCAATATGTTGATCTGACCTCGCTGCGTTTAATACCGAGAGTGATTAAA
GTTAGGGCTGTTGGAGCCAATTGTGATAATGTGAAGGGGACTCTTCTTATCAAGAACAAA
ATACCAGTAAATGCTCACTGCGCCTTAGTTCTACAAACTATCCGACACTGCTGTTGA

Protein sequence:

MRFLILSLAVFTSAVAASDTEWWKTALIYQIYPRSFKDSNGDGIGDLNGITEKLVYLNQT
GVDAIWLSPIYLSPMYDFGYDITDYRKIAPEYGTMDDFKTLMTEARRLGIRVIMDLVPNH
TGNESEWFQKSIRREPGYEDYYIWADGIKTEGSNDTKPPSNWVSTFRKSAWEYNSVRGQY
YLHKFVIGQPDLNYRSTRVQQEMKDVQKFWLDLGVSGFRVDAINHLYESNPANFGGRYPD
EPLSGNPNTNPDDYEYLNHIHTENLNETYEVVYDWRDLLDEYIELQGEYKIMMTEAYADL
DSMMRYYGTSTRNGSIPFNFSFLGDITKDSDARHIKTVIDKWMTYMPSGRTANWVNGNHD
QSRMANRQGVDRVDAMNMIALLLPGVAITYQGEEIGMTDGEVSWEETKDPQACNTDDPVN
YWKKSRDPNRTPFHWDNSTNAGFSTGKTWLPVASNYHKVNLAEQINNTKSHYQFYKDLAA
IRKMAAVKYGDVDTRALSETVLVVTRLLPGEQGVLGIVNLSDEDQYVDLTSLRLIPRVIK
VRAVGANCDNVKGTLLIKNKIPVNAHCALVLQTIRHCC