DPGLEAN12805 in OGS1.0

New model in OGS2.0DPOGS209371 
Genomic Positionscaffold151:+ 158014-160116
See gene structure
CDS Length1860
Paired RNAseq reads  161
Single RNAseq reads  375
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA005703 (0.0)
Best Drosophila hit  CG9517, isoform A (8e-88)
Best Human hitcholine dehydrogenase, mitochondrial precursor (3e-66)
Best NR hit (blastp)  AGAP003785-PA [Anopheles gambiae str. PEST] (3e-119)
Best NR hit (blastx)  AGAP003785-PA [Anopheles gambiae str. PEST] (9e-107)
GeneOntology terms

  
GO:0004344 glucose dehydrogenase activity
GO:0050660 FAD binding
GO:0006066 alcohol metabolic process
InterPro families

  
IPR000172 Glucose-methanol-choline oxidoreductase, N-terminal
IPR007867 Glucose-methanol-choline oxidoreductase, C-terminal
IPR012132 Glucose-methanol-choline oxidoreductase
Orthology groupMCL39988

Nucleotide sequence:

ATGACGAGTCTAAGTCCATGTGTGCCTGCCACGTCACCGGCGGGAGCTGCTTTCACTGCT
TTAATATCTTATATATCGACCCTCCAGTGTCTCATCACGGAACCCTGGCCGGAAGACCAT
AGCCATCGCGTTAAAGACGGTGATCAATTCGATTTCATTATTATCGGTTCCGGGACAGCT
GGATCAATCTTAGCGAATCGTTTGACACAAGCTGATGATTGGAAGGTTTTACTCCTTGAG
GCCGGCGACAATCCGCCTTTGGAGAGTATTATCCCGAATTTCTCCGGAGCGACACATAGG
AGTGACCAGGTGTGGCAATATTATACGGAGAGAGATGAGATGTCGAATAGGGCCTGCGTT
GATGGACGGTCTTTCTGGCCTCGAGGCAGGATGCTGGGTGGCACGGGATCAATCAATGGA
ATGCTGCACATGACGGGCAGTCCCGGGGACTATCAATCTTGGAACGTCGATGACGGTTGG
GACTATCTTACCATAAAGAAATATTTTAGGAAAAGTGAAAAAATTATCGATCCCTATATT
CTTAATAATCCAGAACTTTTAAATAATCACGGCACGAATGGGGAGTTTGTAGTTGATCAA
TTGAATTTCACACATACGGATATAGCTGATAAACTGACGGAGGCCTACTTGGAAATTGGT
CTCGATTACTTGGATGACCTGAATGGACCAACTCAAATGGGTGTTGGTAAGATAAGGGGC
GGTCATCACAAAGGGAAACGAGTGAGCACTGCAACTGCTTTTTTAAACGTAATCAAAGAA
CGTAAAAATTTATACATTCTCAAAAATACATTTGCTACAAAAATTATTTTTCAAGACTCT
AAAGCAATTGGCGTAAAGGTTTCTTTGCCAGACAAGAAAACAGCGCAGTATTATACAACA
AAAGAGATAATTGTGAGTGCTGGAACAATAAACACTCCAGTTTTACTCATGTCCTCTGGT
ATAGGACCAAAAGAACATTTGGAGAGTTTGGACATCAAAGTCGTTTCTGACTTACCAGTC
GGCAAAAATCTGCAGGATCATGTTAGAATTCCAATACCGGTGAGGATTAATACAGGAGCG
AAGGCAAAATCTCAAGATTATTGGCAAAAAGCCACACTGCAATACTTACTAGAGCAGTCA
GGTCCACACTCAACTAACTATGATCAACCTAATATTAATGCTTTTCTATCAGTCACAGAT
CATAAGCAACTCCCGGATATACAAATCGATCATAATTATTTTGTTCCAAATACTTCCTAC
ATATATTCTATGTGTAAAAATGTCATGAACTACAAGGATGAGATTTGCGAACAATTTGCT
AAAATGAACGTTGAGAGTGAAATGATAATATTTTTTGTATCTCTATGCCGACCATTTTCA
AAGGGTGAGATTTTATTGCGTTCAACTAATCCCTTCGATCATCCACGTATATATCCAAAA
TATTTCAGTGATCGACGAGACATGGATACATTCATAAAGGGTTTAAAAAAAGTTACGGAA
ATTGTGAACACAGAAGCATTAAGAAATGTAGACGCGAAGGTTGAAAGAATCTATTTTAAG
GACTGTGATGATTTTAAATTTAAATCTGATGATTATTGGGAGTGTATGGCCAGGGCTTTG
ACGTACAATGTATATCATCCTGTGGGCACCTCGAAGATGGGCAAGCCTGGAGACGCTAGC
AGTGTAGTGGATAGTAGGTTGAGGGTGTTAGGAGTGAAAAACTTGAGAGTCGTCGACGCT
AGTATAATGCCAACTATAACAAGCGTTAATACTAACGCTCCGACCATGATGATCGCAGAA
AGAGCTTCTGCGTTCATAAAACTGCAATATAAAAGCAAATACGCGAATGACGAGTTATAA

Protein sequence:

MTSLSPCVPATSPAGAAFTALISYISTLQCLITEPWPEDHSHRVKDGDQFDFIIIGSGTA
GSILANRLTQADDWKVLLLEAGDNPPLESIIPNFSGATHRSDQVWQYYTERDEMSNRACV
DGRSFWPRGRMLGGTGSINGMLHMTGSPGDYQSWNVDDGWDYLTIKKYFRKSEKIIDPYI
LNNPELLNNHGTNGEFVVDQLNFTHTDIADKLTEAYLEIGLDYLDDLNGPTQMGVGKIRG
GHHKGKRVSTATAFLNVIKERKNLYILKNTFATKIIFQDSKAIGVKVSLPDKKTAQYYTT
KEIIVSAGTINTPVLLMSSGIGPKEHLESLDIKVVSDLPVGKNLQDHVRIPIPVRINTGA
KAKSQDYWQKATLQYLLEQSGPHSTNYDQPNINAFLSVTDHKQLPDIQIDHNYFVPNTSY
IYSMCKNVMNYKDEICEQFAKMNVESEMIIFFVSLCRPFSKGEILLRSTNPFDHPRIYPK
YFSDRRDMDTFIKGLKKVTEIVNTEALRNVDAKVERIYFKDCDDFKFKSDDYWECMARAL
TYNVYHPVGTSKMGKPGDASSVVDSRLRVLGVKNLRVVDASIMPTITSVNTNAPTMMIAE
RASAFIKLQYKSKYANDEL