DPGLEAN05919 in OGS1.0

New model in OGS2.0DPOGS205716 
Genomic Positionscaffold283:+ 496-4518
See gene structure
CDS Length1875
Paired RNAseq reads  8
Single RNAseq reads  18
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA009925 (8e-164)
Best Drosophila hit  CG9517, isoform B (3e-71)
Best Human hitcholine dehydrogenase, mitochondrial precursor (1e-40)
Best NR hit (blastp)  AGAP003782-PA [Anopheles gambiae str. PEST] (3e-87)
Best NR hit (blastx)  AGAP003782-PA [Anopheles gambiae str. PEST] (7e-80)
GeneOntology terms

  
GO:0004344 glucose dehydrogenase activity
GO:0050660 FAD binding
GO:0006066 alcohol metabolic process
InterPro families

  
IPR000172 Glucose-methanol-choline oxidoreductase, N-terminal
IPR007867 Glucose-methanol-choline oxidoreductase, C-terminal
IPR012132 Glucose-methanol-choline oxidoreductase
Orthology groupMCL23563

Nucleotide sequence:

ATGTTGATGTTAGTCGGTACATCTCCTGTGCGGTATCATTATCCTCCTGGGACACTGAGG
ATCCAGCACTTGTACCGCTGGTTCACACACAGCGCTAAGGAACAAGATCTCAAACCAAGG
CAGCTCCTGCGATGGGATCTGGTCTTTTATGACGGCGACGAGTTTGACTACGTGGTGATT
GGCGCCGGCGCAGCTGGGAGTGCGGTGGCAGCGAGACTGGCGCTGGCTGGACATAGCGTG
CTGTTGGTTGAAGCAGGCGGAGATCCCAACATCCTCACAAGAATACCTGGAGCAACTTTG
GCTTTGACTGGTTCAAATCTGGATTGGTACTATGATACGATACCGAATAACAAGTCGTGT
CTTTCTTCTAAAGGAGGGAAATGTCGTTTAAGTCGAGGTCGATGTCTAGGAGGATCGACT
AGCCTTAACTACATGATGTATACTCGAGGAAATAAGCAGGATTACGACTTTAATGTTACC
GGCTGGAATTGGGAAGACATTAAACCGTATTTTCTTAGATTTGAAGGACTACAGGAACCT
TCTAGACTTCCAAAATCGTCTGGAGCGTATCATAATACTTCTGGTATAACGCCGATAGGA
TACTTTGGTGATTCCGGCAATCCATGGCACCAGAGGATTGTCGAGGGCCTGACTTCCGTG
AATTTTCCATATAATCCAGACGTAAATTCCAAGTCTCAGATAGGTGTTTCTAAAATTCTG
GGTTTTACTTCCGGCGGAGAACGAGTTAGCACTGCAACTGCTTATTTAGGTACAAAAAAT
GTGAAGGAATCCTTAAAAATTATTAAAAATACAAAGTGTACAGGAGTAATTATTGATACT
GAAAATATAGCTAGAGGGGTAACTATAGCGAGAGGTTTTAATGATACTATAAATATATTT
ACAAAAAAAGAAGTAATTTTAAGTGCTGGAGCTTTTAACACTCCTCAATTACTAATGCTG
TCAGGAATTGGACCAAAAGAACATTTAGAGGAATTTAACATTCCTGTCAAAGCAAATTTG
CCCGTAGGTCACGGAATGTCTGACCATGTTTTGCCCATAATAAACGTAAGAGTCGATCAT
GATTCTATGCCATCATCAAATATTTTATCTATTGGATCCAAGCTCTGGCAGGGTCTCAGT
TGGCTACTAATGCGTAGCGGACCATTAGCGTCCAATAGTATAACTGACCTGACTGCTTTT
GCGAACACCGAATGCTACGACTTTAAACTTAGGCGATTACTGAATGATAGGCCTGAATGT
GAATTGCCAAATTTACAATTAATTTATGCTTACATTGACAAGGGGTTACTTAGTATGGTT
AAATCGTTATATGAAATTGCCGCTCCGCACTCTCCTGAAGTTATGAATCAAGTGGTGTCA
GCCAACGAAGAAAGCTCTTTCATTGTGGTGTCACCGGTAGTGCTAAAGCCAAAGTCTCGG
GGCTGGGTGAAGCTAGCTAGTTCCGATCCATTCGAACAACCGGCAATTATTCCCAACTAC
TTGAGTGACAAAAGAGATGTCGAAGAAATGGTGCGTGCAATAAAATTACTGGAGCAAGTG
GTTGAGACGCCTGCATTTAAAAACTTTAATGCATCCATTTTGAAGCTTCATATTTCCGAA
TGTCCTGCCTTTGATGAAGAAGGTTACTGGGAATGTTATTCAAGACATATGACGCATTCA
GTACAACACGCGGTCGGAACAGCCGCACTCGGGCAAGTGGTTGACGAAAGATTAAGAGTT
AAGGGTGTTAAAAATCTTCGCATTGCCGACGCCTCGGTACTTCCACACTTGCCACGTGGC
AATACGGCCGCTGCTATAATCGCTATTGGGGAACGTTTATCAGATTTCCTTTTACAAGAT
CGAGGATTAGAATGA

Protein sequence:

MLMLVGTSPVRYHYPPGTLRIQHLYRWFTHSAKEQDLKPRQLLRWDLVFYDGDEFDYVVI
GAGAAGSAVAARLALAGHSVLLVEAGGDPNILTRIPGATLALTGSNLDWYYDTIPNNKSC
LSSKGGKCRLSRGRCLGGSTSLNYMMYTRGNKQDYDFNVTGWNWEDIKPYFLRFEGLQEP
SRLPKSSGAYHNTSGITPIGYFGDSGNPWHQRIVEGLTSVNFPYNPDVNSKSQIGVSKIL
GFTSGGERVSTATAYLGTKNVKESLKIIKNTKCTGVIIDTENIARGVTIARGFNDTINIF
TKKEVILSAGAFNTPQLLMLSGIGPKEHLEEFNIPVKANLPVGHGMSDHVLPIINVRVDH
DSMPSSNILSIGSKLWQGLSWLLMRSGPLASNSITDLTAFANTECYDFKLRRLLNDRPEC
ELPNLQLIYAYIDKGLLSMVKSLYEIAAPHSPEVMNQVVSANEESSFIVVSPVVLKPKSR
GWVKLASSDPFEQPAIIPNYLSDKRDVEEMVRAIKLLEQVVETPAFKNFNASILKLHISE
CPAFDEEGYWECYSRHMTHSVQHAVGTAALGQVVDERLRVKGVKNLRIADASVLPHLPRG
NTAAAIIAIGERLSDFLLQDRGLE