DPGLEAN15610 in OGS1.0

New model in OGS2.0DPOGS207058 
Genomic Positionscaffold1:+ 1807475-1826592
See gene structure
CDS Length3684
Paired RNAseq reads  175
Single RNAseq reads  466
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA013003 (9e-156)
Best Drosophila hit  CG9518 (0.0)
Best Human hitcholine dehydrogenase, mitochondrial precursor (3e-74)
Best NR hit (blastp)  glucose dehydrogenase [Aedes aegypti] (0.0)
Best NR hit (blastx)  glucose dehydrogenase [Culex quinquefasciatus] (0.0)
GeneOntology terms

  
GO:0008812 choline dehydrogenase activity
GO:0006066 alcohol metabolic process
GO:0050660 FAD binding
InterPro families
  
IPR000172 Glucose-methanol-choline oxidoreductase, N-terminal
IPR007867 Glucose-methanol-choline oxidoreductase, C-terminal
Orthology groupMCL10046

Nucleotide sequence:

ATGGCTATCCAAGTCCTTTTAGCATCGACAGCTTTAAAATCAGTCAGCGTTACTGGTCTG
TGGTTAATACCACTTCTGCTTGGGGCCTTCACGTATCATAATTATAACTCCTACGATCCA
GAATCGAAGGTACTAGAAAAAGAACCTAAGAGGGAGTACGATTTCGTTGTAGTTGGCGGA
GGCTCTGCTGGTGCAGTCGTTGCAAATCGTCTAACCGAAATCAAAGATTGGAATTTACTT
TTATTAGAAAGTGGACCAGACGAGAACGAGATTACTGATGTCCCCTCTTTAGCCGCTTAT
TTGCAACTAACGAAGTTGGATTGGCAATACAAGACTGAACCGACACCTTACGCTTGTTTG
GGTTTTAAGAACAACAGGTGCAGCTGGCCGAGAGGAAAGCTTCTCGGCGGTTCCAGCGTT
TTAAACTATATGATTTACGTAAGAGGTAATAAATACGACTACGACCAATGGGAATCTTTT
GGCAATCCAGGATGGGGATATCGAGATGTTCTTAAATATTTTATTAAATCCGAAGATAAC
AGAAACCCTTATTTGGCCAAAAATCAGTATCATGGTCAAGGCGGTTATTTGACTGTGCAG
GAAGCACCATGGAAAACACCCCTTGTAGCAGCTTTCGTTGAAGCTGGGGTCGAAATTGGC
TATGACAACAGAGATATAAATGGTGCCATCCAAACCGGGTTCATGATGGCCCAAGGGACG
ATAAGACGTGGTTCTAGATGCAGCACAGCTAAAGCATTTTTAAGACCAGTGAGAACCCGT
AAAAATTTAGATATTTCACTGCATTCACACGTTACTAAAATACTCATTAATCCTATGACA
ATGAAAGCTTACGGAGTAGAATATGTAAAACATGGTATTAAGAAAGTGGTTTATGCTAGA
AAGGAAGTTATATTGTCGGCAGGAGCCATTAACAGTCCACAATTATTAATGCTTTCTGGT
ATTGGTCCAAAAGATCACTTACAGAGCGTTGGCATAAAAGTCCTAAAAGATTTACCAGTA
GGAGAAAATTTAATGGATCATGTGGGAGTAGGAGGACTGACATTTCTAGTCGATAAACCA
GTCGGAATTGTCCAAAATAGACTTCAGGCATTTCCTGTTACAATGAATTACGTATTAAAC
GAAAGAGGCCCCATGACCACATTAGGAGGACTTGAAGGTATTGCTTTTGTAAATACAAAA
TATGCTAATAGCTCCGGATTATGGCCTGATATTCAATTCCATATGGCTCCTGCAACATTT
GCTTCAGATAATGGACAAACTGTGAAAAAAGTGTTAGGTCTGAAAGATGAAATTTATGAC
ACTGTTTTTAAACCTATAGCAAATAAAGATGGGTGGACTATTATGCCACTGTTGTTACGG
CCTAATACTAGAGGTTACGTTCGATTAAAAAGTTCCAATCCTTTTGAGTATCCTATAATG
AATCCACGCTATCATGAAGATCCTCTAGATGTAAGTCGCCTTGTTGAAGGGATAAAAATT
GCCTTAAAAGTTGCGAACGCTTCCCCATTTAAGCAATTTGGATCAAGATTATATATGAAA
CCATTACCAAACTGTAAACAACACAAATTTATGTCCGATGAATATATTGAATGTCAAGTT
AGATCAATAAGTATGACCATATATCACCAATGTGGGACGGCTAAAATGGGACCATCTTGG
GATAAGGGTGCTGTTGTTGACCCTAGATTGAGGGTGTTTGGTATTGAAGGACTAAGAGTT
ATAGACGCTAGCATAATGCCGACTATTGTGAGTGGAAACACAAATGCACCAGTAATCATG
ATAGGAGAAAAGGGTTCTGACATGATAAAAGAAGATTGGTTGAATAGCCTCTGCTACTTC
CCTCTCGCAACTTTTGGAAGGGATACTATCCTCGATGGGATAGCCGGCTTTCTCCGTGAC
GCAGCGGAGATACATAACGGTGAGCCAGCCGAGACTGACTTCATCTTACCCAAGTACGAC
TTCATCATCGTCGGTGCTGGCACAGCTGGTTGTATACTTAGCAACAGATTGACCGAAGTC
GATAAGTTTAAGGTCCTTTTAATAGAGGCAGGTGGAGCAGAGCAAGTATTTATGGACATC
CCCGTTCTGGCTACAATGCTGCAATTCACTGAAGCAAATTGGAAGTATCGCACAGAACCT
CAAAAGGCCGGATGTATGGGTATGCGTGATAAACGGTGCGCATGGCCAAGAGGAAAAGTC
GTAGGAGGGTCTTCCGTGCTCCATTCAATGATGCACACGAGGGGAAATAAACGAGATTAT
GATACATGGGCAGCTAGTGGAAATCCAGGTTGGGATTATGATAGCGTATTGAAATATTTT
AAAAAATCAGAAAATATTGAAATTCCACATTTGGTAAATGACAAAAAATATCATTCAACT
CAGGGGCCGATGACAATACAAGAGCCAAGATGGCGAACTCCACTATCAGATGCCTTCCTT
GATGCCGGAGTCGAAATCGGTGGAAATATTAATGATTATAATGGTAAAACACAGATTGGA
TATTCCATTATTCAATTTACTATGAAGAATGGAACTAGAATGAGTGTCAGTCGAGCTTTC
TTACATCCTATAAAAAAACGACGTAATTTTCATATCATTAAGAATGCTTTAGTGACCAAA
GTTCTCATAGATCACAAAAAAAAACGCGCTTATGGCGTACAATTTGAAAAAGATGGTAAA
CAAATTGTAGTAAGAGCAAAACGAGAAGTGATTTTATCCGCCGGATCTGTGAACTCTCCA
CAGTTATTGATGCTGAGCGGAATAGGACCAAGGGACGATCTCATAAAAATAAATATTACA
ACAGTGTCAGACTTACCGGTAGGATACAATTTGCAAGATCACTATGCGTTGGGTGGTCTA
ACTTTCATAATCAATACAACAGACTCTCTTAGATTTGAAAGAATTGCAACCTTGAATAAC
ATCATTGAATACTTTTGTCATCACACCGGTCCTCTAACAGTTCCGACCGGTGCGGAAGCA
CTTGCTTTCATTGATACCAAAAATCCAAATAATAGAGATGGTTATCCTGATTTAGAACTA
TTATTTGTGGGCGGTTCAATTGTTTCCCAAAATGCTTACCGGTACGCATTTGACATCGAT
GACATTTTGTATGACACAGTTTATAGACCAATTGCCAATAGTGATACCTGGATGGTATTT
CCGATGCTGTTACTCCCTAAATCGAGAGGCTACATAAAACTAAGGAGTAATAAACCACAC
GACAAACCAATTATCAATCCAAACTATTTTACTGACGGAGGACACGACGATCATGTTATC
TTGTATGGTATTAGGAAAGTGTTACAGTTATCCCAAACAAAAGCTTTTCAAAAATATGGG
AGTAAACTTCACGATATTCCTATTCCTAATTGCGCTCAACACAAATTCGATTCAGATAGT
TATTGGTTATGCGCTATGAGGGCACTAACGAATACTATATACCATCCTTGCTGCACAGCA
AAAATGGGACCAAGTAATGACCCTGAAGCAGTCGTCGATTCACGTTTGAAAGTCCACGGT
ATGGAAGGTCTAAGGGTTGTGGATGCTAGTATAATGCCAAATATTCCTGCGGCCCACACA
AATGCACCCACAATGATGATCGCTGAAAAGGCCGCCGACATGATAAAAGAAGACTGGGGT
ATACCCATACCAATATCAAACTGA

Protein sequence:

MAIQVLLASTALKSVSVTGLWLIPLLLGAFTYHNYNSYDPESKVLEKEPKREYDFVVVGG
GSAGAVVANRLTEIKDWNLLLLESGPDENEITDVPSLAAYLQLTKLDWQYKTEPTPYACL
GFKNNRCSWPRGKLLGGSSVLNYMIYVRGNKYDYDQWESFGNPGWGYRDVLKYFIKSEDN
RNPYLAKNQYHGQGGYLTVQEAPWKTPLVAAFVEAGVEIGYDNRDINGAIQTGFMMAQGT
IRRGSRCSTAKAFLRPVRTRKNLDISLHSHVTKILINPMTMKAYGVEYVKHGIKKVVYAR
KEVILSAGAINSPQLLMLSGIGPKDHLQSVGIKVLKDLPVGENLMDHVGVGGLTFLVDKP
VGIVQNRLQAFPVTMNYVLNERGPMTTLGGLEGIAFVNTKYANSSGLWPDIQFHMAPATF
ASDNGQTVKKVLGLKDEIYDTVFKPIANKDGWTIMPLLLRPNTRGYVRLKSSNPFEYPIM
NPRYHEDPLDVSRLVEGIKIALKVANASPFKQFGSRLYMKPLPNCKQHKFMSDEYIECQV
RSISMTIYHQCGTAKMGPSWDKGAVVDPRLRVFGIEGLRVIDASIMPTIVSGNTNAPVIM
IGEKGSDMIKEDWLNSLCYFPLATFGRDTILDGIAGFLRDAAEIHNGEPAETDFILPKYD
FIIVGAGTAGCILSNRLTEVDKFKVLLIEAGGAEQVFMDIPVLATMLQFTEANWKYRTEP
QKAGCMGMRDKRCAWPRGKVVGGSSVLHSMMHTRGNKRDYDTWAASGNPGWDYDSVLKYF
KKSENIEIPHLVNDKKYHSTQGPMTIQEPRWRTPLSDAFLDAGVEIGGNINDYNGKTQIG
YSIIQFTMKNGTRMSVSRAFLHPIKKRRNFHIIKNALVTKVLIDHKKKRAYGVQFEKDGK
QIVVRAKREVILSAGSVNSPQLLMLSGIGPRDDLIKINITTVSDLPVGYNLQDHYALGGL
TFIINTTDSLRFERIATLNNIIEYFCHHTGPLTVPTGAEALAFIDTKNPNNRDGYPDLEL
LFVGGSIVSQNAYRYAFDIDDILYDTVYRPIANSDTWMVFPMLLLPKSRGYIKLRSNKPH
DKPIINPNYFTDGGHDDHVILYGIRKVLQLSQTKAFQKYGSKLHDIPIPNCAQHKFDSDS
YWLCAMRALTNTIYHPCCTAKMGPSNDPEAVVDSRLKVHGMEGLRVVDASIMPNIPAAHT
NAPTMMIAEKAADMIKEDWGIPIPISN