DPGLEAN15611 in OGS1.0

New model in OGS2.0DPOGS207059 
Genomic Positionscaffold1:+ 1833870-1835903
See gene structure
CDS Length1914
Paired RNAseq reads  572
Single RNAseq reads  1380
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA013005 (0.0)
Best Drosophila hit  CG9521 (4e-142)
Best Human hitcholine dehydrogenase, mitochondrial precursor (8e-74)
Best NR hit (blastp)  PREDICTED: similar to glucose dehydrogenase [Tribolium castaneum] (0.0)
Best NR hit (blastx)  PREDICTED: similar to glucose dehydrogenase [Tribolium castaneum] (0.0)
GeneOntology terms

  
GO:0008812 choline dehydrogenase activity
GO:0050660 FAD binding
GO:0006066 alcohol metabolic process
InterPro families

  
IPR000172 Glucose-methanol-choline oxidoreductase, N-terminal
IPR007867 Glucose-methanol-choline oxidoreductase, C-terminal
IPR012132 Glucose-methanol-choline oxidoreductase
Orthology groupMCL10197

Nucleotide sequence:

ATGATCTCAAATAAGAATGGTGAAAGTATCATGATGTGGAGCCATTGGATTCTATGTTTC
AAGGTTTTGTTTGGAGTTTTACTATTTCCATCACCGAGTAAACTTCAGTCCGTCAACCCG
ATAACATCGTTTATGAATTTTTTACAAGAAGGTACGAATCAACGTGACAATGAACCACCC
GACCAAGTTAATTTGTTGACGGAGTACGACTTCATTGTTGTTGGTGCGGGAACAGCTGGG
TGCGTTGTGGCTAACCGATTAACAGAATTAAAGGACGTGAAAGTTCTACTCTTAGAAGCT
GGAGTTAATGAGAACTACGTTATGGACATACCAATTCTAGCAAATTATCTGCAGTTCACT
GAAGCGAACTGGGGATACAAGACGAAACCCTCGAAAAAATATTGTGCAGGTTTCGAAAAT
CAGCAATGTAATTGGCCACGCGGAAAAGTTGTCGGTGGATCAAGTGTCCTAAATTATATG
ATATACACACGAGGGGCTGCAGATGATTATAACAATTGGGCATCAAAAGGTAATGAAGGC
TGGGGATGGGACGATGTACTGGATTATTTCAAAAAAATTGAAAATTACAACATACCAGCC
TTTGACGATCCTAAATATCACGGCCATGACGGCCATGTTAATGTAGAGTATGCACCATTT
CGTACAACAAAAGGAAAAGCTTGGGTTAAAGGGGCCCAAGAATTAGGCTTTAAGTATAAT
GATTACAATGGACAAAATCCAAGTGGTGTCTCTTTCCTACAACTGTCTATGAAGAACGGA
ACAAGGCACAGTTCCAGTCGAGCATATCTTCATCCTATAAAGAAAAGAAATAATTTACAC
GTATCTAAAGTGAGCATGGCTACGAGATTACTGTTCGATACAACAAAAACTCGTGTAATT
GGAGTCGAATTCGAGAAACGAGGAAAGCGCTATAAAATATTAGCAAAAAAAGAGATCATT
GTATCGGCTGGTGCAATCAATTCACCTCAACTCCTCATGTTATCAGGAATAGGCCCTAAA
AAGCATTTAGAGTCACTAAATATTCCAGTTGTAAAAGATTTACCTGTAGGATATAATCTA
ATGGACCACATTGCCGCCGGTGGACTCCAATTTATTGTTCAACAACAAAACCTCAGTCTG
TCTACTGGTTATATTTTAAACCATTTAGAATTGGTATTTAAGTGGATGCGGAATCATAAA
GGACCGTTGTCTGTGCCTGGTGGTTGCGAAGCATTAGTATTTTTGGATTTAAAAGATAGA
TTTAACGTGAGCGGCTGGCCGGACTTAGAACTGCTTTTTATAAGTGGGGGATTAAATTCA
GATCCTTTGTTAAGAAGAAATTTTGGTTTCGATGAACAAATATTCACAGACACCTATACA
GCTCTAGGTAATAATGAAGTTTTTATGGTTTTTCCAATGTTGATGAGACCAAAATCAAGA
GGCAGGGTAATGTTACAAAACAGAAATCCAAAGTCACATCCGATATTAATCCCAAATTAC
TTTGATGATCCAGAAGATTTGCAAAAAATTGTGGAAGGCATCAAAGTGGCAATTGAGATA
ACTCGTCAACCGTCAATGAAAAAGATACAAACGAAATTATATGACGTTCCTATCGCTGAC
TGTCTGAAGTATGGGCCTTTCGGCAGTGACGAGTACTTCGCGTGTCAAGCACAAATGTTC
ACTTTTACAATTTACCATCAAAGTGGGAGTTGTAAAATGGGTGTCAAAAGTGATCCTACA
GCGGTTGTAGATCCTAGACTAAGAGTACATGGTATAGAAAATCTAAGAGTAATCGATGCT
AGTATAATGCCAGAAATTGTTTCAAGTCATACAAATGCCCCAACATTCATGATAGCAGAA
AAGGGCGCAGACATGATTAAAGAAGACTGGGGGAGAAAATCGCAGAACATGTAA

Protein sequence:

MISNKNGESIMMWSHWILCFKVLFGVLLFPSPSKLQSVNPITSFMNFLQEGTNQRDNEPP
DQVNLLTEYDFIVVGAGTAGCVVANRLTELKDVKVLLLEAGVNENYVMDIPILANYLQFT
EANWGYKTKPSKKYCAGFENQQCNWPRGKVVGGSSVLNYMIYTRGAADDYNNWASKGNEG
WGWDDVLDYFKKIENYNIPAFDDPKYHGHDGHVNVEYAPFRTTKGKAWVKGAQELGFKYN
DYNGQNPSGVSFLQLSMKNGTRHSSSRAYLHPIKKRNNLHVSKVSMATRLLFDTTKTRVI
GVEFEKRGKRYKILAKKEIIVSAGAINSPQLLMLSGIGPKKHLESLNIPVVKDLPVGYNL
MDHIAAGGLQFIVQQQNLSLSTGYILNHLELVFKWMRNHKGPLSVPGGCEALVFLDLKDR
FNVSGWPDLELLFISGGLNSDPLLRRNFGFDEQIFTDTYTALGNNEVFMVFPMLMRPKSR
GRVMLQNRNPKSHPILIPNYFDDPEDLQKIVEGIKVAIEITRQPSMKKIQTKLYDVPIAD
CLKYGPFGSDEYFACQAQMFTFTIYHQSGSCKMGVKSDPTAVVDPRLRVHGIENLRVIDA
SIMPEIVSSHTNAPTFMIAEKGADMIKEDWGRKSQNM