New model in OGS2.0 | DPOGS205716  |
---|---|
Genomic Position | scaffold283:+ 496-4518 |
See gene structure | |
CDS Length | 1875 |
Paired RNAseq reads   | 8 |
Single RNAseq reads   | 18 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA009925 (8e-164) |
Best Drosophila hit   | CG9517, isoform B (3e-71) |
Best Human hit | choline dehydrogenase, mitochondrial precursor (1e-40) |
Best NR hit (blastp)   | AGAP003782-PA [Anopheles gambiae str. PEST] (3e-87) |
Best NR hit (blastx)   | AGAP003782-PA [Anopheles gambiae str. PEST] (7e-80) |
GeneOntology terms    | GO:0004344 glucose dehydrogenase activity GO:0050660 FAD binding GO:0006066 alcohol metabolic process |
InterPro families    | IPR000172 Glucose-methanol-choline oxidoreductase, N-terminal IPR007867 Glucose-methanol-choline oxidoreductase, C-terminal IPR012132 Glucose-methanol-choline oxidoreductase |
Orthology group | MCL23563 |
Nucleotide sequence:
ATGTTGATGTTAGTCGGTACATCTCCTGTGCGGTATCATTATCCTCCTGGGACACTGAGG
ATCCAGCACTTGTACCGCTGGTTCACACACAGCGCTAAGGAACAAGATCTCAAACCAAGG
CAGCTCCTGCGATGGGATCTGGTCTTTTATGACGGCGACGAGTTTGACTACGTGGTGATT
GGCGCCGGCGCAGCTGGGAGTGCGGTGGCAGCGAGACTGGCGCTGGCTGGACATAGCGTG
CTGTTGGTTGAAGCAGGCGGAGATCCCAACATCCTCACAAGAATACCTGGAGCAACTTTG
GCTTTGACTGGTTCAAATCTGGATTGGTACTATGATACGATACCGAATAACAAGTCGTGT
CTTTCTTCTAAAGGAGGGAAATGTCGTTTAAGTCGAGGTCGATGTCTAGGAGGATCGACT
AGCCTTAACTACATGATGTATACTCGAGGAAATAAGCAGGATTACGACTTTAATGTTACC
GGCTGGAATTGGGAAGACATTAAACCGTATTTTCTTAGATTTGAAGGACTACAGGAACCT
TCTAGACTTCCAAAATCGTCTGGAGCGTATCATAATACTTCTGGTATAACGCCGATAGGA
TACTTTGGTGATTCCGGCAATCCATGGCACCAGAGGATTGTCGAGGGCCTGACTTCCGTG
AATTTTCCATATAATCCAGACGTAAATTCCAAGTCTCAGATAGGTGTTTCTAAAATTCTG
GGTTTTACTTCCGGCGGAGAACGAGTTAGCACTGCAACTGCTTATTTAGGTACAAAAAAT
GTGAAGGAATCCTTAAAAATTATTAAAAATACAAAGTGTACAGGAGTAATTATTGATACT
GAAAATATAGCTAGAGGGGTAACTATAGCGAGAGGTTTTAATGATACTATAAATATATTT
ACAAAAAAAGAAGTAATTTTAAGTGCTGGAGCTTTTAACACTCCTCAATTACTAATGCTG
TCAGGAATTGGACCAAAAGAACATTTAGAGGAATTTAACATTCCTGTCAAAGCAAATTTG
CCCGTAGGTCACGGAATGTCTGACCATGTTTTGCCCATAATAAACGTAAGAGTCGATCAT
GATTCTATGCCATCATCAAATATTTTATCTATTGGATCCAAGCTCTGGCAGGGTCTCAGT
TGGCTACTAATGCGTAGCGGACCATTAGCGTCCAATAGTATAACTGACCTGACTGCTTTT
GCGAACACCGAATGCTACGACTTTAAACTTAGGCGATTACTGAATGATAGGCCTGAATGT
GAATTGCCAAATTTACAATTAATTTATGCTTACATTGACAAGGGGTTACTTAGTATGGTT
AAATCGTTATATGAAATTGCCGCTCCGCACTCTCCTGAAGTTATGAATCAAGTGGTGTCA
GCCAACGAAGAAAGCTCTTTCATTGTGGTGTCACCGGTAGTGCTAAAGCCAAAGTCTCGG
GGCTGGGTGAAGCTAGCTAGTTCCGATCCATTCGAACAACCGGCAATTATTCCCAACTAC
TTGAGTGACAAAAGAGATGTCGAAGAAATGGTGCGTGCAATAAAATTACTGGAGCAAGTG
GTTGAGACGCCTGCATTTAAAAACTTTAATGCATCCATTTTGAAGCTTCATATTTCCGAA
TGTCCTGCCTTTGATGAAGAAGGTTACTGGGAATGTTATTCAAGACATATGACGCATTCA
GTACAACACGCGGTCGGAACAGCCGCACTCGGGCAAGTGGTTGACGAAAGATTAAGAGTT
AAGGGTGTTAAAAATCTTCGCATTGCCGACGCCTCGGTACTTCCACACTTGCCACGTGGC
AATACGGCCGCTGCTATAATCGCTATTGGGGAACGTTTATCAGATTTCCTTTTACAAGAT
CGAGGATTAGAATGA
Protein sequence:
MLMLVGTSPVRYHYPPGTLRIQHLYRWFTHSAKEQDLKPRQLLRWDLVFYDGDEFDYVVI
GAGAAGSAVAARLALAGHSVLLVEAGGDPNILTRIPGATLALTGSNLDWYYDTIPNNKSC
LSSKGGKCRLSRGRCLGGSTSLNYMMYTRGNKQDYDFNVTGWNWEDIKPYFLRFEGLQEP
SRLPKSSGAYHNTSGITPIGYFGDSGNPWHQRIVEGLTSVNFPYNPDVNSKSQIGVSKIL
GFTSGGERVSTATAYLGTKNVKESLKIIKNTKCTGVIIDTENIARGVTIARGFNDTINIF
TKKEVILSAGAFNTPQLLMLSGIGPKEHLEEFNIPVKANLPVGHGMSDHVLPIINVRVDH
DSMPSSNILSIGSKLWQGLSWLLMRSGPLASNSITDLTAFANTECYDFKLRRLLNDRPEC
ELPNLQLIYAYIDKGLLSMVKSLYEIAAPHSPEVMNQVVSANEESSFIVVSPVVLKPKSR
GWVKLASSDPFEQPAIIPNYLSDKRDVEEMVRAIKLLEQVVETPAFKNFNASILKLHISE
CPAFDEEGYWECYSRHMTHSVQHAVGTAALGQVVDERLRVKGVKNLRIADASVLPHLPRG
NTAAAIIAIGERLSDFLLQDRGLE