New model in OGS2.0 | DPOGS209371  |
---|---|
Genomic Position | scaffold151:+ 158014-160116 |
See gene structure | |
CDS Length | 1860 |
Paired RNAseq reads   | 161 |
Single RNAseq reads   | 375 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA005703 (0.0) |
Best Drosophila hit   | CG9517, isoform A (8e-88) |
Best Human hit | choline dehydrogenase, mitochondrial precursor (3e-66) |
Best NR hit (blastp)   | AGAP003785-PA [Anopheles gambiae str. PEST] (3e-119) |
Best NR hit (blastx)   | AGAP003785-PA [Anopheles gambiae str. PEST] (9e-107) |
GeneOntology terms    | GO:0004344 glucose dehydrogenase activity GO:0050660 FAD binding GO:0006066 alcohol metabolic process |
InterPro families    | IPR000172 Glucose-methanol-choline oxidoreductase, N-terminal IPR007867 Glucose-methanol-choline oxidoreductase, C-terminal IPR012132 Glucose-methanol-choline oxidoreductase |
Orthology group | MCL39988 |
Nucleotide sequence:
ATGACGAGTCTAAGTCCATGTGTGCCTGCCACGTCACCGGCGGGAGCTGCTTTCACTGCT
TTAATATCTTATATATCGACCCTCCAGTGTCTCATCACGGAACCCTGGCCGGAAGACCAT
AGCCATCGCGTTAAAGACGGTGATCAATTCGATTTCATTATTATCGGTTCCGGGACAGCT
GGATCAATCTTAGCGAATCGTTTGACACAAGCTGATGATTGGAAGGTTTTACTCCTTGAG
GCCGGCGACAATCCGCCTTTGGAGAGTATTATCCCGAATTTCTCCGGAGCGACACATAGG
AGTGACCAGGTGTGGCAATATTATACGGAGAGAGATGAGATGTCGAATAGGGCCTGCGTT
GATGGACGGTCTTTCTGGCCTCGAGGCAGGATGCTGGGTGGCACGGGATCAATCAATGGA
ATGCTGCACATGACGGGCAGTCCCGGGGACTATCAATCTTGGAACGTCGATGACGGTTGG
GACTATCTTACCATAAAGAAATATTTTAGGAAAAGTGAAAAAATTATCGATCCCTATATT
CTTAATAATCCAGAACTTTTAAATAATCACGGCACGAATGGGGAGTTTGTAGTTGATCAA
TTGAATTTCACACATACGGATATAGCTGATAAACTGACGGAGGCCTACTTGGAAATTGGT
CTCGATTACTTGGATGACCTGAATGGACCAACTCAAATGGGTGTTGGTAAGATAAGGGGC
GGTCATCACAAAGGGAAACGAGTGAGCACTGCAACTGCTTTTTTAAACGTAATCAAAGAA
CGTAAAAATTTATACATTCTCAAAAATACATTTGCTACAAAAATTATTTTTCAAGACTCT
AAAGCAATTGGCGTAAAGGTTTCTTTGCCAGACAAGAAAACAGCGCAGTATTATACAACA
AAAGAGATAATTGTGAGTGCTGGAACAATAAACACTCCAGTTTTACTCATGTCCTCTGGT
ATAGGACCAAAAGAACATTTGGAGAGTTTGGACATCAAAGTCGTTTCTGACTTACCAGTC
GGCAAAAATCTGCAGGATCATGTTAGAATTCCAATACCGGTGAGGATTAATACAGGAGCG
AAGGCAAAATCTCAAGATTATTGGCAAAAAGCCACACTGCAATACTTACTAGAGCAGTCA
GGTCCACACTCAACTAACTATGATCAACCTAATATTAATGCTTTTCTATCAGTCACAGAT
CATAAGCAACTCCCGGATATACAAATCGATCATAATTATTTTGTTCCAAATACTTCCTAC
ATATATTCTATGTGTAAAAATGTCATGAACTACAAGGATGAGATTTGCGAACAATTTGCT
AAAATGAACGTTGAGAGTGAAATGATAATATTTTTTGTATCTCTATGCCGACCATTTTCA
AAGGGTGAGATTTTATTGCGTTCAACTAATCCCTTCGATCATCCACGTATATATCCAAAA
TATTTCAGTGATCGACGAGACATGGATACATTCATAAAGGGTTTAAAAAAAGTTACGGAA
ATTGTGAACACAGAAGCATTAAGAAATGTAGACGCGAAGGTTGAAAGAATCTATTTTAAG
GACTGTGATGATTTTAAATTTAAATCTGATGATTATTGGGAGTGTATGGCCAGGGCTTTG
ACGTACAATGTATATCATCCTGTGGGCACCTCGAAGATGGGCAAGCCTGGAGACGCTAGC
AGTGTAGTGGATAGTAGGTTGAGGGTGTTAGGAGTGAAAAACTTGAGAGTCGTCGACGCT
AGTATAATGCCAACTATAACAAGCGTTAATACTAACGCTCCGACCATGATGATCGCAGAA
AGAGCTTCTGCGTTCATAAAACTGCAATATAAAAGCAAATACGCGAATGACGAGTTATAA
Protein sequence:
MTSLSPCVPATSPAGAAFTALISYISTLQCLITEPWPEDHSHRVKDGDQFDFIIIGSGTA
GSILANRLTQADDWKVLLLEAGDNPPLESIIPNFSGATHRSDQVWQYYTERDEMSNRACV
DGRSFWPRGRMLGGTGSINGMLHMTGSPGDYQSWNVDDGWDYLTIKKYFRKSEKIIDPYI
LNNPELLNNHGTNGEFVVDQLNFTHTDIADKLTEAYLEIGLDYLDDLNGPTQMGVGKIRG
GHHKGKRVSTATAFLNVIKERKNLYILKNTFATKIIFQDSKAIGVKVSLPDKKTAQYYTT
KEIIVSAGTINTPVLLMSSGIGPKEHLESLDIKVVSDLPVGKNLQDHVRIPIPVRINTGA
KAKSQDYWQKATLQYLLEQSGPHSTNYDQPNINAFLSVTDHKQLPDIQIDHNYFVPNTSY
IYSMCKNVMNYKDEICEQFAKMNVESEMIIFFVSLCRPFSKGEILLRSTNPFDHPRIYPK
YFSDRRDMDTFIKGLKKVTEIVNTEALRNVDAKVERIYFKDCDDFKFKSDDYWECMARAL
TYNVYHPVGTSKMGKPGDASSVVDSRLRVLGVKNLRVVDASIMPTITSVNTNAPTMMIAE
RASAFIKLQYKSKYANDEL