New model in OGS2.0 | DPOGS206520  |
---|---|
Genomic Position | scaffold1301:- 93704-97909 |
See gene structure | |
CDS Length | 1857 |
Paired RNAseq reads   | 152 |
Single RNAseq reads   | 397 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA013951 (0.0) |
Best Drosophila hit   | CG9518 (5e-110) |
Best Human hit | choline dehydrogenase, mitochondrial precursor (5e-65) |
Best NR hit (blastp)   | PREDICTED: similar to AGAP003782-PA [Tribolium castaneum] (1e-150) |
Best NR hit (blastx)   | PREDICTED: similar to AGAP003782-PA [Tribolium castaneum] (7e-141) |
GeneOntology terms    | GO:0008812 choline dehydrogenase activity GO:0006066 alcohol metabolic process GO:0050660 FAD binding |
InterPro families    | IPR012132 Glucose-methanol-choline oxidoreductase IPR000172 Glucose-methanol-choline oxidoreductase, N-terminal IPR007867 Glucose-methanol-choline oxidoreductase, C-terminal |
Orthology group | MCL40924 |
Nucleotide sequence:
ATGTTGTGGCAACCCTTAAACCTGTCGGACGTATGTCCACCCAATGCGCATATGGACTCC
TGTACATTGTTCGGATACGTGTATCTGAACCTCTTGGTCAAGTTGTACGGTGGAAGTCGT
GACAAGGTCTCCCCCGAGACCTCTCGCCAGGAGTACGACTTCATAGTGGTGGGCGCCGGC
TCCGCTGGCTGTGTGGTAGCCAATAGACTGACGGAAAACCCCAATTGGAAGGTGCTGTTG
TTGGAGGCGGGTGGTCGTCAACCGGATGTGACTCTGTCACCAGCACTCTCCACGGCTCTA
CTCGGCTCTAATATAGATTGGAATTACTCCACGGAACCCAACGGCAAGAGTTGTCTCGCT
CACCGCAATCAAAGATGCCCTATGCCCAGAGGTAAAGTGTTGGGTGGATCAAGCACTATC
AACTCCATGTCATACGTCCGCGGGAACAGAGTTGATTATAACCTCTGGCATGACCTTGGA
AACCCTGGGTGGAGTTATCATGATGTTCTTCCATTCTTCAAGAAATCTGAAAGGAATGTT
AACATCGAGGCTCTGGATGCGGTCTATCACGGTGTCCAAGGCGAGCAATTCGTGGCTCGA
TACCCGTACATAGACACACCACCCCTCATGCTGACGGAGGGGTACACTGAAGGAGGCGCC
CCGCTGAGGGACTTCAACGGAGCCTTCCAGGAAGGAAACAACCAGGCACAAGCGTTCAGT
GTTCAAGGAGAGAGAGTTTCAACCAACACGGCCTTCCTACAACCCATCATTGAAAAGAGA
CCAAATCTCGTGGTTAAAATCGAATCGGAGGTAGTTAAAATTCTCATAGACGATAAGAAT
AGAGCTTACGGGGTTGATTATATACAAAATGGCAAAAAATATACTGTTTATGCGAAAAGG
GAAGTGATTGTTAGCGCGGGATCTATAAATACACCAAAATTAATGATGTTATCCGGCATA
GGACCGAAAGAACATTTGCAAGACTTGGGTATACCGGTCAAAAAAGACTTACCCGTGGGC
AGAAACCTTCACGATCACGTGACATTTAACGGAATGTTGCTTGCGTTACCGAACAGAACA
TCGACTCTGGTCAGTAACGAGGAGATTCTCCAGGCTGTGGTAGACTACCACGACATGGAT
ATCAAGGGAGGACCGATGTCAGCTAACGGTCCCGTTAACTCTATATGCTTCATTAAAAGC
CAGCCTGACTTGATAGCACCAGATCTACAATTCCAAGTAAATAACATCCACAACTGGAGG
CAGTATATTGAAGATCCGATACTTTATGAGGAGGTGGCGTTCCTGCCGACGGCATTCTAT
GACGCCGTGGTTATACGGCCCATGAACTTGGTACCTAAAAGTAGAGGATATGTTTTGCTC
AACGCGACCGACCCCCACGGAGCTCCTCTCATACAACCGAACTATTTCGCTGATCGTCGC
GATCTAATACCATTACTGTACGCAGTCGAATTTCTTCTGAGTCTCGAAAAAACACCAGCG
TACAGAGCCAGAGGCGCGTACTACGTCCGTGAGCCTCTGCCCGCTTGTCGTGACTATGAA
TGGGGAACAGAAGGGTATTATATTTGTCTGGCTAAAGAGTACACGTCTACCACCTATCAT
CCTGTGGGTACTTGCAAAATGGGTCCAAAAGAGGATGCAGAAGCCGTCGTGGACCCCGAG
CTGAGGGTCTACGGTGTGAAATATTTAAGAGTCATAGATGCCTCCATAATGCCGGTCATA
ATTCGAGGGAATACCAACGCTCCTACAATGATGATAGCAGAAAGGGGAGTGGACTTTGTC
ATACGACATTGGAATAAAATACTTTCGAAACAAAATGATGAGGATAAATCTCCATAA
Protein sequence:
MLWQPLNLSDVCPPNAHMDSCTLFGYVYLNLLVKLYGGSRDKVSPETSRQEYDFIVVGAG
SAGCVVANRLTENPNWKVLLLEAGGRQPDVTLSPALSTALLGSNIDWNYSTEPNGKSCLA
HRNQRCPMPRGKVLGGSSTINSMSYVRGNRVDYNLWHDLGNPGWSYHDVLPFFKKSERNV
NIEALDAVYHGVQGEQFVARYPYIDTPPLMLTEGYTEGGAPLRDFNGAFQEGNNQAQAFS
VQGERVSTNTAFLQPIIEKRPNLVVKIESEVVKILIDDKNRAYGVDYIQNGKKYTVYAKR
EVIVSAGSINTPKLMMLSGIGPKEHLQDLGIPVKKDLPVGRNLHDHVTFNGMLLALPNRT
STLVSNEEILQAVVDYHDMDIKGGPMSANGPVNSICFIKSQPDLIAPDLQFQVNNIHNWR
QYIEDPILYEEVAFLPTAFYDAVVIRPMNLVPKSRGYVLLNATDPHGAPLIQPNYFADRR
DLIPLLYAVEFLLSLEKTPAYRARGAYYVREPLPACRDYEWGTEGYYICLAKEYTSTTYH
PVGTCKMGPKEDAEAVVDPELRVYGVKYLRVIDASIMPVIIRGNTNAPTMMIAERGVDFV
IRHWNKILSKQNDEDKSP