New model in OGS2.0 | DPOGS207059  |
---|---|
Genomic Position | scaffold1:+ 1833870-1835903 |
See gene structure | |
CDS Length | 1914 |
Paired RNAseq reads   | 572 |
Single RNAseq reads   | 1380 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA013005 (0.0) |
Best Drosophila hit   | CG9521 (4e-142) |
Best Human hit | choline dehydrogenase, mitochondrial precursor (8e-74) |
Best NR hit (blastp)   | PREDICTED: similar to glucose dehydrogenase [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to glucose dehydrogenase [Tribolium castaneum] (0.0) |
GeneOntology terms    | GO:0008812 choline dehydrogenase activity GO:0050660 FAD binding GO:0006066 alcohol metabolic process |
InterPro families    | IPR000172 Glucose-methanol-choline oxidoreductase, N-terminal IPR007867 Glucose-methanol-choline oxidoreductase, C-terminal IPR012132 Glucose-methanol-choline oxidoreductase |
Orthology group | MCL10197 |
Nucleotide sequence:
ATGATCTCAAATAAGAATGGTGAAAGTATCATGATGTGGAGCCATTGGATTCTATGTTTC
AAGGTTTTGTTTGGAGTTTTACTATTTCCATCACCGAGTAAACTTCAGTCCGTCAACCCG
ATAACATCGTTTATGAATTTTTTACAAGAAGGTACGAATCAACGTGACAATGAACCACCC
GACCAAGTTAATTTGTTGACGGAGTACGACTTCATTGTTGTTGGTGCGGGAACAGCTGGG
TGCGTTGTGGCTAACCGATTAACAGAATTAAAGGACGTGAAAGTTCTACTCTTAGAAGCT
GGAGTTAATGAGAACTACGTTATGGACATACCAATTCTAGCAAATTATCTGCAGTTCACT
GAAGCGAACTGGGGATACAAGACGAAACCCTCGAAAAAATATTGTGCAGGTTTCGAAAAT
CAGCAATGTAATTGGCCACGCGGAAAAGTTGTCGGTGGATCAAGTGTCCTAAATTATATG
ATATACACACGAGGGGCTGCAGATGATTATAACAATTGGGCATCAAAAGGTAATGAAGGC
TGGGGATGGGACGATGTACTGGATTATTTCAAAAAAATTGAAAATTACAACATACCAGCC
TTTGACGATCCTAAATATCACGGCCATGACGGCCATGTTAATGTAGAGTATGCACCATTT
CGTACAACAAAAGGAAAAGCTTGGGTTAAAGGGGCCCAAGAATTAGGCTTTAAGTATAAT
GATTACAATGGACAAAATCCAAGTGGTGTCTCTTTCCTACAACTGTCTATGAAGAACGGA
ACAAGGCACAGTTCCAGTCGAGCATATCTTCATCCTATAAAGAAAAGAAATAATTTACAC
GTATCTAAAGTGAGCATGGCTACGAGATTACTGTTCGATACAACAAAAACTCGTGTAATT
GGAGTCGAATTCGAGAAACGAGGAAAGCGCTATAAAATATTAGCAAAAAAAGAGATCATT
GTATCGGCTGGTGCAATCAATTCACCTCAACTCCTCATGTTATCAGGAATAGGCCCTAAA
AAGCATTTAGAGTCACTAAATATTCCAGTTGTAAAAGATTTACCTGTAGGATATAATCTA
ATGGACCACATTGCCGCCGGTGGACTCCAATTTATTGTTCAACAACAAAACCTCAGTCTG
TCTACTGGTTATATTTTAAACCATTTAGAATTGGTATTTAAGTGGATGCGGAATCATAAA
GGACCGTTGTCTGTGCCTGGTGGTTGCGAAGCATTAGTATTTTTGGATTTAAAAGATAGA
TTTAACGTGAGCGGCTGGCCGGACTTAGAACTGCTTTTTATAAGTGGGGGATTAAATTCA
GATCCTTTGTTAAGAAGAAATTTTGGTTTCGATGAACAAATATTCACAGACACCTATACA
GCTCTAGGTAATAATGAAGTTTTTATGGTTTTTCCAATGTTGATGAGACCAAAATCAAGA
GGCAGGGTAATGTTACAAAACAGAAATCCAAAGTCACATCCGATATTAATCCCAAATTAC
TTTGATGATCCAGAAGATTTGCAAAAAATTGTGGAAGGCATCAAAGTGGCAATTGAGATA
ACTCGTCAACCGTCAATGAAAAAGATACAAACGAAATTATATGACGTTCCTATCGCTGAC
TGTCTGAAGTATGGGCCTTTCGGCAGTGACGAGTACTTCGCGTGTCAAGCACAAATGTTC
ACTTTTACAATTTACCATCAAAGTGGGAGTTGTAAAATGGGTGTCAAAAGTGATCCTACA
GCGGTTGTAGATCCTAGACTAAGAGTACATGGTATAGAAAATCTAAGAGTAATCGATGCT
AGTATAATGCCAGAAATTGTTTCAAGTCATACAAATGCCCCAACATTCATGATAGCAGAA
AAGGGCGCAGACATGATTAAAGAAGACTGGGGGAGAAAATCGCAGAACATGTAA
Protein sequence:
MISNKNGESIMMWSHWILCFKVLFGVLLFPSPSKLQSVNPITSFMNFLQEGTNQRDNEPP
DQVNLLTEYDFIVVGAGTAGCVVANRLTELKDVKVLLLEAGVNENYVMDIPILANYLQFT
EANWGYKTKPSKKYCAGFENQQCNWPRGKVVGGSSVLNYMIYTRGAADDYNNWASKGNEG
WGWDDVLDYFKKIENYNIPAFDDPKYHGHDGHVNVEYAPFRTTKGKAWVKGAQELGFKYN
DYNGQNPSGVSFLQLSMKNGTRHSSSRAYLHPIKKRNNLHVSKVSMATRLLFDTTKTRVI
GVEFEKRGKRYKILAKKEIIVSAGAINSPQLLMLSGIGPKKHLESLNIPVVKDLPVGYNL
MDHIAAGGLQFIVQQQNLSLSTGYILNHLELVFKWMRNHKGPLSVPGGCEALVFLDLKDR
FNVSGWPDLELLFISGGLNSDPLLRRNFGFDEQIFTDTYTALGNNEVFMVFPMLMRPKSR
GRVMLQNRNPKSHPILIPNYFDDPEDLQKIVEGIKVAIEITRQPSMKKIQTKLYDVPIAD
CLKYGPFGSDEYFACQAQMFTFTIYHQSGSCKMGVKSDPTAVVDPRLRVHGIENLRVIDA
SIMPEIVSSHTNAPTFMIAEKGADMIKEDWGRKSQNM