New model in OGS2.0 | DPOGS207058  |
---|---|
Genomic Position | scaffold1:+ 1807475-1826592 |
See gene structure | |
CDS Length | 3684 |
Paired RNAseq reads   | 175 |
Single RNAseq reads   | 466 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA013003 (9e-156) |
Best Drosophila hit   | CG9518 (0.0) |
Best Human hit | choline dehydrogenase, mitochondrial precursor (3e-74) |
Best NR hit (blastp)   | glucose dehydrogenase [Aedes aegypti] (0.0) |
Best NR hit (blastx)   | glucose dehydrogenase [Culex quinquefasciatus] (0.0) |
GeneOntology terms    | GO:0008812 choline dehydrogenase activity GO:0006066 alcohol metabolic process GO:0050660 FAD binding |
InterPro families    | IPR000172 Glucose-methanol-choline oxidoreductase, N-terminal IPR007867 Glucose-methanol-choline oxidoreductase, C-terminal |
Orthology group | MCL10046 |
Nucleotide sequence:
ATGGCTATCCAAGTCCTTTTAGCATCGACAGCTTTAAAATCAGTCAGCGTTACTGGTCTG
TGGTTAATACCACTTCTGCTTGGGGCCTTCACGTATCATAATTATAACTCCTACGATCCA
GAATCGAAGGTACTAGAAAAAGAACCTAAGAGGGAGTACGATTTCGTTGTAGTTGGCGGA
GGCTCTGCTGGTGCAGTCGTTGCAAATCGTCTAACCGAAATCAAAGATTGGAATTTACTT
TTATTAGAAAGTGGACCAGACGAGAACGAGATTACTGATGTCCCCTCTTTAGCCGCTTAT
TTGCAACTAACGAAGTTGGATTGGCAATACAAGACTGAACCGACACCTTACGCTTGTTTG
GGTTTTAAGAACAACAGGTGCAGCTGGCCGAGAGGAAAGCTTCTCGGCGGTTCCAGCGTT
TTAAACTATATGATTTACGTAAGAGGTAATAAATACGACTACGACCAATGGGAATCTTTT
GGCAATCCAGGATGGGGATATCGAGATGTTCTTAAATATTTTATTAAATCCGAAGATAAC
AGAAACCCTTATTTGGCCAAAAATCAGTATCATGGTCAAGGCGGTTATTTGACTGTGCAG
GAAGCACCATGGAAAACACCCCTTGTAGCAGCTTTCGTTGAAGCTGGGGTCGAAATTGGC
TATGACAACAGAGATATAAATGGTGCCATCCAAACCGGGTTCATGATGGCCCAAGGGACG
ATAAGACGTGGTTCTAGATGCAGCACAGCTAAAGCATTTTTAAGACCAGTGAGAACCCGT
AAAAATTTAGATATTTCACTGCATTCACACGTTACTAAAATACTCATTAATCCTATGACA
ATGAAAGCTTACGGAGTAGAATATGTAAAACATGGTATTAAGAAAGTGGTTTATGCTAGA
AAGGAAGTTATATTGTCGGCAGGAGCCATTAACAGTCCACAATTATTAATGCTTTCTGGT
ATTGGTCCAAAAGATCACTTACAGAGCGTTGGCATAAAAGTCCTAAAAGATTTACCAGTA
GGAGAAAATTTAATGGATCATGTGGGAGTAGGAGGACTGACATTTCTAGTCGATAAACCA
GTCGGAATTGTCCAAAATAGACTTCAGGCATTTCCTGTTACAATGAATTACGTATTAAAC
GAAAGAGGCCCCATGACCACATTAGGAGGACTTGAAGGTATTGCTTTTGTAAATACAAAA
TATGCTAATAGCTCCGGATTATGGCCTGATATTCAATTCCATATGGCTCCTGCAACATTT
GCTTCAGATAATGGACAAACTGTGAAAAAAGTGTTAGGTCTGAAAGATGAAATTTATGAC
ACTGTTTTTAAACCTATAGCAAATAAAGATGGGTGGACTATTATGCCACTGTTGTTACGG
CCTAATACTAGAGGTTACGTTCGATTAAAAAGTTCCAATCCTTTTGAGTATCCTATAATG
AATCCACGCTATCATGAAGATCCTCTAGATGTAAGTCGCCTTGTTGAAGGGATAAAAATT
GCCTTAAAAGTTGCGAACGCTTCCCCATTTAAGCAATTTGGATCAAGATTATATATGAAA
CCATTACCAAACTGTAAACAACACAAATTTATGTCCGATGAATATATTGAATGTCAAGTT
AGATCAATAAGTATGACCATATATCACCAATGTGGGACGGCTAAAATGGGACCATCTTGG
GATAAGGGTGCTGTTGTTGACCCTAGATTGAGGGTGTTTGGTATTGAAGGACTAAGAGTT
ATAGACGCTAGCATAATGCCGACTATTGTGAGTGGAAACACAAATGCACCAGTAATCATG
ATAGGAGAAAAGGGTTCTGACATGATAAAAGAAGATTGGTTGAATAGCCTCTGCTACTTC
CCTCTCGCAACTTTTGGAAGGGATACTATCCTCGATGGGATAGCCGGCTTTCTCCGTGAC
GCAGCGGAGATACATAACGGTGAGCCAGCCGAGACTGACTTCATCTTACCCAAGTACGAC
TTCATCATCGTCGGTGCTGGCACAGCTGGTTGTATACTTAGCAACAGATTGACCGAAGTC
GATAAGTTTAAGGTCCTTTTAATAGAGGCAGGTGGAGCAGAGCAAGTATTTATGGACATC
CCCGTTCTGGCTACAATGCTGCAATTCACTGAAGCAAATTGGAAGTATCGCACAGAACCT
CAAAAGGCCGGATGTATGGGTATGCGTGATAAACGGTGCGCATGGCCAAGAGGAAAAGTC
GTAGGAGGGTCTTCCGTGCTCCATTCAATGATGCACACGAGGGGAAATAAACGAGATTAT
GATACATGGGCAGCTAGTGGAAATCCAGGTTGGGATTATGATAGCGTATTGAAATATTTT
AAAAAATCAGAAAATATTGAAATTCCACATTTGGTAAATGACAAAAAATATCATTCAACT
CAGGGGCCGATGACAATACAAGAGCCAAGATGGCGAACTCCACTATCAGATGCCTTCCTT
GATGCCGGAGTCGAAATCGGTGGAAATATTAATGATTATAATGGTAAAACACAGATTGGA
TATTCCATTATTCAATTTACTATGAAGAATGGAACTAGAATGAGTGTCAGTCGAGCTTTC
TTACATCCTATAAAAAAACGACGTAATTTTCATATCATTAAGAATGCTTTAGTGACCAAA
GTTCTCATAGATCACAAAAAAAAACGCGCTTATGGCGTACAATTTGAAAAAGATGGTAAA
CAAATTGTAGTAAGAGCAAAACGAGAAGTGATTTTATCCGCCGGATCTGTGAACTCTCCA
CAGTTATTGATGCTGAGCGGAATAGGACCAAGGGACGATCTCATAAAAATAAATATTACA
ACAGTGTCAGACTTACCGGTAGGATACAATTTGCAAGATCACTATGCGTTGGGTGGTCTA
ACTTTCATAATCAATACAACAGACTCTCTTAGATTTGAAAGAATTGCAACCTTGAATAAC
ATCATTGAATACTTTTGTCATCACACCGGTCCTCTAACAGTTCCGACCGGTGCGGAAGCA
CTTGCTTTCATTGATACCAAAAATCCAAATAATAGAGATGGTTATCCTGATTTAGAACTA
TTATTTGTGGGCGGTTCAATTGTTTCCCAAAATGCTTACCGGTACGCATTTGACATCGAT
GACATTTTGTATGACACAGTTTATAGACCAATTGCCAATAGTGATACCTGGATGGTATTT
CCGATGCTGTTACTCCCTAAATCGAGAGGCTACATAAAACTAAGGAGTAATAAACCACAC
GACAAACCAATTATCAATCCAAACTATTTTACTGACGGAGGACACGACGATCATGTTATC
TTGTATGGTATTAGGAAAGTGTTACAGTTATCCCAAACAAAAGCTTTTCAAAAATATGGG
AGTAAACTTCACGATATTCCTATTCCTAATTGCGCTCAACACAAATTCGATTCAGATAGT
TATTGGTTATGCGCTATGAGGGCACTAACGAATACTATATACCATCCTTGCTGCACAGCA
AAAATGGGACCAAGTAATGACCCTGAAGCAGTCGTCGATTCACGTTTGAAAGTCCACGGT
ATGGAAGGTCTAAGGGTTGTGGATGCTAGTATAATGCCAAATATTCCTGCGGCCCACACA
AATGCACCCACAATGATGATCGCTGAAAAGGCCGCCGACATGATAAAAGAAGACTGGGGT
ATACCCATACCAATATCAAACTGA
Protein sequence:
MAIQVLLASTALKSVSVTGLWLIPLLLGAFTYHNYNSYDPESKVLEKEPKREYDFVVVGG
GSAGAVVANRLTEIKDWNLLLLESGPDENEITDVPSLAAYLQLTKLDWQYKTEPTPYACL
GFKNNRCSWPRGKLLGGSSVLNYMIYVRGNKYDYDQWESFGNPGWGYRDVLKYFIKSEDN
RNPYLAKNQYHGQGGYLTVQEAPWKTPLVAAFVEAGVEIGYDNRDINGAIQTGFMMAQGT
IRRGSRCSTAKAFLRPVRTRKNLDISLHSHVTKILINPMTMKAYGVEYVKHGIKKVVYAR
KEVILSAGAINSPQLLMLSGIGPKDHLQSVGIKVLKDLPVGENLMDHVGVGGLTFLVDKP
VGIVQNRLQAFPVTMNYVLNERGPMTTLGGLEGIAFVNTKYANSSGLWPDIQFHMAPATF
ASDNGQTVKKVLGLKDEIYDTVFKPIANKDGWTIMPLLLRPNTRGYVRLKSSNPFEYPIM
NPRYHEDPLDVSRLVEGIKIALKVANASPFKQFGSRLYMKPLPNCKQHKFMSDEYIECQV
RSISMTIYHQCGTAKMGPSWDKGAVVDPRLRVFGIEGLRVIDASIMPTIVSGNTNAPVIM
IGEKGSDMIKEDWLNSLCYFPLATFGRDTILDGIAGFLRDAAEIHNGEPAETDFILPKYD
FIIVGAGTAGCILSNRLTEVDKFKVLLIEAGGAEQVFMDIPVLATMLQFTEANWKYRTEP
QKAGCMGMRDKRCAWPRGKVVGGSSVLHSMMHTRGNKRDYDTWAASGNPGWDYDSVLKYF
KKSENIEIPHLVNDKKYHSTQGPMTIQEPRWRTPLSDAFLDAGVEIGGNINDYNGKTQIG
YSIIQFTMKNGTRMSVSRAFLHPIKKRRNFHIIKNALVTKVLIDHKKKRAYGVQFEKDGK
QIVVRAKREVILSAGSVNSPQLLMLSGIGPRDDLIKINITTVSDLPVGYNLQDHYALGGL
TFIINTTDSLRFERIATLNNIIEYFCHHTGPLTVPTGAEALAFIDTKNPNNRDGYPDLEL
LFVGGSIVSQNAYRYAFDIDDILYDTVYRPIANSDTWMVFPMLLLPKSRGYIKLRSNKPH
DKPIINPNYFTDGGHDDHVILYGIRKVLQLSQTKAFQKYGSKLHDIPIPNCAQHKFDSDS
YWLCAMRALTNTIYHPCCTAKMGPSNDPEAVVDSRLKVHGMEGLRVVDASIMPNIPAAHT
NAPTMMIAEKAADMIKEDWGIPIPISN