New model in OGS2.0 | DPOGS207057  |
---|---|
Genomic Position | scaffold1:+ 1792801-1794920 |
See gene structure | |
CDS Length | 1851 |
Paired RNAseq reads   | 812 |
Single RNAseq reads   | 2044 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA013002 (0.0) |
Best Drosophila hit   | CG9517, isoform B (0.0) |
Best Human hit | choline dehydrogenase, mitochondrial precursor (2e-79) |
Best NR hit (blastp)   | PREDICTED: similar to ENSANGP00000024305 [Nasonia vitripennis] (0.0) |
Best NR hit (blastx)   | glucose dehydrogenase precursor, putative [Pediculus humanus corporis] (0.0) |
GeneOntology terms    | GO:0004344 glucose dehydrogenase activity GO:0050660 FAD binding GO:0006066 alcohol metabolic process |
InterPro families    | IPR000172 Glucose-methanol-choline oxidoreductase, N-terminal IPR007867 Glucose-methanol-choline oxidoreductase, C-terminal IPR012132 Glucose-methanol-choline oxidoreductase |
Orthology group | MCL10046 |
Nucleotide sequence:
ATGGAGGCGGCTGGCGCATTGGCGAGTCTAGCTCCATCGCCGATTACCGTGCTGGGACTG
ATACCACTCTTAGCACTTGGGATCACCTACTTCAGATATCAGCAATATGATCCGGAATCT
TATATCACAGACACAAACATTATATTACCAATCTATGACTTCGTGGTGGTCGGGGGAGGC
TCTGCAGGTGCGGTTATGGCATCAAGACTCTCTGAGATTGGTAATTGGACTGTCCTGCTC
CTTGAAGCCGGTCAAGATGAAAACGAGATTTCTGATATCCCTGCATTAGCCGGATACACC
CAATTGTCGGATATGGATTGGAAGTTCCAAACAACGCCATCTAAAAACCGTTCTTATTGC
CTCGCTATGAACGGTGACCGATGCAATTGGCCTAGAGGAAAAGTCCTTGGCGGAAGCAGT
GTCCTGAACGCAATGGTTTACGTCAGAGGCAATCGCAACGACTACGATTTGTGGGAGGCT
CTAGGCAATCCGGGCTGGTCGTACGATCAAGTGTTACCCTACTTTTTGAAATCTGAAGAT
AATCGAAATCCTTATTTGGCCTCAACACCGTATCATTCAGCAGGTGGATATTTGACGGTT
CAAGAAGCGCCGTGGCGGACACCGTTATCCATTACATTTTTAAAAGGCGGAATGGAACTA
GGTTATGATTTTCGCGATATAAACGGCGAGAAACAAACGGGTTTTATGTTGACCCAAGCA
ACTATGCGTCGTGGGAGCAGATGTAGTACGGCCAAAGCTTTTCTTAGACCAATACGTAAT
AGAGATAATTTGCACATAGCTCTGGGAGCGCAAGTCACTCGTATATTAATAAACTCGGTC
AAGAAACAAGCCTATGGTGTAGAATTTTATCGTAACGGCCAAAGACACAAAGTCAGAATA
AAACGAGAAGTAATCATGTCCGCAGGAGCATTAGCAACGCCCCAAATAATGATGTTGAGT
GGAATTGGACCCGCAGATCATCTCAGAGAGCACGGTATACCACTTGTTGCAAATCTTAAA
GTCGGTCACAACTTGCAAGATCACGTGGGCCTAGGTGGTCTTACATTTGTCGTTAACAAA
CCGGTCACATTTAAAAAGGACCGGTTCCAATCATTCTCAGTTGCAATGAACTACATTTTA
TATGAGAATGGACCGATGACGACACAAGGCGTTGAAGGTTTGGCATTTGTCAACACTAAA
TACGCTCCCACTTCTGGTAACTGGCCCGATATTCAATTTCACTTTGCACCTAGTTCAGTA
AATTCTGATGGAGGGGAGCAGATACGAAAAATTTTGAATTTACGTGACAGAGTTTACAAT
ACCGTATACAAACCTATGGAAAACGCTGAAACTTGGACCATACTACCTTTGTTATTGCGA
CCCAAAAGTTCCGGCTGGATAAAATTAAAAAGTCGAAACCCGTTCCAAGCGCCATCGATT
GAGCCCAATTACTTCGCATACAAAGAGGACATTAAAGTGCTAACGGAAGGTATAAAGATC
GCTTTCGCCTTATCAAATACCACTGCGTTCCAGAGATACGGGTCGAGACCTCTTAACATT
CCATTGCCAGGTTGCCAGCAGCATGTACTATTCAGTGATGAATATTGGGAATGCAGCCTT
AAACACTTCACGTTCACAATTTACCATCCAACTGGCACATGTAAGATGGGTCCCAATCAT
GACCAGGATGCTGTTGTCGATCCAAGATTACGAGTTCACGGGGTTGCCAACCTTCGAGTT
GTGGATGCAAGCATCATGCCCACGATCATCAGCGGCAACCCTAATGCTCCAGTAATTATG
ATAGCCGAAAAAGCCGCCGACATGATCAAAGAAGACTGGCTCGTATTATGA
Protein sequence:
MEAAGALASLAPSPITVLGLIPLLALGITYFRYQQYDPESYITDTNIILPIYDFVVVGGG
SAGAVMASRLSEIGNWTVLLLEAGQDENEISDIPALAGYTQLSDMDWKFQTTPSKNRSYC
LAMNGDRCNWPRGKVLGGSSVLNAMVYVRGNRNDYDLWEALGNPGWSYDQVLPYFLKSED
NRNPYLASTPYHSAGGYLTVQEAPWRTPLSITFLKGGMELGYDFRDINGEKQTGFMLTQA
TMRRGSRCSTAKAFLRPIRNRDNLHIALGAQVTRILINSVKKQAYGVEFYRNGQRHKVRI
KREVIMSAGALATPQIMMLSGIGPADHLREHGIPLVANLKVGHNLQDHVGLGGLTFVVNK
PVTFKKDRFQSFSVAMNYILYENGPMTTQGVEGLAFVNTKYAPTSGNWPDIQFHFAPSSV
NSDGGEQIRKILNLRDRVYNTVYKPMENAETWTILPLLLRPKSSGWIKLKSRNPFQAPSI
EPNYFAYKEDIKVLTEGIKIAFALSNTTAFQRYGSRPLNIPLPGCQQHVLFSDEYWECSL
KHFTFTIYHPTGTCKMGPNHDQDAVVDPRLRVHGVANLRVVDASIMPTIISGNPNAPVIM
IAEKAADMIKEDWLVL