New model in OGS2.0 | DPOGS207080  |
---|---|
Genomic Position | scaffold1:+ 2184651-2187719 |
See gene structure | |
CDS Length | 1848 |
Paired RNAseq reads   | 1377 |
Single RNAseq reads   | 3654 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA000068 (3e-109) |
Best Drosophila hit   | CG9519 (2e-70) |
Best Human hit | choline dehydrogenase, mitochondrial precursor (2e-58) |
Best NR hit (blastp)   | putative ecdysone oxidase [Heliconius melpomene] (1e-121) |
Best NR hit (blastx)   | putative ecdysone oxidase [Heliconius melpomene] (1e-114) |
GeneOntology terms    | GO:0008812 choline dehydrogenase activity GO:0006066 alcohol metabolic process GO:0050660 FAD binding |
InterPro families    | IPR000172 Glucose-methanol-choline oxidoreductase, N-terminal IPR007867 Glucose-methanol-choline oxidoreductase, C-terminal IPR012132 Glucose-methanol-choline oxidoreductase |
Orthology group | MCL23522 |
Nucleotide sequence:
ATGCAGGCAGCACCAGAGTGTTATATAAGGATGAGAGAGTATAACATTGGGTGCAGTTTG
TTGTTACTGGTCGAGTCGAGAGGATTCTCATACGCAACGCAGCTATCGAGGGCGATGGGC
AACTTTTATAGGATCAATGCCTTGATTTTATTGTCTGCTCTAGGACTTACCGCAAATAAA
TGGCCTCCTGATACCTTTATTCCAAATAACGGAGAATTCACTGCCGATTATGTAGTGGTA
GGAGCTGGTACGGCAGGAAGCATAATTGGCTTTCGTCTAACAGAGGATCCTAATGTCGAT
GTCGTGATGGTTGAAGCTGGCGATGATCCCCCAACAGATGCGGAATTACCAGGGTTATTC
TTTTCATTGCCAAAAACTAAAATTGATTGGAATTATACATCAGAAGACGATGGCTACAGT
GCTCAGTATCATAGAAATAAATTTGTTGATTTACCATCGGGAAAAGTACTCGGTGGAAGC
AGCAGCCTTCATCACTTCTATTACCTCAGGGGAGATGCCGCTGACTTTGAAGACTGGGTG
AAAGCTAGTGGCAATGAATCGTGGTCTTTAGAAAACCTCTTACCTTATTTTAAGAAGAGT
GAACGTCTCGAGGACAAGGACATAAGCGATTCAGAAACTGGTAATTTACATGGATACAGC
GGAGAGGTCGGAATCACGAGACGTGTAACAGAATTGCCAGAAAAATATTTACAAGCATTC
CAAGAAGTTGGACATCCAGTTGTTCTTGATATTAACGGCCATCATGTCAAAGGATTTACA
CAACCTTTGTTTTTTATTGCTGAAAAGAAGCGACAAAGTAGTGCCGAAGGTTATTTAACT
AGAGCAAAGTCTCGAGATAATCTTCATCTAGTAAAGAATACAATAGCTAACAGAATTTTG
TTTGATTCCAATAATAATGCTATCGGTGTTGAATGCGCTTCATTAGACGGAAGAGTGTTC
AAAGTTTTCGCTCGAAAGGAAGTCGTCATATCTGCTGGGGCTTTCAATACGCCTAAATTG
TTAAAACTATCGGGCATAGGTCCTCGAGCTGAACTCGAAAGTTTTGGCATTAAGGTTATT
TCAGATCTACCAGTGGGAGAAAATTTACAAGACCATTTGGCTGTCGTTCTTGCTCATGGA
CTAGAAAAAACTAATGACACTCCATCGGCTCCAATTCTGAATGATTTTCCTCTAGACACT
TTTGTAGGTTTAGAATCTATTGACCCAAATCAGGAAAAACCAGATTATCTGACGTTAAAC
CTAATTTGTAGAAATAATCCAGAGTGTTTGAGTCAACTTTGTTCCGTTGTGTTTGGTTTA
AACCAAGACGTATGTAATCAGATAATGAAAGCTGGTGAAGGTAGAGAGATTTTAGTCTCT
ATACTTACTGTCTGTCGTCCAGTATCCACTGGAAGAGTTTTACTGAAGAGTTCAGACCCT
AAAGACCCGCCTGTGATCTATACCGGTTTCCTTTCTAACAAAACTGATCTGGAAAACAGC
GCTCGTTATATCGAAGACTTCATAAGAGTTGTAGAGTCAAAATACTTTAAGAGTGTCGGA
GGAGAGACTTTACAACCACATTTACCGAATTGTTCGCACTTACAGTGGAACACGAGAGAA
TATTGGAAGTGTTATGTTCTCAACATGATGGACACTACATTCCACTACAGTAGTACATGT
CCAATGGGTTCCGTATTAGATTCTCAATTGAGAGTGCGAGGTGTGGGGAGACTGCGAGTA
GGCGATGCCAGTGCTATGCCGAATATAGTCTCAAGTAACATAAACGCTGCTGTCATGGTA
CTTGCTGAAAAGCTTGCTGACCTTCTTAAGGAGTCAGGTAAACAATGA
Protein sequence:
MQAAPECYIRMREYNIGCSLLLLVESRGFSYATQLSRAMGNFYRINALILLSALGLTANK
WPPDTFIPNNGEFTADYVVVGAGTAGSIIGFRLTEDPNVDVVMVEAGDDPPTDAELPGLF
FSLPKTKIDWNYTSEDDGYSAQYHRNKFVDLPSGKVLGGSSSLHHFYYLRGDAADFEDWV
KASGNESWSLENLLPYFKKSERLEDKDISDSETGNLHGYSGEVGITRRVTELPEKYLQAF
QEVGHPVVLDINGHHVKGFTQPLFFIAEKKRQSSAEGYLTRAKSRDNLHLVKNTIANRIL
FDSNNNAIGVECASLDGRVFKVFARKEVVISAGAFNTPKLLKLSGIGPRAELESFGIKVI
SDLPVGENLQDHLAVVLAHGLEKTNDTPSAPILNDFPLDTFVGLESIDPNQEKPDYLTLN
LICRNNPECLSQLCSVVFGLNQDVCNQIMKAGEGREILVSILTVCRPVSTGRVLLKSSDP
KDPPVIYTGFLSNKTDLENSARYIEDFIRVVESKYFKSVGGETLQPHLPNCSHLQWNTRE
YWKCYVLNMMDTTFHYSSTCPMGSVLDSQLRVRGVGRLRVGDASAMPNIVSSNINAAVMV
LAEKLADLLKESGKQ