New model in OGS2.0 | DPOGS207061 |
---|---|
Genomic Position | scaffold1:+ 1851287-1855029 |
See gene structure | |
CDS Length | 1806 |
Paired RNAseq reads | 33 |
Single RNAseq reads | 74 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA013009 (0.0) |
Best Drosophila hit | CG9522 (1e-133) |
Best Human hit | choline dehydrogenase, mitochondrial precursor (9e-61) |
Best NR hit (blastp) | hypothetical protein TcasGA2_TC015725 [Tribolium castaneum] (7e-166) |
Best NR hit (blastx) | hypothetical protein TcasGA2_TC015725 [Tribolium castaneum] (1e-162) |
GeneOntology terms | GO:0008812 choline dehydrogenase activity GO:0050660 FAD binding GO:0006066 alcohol metabolic process |
InterPro families | IPR000172 Glucose-methanol-choline oxidoreductase, N-terminal IPR007867 Glucose-methanol-choline oxidoreductase, C-terminal IPR012132 Glucose-methanol-choline oxidoreductase |
Orthology group | MCL10046 |
Nucleotide sequence:
ATGCTGAAAACTCCAATGTATCAGTTCGAAAAGACCACGCCAAATATATTTGCATCGTTC
AAAGATAACTACGAACTGCCAAAAGAATTCAAAGGCCCTTTGAAGGAATACGACTTCATT
GTAGTAGGAGCAGGATCTGCAGGGAGTGTACTGGCTTCGAGACTTAGTGAAGGAAAACAA
GCCTCAGTACTACTTTTAGAGGCTGGCCAAGGAGAAGCTATCCTTACAGGAGTGCCCATT
CTGGCACCAATGTTACAACGAACTAATTACGTATGGCCTTACCTCATGGAGTATCAACCA
GGAGTATGCATGGGTATGGAAAACGGGCGTTGTTTCTGGCCGCGAGGGAAAGCAGTCGGT
GGCACAAGCGTCGTCAACTATATGATTTACACAAGAGGATTCAAGGAAGACTGGGACAGA
ATAGCCGCTAAAGGCAATTATGGATGGTCATACGACGACGTTATCCCGTACTACATAAAA
TCCGAGAGAGCAAAACTTCGTGGATTAAACAAATCCCCGTGGCACGGGAAAGATGGCGAG
TTGAGCGTAGAGGATGTACCTTTTAGATCGAAACTATCAAAAGCATTTATGGATGCTGCA
AAATTATTAGGACAGAGACAAGTCGACTATAACAGCCCAGACAGCTTTGGCTCGAGTTAC
ATTCAAGCAACAATAAGTAAAGGAATACGAGCGAGTAGCGCGAGAGCATTTCTTCACAAC
AATAAGAAAAGAAAGAACCTCCACATCTTGACAAACAGTAGGGTGACAAGAATTATTATA
GATCCATACACAAAAACAGCCATCGGTGTGGAGTTCCAAAGGGAGGGGAAAATGTACAAT
ATTACAGCTAAAAAGGAAGTCATACTTAGTGCTGGACCCATCGAATCGCCACATTTGCTC
ATGTTATCAGGGATAGGACCCAGGGAGCATCTTCAAAGCATGGGAATTAATGTGATACAA
GATCTTAGAGTTGGAGAGACTCTATATGACCATATATCTTTCCCGGCTTTAGCATTTACT
TTAAACGCGACGAGATTGACTTTAGTAGAAAGAAAACTTGCCACGTTGGATAATGTTGTC
CAGTACACACAGTATGGAGACGGACCGATGTCTTCTTTGGCTGGAGTAGAAACTTTAGGA
TATATTAAAACAGAACTATCTGATGAACCTGGTGATTATCCTGACATTGAACTCTTAGGT
AGCTGCGCCTCTCTGGCGTCAGACGAAGGCGATGTAGTAGCTCGGGGAATAAGAATCGCT
GATTGGCTATACAATGACGTCTACAGACCTATAGAAAATGTCGAAAGTTTCACAATACTG
TTTATGCTTTTACATCCGAAATCTAAAGGGCACTTAAAGTTAAAATCGAAAAATCCATTT
GAACAACCAAATCTCTATGGCAACTATTTAACACACCCTAAAGATGTAGCGACCATGATT
GCAGCTATTCGATACATATTACGATTAGTAGACACCCCGCCATATCAAAAATATGGCGCT
ACATTACATACTAAAAAATTCCCTAATTGTATGTCATACCAATTTAACAGTGACGCTTAT
TGGGAGTGTGCTATTAGAACGGTGACGTCAACACTTCACCACCAAATCGCGACATGTAAA
ATGGGCCCCCCGCAAGACCCCGAAGCAGTTGTGGACCCCGAATTGCGAGTTTATGGAATA
AAAAAATTACGAGTTATAGACTCAGGGGTTATACCTCAGACAATAGTAGCACACACTAAC
GCACCCGCTATTATGATAGGGGAGAAGGGTGCGGATTTAATAAAACGTACATGGGGTCTG
CTCTAG
Protein sequence:
MLKTPMYQFEKTTPNIFASFKDNYELPKEFKGPLKEYDFIVVGAGSAGSVLASRLSEGKQ
ASVLLLEAGQGEAILTGVPILAPMLQRTNYVWPYLMEYQPGVCMGMENGRCFWPRGKAVG
GTSVVNYMIYTRGFKEDWDRIAAKGNYGWSYDDVIPYYIKSERAKLRGLNKSPWHGKDGE
LSVEDVPFRSKLSKAFMDAAKLLGQRQVDYNSPDSFGSSYIQATISKGIRASSARAFLHN
NKKRKNLHILTNSRVTRIIIDPYTKTAIGVEFQREGKMYNITAKKEVILSAGPIESPHLL
MLSGIGPREHLQSMGINVIQDLRVGETLYDHISFPALAFTLNATRLTLVERKLATLDNVV
QYTQYGDGPMSSLAGVETLGYIKTELSDEPGDYPDIELLGSCASLASDEGDVVARGIRIA
DWLYNDVYRPIENVESFTILFMLLHPKSKGHLKLKSKNPFEQPNLYGNYLTHPKDVATMI
AAIRYILRLVDTPPYQKYGATLHTKKFPNCMSYQFNSDAYWECAIRTVTSTLHHQIATCK
MGPPQDPEAVVDPELRVYGIKKLRVIDSGVIPQTIVAHTNAPAIMIGEKGADLIKRTWGL
L