DPGLEAN15603 in OGS1.0

New model in OGS2.0DPOGS207053 
Genomic Positionscaffold1:+ 1749805-1751942
See gene structure
CDS Length1827
Paired RNAseq reads  611
Single RNAseq reads  1430
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA013000 (1e-96)
Best Drosophila hit  CG9509 (1e-89)
Best Human hitcholine dehydrogenase, mitochondrial precursor (9e-42)
Best NR hit (blastp)  PREDICTED: similar to ENSANGP00000015052 [Nasonia vitripennis] (4e-122)
Best NR hit (blastx)  glucose dehydrogenase [Aedes aegypti] (2e-104)
GeneOntology terms


  
GO:0008812 choline dehydrogenase activity
GO:0007498 mesoderm development
GO:0006066 alcohol metabolic process
GO:0050660 FAD binding
InterPro families

  
IPR012132 Glucose-methanol-choline oxidoreductase
IPR000172 Glucose-methanol-choline oxidoreductase, N-terminal
IPR007867 Glucose-methanol-choline oxidoreductase, C-terminal
Orthology groupMCL10316

Nucleotide sequence:

ATGGAATCTCTGGCGGCAAATATAACTGCAACATGCCCGTTATCGTTTGGTGGCACAGCT
GGAGAACTTTTTTTAAAAGCAGTTACAACGGTGATTACCGCACATTGTGGAATCATGGAT
GACTATAAATGGCCCCCAGACGATGCTTATGATATCATCAATAAAGGATCTGGAATATCT
TTTGATTTCATAGTCGTTGGCGCAGGAACTGCTGGATCTTTAATTGCCAGCAGACTTTCA
AAGCAATATCCGTCTTGGAATATACTTCTGATTGAAGCTGGTGATGATCCCGGAATTGAT
AGTGAGATCCCAGCATTTTTATTTTTAAATCAAAACTCAAGCAATGACTGGTCATATACA
ACAGAGGGACGTGGGGAGAGTTGTTTGGGTTTCAATAATGAAAGATGCATTTGGAGTAAA
GGAAAAGGACTCGGCGGATCAAGTTCTATTAATGCGATGATTTATTTAAGAGGGCACCCT
AAAGACTATAACACATGGGAAAAGTTAGGCAACCCGGGATGGGGATACAAGGAAATGTCT
AAATATTTCGATAAAATAGAAAATATTTTTAATATTACTGACCCTCACTTCAGCGGATAC
GAAAACCAATGGTATAAAATTTTAGATAATGCATGGAAAGAATTATCTTTTGCAAATTAT
AATTACGAAAATCATGAAGCCCTAACCGGGACCAAGAAAACGAGACTGCTAACAAGAAAT
GGGAAACGTATGAACACAGCTAAAGCATTTTTTAACCAGGCAGGAAAAATGACTGTAATG
AAAAATACGCAGGTAGAGAAGGTTATAATTAACCCAAAAACTAAACGAGCTACTGGTGTC
AAAATACACCACAAAGATGGAACCATCATGGAAATTGATGTTAGCAAAGAGATATTATTG
GCAGCTGGTTCGATTGCAACTCCACAAATTCTTATGCTATCAGGAATCGGACCTAAAGAT
CACCTTAAAGTTATGGGCATCGATATCATCTTAAATTCACCCGTAGGAAAAAACTTACAA
GATCATATTATTCTTCCATTATTTCTTAAAACCAATATAAAAATGGAACTGCCTTCTTCT
GTTATTCAAATGTTTTTGTTACAGTACATGTTAACGAAATCGGGACCAATATCAAACATC
GGTCTAACAGATTACATGGGTTTTATAGATACGAAAAACGTATCAGATTATCCAGATATA
CAATTTCACTACACATATTTCACTAAGAACGACAATTTTGTTTTAAGGCCATACCTAGAA
GGCATTGGTTATAAAAGAAAAATCATTGAAGCCATAGAGGCGTTGAACTACAAAAACGAT
ATTCTAGGCATTTATCCGACATTATTGCATCCTAAGGCTAGGGGTGAGATATTTCTTTCA
GAACGTGATTTATCAAAACCTATTATAAATGCTAATTATTTTCAACATTCTGACGACATG
CTAGCAATGATAGAGGCTATTGATTTTATTCACACACTCGAAAAAACCTCCACGTTCGAG
AAATACAATATAAAATTGTTACATATTAATATTTCTGAATGCGATATATATCCATTTGAC
ACTGAGAAATATTGGGAATGTTATATAAAATATATGGCGACGACGATTTATCATCCCGTC
GGTACTACCAAGATGGGACCACCAGAAGATGCGTCTGCTGTTGTAAATTCTGAATTAATT
GTTCATGGAACACCAAACATCAGAGTTGTTGACGCTAGCATAATGCCTAACATACCGGGA
GGTAACACTATGGCAGCGACTTTGGCGATCGCCGAAAAAGCATTCGACATTGTCAAAAAG
AAATATGTCTTAAAAAATGAATTGTAA

Protein sequence:

MESLAANITATCPLSFGGTAGELFLKAVTTVITAHCGIMDDYKWPPDDAYDIINKGSGIS
FDFIVVGAGTAGSLIASRLSKQYPSWNILLIEAGDDPGIDSEIPAFLFLNQNSSNDWSYT
TEGRGESCLGFNNERCIWSKGKGLGGSSSINAMIYLRGHPKDYNTWEKLGNPGWGYKEMS
KYFDKIENIFNITDPHFSGYENQWYKILDNAWKELSFANYNYENHEALTGTKKTRLLTRN
GKRMNTAKAFFNQAGKMTVMKNTQVEKVIINPKTKRATGVKIHHKDGTIMEIDVSKEILL
AAGSIATPQILMLSGIGPKDHLKVMGIDIILNSPVGKNLQDHIILPLFLKTNIKMELPSS
VIQMFLLQYMLTKSGPISNIGLTDYMGFIDTKNVSDYPDIQFHYTYFTKNDNFVLRPYLE
GIGYKRKIIEAIEALNYKNDILGIYPTLLHPKARGEIFLSERDLSKPIINANYFQHSDDM
LAMIEAIDFIHTLEKTSTFEKYNIKLLHINISECDIYPFDTEKYWECYIKYMATTIYHPV
GTTKMGPPEDASAVVNSELIVHGTPNIRVVDASIMPNIPGGNTMAATLAIAEKAFDIVKK
KYVLKNEL