DPGLEAN21959 in OGS1.0

New model in OGS2.0DPOGS200776 
Genomic Positionscaffold5937:+ 1431-2411
See gene structure
CDS Length981
Paired RNAseq reads  226
Single RNAseq reads  783
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012123 (8e-129)
Best Drosophila hit  CG1753, isoform B (8e-26)
Best Human hitcystathionine beta-synthase (5e-30)
Best NR hit (blastp)  pyridoxal-5'-phosphate-dependent protein beta subunit [Methylobacterium radiotolerans JCM 2831] (3e-87)
Best NR hit (blastx)  pyridoxal-5'-phosphate-dependent protein beta subunit [Methylobacterium radiotolerans JCM 2831] (2e-81)
GeneOntology terms




  
GO:0004124 cysteine synthase activity
GO:0005829 cytosol
GO:0006535 cysteine biosynthetic process from serine
GO:0008652 cellular amino acid biosynthetic process
GO:0016740 transferase activity
GO:0030170 pyridoxal phosphate binding
InterPro families
  
IPR001926 Pyridoxal phosphate-dependent enzyme, beta subunit
IPR001216 Cysteine synthase/cystathionine beta-synthase P-phosphate-binding site
Orthology groupMCL22739

Nucleotide sequence:

ATGTCTCAGACAAGTGGGAGCACGGACATCAAAGCATCGGCCTTAGAGCTGATCGGCAAC
ACTCCACTCGTAGCGCTGGACAGGCTCTGGCCTGGACCTGGAAGGATCCTGGCCAAATGT
GAGTTTATGAATCCTGGAGCGTCCATCAAATGCCGGTCCTCGCTTTACATGATCACTAAG
GCCTTGGAGTCTGGAGCTTTGAAGCCGGGGGAACCGGTCCTGGAAATCACCTCCGGGAAT
CAGGGATGTGGTCTGGCGGTGGTGTGCGCGGTGCTGGGACATCCTTTGACCGTGACCATG
TCTGCTGGGAACAGCGTGCAGAGAGCGATACACATGGAGGCTCTCGGAGCCAGGTGTGTC
AGGGTTCCGCAAGTAGAAGGCACATACGGTAATGTGACATTATCTGATGTCAAAGCAGCG
GAGGAGAAAGGTTTGAAGCTGGTGGAAGAGACTGGCGCGTACTACGTCAACCAGTTCAAC
AATGATATGAACTCTGAAGCTCATTACGAAACAACGGGCCCAGAGATCTGGAGACAAACT
GGCCAGCGAGTCGACGCGTTCGTCGCCACAGTTGGCACAGCAGGAACCTTCGCTGGGACC
TCCAGATATCTCAAGGAAAAGGATCCGAGTATTGTTTGTGTGGTCGTGGAACCAGAAGGG
TCTGAGCCCATCAAAGGCTGCGAAGTAACGAAGCCATTGCATTTACTCCAGGGTTCGGGT
TATGGATGTGTTCCAAATTTGTTCAACTATGAACACCTGGACGACACCATAAGCGTCAGT
GACGAGGAAGTCCTGGAATACAAGAGGCTGATCGGAGAAAAAGAAGGTCTGTTCGTGGGC
TACACGAGTGCAGCAAATGTCCTGGCTGCGGCGAAGCTGCTGAAGGCTGGAAAGTTAAAG
GAAGACGCCTGGGTGGTGACCGTGCTCTGTGACACAGGCCTGAAGTACACCCCGGTGCCA
TCAGAGATCACCAAAGCGTAA

Protein sequence:

MSQTSGSTDIKASALELIGNTPLVALDRLWPGPGRILAKCEFMNPGASIKCRSSLYMITK
ALESGALKPGEPVLEITSGNQGCGLAVVCAVLGHPLTVTMSAGNSVQRAIHMEALGARCV
RVPQVEGTYGNVTLSDVKAAEEKGLKLVEETGAYYVNQFNNDMNSEAHYETTGPEIWRQT
GQRVDAFVATVGTAGTFAGTSRYLKEKDPSIVCVVVEPEGSEPIKGCEVTKPLHLLQGSG
YGCVPNLFNYEHLDDTISVSDEEVLEYKRLIGEKEGLFVGYTSAANVLAAAKLLKAGKLK
EDAWVVTVLCDTGLKYTPVPSEITKA