New model in OGS2.0 | DPOGS200776  |
---|---|
Genomic Position | scaffold5937:+ 1431-2411 |
See gene structure | |
CDS Length | 981 |
Paired RNAseq reads   | 226 |
Single RNAseq reads   | 783 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA012123 (8e-129) |
Best Drosophila hit   | CG1753, isoform B (8e-26) |
Best Human hit | cystathionine beta-synthase (5e-30) |
Best NR hit (blastp)   | pyridoxal-5'-phosphate-dependent protein beta subunit [Methylobacterium radiotolerans JCM 2831] (3e-87) |
Best NR hit (blastx)   | pyridoxal-5'-phosphate-dependent protein beta subunit [Methylobacterium radiotolerans JCM 2831] (2e-81) |
GeneOntology terms    | GO:0004124 cysteine synthase activity GO:0005829 cytosol GO:0006535 cysteine biosynthetic process from serine GO:0008652 cellular amino acid biosynthetic process GO:0016740 transferase activity GO:0030170 pyridoxal phosphate binding |
InterPro families    | IPR001926 Pyridoxal phosphate-dependent enzyme, beta subunit IPR001216 Cysteine synthase/cystathionine beta-synthase P-phosphate-binding site |
Orthology group | MCL22739 |
Nucleotide sequence:
ATGTCTCAGACAAGTGGGAGCACGGACATCAAAGCATCGGCCTTAGAGCTGATCGGCAAC
ACTCCACTCGTAGCGCTGGACAGGCTCTGGCCTGGACCTGGAAGGATCCTGGCCAAATGT
GAGTTTATGAATCCTGGAGCGTCCATCAAATGCCGGTCCTCGCTTTACATGATCACTAAG
GCCTTGGAGTCTGGAGCTTTGAAGCCGGGGGAACCGGTCCTGGAAATCACCTCCGGGAAT
CAGGGATGTGGTCTGGCGGTGGTGTGCGCGGTGCTGGGACATCCTTTGACCGTGACCATG
TCTGCTGGGAACAGCGTGCAGAGAGCGATACACATGGAGGCTCTCGGAGCCAGGTGTGTC
AGGGTTCCGCAAGTAGAAGGCACATACGGTAATGTGACATTATCTGATGTCAAAGCAGCG
GAGGAGAAAGGTTTGAAGCTGGTGGAAGAGACTGGCGCGTACTACGTCAACCAGTTCAAC
AATGATATGAACTCTGAAGCTCATTACGAAACAACGGGCCCAGAGATCTGGAGACAAACT
GGCCAGCGAGTCGACGCGTTCGTCGCCACAGTTGGCACAGCAGGAACCTTCGCTGGGACC
TCCAGATATCTCAAGGAAAAGGATCCGAGTATTGTTTGTGTGGTCGTGGAACCAGAAGGG
TCTGAGCCCATCAAAGGCTGCGAAGTAACGAAGCCATTGCATTTACTCCAGGGTTCGGGT
TATGGATGTGTTCCAAATTTGTTCAACTATGAACACCTGGACGACACCATAAGCGTCAGT
GACGAGGAAGTCCTGGAATACAAGAGGCTGATCGGAGAAAAAGAAGGTCTGTTCGTGGGC
TACACGAGTGCAGCAAATGTCCTGGCTGCGGCGAAGCTGCTGAAGGCTGGAAAGTTAAAG
GAAGACGCCTGGGTGGTGACCGTGCTCTGTGACACAGGCCTGAAGTACACCCCGGTGCCA
TCAGAGATCACCAAAGCGTAA
Protein sequence:
MSQTSGSTDIKASALELIGNTPLVALDRLWPGPGRILAKCEFMNPGASIKCRSSLYMITK
ALESGALKPGEPVLEITSGNQGCGLAVVCAVLGHPLTVTMSAGNSVQRAIHMEALGARCV
RVPQVEGTYGNVTLSDVKAAEEKGLKLVEETGAYYVNQFNNDMNSEAHYETTGPEIWRQT
GQRVDAFVATVGTAGTFAGTSRYLKEKDPSIVCVVVEPEGSEPIKGCEVTKPLHLLQGSG
YGCVPNLFNYEHLDDTISVSDEEVLEYKRLIGEKEGLFVGYTSAANVLAAAKLLKAGKLK
EDAWVVTVLCDTGLKYTPVPSEITKA