New model in OGS2.0 | DPOGS210836 |
---|---|
Genomic Position | scaffold77:- 100005-109451 |
See gene structure | |
CDS Length | 2136 |
Paired RNAseq reads | 1783 |
Single RNAseq reads | 4102 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA003910 (0.0) |
Best Drosophila hit | CG1486, isoform B (2e-76) |
Best Human hit | pyridoxal-dependent decarboxylase domain-containing protein 1 (3e-51) |
Best NR hit (blastp) | PREDICTED: similar to CG1486 CG1486-PA [Tribolium castaneum] (4e-154) |
Best NR hit (blastx) | PREDICTED: similar to CG1486 CG1486-PA [Tribolium castaneum] (3e-123) |
GeneOntology terms | GO:0019752 carboxylic acid metabolic process GO:0016831 carboxy-lyase activity GO:0030170 pyridoxal phosphate binding |
InterPro families | IPR015424 Pyridoxal phosphate-dependent transferase, major domain IPR015421 Pyridoxal phosphate-dependent transferase, major region, subdomain 1 IPR002129 Pyridoxal phosphate-dependent decarboxylase |
Orthology group | MCL13871 |
Nucleotide sequence:
ATGGGAGATGCCCCGACTTCCGAAATGGACTCAAATAAAGTTAGCCTTGGAAGCGAAAAT
CCATCCGATGTGGACCGGCGACCGTTTGGAGGACTTGAATTCCAAGTTTCCGAAGTAGTA
GAGAGGTTGGAAGCTGGTGTGAATGCTCAAGATGCCATGGAAGAAGAAAAAAAGCCAGAA
GAACGAAAAATAAGCACGGGATTCTTCGAGCCAGAAAAAATGGATATGGATGAAATTTTG
AAGGTCTTAGAACAACTAGTACTTCAAACTGATCCAAGTTGTGAAAGTTTGGAACCACCC
TTATTGCCAACGGATTCTGTGACGCGAGCCGCAATACTTTCTCATAGTATTTCGGCATTA
TTCTCGAGGTTGGAGAGGAGCCACGCTGCCCGGCTAGGAACGCACATAGCTACTGAAACC
ACGCGATGGATGGCGCATTTATTTAGGTTGTCCGATTACGACGCGTTTTATCACCAAGAG
CAGCTCGAGGGTCTGGTCAGAGTCACTCGGATGCTGTTACACCACAAGTACCCGAGATAT
CTCGAAGATGGAGCTCTAGCTTTCTCGAACCGTCTCCCCTCCATCTACAGCTGTGTGGCG
AGTCCTCTGGGCGTGGTCCAACACCTGTGCCGGCAGCTGGGTCTGCCGCTGGCCTGCGTC
AGACCGGTGCCAGTAGATTCATCTGGTAAGGGTATGGATCTGAATGCTCTGGATCGTCTG
TGCGAGGAGGACTCGGCTGGTCGTACTCCGCTGCTGGTGTTAGGCGAGGCGGGCGAGCCT
CCCCTCGGCGGGGGATCCCCGCTGAAAGCGCTGGCTGAACTATGTGGACGTAGAGGGGTC
CACTTACATGTGAGGGGACACGCCCTCGCCCTCCCCGCCGCCGGGGGATTTGAACAGACG
TACAGTATAGCGGACTCGCTGACACTACAACCGGGTCCGTGGTTCGGAATACCGGGGCTG
CCGACTGTTACGTTTTACAAAATACCGGAACCGCTGACGGCGAACGATCACTCCAAGGTT
GTAAATTCGGCGAGTAGTCGCGAGGGTGCTCTGGCCGCACTGGGCGGTCTGACCGCTGGC
GCGGCGCGGCTGGCAGCTCTGCCGCTGTGGACGGCGACGAGGGCGGCCGGCGCTAAGAGG
CTCGCAAGACGGATAGACGCCGCCTTCCGCTCCGCCCGTACAGCGCGGGCCTTAATAGCC
AGCACTGAGCTGAGATTGCTGAGCGATAGACCCGGCGGTGATGAACCTCCTAACATGGAT
ATAGTCGATGCCATAAGTGAATCCTCAGCGTGCGTGTCCTTCCAATTCGCGCCAGCAGGG
TGCGCTGACCGGCCACCCCCCTACTACGATAAACTCAACTCGTGGTTGGGGCAAGTGTTG
CAACGAGAGGCTGATATGATCAATATAGAAATCTGCGAGACGGAGAGTTACGGCGTGGTG
CTCCGCTACTGTCCGCTCGAGGGTATCTTTCTGGAGGAGGACCGTCTGTCGGAGTGGGCG
GCCGTGTTAGACGCTCAGCTGCACGTGCTCACCGCTACGGTCGCGCTACGAGAACCCTTC
CAGAAGACGCTACAGACACATCCCTGTCTACGACTTGTACATGTACCGGGATGGGCTGGT
CTGGGAGGAGTTCGTTACGTGCCACCCGGTTGGGAGAACGCTCCTCTTGAGGAATTGAAC
TCCTTGAATAGACAGCTAGTGGAGACATTGAGGGCTACCGACGGAGCCTTCTCGTGTGGG
GACGGAGAAGACGGTATGGCATGTGTCAGGTTCGGTATGGTCACCGCTGACACAGACGTG
GATGAATTGTTGGATCTGGTGTTGTCAGCGGGCAAGGACGTGGAGGAGAACTCCAAGGCT
CTCACTGATATGACCGAGGTGTTGAAAAAAGGTATATCAGCGGCTCAAGAAGAACTGAAT
CGTTCTGCGTGGCAGGAGGGGCTGCTGCGTCGTGTGCCGGTAGTGGGTCGGGTCGTGTCG
TGGTGGGCGCCGCCTCAGCCCTGCCCCGGCCGCCGGCTACTGTTGACCCACGGCACCCTG
CAGGCGACTGATGATATCTACCGATTCGTTCAGAAGAAAGACAAAGAGGAACCAGCCCGC
GCTCACTCCCCAACGAGACAGAACACGGTTCCATAA
Protein sequence:
MGDAPTSEMDSNKVSLGSENPSDVDRRPFGGLEFQVSEVVERLEAGVNAQDAMEEEKKPE
ERKISTGFFEPEKMDMDEILKVLEQLVLQTDPSCESLEPPLLPTDSVTRAAILSHSISAL
FSRLERSHAARLGTHIATETTRWMAHLFRLSDYDAFYHQEQLEGLVRVTRMLLHHKYPRY
LEDGALAFSNRLPSIYSCVASPLGVVQHLCRQLGLPLACVRPVPVDSSGKGMDLNALDRL
CEEDSAGRTPLLVLGEAGEPPLGGGSPLKALAELCGRRGVHLHVRGHALALPAAGGFEQT
YSIADSLTLQPGPWFGIPGLPTVTFYKIPEPLTANDHSKVVNSASSREGALAALGGLTAG
AARLAALPLWTATRAAGAKRLARRIDAAFRSARTARALIASTELRLLSDRPGGDEPPNMD
IVDAISESSACVSFQFAPAGCADRPPPYYDKLNSWLGQVLQREADMINIEICETESYGVV
LRYCPLEGIFLEEDRLSEWAAVLDAQLHVLTATVALREPFQKTLQTHPCLRLVHVPGWAG
LGGVRYVPPGWENAPLEELNSLNRQLVETLRATDGAFSCGDGEDGMACVRFGMVTADTDV
DELLDLVLSAGKDVEENSKALTDMTEVLKKGISAAQEELNRSAWQEGLLRRVPVVGRVVS
WWAPPQPCPGRRLLLTHGTLQATDDIYRFVQKKDKEEPARAHSPTRQNTVP