New model in OGS2.0 | DPOGS206082 |
---|---|
Genomic Position | scaffold231:+ 78666-83576 |
See gene structure | |
CDS Length | 1224 |
Paired RNAseq reads | 149 |
Single RNAseq reads | 394 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA006820 (3e-74) |
Best Drosophila hit | CG2061, isoform B (7e-49) |
Best Human hit | lanC-like protein 2 (5e-74) |
Best NR hit (blastp) | hypothetical protein BRAFLDRAFT_93787 [Branchiostoma floridae] (1e-89) |
Best NR hit (blastx) | unnamed protein product [Tetraodon nigroviridis] (1e-78) |
GeneOntology terms | GO:0003824 catalytic activity GO:0008150 biological_process GO:0005575 cellular_component |
InterPro families | IPR008928 Six-hairpin glycosidase-like IPR007822 Lanthionine synthetase C-like IPR020464 LanC-like protein, eukaryotic |
Orthology group | MCL10256 |
Nucleotide sequence:
ATGACTACAAAAGGCTCATTTGAAAATGATTTCGATGATTACTCCCCAACTAATTTAACT
CCTTTACTCAATGAAAATAGAGATGGAATATCAGAAAAATTTCAAGCAAAGCTACAGAAT
TATAAAACGGCAAAATTTGCGTTTTTAATAACAAAACTAGAAAAGGAATTATTTTGCGAT
GGAACTGTTTACACCGGCTCTACGGGACTAGCATTATATTATTTAATGTTAGGTCTCGGG
AACCATGATTCTCTACAAGATAATCTGCAGAAAGCCCTGGACTATTTAGATCTGGATAAA
TTAAAAGGAAGGAGGATAAGTTTTTTGTGTGGTGATGCTGGTCCACTTGCAATTGCAACT
GTTATTTCACACAAGTTAGGTACAAGACGTCCAAATTATTTGCCTGATTACAGGGAACTA
TCAGTAAGGCTGTTAAACCTTGGATCACTTTTAAATGACTCACCAGATGAACTGCTGTAT
GGGAAAGCAGGATATTTGTATTCATTGCTGTTTGTTAATAAATATGTCCATGGCAGAAAT
GTTATTTCCGATGATCATATTGAAAAGGTGGCTTCTTTAATCTTGAAGTCGGGTAAAGAG
TTTTCGCAGCATACAAAATCGGAGAGTCCACTTCTGTGGCAATGGCATGACAAAGTATAT
TTAGGAGCAGCTCATGGAATGGCTGGAATTTTATATATATTATTGCAGGCTCGTGCTTAC
ATAAAGTCCCATGACATCAGAGGTTTTGTTAGGCCTACCATAGATTGGTTAATGAAACAA
CAATTTCCTAGTGGGAATTTCCCTTCCTCATTACACAGTAGCTCCGGTGATAGATTGGTA
CAGTGGTGTCATGGTGCACCAGGTTTTATACCTTTATGCATATTGGCTTATCAGGTCTTT
GAAGAAGAGAGATATTTAAAAATAGCTATTCAATGTGGAGATTTGATATGGCAGAGAGGA
TTGTGTGCTAAAGGCTATAGTTTATGTCATGGTGTTAGTGGCAATGCTTATGCATTTCTT
CAGCTGTATCAAGTATTAAAGAAACCTGTCTACCTTCACCGCGCTGGGTGCTTCATGGAG
TGGTGCGCGGTGGAGAGACAAGGCACTGAGCTACACCGACCTGATCGGCCGGCCTCGCTC
TTCGAAGGGTTACTCGGCAGAATATACTTGGTCGAAGACATTATTAACCCACAGACGGCT
TTATTCCCTGGACTATGCTTATAA
Protein sequence:
MTTKGSFENDFDDYSPTNLTPLLNENRDGISEKFQAKLQNYKTAKFAFLITKLEKELFCD
GTVYTGSTGLALYYLMLGLGNHDSLQDNLQKALDYLDLDKLKGRRISFLCGDAGPLAIAT
VISHKLGTRRPNYLPDYRELSVRLLNLGSLLNDSPDELLYGKAGYLYSLLFVNKYVHGRN
VISDDHIEKVASLILKSGKEFSQHTKSESPLLWQWHDKVYLGAAHGMAGILYILLQARAY
IKSHDIRGFVRPTIDWLMKQQFPSGNFPSSLHSSSGDRLVQWCHGAPGFIPLCILAYQVF
EEERYLKIAIQCGDLIWQRGLCAKGYSLCHGVSGNAYAFLQLYQVLKKPVYLHRAGCFME
WCAVERQGTELHRPDRPASLFEGLLGRIYLVEDIINPQTALFPGLCL