New model in OGS2.0 | DPOGS213793  |
---|---|
Genomic Position | scaffold3517:+ 6182-14146 |
See gene structure | |
CDS Length | 1242 |
Paired RNAseq reads   | 1112 |
Single RNAseq reads   | 2903 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA009276 (1e-174) |
Best Drosophila hit   | vermilion (2e-134) |
Best Human hit | tryptophan 2,3-dioxygenase (1e-92) |
Best NR hit (blastp)   | tryptophan oxygenase [Plodia interpunctella] (0.0) |
Best NR hit (blastx)   | tryptophan oxygenase [Plodia interpunctella] (1e-179) |
GeneOntology terms    | GO:0004833 tryptophan 2,3-dioxygenase activity GO:0005575 cellular_component GO:0019441 tryptophan catabolic process to kynurenine GO:0020037 heme binding |
InterPro families   | IPR004981 Tryptophan 2,3-dioxygenase |
Orthology group | MCL12401 |
Nucleotide sequence:
ATGGCCTGTCCTATGCGGTCAGCGTACGATGACAGCCAAGATGGCCCACAGCTGGGAAAC
GAGGCTGGGATGCTTTATGGAGAGTACCTGATGCTGGACAAACTCCTGAGCGCTCAGAGA
ATGCTCAGCGCTGAGTCCTCCAAGCCAGTACATGATGAACATCTGTTTATAGTAACACAT
CAAGCGTACGAGCTGTGGTTCAAACAGATTATATTCGAGGTCGATTCAGTACGAGCATTG
CTGGATGTAGAAGGTCTAGATGAAAGTCACACTATGGAGATCTTAAAGCGGCTGAATAGA
GTAGTGCTGATACTTAAGCTGCTTGTAGACCAAGTAATGATACTTGAGACAATGACGCCG
CTTGATTTCATGGACTTCAGGAATTACTTGCGACCGGCGTCCGGCTTCCAAAGCTTGCAA
TTCAGACTTCTAGAAAACAAGCTTGGACTCAAGCAGGCCCTGCGTGTGAAATACAATCAA
AATTACCAAACAGTTTTCGGAGATGACCCCGAGGCTATTAAATCTCTGCACAAATCCGAA
GAGGAGCCGGCTCTGCTCGCGTTGATCGAGCGCTGGTTGGAGCGAACACCTGGGCTAAAC
GCACATGGATTCAACTTCTGGGGCAAATTCCAGGCGGCTGTCAACAACATGATACAAGAG
GATATCGAAGCAGCTATGAGCGAACCAAATGAGATTGTCAGGAACCATAGACTGAGGGAT
GCGGAGAATAGACGAGAGACTTACCGCTCCATCTTCGACGCGGAAGTTCACAACGCACTG
AGATCCCGGGGAGAGAGAAGGTTGTCCCACAAGGCGTTGCAGGGCGCTATCATGATAACG
TTCTACAGGGACGAGCCGCGTTTCTCTCAGCCTCACCAACTTCTGATGCTGCTTATGGAC
ATCGACAGTCTCATCACCAAATGGAGATATAACCACGTGATCATGGTTCAGCGCATGATT
GGCTCGCAGCAGCTAGGAACTGGCGGCTCGTCAGGGTACCAGTACCTGAGATCTACGCTC
AGTGACCGCTACAAAGTATTCCTGGATCTTTTTAATCTGTCCACGTTCCTCCTCCCGCGT
TCCCTGATCCCCCCTCTGGATGACGGGATGAAGAAAGATCTGAACCTCATGTGGGGAGAT
CTCAAGGAAATGGGGGAAAATGGTGAGAACCAATTGAACGGTGAAAATGGTCACCCTTTG
GAGCAATCAATCTCGAATTTAACACTCAAAGATAAATCCTGA
Protein sequence:
MACPMRSAYDDSQDGPQLGNEAGMLYGEYLMLDKLLSAQRMLSAESSKPVHDEHLFIVTH
QAYELWFKQIIFEVDSVRALLDVEGLDESHTMEILKRLNRVVLILKLLVDQVMILETMTP
LDFMDFRNYLRPASGFQSLQFRLLENKLGLKQALRVKYNQNYQTVFGDDPEAIKSLHKSE
EEPALLALIERWLERTPGLNAHGFNFWGKFQAAVNNMIQEDIEAAMSEPNEIVRNHRLRD
AENRRETYRSIFDAEVHNALRSRGERRLSHKALQGAIMITFYRDEPRFSQPHQLLMLLMD
IDSLITKWRYNHVIMVQRMIGSQQLGTGGSSGYQYLRSTLSDRYKVFLDLFNLSTFLLPR
SLIPPLDDGMKKDLNLMWGDLKEMGENGENQLNGENGHPLEQSISNLTLKDKS