New model in OGS2.0 | DPOGS203007  |
---|---|
Genomic Position | scaffold8:+ 94506-97561 |
See gene structure | |
CDS Length | 1359 |
Paired RNAseq reads   | 1211 |
Single RNAseq reads   | 3215 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA003866 (0.0) |
Best Drosophila hit   | henna, isoform A (0.0) |
Best Human hit | phenylalanine-4-hydroxylase (5e-141) |
Best NR hit (blastp)   | phenylalanine hydroxylase [Papilio xuthus] (0.0) |
Best NR hit (blastx)   | phenylalanine hydroxylase [Papilio xuthus] (0.0) |
GeneOntology terms    | GO:0004505 phenylalanine 4-monooxygenase activity GO:0006559 L-phenylalanine catabolic process GO:0006726 eye pigment biosynthetic process GO:0004510 tryptophan 5-monooxygenase activity GO:0005506 iron ion binding GO:0055114 oxidation reduction GO:0006911 phagocytosis, engulfment GO:0005811 lipid particle |
InterPro families    | IPR019774 Aromatic amino acid hydroxylase, C-terminal IPR019773 Tyrosine 3-monooxygenase-like IPR001273 Aromatic amino acid hydroxylase IPR005961 Phenylalanine-4-hydroxylase, tetrameric form IPR002912 Amino acid-binding ACT |
Orthology group | MCL15482 |
Nucleotide sequence:
ATGGAGCCGAGTGTAAACATACTATCAACCTCGCCGATGGACAAGCCAAAGTTAATGCAG
GGTGGCAACTACATAGCCGAGGGACGCGATTCTAAAAAGTCAACATGGCTCTTATTTTCT
CCGGAGACTCCGGATCAAGCTGGTTCTTTGGAGAAATTTCTGAGTATCTTTTCATCTCAC
GGGGTCAACTTGAGCCACATCGAATCTCGCTCCTCTGCCAGGAGACCAGGCTATGAATTC
ATGGTCGAGTGTGAACACGAATCCGGGGACTTTGGAGCGGCTTTGGATGAGCTGAAGAAG
AGCACTGGATATCTCAACATTATTTCTAGAAACTACAAGGATAATAGATCTGCGGTGCCT
TGGTTCCCTCGCCGTATTCGTGATCTGGATAGATTCGCTAATCAGATATTGTCTTATGGA
GCCGAGCTCGACTCAGATCATCCAGGTTTCACAGACCCGGAGTACCGCGCGAGAAGAAAG
TATTTTGCTGATATCGCTTACAACTACAAGCACGGCCAGCCGCTGCCTCACGTGAATTAT
ACTAAAGAAGAAATTAACACATGGGGAGTAGTGTTCAGGAAGCTCACGGAACTCTACCCG
ACGCACGCCTGCAAGGAACACAATCATGTTTTTCCGCTTTTGATTGAAAACTGTGGTTAT
AGGGAGGACAATATTCCACAACTCGAAGACGTATCTAACTTTCTCAAAGATTGCACTGGA
TTCACTCTCCGTCCAGTGGCAGGTCTGCTTTCTTCACGAGATTTCCTCGCTGGCTTGGCG
TTCCGTGTATTTCATAGTACTCAGTACATTAGGCACCATTCTCGTCCCCTTTACACTCCT
GAACCTGATGTCTGCCACGAGCTCCTCGGACACGCGCCATTGTTCGCTGATCCCGCGTTC
GCACAGTTCTCTCAGGAAATCGGCCTGGCTTCATTGGGAGCTCCTGACGATTTTATCGAA
AGACTTGCAACGTGCTTTTGGTTTACTGTTGAATTTGGTCTGTGTCGGCAAGAAGGACAG
CTGAAGGCATACGGCGCCGGTTTGCTGTCATCATTCGGTGAACTTCAATATTGTCTCTCA
GATAAGCCACAGCTCCAAGAATTTGAACCAGAAATCACGGGAGAACAGAAGTATCCTATC
ACTGAATACCAACCAATATATTTCGTTGCTAACAGTTTTGAAAGTGCTAAGGAAAAGATG
ATCAAATTCGCCCAAACAATACCCCGTGACTTCGGAGTGAGATACAATCCCTACACCCAA
AGTATTGACCTCCTAGATTCTCCACGGCAGATGAAAGATCTGCTGAAAGGCATCCGCCAA
GAAATGGAACTGCTGGTTGGCACCATGGACAAGTTGTAG
Protein sequence:
MEPSVNILSTSPMDKPKLMQGGNYIAEGRDSKKSTWLLFSPETPDQAGSLEKFLSIFSSH
GVNLSHIESRSSARRPGYEFMVECEHESGDFGAALDELKKSTGYLNIISRNYKDNRSAVP
WFPRRIRDLDRFANQILSYGAELDSDHPGFTDPEYRARRKYFADIAYNYKHGQPLPHVNY
TKEEINTWGVVFRKLTELYPTHACKEHNHVFPLLIENCGYREDNIPQLEDVSNFLKDCTG
FTLRPVAGLLSSRDFLAGLAFRVFHSTQYIRHHSRPLYTPEPDVCHELLGHAPLFADPAF
AQFSQEIGLASLGAPDDFIERLATCFWFTVEFGLCRQEGQLKAYGAGLLSSFGELQYCLS
DKPQLQEFEPEITGEQKYPITEYQPIYFVANSFESAKEKMIKFAQTIPRDFGVRYNPYTQ
SIDLLDSPRQMKDLLKGIRQEMELLVGTMDKL