DPGLEAN21605 in OGS1.0

New model in OGS2.0DPOGS203007 
Genomic Positionscaffold8:+ 94506-97561
See gene structure
CDS Length1359
Paired RNAseq reads  1211
Single RNAseq reads  3215
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA003866 (0.0)
Best Drosophila hit  henna, isoform A (0.0)
Best Human hitphenylalanine-4-hydroxylase (5e-141)
Best NR hit (blastp)  phenylalanine hydroxylase [Papilio xuthus] (0.0)
Best NR hit (blastx)  phenylalanine hydroxylase [Papilio xuthus] (0.0)
GeneOntology terms






  
GO:0004505 phenylalanine 4-monooxygenase activity
GO:0006559 L-phenylalanine catabolic process
GO:0006726 eye pigment biosynthetic process
GO:0004510 tryptophan 5-monooxygenase activity
GO:0005506 iron ion binding
GO:0055114 oxidation reduction
GO:0006911 phagocytosis, engulfment
GO:0005811 lipid particle
InterPro families



  
IPR019774 Aromatic amino acid hydroxylase, C-terminal
IPR019773 Tyrosine 3-monooxygenase-like
IPR001273 Aromatic amino acid hydroxylase
IPR005961 Phenylalanine-4-hydroxylase, tetrameric form
IPR002912 Amino acid-binding ACT
Orthology groupMCL15482

Nucleotide sequence:

ATGGAGCCGAGTGTAAACATACTATCAACCTCGCCGATGGACAAGCCAAAGTTAATGCAG
GGTGGCAACTACATAGCCGAGGGACGCGATTCTAAAAAGTCAACATGGCTCTTATTTTCT
CCGGAGACTCCGGATCAAGCTGGTTCTTTGGAGAAATTTCTGAGTATCTTTTCATCTCAC
GGGGTCAACTTGAGCCACATCGAATCTCGCTCCTCTGCCAGGAGACCAGGCTATGAATTC
ATGGTCGAGTGTGAACACGAATCCGGGGACTTTGGAGCGGCTTTGGATGAGCTGAAGAAG
AGCACTGGATATCTCAACATTATTTCTAGAAACTACAAGGATAATAGATCTGCGGTGCCT
TGGTTCCCTCGCCGTATTCGTGATCTGGATAGATTCGCTAATCAGATATTGTCTTATGGA
GCCGAGCTCGACTCAGATCATCCAGGTTTCACAGACCCGGAGTACCGCGCGAGAAGAAAG
TATTTTGCTGATATCGCTTACAACTACAAGCACGGCCAGCCGCTGCCTCACGTGAATTAT
ACTAAAGAAGAAATTAACACATGGGGAGTAGTGTTCAGGAAGCTCACGGAACTCTACCCG
ACGCACGCCTGCAAGGAACACAATCATGTTTTTCCGCTTTTGATTGAAAACTGTGGTTAT
AGGGAGGACAATATTCCACAACTCGAAGACGTATCTAACTTTCTCAAAGATTGCACTGGA
TTCACTCTCCGTCCAGTGGCAGGTCTGCTTTCTTCACGAGATTTCCTCGCTGGCTTGGCG
TTCCGTGTATTTCATAGTACTCAGTACATTAGGCACCATTCTCGTCCCCTTTACACTCCT
GAACCTGATGTCTGCCACGAGCTCCTCGGACACGCGCCATTGTTCGCTGATCCCGCGTTC
GCACAGTTCTCTCAGGAAATCGGCCTGGCTTCATTGGGAGCTCCTGACGATTTTATCGAA
AGACTTGCAACGTGCTTTTGGTTTACTGTTGAATTTGGTCTGTGTCGGCAAGAAGGACAG
CTGAAGGCATACGGCGCCGGTTTGCTGTCATCATTCGGTGAACTTCAATATTGTCTCTCA
GATAAGCCACAGCTCCAAGAATTTGAACCAGAAATCACGGGAGAACAGAAGTATCCTATC
ACTGAATACCAACCAATATATTTCGTTGCTAACAGTTTTGAAAGTGCTAAGGAAAAGATG
ATCAAATTCGCCCAAACAATACCCCGTGACTTCGGAGTGAGATACAATCCCTACACCCAA
AGTATTGACCTCCTAGATTCTCCACGGCAGATGAAAGATCTGCTGAAAGGCATCCGCCAA
GAAATGGAACTGCTGGTTGGCACCATGGACAAGTTGTAG

Protein sequence:

MEPSVNILSTSPMDKPKLMQGGNYIAEGRDSKKSTWLLFSPETPDQAGSLEKFLSIFSSH
GVNLSHIESRSSARRPGYEFMVECEHESGDFGAALDELKKSTGYLNIISRNYKDNRSAVP
WFPRRIRDLDRFANQILSYGAELDSDHPGFTDPEYRARRKYFADIAYNYKHGQPLPHVNY
TKEEINTWGVVFRKLTELYPTHACKEHNHVFPLLIENCGYREDNIPQLEDVSNFLKDCTG
FTLRPVAGLLSSRDFLAGLAFRVFHSTQYIRHHSRPLYTPEPDVCHELLGHAPLFADPAF
AQFSQEIGLASLGAPDDFIERLATCFWFTVEFGLCRQEGQLKAYGAGLLSSFGELQYCLS
DKPQLQEFEPEITGEQKYPITEYQPIYFVANSFESAKEKMIKFAQTIPRDFGVRYNPYTQ
SIDLLDSPRQMKDLLKGIRQEMELLVGTMDKL