Monarch geneset OGS2.0

DPOGS207185
TranscriptDPOGS207185-TA1677 bp
ProteinDPOGS207185-PA558 aa
Genomic positionDPSCF300001 + 5182782-5191431
RNAseq coverage3x (Rank: top 90%)
Annotation
HeliconiusHMEL0103850.084.81% 
BombyxBGIBMGA000642-TA0.086.04% 
DrosophilaTrh-PA0.064.53% 
EBI UniRef50UniRef50_Q8CGV21e-14957.36%Tryptophan 5-hydroxylase 2 n=104 Tax=Metazoa RepID=TPH2_MOUSE
NCBI RefSeqXP_316057.30.068.09%AGAP006020-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3071808410.068.69%Tryptophan 5-hydroxylase 1 [Camponotus floridanus]
NCBI nr blastxgi|3071808410.068.69%Tryptophan 5-hydroxylase 1 [Camponotus floridanus]
Group
Gene OntologyGO:00167143.2e-271oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen, reduced pteridine as one donor, and incorporation of one atom of oxygen
GO:00551143.2e-271oxidation-reduction process
GO:00090721.5e-270aromatic amino acid family metabolic process
GO:00055061.5e-270iron ion binding
GO:00044971.5e-270monooxygenase activity
GO:00165973.6e-07amino acid binding
GO:00081523.6e-07metabolic process
KEGG pathwayaga:AgaP_AGAP0060200.0 
 K00502 (E1.14.16.4, TPH)maps-> Tryptophan metabolism
InterPro domain[1-511] IPR0197733.2e-271Tyrosine 3-monooxygenase-like
[47-511] IPR0012731.5e-270Aromatic amino acid hydroxylase
[146-429] IPR0197741.8e-149Aromatic amino acid hydroxylase, C-terminal
[49-100] IPR0029123.6e-07Amino acid-binding ACT
Orthology groupMCL11351 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207185-TA
ATGAGTGGATCTGGAAAAGGCCTTCTTGGTTTGTGGTTATACAGAAATGGATCTGATTGGCAAGTCAAGAATGAGGCTCCTCATCATCCGAAATTTGCTGATCTTCATTCTGCAACTCAGAGACAAGCGCAAGATGAGATCATATCTGTGATATTTACCGTAAAAAACCAAGTAGGAGGGTTGGTAAAAGTTTTGTCAGTTTTTCAAGATCTCGGAGTTAATGTTATTCATATAGAATCGAGGAAATCAGCGACGGAATTATCTTCCTCCGATATCTTGGTGGATGTAGAATGTGACCCGCGAAGAATGGAACAGCTGAAGCGAATGTTGAAGCGTGAAGTGCAAGATTTTGAGGTAGTTGCTGCACAATCTGATGAAAAATTTCCACCACCAACACCTCTGTCTGCTGCAGCCAGCTTCGATTTTGGTGAGATGCCATGGTTTCCAAGAAAAATCTCTGACTTGGATCGCGCGCAAAATGTTCTTATGTATGGGTCTGAGCTCGACGCTGATCATCCTGGTTTTAAGGATCCAGTTTACCGTAAGCGACGAGAACAATTTGCTGCGATTGCTAACAATTATAAATACGGACAGCCAATTCCCAAAGTGCAATATACTGAAGTTGAAATTAAAACCTGGGGAGTCGTATTTAGCGAATTGCATAAATTGTATCAGAAACATGCGTGCGCAGAATATTTGGAAAACTGGCCGCAACTCGTCAAATACTGTGGTTACAGAGAAGACAACTTGCCCCAGTTGGAGGACGTAAGTTCTTTTCTGAAACGAAAAACTGGCTTCCAACTGCGTCCTGTGGCTGGTTATTTATCACCTCGAGACTTTCTCTCCGGACTTGCTTTTAGAGTTTTTCATTGTACTCAGTATATACGTCATTCTTCAGACCCGTTTTATACTCCCGAGCCTGATTGCTGTCACGAGTTGCTCGGACACATGCCATTACTGGCAAATCCATCATTTGCGCAGTTCTCTCAGGAACTGGGTCTAGCTTCCCTTGGAGCATCTGATGAAGATATTGATAAATTGGCAACGCTCTACTTTTTTACCGTTGAGTTCGGTTTATGTCGTCAATTGGATGGTAGTTATCGAGTATACGGTGCGGGGCTTCTTTCCTCCGTTGCCGAACTACAGCATGCCCTGTCAACCCCCGAAAAGATTAAACGATTTGACCCAGATATTACCGTCAATGAAGAATGTATTATTACTTCATACCAAAACGCATACTACTATACTGATTCATTTGAGGAAGCCAAGGAAAAAATGAGGCAATATCCGTTTGTTTCCTTTTTGCTTTATGAGCGCATTTTTATCAACATAATATATTTTCATCACATCTGCAGTCTCATAAAATCATTGGCATTTGCGGATAGTATCCAGCGCCCCTTTGGTGTCCGTTACAATCCATACACTCAAAGCGTAGAGGTATTGAGCAATGCCCAGAAAATAACAGCATTGGTACGGGAGCTAAGAGGTGACATCTGTATTGTGTCATCTGCTATAAAGAAAATAAGTGCCCAAGACTCAACACTTGATGTTGAAACTATCGCTAACATGCTGCATACTGGACTACAGGTAAATGAAAGGAGTCCTCAAAGCTTATCCGGAGGTAGTTCGCCAAATTCAGAACGCGGTCTATCTCCCAAACCAGAAGAAACAGCATAA

Protein sequence:

>DPOGS207185-PA
MSGSGKGLLGLWLYRNGSDWQVKNEAPHHPKFADLHSATQRQAQDEIISVIFTVKNQVGGLVKVLSVFQDLGVNVIHIESRKSATELSSSDILVDVECDPRRMEQLKRMLKREVQDFEVVAAQSDEKFPPPTPLSAAASFDFGEMPWFPRKISDLDRAQNVLMYGSELDADHPGFKDPVYRKRREQFAAIANNYKYGQPIPKVQYTEVEIKTWGVVFSELHKLYQKHACAEYLENWPQLVKYCGYREDNLPQLEDVSSFLKRKTGFQLRPVAGYLSPRDFLSGLAFRVFHCTQYIRHSSDPFYTPEPDCCHELLGHMPLLANPSFAQFSQELGLASLGASDEDIDKLATLYFFTVEFGLCRQLDGSYRVYGAGLLSSVAELQHALSTPEKIKRFDPDITVNEECIITSYQNAYYYTDSFEEAKEKMRQYPFVSFLLYERIFINIIYFHHICSLIKSLAFADSIQRPFGVRYNPYTQSVEVLSNAQKITALVRELRGDICIVSSAIKKISAQDSTLDVETIANMLHTGLQVNERSPQSLSGGSSPNSERGLSPKPEETA-