Monarch geneset OGS2.0

DPOGS206781
TranscriptDPOGS206781-TA1665 bp
ProteinDPOGS206781-PA554 aa
Genomic positionDPSCF300001 - 5700113-5707356
RNAseq coverage852x (Rank: top 15%)
Annotation
HeliconiusHMEL0098220.089.96% 
BombyxBGIBMGA000563-TA0.088.06% 
Drosophilaple-PB0.069.72% 
EBI UniRef50UniRef50_G6CT030.0100.00%Tyrosine hydroxylase n=67 Tax=Bilateria RepID=G6CT03_DANPL
NCBI RefSeqNP_001138794.10.088.06%tyrosine hydroxylase [Bombyx mori]
NCBI nr blastpgi|2960403370.090.04%tyrosine hydroxylase [Papilio polytes]
NCBI nr blastxgi|2960403370.090.04%tyrosine hydroxylase [Papilio polytes]
Group
Gene OntologyGO:00167141.8e-295oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen, reduced pteridine as one donor, and incorporation of one atom of oxygen
GO:00551141.8e-295oxidation-reduction process
GO:00090722.4e-246aromatic amino acid family metabolic process
GO:00055062.4e-246iron ion binding
GO:00044972.4e-246monooxygenase activity
GO:00424232e-215catecholamine biosynthetic process
GO:00045112e-215tyrosine 3-monooxygenase activity
KEGG pathwaytca:6549180.0 
 K00501 (TH)maps-> Isoquinoline alkaloid biosynthesis
    Tyrosine metabolism
    Parkinson's disease
InterPro domain[1-551] IPR0197731.8e-295Tyrosine 3-monooxygenase-like
[105-551] IPR0012732.4e-246Aromatic amino acid hydroxylase
[102-548] IPR0059622e-215Tyrosine 3-monooxygenase
[216-547] IPR0197748.5e-175Aromatic amino acid hydroxylase, C-terminal
Orthology groupMCL14257 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206781-TA
ATGGCCGTCGCAGCAGCCCAGAAAAATCGCGAAATGTTCGCCATCAAGAAGTCTTACAGCATTGAGAACGGCTATCCATCCCGCCGTCGTTCACTGGTAGACGATGCCCGTTTCGAAACCCTGGTAGTCAAACAGACCAAACAAAGTGTACTTGAAGAAGCTCGTGCCCGTGCTAATGACTCTGGTTTGGATTCCGAATTCATCCAAGATGTCTCTCAAATTGATGACGCCGAGAAGACCGAAGGTGTCCAAAACGAAGATTGTAAAAACGGTCACCTTGAAGGAGGTAACGAGACTGGTACAAAATCAGATGAAGATTACACTCTTACTGAAGAGGAAGTTATTCTACAAAATGCTGCAAGTGAAAGCCCTGAAGCAGAACAGGTGATCCAAAAAGCTGCTTTACTTTTGCGTATGCGAGATGGAATGGGCTCTCTTGCTCGCATTCTTAAAACAATTGATAACTATAAAGGTTGTGTTGACCATCTTGAAACTCGGCCCTCCCAAATATCAGGAGTCCAATTCGATGCACTCGTAAAGGTCAGCATGACCCGCATCAACCTGCTCCAACTTATCCGGGCACTCCGTCAATCAACCTCATTTGCCGGTGTAAATTTGCTTTCGGATAATATTTCAAACAAAACTCCATGGTTCCCTCGTCATGCTTCCGATCTTGACAACTGTAACCATCTTATGACTAAATTTGAGCCAGAACTTGATATGAATCACCCAGGATTCGCTGATAAGGAATACAGAGAACGTAGGAAACAAATTGCTGCTGTCGCTTTTGCATACAAATATGGTGATCCATTTCCAGCGATTACTTACACTGAAAGCGAGAATGCTACCTGGCAACGAGTATTCAATACTGTACTGGATTTGATGCCAAAACATGCATGCCGTGAATATAAGGCCGCTTTTGGTAAATTACAAGCTGCCGAAATCTTCGTGCCACACCGCATTCCCCAGTTGGAGGATGTAAGCAACTTCCTCCGCAAACATACTGGTTTCACCCTGCGCCCAGCTGCAGGATTACTTACGGCTCGAGACTTTTTGGCTTCTCTCGCTTTTCGTGTATTCCAATCAACACAATACGTGCGCCACGCTAACTCACCCTTCCACACTCCTGAACCGGACTGTATTCATGAACTATTAGGACATATTCCACTTCTAGCTGACCCAAGCTTTGCTCAATTTTCTCAAGAAATTGGTCTTGCTTCACTCGGCGCTTCTGATTCCGAAATCGAAAAGCTTTCTACGGTTTACTGGTTCACGGTCGAATTCGGTCTTTGTAAGGAGAACCAACAACTGAAGGCATACGGAGCAGCTCTTCTATCGTCTATCGGAGAACTGCTTCATGCTTTAAGTGACAAGCCTGAACTGCGACCCTTCGAACCATCTTCTACTTCCATTCAACCTTACCAAGACCAAGAGTACCAACCAATTTATTACGTGGCTGAAAGCTTTGAGGATGCAAAAGATAAATTCAGACGCTGGGTATCAACTATGTCAAGACCATTCGAAGTGCGTTTCAACCCACACACAGAGCGCGTGGAAATCCTCGACTCCGTAGACAAACTTGAAACACTCATATGGCAATTGAATACCGAGATGCTCCACCTCACTAATGCTATCAAAAAACTTAAGGATTCATCCTTTGAGTAA

Protein sequence:

>DPOGS206781-PA
MAVAAAQKNREMFAIKKSYSIENGYPSRRRSLVDDARFETLVVKQTKQSVLEEARARANDSGLDSEFIQDVSQIDDAEKTEGVQNEDCKNGHLEGGNETGTKSDEDYTLTEEEVILQNAASESPEAEQVIQKAALLLRMRDGMGSLARILKTIDNYKGCVDHLETRPSQISGVQFDALVKVSMTRINLLQLIRALRQSTSFAGVNLLSDNISNKTPWFPRHASDLDNCNHLMTKFEPELDMNHPGFADKEYRERRKQIAAVAFAYKYGDPFPAITYTESENATWQRVFNTVLDLMPKHACREYKAAFGKLQAAEIFVPHRIPQLEDVSNFLRKHTGFTLRPAAGLLTARDFLASLAFRVFQSTQYVRHANSPFHTPEPDCIHELLGHIPLLADPSFAQFSQEIGLASLGASDSEIEKLSTVYWFTVEFGLCKENQQLKAYGAALLSSIGELLHALSDKPELRPFEPSSTSIQPYQDQEYQPIYYVAESFEDAKDKFRRWVSTMSRPFEVRFNPHTERVEILDSVDKLETLIWQLNTEMLHLTNAIKKLKDSSFE-