Monarch geneset OGS2.0

DPOGS211220
TranscriptDPOGS211220-TA1431 bp
ProteinDPOGS211220-PA476 aa
Genomic positionDPSCF300007 + 1186545-1189214
RNAseq coverage458x (Rank: top 27%)
Annotation
HeliconiusHMEL0124870.087.82% 
BombyxBGIBMGA003199-TA0.088.66% 
DrosophilaDdc-PC0.075.00% 
EBI UniRef50UniRef50_F4X3E60.074.79%Aromatic-L-amino-acid decarboxylase n=16 Tax=Coelomata RepID=F4X3E6_ACREC
NCBI RefSeqNP_001037174.10.088.03%aromatic-L-amino-acid decarboxylase [Bombyx mori]
NCBI nr blastpgi|385648070.088.66%dopa-decarboxylase [Antheraea pernyi]
NCBI nr blastxgi|3154934440.090.95%dopa decarboxylase [Heliconius melpomene malleti]
Group
Gene OntologyGO:00197525.3e-302carboxylic acid metabolic process
GO:00168315.3e-302carboxy-lyase activity
GO:00301705.3e-302pyridoxal phosphate binding
GO:00038242.3e-122catalytic activity
GO:00065207.3e-88cellular amino acid metabolic process
KEGG pathwayaag:AaeL_AAEL0142380.0 
 K01593 (E4.1.1.28, DDC)maps-> Betalain biosynthesis
    Isoquinoline alkaloid biosynthesis
    Tryptophan metabolism
    Tyrosine metabolism
    Histidine metabolism
    Phenylalanine metabolism
    Indole alkaloid biosynthesis
InterPro domain[1-476] IPR0021295.3e-302Pyridoxal phosphate-dependent decarboxylase
[1-474] IPR0154245.6e-153Pyridoxal phosphate-dependent transferase, major domain
[84-377] IPR0154212.3e-122Pyridoxal phosphate-dependent transferase, major region, subdomain 1
[6-25] IPR0109777.3e-88Aromatic-L-amino-acid decarboxylase
[378-475] IPR0154223e-41Pyridoxal phosphate-dependent transferase, major region, subdomain 2
Orthology groupMCL14338 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211220-TA
ATGGAAGCTAAGGAGTTCAAGGATTTCGCGAAGGCAATGGCTGATTACATCGCCGAGTACTTGGAAAATATACGCGACAGGCAAGTGGTGCCCTCAGTGAAGCCAGGGTATTTAAGACCCTTGGTACCGGAGCAGGCTCCAGAGAAGCCTGAACCCTGGACGGCCGTAATGGCTGATATTGAGAGAGTGGTTATGTCAGGAGTTACCCACTGGCATTCCCCACGTTTCCATGCCTATTTCCCCACTGCCAACTCCTATCCATCGATCGTTGCAGATATGTTAAGCGGAGCCATCGCCTGCATTGGTTTCACCTGGATCGCAAGTCCAGCTTGCACCGAGCTTGAGGTGGTGATGCTGGATTGGCTAGGCCAGATGCTGGGTCTGCCAGAAGAATTCTTGGCTCGCTCTGGTGGCGAGGGCGGTGGCGTCATTCAGGGTACAGCCAGTGAAGCCACATTGGTCGCTCTGTTAGGAGCTAAGGCCCGAGCGATGCAGAGGACTAAGGAACAGCATCCAGACTGGACGGAAGTTGAAATCCTGTCCAAACTTGTTGGATATTGTAATAAACAAGCTCATTCGTCTGTCGAGCGAGCTGGTCTCCTCGGTGGAGTAAAGCTCCGCAGCCTGAAGCACGATGACAAGAGACGCCTGCGCGGAGATACCTTGAAAGAGGCTATCGACGAAGATATCAAGAATGGATTGATACCGTTTTATGTCGTCGCAACCTTAGGCACAACATCATCGTGCGCTTTCGACGCTCTGGACGAGATAGGCGACGTCTGCAAGTCGCATGATGTGTGGCTTCATGTGGACGCAGCCTACGCCGGCTCCGCGTTCATCTGTCCAGAGTACCGCCACCTCATGAAGGGAGTCGAAAAGGCTGATTCGTTCAACTTCAACCCTCACAAGTGGATGCTGGTTAACTTCGACTGTTCCGCCATGTGGCTGAAACAGCCGCGTTGGATCGTTGACGCCTTCAACGTCGATCCTTTATACTTGAAACACGACATGCAAGGATCAGCGCCGGACTACCGTCACTGGCAGATACCTCTCGGAAGACGCTTCCGATCCCTTAAACTATGGTTCGTGTTGAGACTGTATGGAGTTGAGAACATTCAGAACTTCATCCGTAAACATATTGGACTGGCTCACCTTTTCGAAAAACTCTGTCTTGATGACGAAAGATTCGAACTTTTCGAAGAGGTCACTATGGGCTTAGTTTGCTTCAGACTCAAAGGTGATAATGAAACTAATGAGGCTCTCTTGAGACGTATTAATGGACGCGGGAAGATTCATCTTGTACCTTCAAAAGTGGATGACGTTTATTTCCTAAGATTTGCTGTTTGCTCGCGTTTCACTGAAGAAAGTGATATTCAAAGCTCGTGGGAAGAAATAAAGACATCGGCTGATGAAGTCCTAGCAGAAAAATAG

Protein sequence:

>DPOGS211220-PA
MEAKEFKDFAKAMADYIAEYLENIRDRQVVPSVKPGYLRPLVPEQAPEKPEPWTAVMADIERVVMSGVTHWHSPRFHAYFPTANSYPSIVADMLSGAIACIGFTWIASPACTELEVVMLDWLGQMLGLPEEFLARSGGEGGGVIQGTASEATLVALLGAKARAMQRTKEQHPDWTEVEILSKLVGYCNKQAHSSVERAGLLGGVKLRSLKHDDKRRLRGDTLKEAIDEDIKNGLIPFYVVATLGTTSSCAFDALDEIGDVCKSHDVWLHVDAAYAGSAFICPEYRHLMKGVEKADSFNFNPHKWMLVNFDCSAMWLKQPRWIVDAFNVDPLYLKHDMQGSAPDYRHWQIPLGRRFRSLKLWFVLRLYGVENIQNFIRKHIGLAHLFEKLCLDDERFELFEEVTMGLVCFRLKGDNETNEALLRRINGRGKIHLVPSKVDDVYFLRFAVCSRFTEESDIQSSWEEIKTSADEVLAEK-