Monarch geneset OGS2.0

DPOGS211076
TranscriptDPOGS211076-TA1506 bp
ProteinDPOGS211076-PA501 aa
Genomic positionDPSCF300007 - 1365755-1368443
RNAseq coverage317x (Rank: top 36%)
Annotation
HeliconiusHMEL0022170.071.46% 
BombyxBGIBMGA002958-TA0.070.26% 
DrosophilaDdc-PC6e-14852.30% 
EBI UniRef50UniRef50_P227813e-15252.00%Aromatic-L-amino-acid decarboxylase n=33 Tax=Coelomata RepID=DDC_CAVPO
NCBI RefSeqXP_973109.22e-17258.51%PREDICTED: similar to AGAP009091-PA [Tribolium castaneum]
NCBI nr blastpgi|3838583872e-17359.28%PREDICTED: aromatic-L-amino-acid decarboxylase-like [Megachile rotundata]
NCBI nr blastxgi|3838583877e-16959.28%PREDICTED: aromatic-L-amino-acid decarboxylase-like [Megachile rotundata]
Group
Gene OntologyGO:00197529.7e-257carboxylic acid metabolic process
GO:00168319.7e-257carboxy-lyase activity
GO:00301709.7e-257pyridoxal phosphate binding
GO:00038242e-104catalytic activity
GO:00065206.3e-72cellular amino acid metabolic process
KEGG pathwayapi:1001604662e-168 
 K01593 (E4.1.1.28, DDC)maps-> Betalain biosynthesis
    Isoquinoline alkaloid biosynthesis
    Tryptophan metabolism
    Tyrosine metabolism
    Histidine metabolism
    Phenylalanine metabolism
    Indole alkaloid biosynthesis
InterPro domain[1-481] IPR0021299.7e-257Pyridoxal phosphate-dependent decarboxylase
[1-473] IPR0154248.6e-137Pyridoxal phosphate-dependent transferase, major domain
[84-378] IPR0154212e-104Pyridoxal phosphate-dependent transferase, major region, subdomain 1
[6-25] IPR0109776.3e-72Aromatic-L-amino-acid decarboxylase
[380-477] IPR0154222.1e-29Pyridoxal phosphate-dependent transferase, major region, subdomain 2
Orthology groupMCL17698 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211076-TA
ATGAATTCTCAAGAATTTAGAGAAATAGGAAAAGCTACCATTGATCTGATCGCCGATTATCATGATAATATTAGGAATAGAAATGTATTACCGTCAGTTGAACCAGGATATCTTTTGAAACTATTGCCTGAAGACGCTCCAGAAGAACCAGAAGATCACCAAAATGTTCTTAAGGATTTTTGTGAAACGATAATGCCTGGGATAACTCACTGGCAATCACCGCAATTTCACGCATATTTTCCAACTGGACAATCGTTCGCCAGCATGATTGGAAGCATCCTTAGCGATGGATTAGGTGTCATCGGTATAACATGGAATGCAAGTCCTGCCTGTACTGAACTAGAGGTCGTTACTATGAATTGGTTAGGAAAATTATTGGGTTTGCCTGAGGAATTTCTCAACTGCTCTGAAGGACCTGGAGGTGGCATCATACAGGGCTCCGCAAGTGAAGCAACTCTTGTTTGTCTACTAGCGGCAAAGGATAAAAAGATACGACAACTTCTAGAAAACGATCCAACTTTAGATGAAGACCAAACTAAAAATAAGTTTGTTGCATATACATCGGATCAGTGTAATTCTTCTGTTGAAAAAGCTGGTGTACTTGGTTCGATGAAAATGCGGCTCCTAAAAAGTGATAACAACGGCCAGTTACGAGCACAAACATTAAAAGACGCATTTGAAGAAGATAAGGCCAAGGGTCTTATACCATGCTACTTTGTTGCAAATTTGGGGACCACAGGAATATGTGCTTTTGATCTCATTTACGAAATTGGACCAATATGTCAAGAAGAAGGTGTCTGGTTGCACGTTGATGCAGCCTATGCTGGAGCTGCATTTATATGCCCTGAATACAGACATTTAATGAAAGGCATAGAATATGCAGATTCTTTTGATATGAACGCACACAAATGGCTTCTTGTGAATTTTGATTGCTCAGCAATGTGGGTAAAAAACTCGTATGACTTAATAAATGCTTTCGACGTTCAACGTATATATTTAGATGACGTAAAAACAGCTGCTAAAGTTCCGGATTATCGTCACTGGCAAATGCCACTAGGCCGTAGATTTCGCTCTTTGAAACTATGGACTGTGATAAAAATGTATGGAGCAGAAGGTCTGAGAAAACATATCAGAGATCAAATAAGTTTAGCACAGTATTTTGCTAAGTTAGTGCAACGCGATGAAAGGTTTGTAGTAGAACCAGAGCCATCCATGGCCTTGGTGTGCTTCAGACTTGTAAATGGTGATAAAATAACAAGAGACTTATTAGATAATTTAACTAAGAAGAAGGAATTATTTATGGTTGGGTGTACGTACAGAGAGCGATTCGTTATACGATTTGTTATCTGTTCTCGATTTACTAACAAGGAAGATGTGGAAACAAGCTGGAATATTATCAAGGAAGAGGCAGATCAGTTAATTCCAGAAAAAATGAACGCTAAATCACATGCAATTTCAGCATTCGACCAGTTGGGAACTATTGGTATATACGAAAAATCTAAATAA

Protein sequence:

>DPOGS211076-PA
MNSQEFREIGKATIDLIADYHDNIRNRNVLPSVEPGYLLKLLPEDAPEEPEDHQNVLKDFCETIMPGITHWQSPQFHAYFPTGQSFASMIGSILSDGLGVIGITWNASPACTELEVVTMNWLGKLLGLPEEFLNCSEGPGGGIIQGSASEATLVCLLAAKDKKIRQLLENDPTLDEDQTKNKFVAYTSDQCNSSVEKAGVLGSMKMRLLKSDNNGQLRAQTLKDAFEEDKAKGLIPCYFVANLGTTGICAFDLIYEIGPICQEEGVWLHVDAAYAGAAFICPEYRHLMKGIEYADSFDMNAHKWLLVNFDCSAMWVKNSYDLINAFDVQRIYLDDVKTAAKVPDYRHWQMPLGRRFRSLKLWTVIKMYGAEGLRKHIRDQISLAQYFAKLVQRDERFVVEPEPSMALVCFRLVNGDKITRDLLDNLTKKKELFMVGCTYRERFVIRFVICSRFTNKEDVETSWNIIKEEADQLIPEKMNAKSHAISAFDQLGTIGIYEKSK-