Monarch geneset OGS2.0

DPOGS215050
TranscriptDPOGS215050-TA1134 bp
ProteinDPOGS215050-PA377 aa
Genomic positionDPSCF300208 - 6689-10089
RNAseq coverage207x (Rank: top 46%)
Annotation
HeliconiusHMEL0024123e-17377.54% 
BombyxBGIBMGA005681-TA6e-15773.24% 
DrosophilaCG10184-PA1e-11152.79% 
EBI UniRef50UniRef50_Q9VCK62e-10952.79%CG10184 n=18 Tax=Neoptera RepID=Q9VCK6_DROME
NCBI RefSeqXP_001357814.22e-11454.93%GA10138 [Drosophila pseudoobscura pseudoobscura]
NCBI nr blastpgi|1984500224e-11354.93%GA10138 [Drosophila pseudoobscura pseudoobscura]
NCBI nr blastxgi|1984500222e-10854.93%GA10138 [Drosophila pseudoobscura pseudoobscura]
Group
Gene OntologyGO:00168294.4e-87lyase activity
GO:00065204.4e-87cellular amino acid metabolic process
GO:00038242.4e-74catalytic activity
GO:00301702.4e-74pyridoxal phosphate binding
KEGG pathwaydpo:Dpse_GA101386e-114 
 K01620 (E4.1.2.5, ltaA)maps-> Glycine, serine and threonine metabolism
InterPro domain[4-369] IPR0154242e-99Pyridoxal phosphate-dependent transferase, major domain
[6-290] IPR0015974.4e-87Aromatic amino acid beta-eliminating lyase/threonine aldolase
[21-253] IPR0154212.4e-74Pyridoxal phosphate-dependent transferase, major region, subdomain 1
[256-366] IPR0154229e-12Pyridoxal phosphate-dependent transferase, major region, subdomain 2
Orthology groupMCL15013 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215050-TA
ATGGCGTACATAGTAGATTTACGATCAGATACCGTCACTAAGCCGACGGAGGCTATGAAGCATGCTATGGTGAATTCGGCTCTGGGCGACGACGTGTTCGGCGAGGACCCCACTGTGAACGCGTTGGAGAGCAAAGTAGCCACCCTGCTAGGGAAGCAGGCCGCTCTATTTGTACCAAGTGGTACCATGGCCAACCTTATTGCGATTATGGTCCATTGTAGCAAGCGAGGTGCTGAAGCCATTGTTGGCAATTTATCACATATTTATAAATATGAACAAGGTGGTGCAGCTCATGTGGCTGGAGTATTGTTGAGTACAATACAGAACAAACCAGATGGAACATTTGACCTGGAAGAACTGGAGAAACGGTTCCAGGGGTCAGATATACACTCACCCATCACATCTATGGTTGCCATAGAAAATACTCACAATGTTTGTGGTGGAAAGGTGGTGCCACTTGAGTGGATGGAGCAGCTGTCAGCTGTGTGTGTCCGACGCGGTGTTCCATTACACCTGGACGGAGCTCGTCTCGTGAACGCGGCCACGTATCTCCAGGTGCCGCCCGCGCGTGTCGCCGCGTGCTGCGACAGTGTCGCCATCTGCTTCAGCAAGGGACTGTCGGCGCCGGCCGGCTCGGCCCTGGTTGGATCATACAGCTTCATACAGCAAGCTCGTCGTATGCGTAAGATGCTAGGCGGCGGTATGCGTCAGGCGGGAGTGTTAGCCGCGGCCGCTTTAGTGTCTTTGGACCAAGTGGTACCCCTACTGGCTTTGGACCACAAGCGAGCTGCCATCCTCGCTAAAGTGATCGAAGGTCTCTTCCTGCCTTGTTTCTCTGTGGACGTAGAGGGTCAACACACCAACATAGTGCTAGTCCGAATCTCCCGGGAGACCAGTCTGACGGCGGACCAGGTGCTGCAGAGACTGGCCCAAGTCAGTCTGGCTGAGACACAGGGTGACTGCAAGACGCCGAACGACGAAGGCGTTATACTGAAGGCGATCTGTTTCGACGAGAAGACGATCAGAATGACGTTACACTGTCAAGTGGACGACGAACAGCTGTGGCTGGCTATAATGAAGATAACTTATGTGTTCAAAGAACTTAACGCCTTATACCCCGTTAAAACAACATAA

Protein sequence:

>DPOGS215050-PA
MAYIVDLRSDTVTKPTEAMKHAMVNSALGDDVFGEDPTVNALESKVATLLGKQAALFVPSGTMANLIAIMVHCSKRGAEAIVGNLSHIYKYEQGGAAHVAGVLLSTIQNKPDGTFDLEELEKRFQGSDIHSPITSMVAIENTHNVCGGKVVPLEWMEQLSAVCVRRGVPLHLDGARLVNAATYLQVPPARVAACCDSVAICFSKGLSAPAGSALVGSYSFIQQARRMRKMLGGGMRQAGVLAAAALVSLDQVVPLLALDHKRAAILAKVIEGLFLPCFSVDVEGQHTNIVLVRISRETSLTADQVLQRLAQVSLAETQGDCKTPNDEGVILKAICFDEKTIRMTLHCQVDDEQLWLAIMKITYVFKELNALYPVKTT-