Monarch geneset OGS2.0

DPOGS212105
TranscriptDPOGS212105-TA1560 bp
ProteinDPOGS212105-PA519 aa
Genomic positionDPSCF300038 - 597016-599837
RNAseq coverage653x (Rank: top 20%)
Annotation
HeliconiusHMEL0125380.090.75% 
BombyxBGIBMGA006606-TA0.087.69% 
DrosophilaCG9629-PA2e-17855.71% 
EBI UniRef50UniRef50_P494190.061.84%Alpha-aminoadipic semialdehyde dehydrogenase n=53 Tax=Coelomata RepID=AL7A1_HUMAN
NCBI RefSeqXP_969882.10.067.45%PREDICTED: similar to aldehyde dehydrogenase 7 family, member A1 [Tribolium castaneum]
NCBI nr blastpgi|910951130.067.45%PREDICTED: similar to aldehyde dehydrogenase 7 family, member A1 [Tribolium castaneum]
NCBI nr blastxgi|910951130.067.58%PREDICTED: similar to aldehyde dehydrogenase 7 family, member A1 [Tribolium castaneum]
Group
Gene OntologyGO:00081522.6e-126metabolic process
GO:00551142.6e-126oxidation-reduction process
GO:00164912.6e-126oxidoreductase activity
GO:00166201.9e-51oxidoreductase activity, acting on the aldehyde or oxo group of donors, NAD or NADP as acceptor
KEGG pathwaytca:6583950.0 
 K00128 (E1.2.1.3)maps-> 1,2-Dichloroethane degradation
    Arginine and proline metabolism
    Glycolysis / Gluconeogenesis
    Propanoate metabolism
    Limonene and pinene degradation
    Tryptophan metabolism
    Lysine degradation
    Valine, leucine and isoleucine degradation
    Pyruvate metabolism
    beta-Alanine metabolism
    Fatty acid metabolism
    3-Chloroacrylic acid degradation
    Glycerolipid metabolism
    Ascorbate and aldarate metabolism
    Histidine metabolism
InterPro domain[30-504] IPR0161612.6e-126Aldehyde/histidinol dehydrogenase
[40-500] IPR0155901.4e-121Aldehyde dehydrogenase domain
[31-285] IPR0161621.2e-71Aldehyde dehydrogenase, N-terminal
[286-467] IPR0161631.9e-51Aldehyde dehydrogenase, C-terminal
Orthology groupMCL13654 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212105-TA
ATGGCTAGAAACGCGTCCAGTTACCTCATCGAGGATCCAAAATATTCCTTTTTAAAAGATTTGGGGTTGGATAAAAAGAATGTGGGAGTTTTTAACGGAAAATGGGAAGCTAACGGCCCGATGATTCAAACTTTTAGTCCAGCCAACGGTAAAGTAATAGCAGAGGTGCAGGCGGCCAGTGTCGCAGATTATGAATCCTGTGCGAAGGCAGCTCAGGATGCGTGGCATGAATGGGCGGAAATGCCAGCACCAGCCCGGGGGGAAATCGTCAGACAAATAGGAGACGCCCTTAGAGAAAAGTTGCAGCCTTTAGGGCAATTAGTTTCTTTAGAAATGGGTAAAATTCTTCCCGAAGCAATAGGCGAAGTCGTCGAATATATCCACGTATGTGACTTAGCACTTGGTCTATCACGTTCACTCCCTGGGACGATTTTCCCATCGGAGCGGCCCGGTCACGTCCTTATTGAAAAATGGAATCCTCTCGGCGCCATCGGTATCATTACTGCTTTCAATTTTCCTGTTGCTGTTTTTGGATGGAACAGCGCTATCGCAATGGTATGCGGCGACGTCAGCGTGTGGAAGCCATCAGAAACCACGCCACTCATATCAGTGGCAGTGACCAAGATTGTAGAAAGTGTGCTCGTTAAAAACAACATTCCGGGGGCTGTAGCTGCGTTGTGTGTTGGCGGGAAGGATATAGGGCAAACATTAGTGAAAGATCACAGGATGAAGCTTGTCTCCTTCACAGGCAGCACAGCTGTCGGACAAGAGGTAGGTGTGGAAGTCCAAAGACGCTTCGGGCGTCACTTGTTGGAACTAGGAGGAAACAACGCTATCATCGTCAACGAGGACGCCAACCTTCAACTACTGCTGAATGCGGCGCTGTTCGCTTGCGCTGGGACCGCGGGTCAACGCTGCACTACCACAAGAAGACTTCTTATACATAAAAAAGTGTACTCCGAGGTAGTGTCTAAGCTAAAGAAGGCCTATGCTAGTGTTTTGAGTCGCATCGGGGATCCCCTGGAGTCCGAATCGCTAATTGGACCGCTCCACACACCAGCTGCCTTACAAGCCTATAAAGACACCGTCGCGGCTGCTGTTAAACAAGGAGGAACTATTGAATTCGGTGGAAAGGTGATCGAACGTGAGGGCTACTTTGTGGAGCCGACTATAATAACAGGGCTACCGCATGATTCTCCTCTGGTTAAGACTGAATGTTTCGCTCCCATCGTTTATTGTATAGAGATTCCTGATCTAGAAACTGGTATTCAATACAACAATGAAGTGGAGCAGGGTCTGTCATCAAGTCTTTTTACTGAAAATATGGGAAATGTTTTCAAGTGGATTGGTCCTCACGGATCGGATTGCGGCATCGTGAATGTAAATATACCAACCAACGGCGCGGAGGTAGGTGGAGCCTTCGGAGGTGAAAAGGCCACGGGCGGCGGCCGCGAGTGTGGCTCTGACTCCTGGAAGAACTATATGCGTCGCTCAACAGTCACTATCAACTACTCCGGAACCATCAAACTCGCACAGAACATCAAATTCGGCGACGACTAA

Protein sequence:

>DPOGS212105-PA
MARNASSYLIEDPKYSFLKDLGLDKKNVGVFNGKWEANGPMIQTFSPANGKVIAEVQAASVADYESCAKAAQDAWHEWAEMPAPARGEIVRQIGDALREKLQPLGQLVSLEMGKILPEAIGEVVEYIHVCDLALGLSRSLPGTIFPSERPGHVLIEKWNPLGAIGIITAFNFPVAVFGWNSAIAMVCGDVSVWKPSETTPLISVAVTKIVESVLVKNNIPGAVAALCVGGKDIGQTLVKDHRMKLVSFTGSTAVGQEVGVEVQRRFGRHLLELGGNNAIIVNEDANLQLLLNAALFACAGTAGQRCTTTRRLLIHKKVYSEVVSKLKKAYASVLSRIGDPLESESLIGPLHTPAALQAYKDTVAAAVKQGGTIEFGGKVIEREGYFVEPTIITGLPHDSPLVKTECFAPIVYCIEIPDLETGIQYNNEVEQGLSSSLFTENMGNVFKWIGPHGSDCGIVNVNIPTNGAEVGGAFGGEKATGGGRECGSDSWKNYMRRSTVTINYSGTIKLAQNIKFGDD-