Monarch geneset OGS2.0

DPOGS215931
TranscriptDPOGS215931-TA1848 bp
ProteinDPOGS215931-PA615 aa
Genomic positionDPSCF300308 - 116366-143783
RNAseq coverage1657x (Rank: top 8%)
Annotation
HeliconiusHMEL0041608e-16562.47% 
BombyxBGIBMGA001966-TA0.071.91% 
DrosophilaAldh-III-PO1e-15949.28% 
EBI UniRef50UniRef50_UPI00022CA2F02e-17852.97%UPI00022CA2F0 related cluster n=3 Tax=unknown RepID=UPI00022CA2F0
NCBI RefSeqNP_001166835.10.064.99%aldehyde dehydrogenase isoform 1 [Bombyx mori]
NCBI nr blastpgi|2905606530.064.99%aldehyde dehydrogenase isoform 1 [Bombyx mori]
NCBI nr blastxgi|2905606530.064.64%aldehyde dehydrogenase isoform 1 [Bombyx mori]
Group
Gene OntologyGO:00060815.1e-238cellular aldehyde metabolic process
GO:00551145.1e-238oxidation-reduction process
GO:00040305.1e-238aldehyde dehydrogenase [NAD(P)+] activity
GO:00081524.9e-107metabolic process
GO:00164914.9e-107oxidoreductase activity
GO:00166208.1e-58oxidoreductase activity, acting on the aldehyde or oxo group of donors, NAD or NADP as acceptor
KEGG pathwaynve:NEMVE_v1g1700787e-123 
 K00128 (E1.2.1.3)maps-> 1,2-Dichloroethane degradation
    Arginine and proline metabolism
    Glycolysis / Gluconeogenesis
    Propanoate metabolism
    Limonene and pinene degradation
    Tryptophan metabolism
    Lysine degradation
    Valine, leucine and isoleucine degradation
    Pyruvate metabolism
    beta-Alanine metabolism
    Fatty acid metabolism
    3-Chloroacrylic acid degradation
    Glycerolipid metabolism
    Ascorbate and aldarate metabolism
    Histidine metabolism
InterPro domain[13-584] IPR0123945.1e-238Aldehyde dehydrogenase NAD(P)-dependent
[53-460] IPR0161614.9e-107Aldehyde/histidinol dehydrogenase
[36-444] IPR0155902.6e-78Aldehyde dehydrogenase domain
[31-273] IPR0161621.9e-58Aldehyde dehydrogenase, N-terminal
[274-453] IPR0161638.1e-58Aldehyde dehydrogenase, C-terminal
Orthology groupMCL10393 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215931-TA
ATGCCAGCGTCGAGCGAACAACACGCGGCTAGTGACGTTCTTGTCATCGACATCGAGTCGGATAACGAAATTATCGATACACCTATAAATATGGATAACCACAACACCATGAACGGAAACGGAACACTATCAAGCAATAAACCCAAAGCCGTCAACATGTCTGAGGTCGTAAATAAAGCAAGGGATACGTTCGACAGCGGCGTCACCAAGCCCATTGAATGGAGAAGGAAGCAACTGAAAAATCTACTGAGAATGTACGAGGAAAACAGAAACGCGATGGTAGAGGCTCTCGTTAAAGACTTAAGAAGAAGCAAAATGGAAGCTATTCTCCTAGAAGTCGACTATCTCATTAATGACATAAGAAATACTATTTACAACTTGGATAACTGGGTCGCTCCTGTGAAGCCCCCAAAGGGTTTAGTGAATATGCTGGATGATGTAGTCATCTACAACGACCCCTACGGCGTTGTTCTCATCATCGGTGCCTGGAACTATCCTCTCCAACTGCTACTGCTGCCACTAGCTGGTGCTATAGCTGCGGGAAATGCTGTCATCCTCAAGCCCAGTGAGCTGGCGGAGGCCAGCGCTAAGTTCATGGTGGAAACCTTGCCTAAATATGTGGATAGTGACGCAATAATTTTAGTGGAAGGAGGTCCGGAGGAAACCTCCGAATTATTGAAACAAAGATTCGACTACATCTTCTACACAGGCGGGACTAACGTTGGCAGAATAGTTTATGCAGCAGCTACCAAAAACTTGACTCCTGTCACATTGGAACTGGGAGGGAAGAGCCCTGTGTACATAGATAACACAGTGGATATAGAAGTAACAGCGAAGCGTATCCTCTGGGGTAAGTTCATCAACGCCGGTCAGACCTGTATAGCCCCGGACTACATCCTGTGCTCGAGGACCGTTCAGGACAAGTTTGTGGATGCAGCCAAGAATGTTCTGCGGGAGTTTTATGGGGAAGATCCTCAGAAATCACCGGATCTCTGCAGAATCATTAATAACAGACACTTCAGTCGTCTGCAAGCATTGATTGATGCTAGCAAGGACAAAGTCGCTATTGGCGGCCGATACGACTCGCAGGACAAATACATTGCTCCAACGTTACTAGCGAATGTCACTGCCAGTGACGTCATCATGAAGGACGAGATATTTGGACCTATCCTGCCCATTGTGCCTGTGGAGAACGCCTATGAAGCCATAAAATTCATTAACGAAAGGGAACATCCGTTAGTTCTATACGTGTTCAGTGTCCAGAGCAACATCCAACAGCTGTTCACACAGCAGACGCGTTCAGGCAGTCTGTGTATCAATGACACTATAATGTTTTATGGCGTACAGGTGATGGTATTTGTAAATAGTTATGTATATAACGTTATGTTGTATGTAAACGATAATGTGGTGGTGGACGTCTGTAGGGAAAAGCCGTTGGTGTTGTACGCGTTCACTACAGACGAGGAACTCGCTAAACGGATAGCGGAGAACACGAGCAGCGGCGGCATGTGCATTAATGATACTGTCATGCAAATGGGAGTTGATACATTGCCATTTGGCGGTGTCGGTAGTAGTGGCATGGGAGCCTACCACGGTAAGGCCTCATTTGACACCTTCACACATAAAAAGAGCTGCTTAATAAGGAACTTCGCTGCTATTGGTGAAAGACTTGGATCAGGCCGCTACCCTCCCTACACGGACGGTAAGCTGAGCTTCATTACAACCCTGATGAGAAAACGCAACGGACCCTCCCTCAAATACCTCCCACACCTGATTGCCTTTGCTCTTGGAGCCGGTGTGGCATACGGAATAGCCACTTGGCAGAAGATGTCGTCGGAGCACCTATAG

Protein sequence:

>DPOGS215931-PA
MPASSEQHAASDVLVIDIESDNEIIDTPINMDNHNTMNGNGTLSSNKPKAVNMSEVVNKARDTFDSGVTKPIEWRRKQLKNLLRMYEENRNAMVEALVKDLRRSKMEAILLEVDYLINDIRNTIYNLDNWVAPVKPPKGLVNMLDDVVIYNDPYGVVLIIGAWNYPLQLLLLPLAGAIAAGNAVILKPSELAEASAKFMVETLPKYVDSDAIILVEGGPEETSELLKQRFDYIFYTGGTNVGRIVYAAATKNLTPVTLELGGKSPVYIDNTVDIEVTAKRILWGKFINAGQTCIAPDYILCSRTVQDKFVDAAKNVLREFYGEDPQKSPDLCRIINNRHFSRLQALIDASKDKVAIGGRYDSQDKYIAPTLLANVTASDVIMKDEIFGPILPIVPVENAYEAIKFINEREHPLVLYVFSVQSNIQQLFTQQTRSGSLCINDTIMFYGVQVMVFVNSYVYNVMLYVNDNVVVDVCREKPLVLYAFTTDEELAKRIAENTSSGGMCINDTVMQMGVDTLPFGGVGSSGMGAYHGKASFDTFTHKKSCLIRNFAAIGERLGSGRYPPYTDGKLSFITTLMRKRNGPSLKYLPHLIAFALGAGVAYGIATWQKMSSEHL-