Monarch geneset OGS2.0

DPOGS205397
TranscriptDPOGS205397-TA1719 bp
ProteinDPOGS205397-PA572 aa
Genomic positionDPSCF300407 - 174797-190766
RNAseq coverage1945x (Rank: top 6%)
Annotation
HeliconiusHMEL0071570.067.66% 
BombyxBGIBMGA002457-TA1e-13249.45% 
DrosophilaAldh-PA0.071.40% 
EBI UniRef50UniRef50_G3NJG53e-17556.55%Uncharacterized protein n=17 Tax=cellular organisms RepID=G3NJG5_GASAC
NCBI RefSeqNP_001040475.10.079.44%mitochondrial aldehyde dehydrogenase [Bombyx mori]
NCBI nr blastpgi|1140524080.079.44%mitochondrial aldehyde dehydrogenase [Bombyx mori]
NCBI nr blastxgi|1140524080.079.44%mitochondrial aldehyde dehydrogenase [Bombyx mori]
Group
Gene OntologyGO:00081524.6e-167metabolic process
GO:00551144.6e-167oxidation-reduction process
GO:00164914.6e-167oxidoreductase activity
GO:00166208.2e-69oxidoreductase activity, acting on the aldehyde or oxo group of donors, NAD or NADP as acceptor
KEGG pathwaynvi:1001205590.0 
 K00128 (E1.2.1.3)maps-> 1,2-Dichloroethane degradation
    Arginine and proline metabolism
    Glycolysis / Gluconeogenesis
    Propanoate metabolism
    Limonene and pinene degradation
    Tryptophan metabolism
    Lysine degradation
    Valine, leucine and isoleucine degradation
    Pyruvate metabolism
    beta-Alanine metabolism
    Fatty acid metabolism
    3-Chloroacrylic acid degradation
    Glycerolipid metabolism
    Ascorbate and aldarate metabolism
    Histidine metabolism
InterPro domain[149-563] IPR0155904.6e-167Aldehyde dehydrogenase domain
[148-572] IPR0161617.5e-163Aldehyde/histidinol dehydrogenase
[148-351] IPR0161621.7e-84Aldehyde dehydrogenase, N-terminal
[352-538] IPR0161638.2e-69Aldehyde dehydrogenase, C-terminal
Orthology groupMCL10287 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205397-TA
ATGTTGAGGCGTTTCAGTACAGTTTTTCTTACGAAAAAAGCAAATTATGCAACAGCAGCAGTCCCGGCACCGAATACAAATCCTGAAATATTTTATACCGGTCTCTTCATAAACAACGAATGGGTTAAGTCGGCTGATGGCAAAACTTTCAATACAGAAAATCCTGCTAACGGCAAAGTTATCGCAGAGGTCCAACAAGCTGGGAAGGCAGACGTCGATCTAGCCGTCGCCGCTGCTAAAGACGCTTTCAGATTTGGATCAAAATGGCGAACAATGGACGCCTCACAGCGAGGGTACCTCATCAACAAACTCGCGGACCTCGTCGAAAGGGACAGAGCTTATCTTGCTCGTGTTAATTATGTTGCTTACGGCTTACGCTATCAAAATGTTTTGACTTGTATTCTGTCAGAGTGGCAACGTCGCATAAAATGTAGATTTGGATCAAAATGGCGAACAATGGACGCCTCACAGCGAGGGTACCTCATCAACAAACTCGCGGACCTCGTCGAAAGGGACAGAGCTTATCTTGCTAGCTTAGAGACGCTGGACAACGGGAAGCCATACCTCGCGGCCTACCATGGAGATTTAGCCGGTGTGATCAAGAATCTGAGGTACTACGCGGGCTGGGCCGACAAAAACCACGGAATGGTACTGCCCGCTGATGGACCATACTTCGCGTACACGAGACACGAACCAGTCGGTGTTTGTGGTCAGATCATCCCGTGGAACTTCCCACTGTTGATGGCCGCCTGGAAGCTGGGCCCCGCGCTCGCCGGCGGGAACACGTTGGTGCTGAAACCGGCCGAGCAGACTCCGCTCACCGCACTGTACCTGGCCCAACTCGTTAAAGAGGCCGGTTTCCCTCCAGGAGTAGTGAACGTGTTGCCGGGCTTCGGAGACGCGGGCGCAGCCCTGGTCGACCATCCCGACGTCGACAAAATCGCATTCACGGGCTCGACGGAGGTCGGTAAATTGATTCAGCGCGGCGCGGCCGGTACCGTGAAGCGCATAACCCTGGAGCTCGGAGGGAAATCCCCGAACATCATATTCGCGGACGCCGACCTTCCCAACGCCGTGGAACTCTCACATCACGCGCTCTTCTACAACATGGGACAGTGCTGCTGTGCCGGGTCCCGAACGTTCGTCGAGGAATCAATTTATGACAAGTTCGTGGAAATGTCGGCCGCTCGCGCCGCCAGGAGGAAGGTGGGCGATCCCTTCAAACCCGACACGGAGCAGGGACCGCAGATCGACTCGGAGCAACACTCTAAGATACTGAGTTTGATTGAAAGCGGCAAACGACAAGGAGCCAAGCTGATGACCGGCGGACTCCGGCACGGCGAGCTCGGGCACTTCATACAGCCGACCGTGTTCGCCGACGTGCGAGACGACATGGACATCGCTCGGACAGAGATCTTCGGGCCGGTGCAGCAAATCATCAAATTCTCCAAGATCGACGAGCTACTGGAGCGCGCGAACAACACCGAATACGGCTTAGCCGCGGCCGTCTTCACCAAGGACATCGACAGAGCCAACTACTTGATCCAGGGTCTCCGCGCCGGGACCGTGTGGGTCAATGACTACAACGTCTTCGGCCAGCAAGTGCCGTTCGGCGGATACAAACAATCTGGACTGGGAAGAGAAAACGGACCCTACGGCATCAAGAACTACACGGAAGTGAAGGCCGTGGTGATCAAGATGCAGCAGAAAAACTCATAG

Protein sequence:

>DPOGS205397-PA
MLRRFSTVFLTKKANYATAAVPAPNTNPEIFYTGLFINNEWVKSADGKTFNTENPANGKVIAEVQQAGKADVDLAVAAAKDAFRFGSKWRTMDASQRGYLINKLADLVERDRAYLARVNYVAYGLRYQNVLTCILSEWQRRIKCRFGSKWRTMDASQRGYLINKLADLVERDRAYLASLETLDNGKPYLAAYHGDLAGVIKNLRYYAGWADKNHGMVLPADGPYFAYTRHEPVGVCGQIIPWNFPLLMAAWKLGPALAGGNTLVLKPAEQTPLTALYLAQLVKEAGFPPGVVNVLPGFGDAGAALVDHPDVDKIAFTGSTEVGKLIQRGAAGTVKRITLELGGKSPNIIFADADLPNAVELSHHALFYNMGQCCCAGSRTFVEESIYDKFVEMSAARAARRKVGDPFKPDTEQGPQIDSEQHSKILSLIESGKRQGAKLMTGGLRHGELGHFIQPTVFADVRDDMDIARTEIFGPVQQIIKFSKIDELLERANNTEYGLAAAVFTKDIDRANYLIQGLRAGTVWVNDYNVFGQQVPFGGYKQSGLGRENGPYGIKNYTEVKAVVIKMQQKNS-