Monarch geneset OGS2.0

DPOGS201995
TranscriptDPOGS201995-TA1440 bp
ProteinDPOGS201995-PA479 aa
Genomic positionDPSCF300060 + 204618-209107
RNAseq coverage866x (Rank: top 15%)
Annotation
HeliconiusHMEL0026280.074.00% 
BombyxBGIBMGA010403-TA0.084.55% 
DrosophilaCG31075-PA1e-17262.45% 
EBI UniRef50UniRef50_Q7Q1655e-16957.46%AGAP009944-PA n=3 Tax=Eukaryota RepID=Q7Q165_ANOGA
NCBI RefSeqNP_001040198.10.084.55%mitochondrial aldehyde dehydrogenase [Bombyx mori]
NCBI nr blastpgi|1140519660.084.55%mitochondrial aldehyde dehydrogenase [Bombyx mori]
NCBI nr blastxgi|1140519660.084.55%mitochondrial aldehyde dehydrogenase [Bombyx mori]
Group
Gene OntologyGO:00081521.7e-178metabolic process
GO:00551141.7e-178oxidation-reduction process
GO:00164911.7e-178oxidoreductase activity
GO:00166203.8e-69oxidoreductase activity, acting on the aldehyde or oxo group of donors, NAD or NADP as acceptor
KEGG pathwaytca:6594380.0 
 K00128 (E1.2.1.3)maps-> 1,2-Dichloroethane degradation
    Arginine and proline metabolism
    Glycolysis / Gluconeogenesis
    Propanoate metabolism
    Limonene and pinene degradation
    Tryptophan metabolism
    Lysine degradation
    Valine, leucine and isoleucine degradation
    Pyruvate metabolism
    beta-Alanine metabolism
    Fatty acid metabolism
    3-Chloroacrylic acid degradation
    Glycerolipid metabolism
    Ascorbate and aldarate metabolism
    Histidine metabolism
InterPro domain[8-469] IPR0155901.7e-178Aldehyde dehydrogenase domain
[1-478] IPR0161612.1e-176Aldehyde/histidinol dehydrogenase
[1-259] IPR0161625.6e-106Aldehyde dehydrogenase, N-terminal
[260-444] IPR0161633.8e-69Aldehyde dehydrogenase, C-terminal
Orthology groupMCL10890 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201995-TA
CAGTTATTCATTAACAATAAATGGGTAGATGCTGTGAGTAAAAAAACTTTTCCCACCATAAACCCACAAGATGAAACTGTTATCGCCAATGTTGCCGAAGGAGACAAGGCTGACATAGACATAGCAGTGGAAGCAGCTCGTAAAGCATTTCACAGATATTCAAAATGGCGTACTATGGATGCATCCCAAAGAGGTTTGCTTATGTTAAAACTGGCAGAACTTATGGATTCCCAAGCAAAATATTTGGCAGAACTAGAGACTTTGGACTGCGGTAAACCTGTCAAGATAGCTGAAGAAGAGGTCCACTCTTCAGCTGGGGTATTGAGATATTATGCAGGAAAAGCTGACAAAATATTGGGCAACACTATACCGGCCGATGGTGAATGTTTGTCTATGACATTGAAAGAACCTGTTGGCGTGTGTGGACAGATTATTCCATGGAATTATCCCATACCAATGATATCATGGAAACTTGGACCAGCTCTGGCGGCTGGGTGTACAATAGTATTGAAGCCCGCGGAACAAACTCCACTAACTGCGCTAGCTGTGGCGGCGTTAGTGAAGGAGGCTGGCTTCCCGCCGGGCGTTGTAAATGTTGTTCCTGGATACGGTCCGACCGCAGGGGCAGCCCTAACGCACCACCCACACGTCGACAAGATAGCATTCACAGGCTCCACAGAGGTCGGAAAATTAATATTGGGTGCAGCGTCGGTTGCTAACCTTAAAAGAGTGACGTTGGAGTTGGGTGGAAAGAGCCCTCTGGTGGTGTTTAATGATGCTGATGTTGAGAAAGCTGCAAGAATAGCTCATGCAGCGGCCTTTGCTAATGGAGGGCAATGTTGTTGTGCTGGCACAAGGACCTACGTACAATCTGGGATATACGAGGCCTTCGTTAACAAGGCGGCAGAGATCGCCAACCAAAGGTCCGTCGGCAACCCTTATGATGAAGTAGATCAAGGACCTCAGATCGACCAAGAGATGTTCAGCAAAGTTCTTGGTTATATTGATTCCGGAAAGAATTCTGGGGCTAGATGTGTCGCTGGAGGGGATAGAATTGGTGATAAGGGATACTATATTAAACCTACAGTTTTTGCTGACGTTGAAGATGATATGAAAATAGCAAGGGAAGAGATATTTGGTCCAGTCCAGAGCATACTGAAGTTTGATACATTCGAGGAGGTGATTGATAGAGCCAATGACACTAACTATGGTCTGGGAGCTGGTGTTATAACAAACGATATTACTATTGCAATGAGCTTTGCAAGACACGTCCGTGCTGGATCTATATGGATTAATACCTATGACCATGTTACAAGTCAAACTCCGTTTGGTGGTTTCGGTGACTCTGGTATGGGCAGGGAACTAGGTGAAGATGGTATACTGCCTTATCTTGAAACTAAAACTATTACGTTAGCACTGCCTAAGCATCCACAATTCTAA

Protein sequence:

>DPOGS201995-PA
QLFINNKWVDAVSKKTFPTINPQDETVIANVAEGDKADIDIAVEAARKAFHRYSKWRTMDASQRGLLMLKLAELMDSQAKYLAELETLDCGKPVKIAEEEVHSSAGVLRYYAGKADKILGNTIPADGECLSMTLKEPVGVCGQIIPWNYPIPMISWKLGPALAAGCTIVLKPAEQTPLTALAVAALVKEAGFPPGVVNVVPGYGPTAGAALTHHPHVDKIAFTGSTEVGKLILGAASVANLKRVTLELGGKSPLVVFNDADVEKAARIAHAAAFANGGQCCCAGTRTYVQSGIYEAFVNKAAEIANQRSVGNPYDEVDQGPQIDQEMFSKVLGYIDSGKNSGARCVAGGDRIGDKGYYIKPTVFADVEDDMKIAREEIFGPVQSILKFDTFEEVIDRANDTNYGLGAGVITNDITIAMSFARHVRAGSIWINTYDHVTSQTPFGGFGDSGMGRELGEDGILPYLETKTITLALPKHPQF-