Monarch geneset OGS2.0

DPOGS210365
TranscriptDPOGS210365-TA1131 bp
ProteinDPOGS210365-PA376 aa
Genomic positionDPSCF300025 + 471670-472800
RNAseq coverage2311x (Rank: top 5%)
Annotation
HeliconiusHMEL0138340.092.55% 
BombyxBGIBMGA011922-TA0.091.03% 
DrosophilaFdh-PA8e-16872.94% 
EBI UniRef50UniRef50_P804672e-17175.34%Alcohol dehydrogenase class-3 n=114 Tax=cellular organisms RepID=ADHX_UROHA
NCBI RefSeqNP_001040507.10.092.02%alcohol dehydrogenase [Bombyx mori]
NCBI nr blastpgi|1140524880.092.02%alcohol dehydrogenase [Bombyx mori]
NCBI nr blastxgi|1140524880.092.27%alcohol dehydrogenase [Bombyx mori]
Group
Gene OntologyGO:00082707.9e-215zinc ion binding
GO:00551147.9e-215oxidation-reduction process
GO:00164917.9e-215oxidoreductase activity
GO:00060695.9e-195ethanol oxidation
GO:00519035.9e-195S-(hydroxymethyl)glutathione dehydrogenase activity
GO:00054883.3e-36binding
KEGG pathwaydre:1165170.0 
 K00121 (frmA, ADH5, adhC)maps-> Drug metabolism - cytochrome P450
    Glycolysis / Gluconeogenesis
    Fatty acid metabolism
    3-Chloroacrylic acid degradation
    Tyrosine metabolism
    Metabolism of xenobiotics by cytochrome P450
    Methane metabolism
    1- and 2-Methylnaphthalene degradation
    Retinol metabolism
InterPro domain[1-377] IPR0020857.9e-215Alcohol dehydrogenase superfamily, zinc-type
[9-376] IPR0141835.9e-195Alcohol dehydrogenase class III/S-(hydroxymethyl)glutathione dehydrogenase
[2-200] IPR0110323.4e-84GroES-like
[197-316] IPR0160403.3e-36NAD(P)-binding domain
[35-160] IPR0131544.5e-25Alcohol dehydrogenase GroES-like
[204-326] IPR0131494.2e-21Alcohol dehydrogenase, C-terminal
Orthology groupMCL11155 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210365-TA
ATGTCTACTGCTGGAAAAGTTATTAAATGTAAAGCAGCTGTAGCTTGGGAAGCTGGAAAGCCATTATCTATTGAAGAAATCGAAGTTGATCCACCAAAAGCTGGAGAAGTACGGGTAAAAATTCTTGCTACCGGTGTCTGTCACACAGATGCATATACTTTGTCCGGAAAAGATCCAGAAGGAGTCTTTCCAGTCATTTTAGGGCATGAAGGCGGTGGTGTAGTCGAAAGTGTTGGCGAGGGAGTGACGTCCGTGAAGCCCGGCGATCATGTGTTACCTCTGTACGTCCCGCAGTGCAAAACATGCAAATTCTGTCAAAACCCTAAAACCAACCTTTGTCAGAAAGTGAGAATCACGCAGGGACAGGGTGTCATGCCTGATGGCACGAAGAGATTCCGATGCAAAGGTCAAGAGCTGTTCCACTTCATGGGGTGCTCAACGTTCAGTGAATATACTGTAGTTCTAGAAATTTCGTTGTGCAAGATAGCTGACGCCGCGCCTCTTGATAAGGTATGTCTATTGGGATGTGGTGTGACAACCGGCTATGGAGCCGCTCTCAATACAGCAAAGGTGGAACCGGGTTCGAACTGCGCTATATTCGGTCTCGGAGCTGTCGGCCTGGCCGTAGCTCTCGGTTGTAAAGCAGCGGGGGCAAATCGCATCATAGGAGTCGACATTAACCCTGACAAATATGAGATTGCCAAAAAATTTGGCGTCAATGAATTCGTCAATCCCAAGGACTACGACAAACCTATCCAGCAGGTACTTGTTGACTTGACAGATGGAGGATTGGAGTATACATTTGAGTGCATAGGTAATGTTAATACTATGCGTGCTGCCCTGGAGGCATGCCATAAGGGTTGGGGTGAGTCTGTTATCATTGGTGTGGCCGCCGCCGGGGAAGAGATCAGCACCCGTCCATTCCAACTCGTGACCGGCCGAGTATGGAGAGGAACAGCCTTCGGAGGCTACAAGAGCCGTGACAGTGTTCCTAAGTTGGTTGATGACTACCTCAATAAGAAACTTCCTTTGGATGACTTCGTTACTCATAAAGTACCTCTCAGTGAGATTAACGAGGCCTTCCATTTAATGCACGTGGGGAAATCCATCCGTGCCGTTGTTGAGTTTTAA

Protein sequence:

>DPOGS210365-PA
MSTAGKVIKCKAAVAWEAGKPLSIEEIEVDPPKAGEVRVKILATGVCHTDAYTLSGKDPEGVFPVILGHEGGGVVESVGEGVTSVKPGDHVLPLYVPQCKTCKFCQNPKTNLCQKVRITQGQGVMPDGTKRFRCKGQELFHFMGCSTFSEYTVVLEISLCKIADAAPLDKVCLLGCGVTTGYGAALNTAKVEPGSNCAIFGLGAVGLAVALGCKAAGANRIIGVDINPDKYEIAKKFGVNEFVNPKDYDKPIQQVLVDLTDGGLEYTFECIGNVNTMRAALEACHKGWGESVIIGVAAAGEEISTRPFQLVTGRVWRGTAFGGYKSRDSVPKLVDDYLNKKLPLDDFVTHKVPLSEINEAFHLMHVGKSIRAVVEF-