Monarch geneset OGS2.0

DPOGS201127
TranscriptDPOGS201127-TA1572 bp
ProteinDPOGS201127-PA523 aa
Genomic positionDPSCF300137 + 467657-478846
RNAseq coverage83x (Rank: top 64%)
Annotation
HeliconiusHMEL0053211e-12455.50% 
BombyxBGIBMGA013658-TA4e-6860.48% 
DrosophilaCG17221-PA3e-0833.33% 
EBI UniRef50UniRef50_Q17DZ71e-3027.56%Alcohol dehydrogenase n=8 Tax=Culicidae RepID=Q17DZ7_AEDAE
NCBI RefSeqXP_001602224.11e-4130.77%PREDICTED: similar to alcohol dehydrogenase [Nasonia vitripennis]
NCBI nr blastpgi|3227950589e-3827.14%hypothetical protein SINV_13816 [Solenopsis invicta]
NCBI nr blastxgi|3227950581e-3427.37%hypothetical protein SINV_13816 [Solenopsis invicta]
Group
Gene OntologyGO:00082701.8e-33zinc ion binding
GO:00551141.8e-33oxidation-reduction process
GO:00164911.8e-33oxidoreductase activity
KEGG pathwaybca:BCE_35215e-18 
 K00001 (E1.1.1.1, adh)maps-> Drug metabolism - cytochrome P450
    Glycolysis / Gluconeogenesis
    Fatty acid metabolism
    3-Chloroacrylic acid degradation
    Tyrosine metabolism
    Metabolism of xenobiotics by cytochrome P450
    1- and 2-Methylnaphthalene degradation
    Retinol metabolism
InterPro domain[169-524] IPR0020851.8e-33Alcohol dehydrogenase superfamily, zinc-type
[175-325] IPR0110324.7e-24GroES-like
[200-262] IPR0131546.5e-08Alcohol dehydrogenase GroES-like
Orthology groupMCL17355 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201127-TA
ATGGCCATCTTTACTAATGATGGTAGCCAAGAAGATGAAGATGTGGAAAATGATATTGGTAGCGAAGAAGAAACATTGAGTGAATCTGATATCGACCCTCCCCCGTTGTTTGTGGCTGAAATGGATGATTTCAAACTGAGAGCTGGTGAAAAAATTGGGGCTTTGCATGATGCCGCTCTGGGTGTAGCAAATAACAGCAAAATCAAAATCAATGATGCTATTGATAAAACCGTTGAGGCTGCAAAGAAGATCCGAGAGGTTGTGTTAGAGTTGTGGCAGCATGAGTTAGTGTTAGAGGGCCGGGCGCGAGCGGCGGCCTGGGCGGCCGAGGCGGTGCGCAGAATCAGAGAGGGCGCTCTGCCCCTCTCTCCGGCCGTCTTATACCAAGACCTGGTCGACCTGTTCAAGGATAGAGTATGGCGTCGCAGTATGGTGATCTTCGTATGTGGCGTTGCGGTGGGCGGGGGCGCGGGCCTAATGGTCGGGCTCCGGGCCGGGACCCGCGCTCCTCTCGGACCTCACGCTAGGGCGCTGCACACGCAAAGCGATCAGACCGTCCTCCTGGTAGAGGATGCTGTGTCTCCGGGAGTGTCAGCGGGGGAGGTGCTGGTCCGAGTGCAGTCGTTCAGTGTGTGTCCTGTAGACCGCGCCGTGCTGCGCGGCCGAGGTTCCGCCCTGCGAGCTCTGTTGGGAACCGGACCCGTCACTGTGGGACGAGGCTTCGCGGGAGTGGTGCTGGACGTGGGCCAGGGGGTCAACGACCTGGAACTGGGCGACGAGGTTTGGGGAGTCGTCAGCGAATGGAGCGGCGGCGGGGCCAGTGAACTACTGGCCATTAGGAGGAGCCGTGTGAGTCGCTCCCCGCGCGGCGTATCCCCTCCTCACGCGGCCGCCATCCCCTGGGGGGGCACGAGAGCGCTAAAGGCCCTCGGAAAGATTGATATGAAGGGAAAAAGGGTGTGCGTGGTGGGCGGCAACACGTCCGCCTCGTGCCTGTCGATACAGCTGCTGTGCGGGCGGGGAGCGAGGGTGTGCGTCGCTGAACACTCGAAAGCACATCCCACCATGAAAGCACTCGCCCTCGTCACTAACTCCACACACGTGTTTCCTGAAGGCGCGCACGAGTTCATCGACCTGTCGTCGGAGGCGTGGTGGGTGTCTTTGGAGAAGGCTACGTCGCGGGCCGCGCTCTGGGACTCACTCATCCTAGGAGACACGGCGCCCAACATCCCGCACAAGGGACTCGTCAAAGCCACGAGTCGCTTGCGTTCTATATCTCTGCGTCCGAGGCCTCTATTGTCGGACCGTGTCCCCTCCCCCGTGTGGCCGGCCTTCTGTGTAGCCTTCATGATGTACAGGCTGCTCAGGTGGTCAGTTGGTCTCGGCTGGCACACTGATTGGTTGGAAGATGGTAATCCGAGCGAACAGCTGGAGGTGTTGAGGGAAATGGTGGACGAGGGTCAGCTGGCTCCGGTACTAGACAAGGTATACTTACCTCACGACTTCGAAGCGGCGCTGGCACACGCGTGCAGTGATGAAGCGGTCGGGACAACCGTCATACGGTTCCCATGA

Protein sequence:

>DPOGS201127-PA
MAIFTNDGSQEDEDVENDIGSEEETLSESDIDPPPLFVAEMDDFKLRAGEKIGALHDAALGVANNSKIKINDAIDKTVEAAKKIREVVLELWQHELVLEGRARAAAWAAEAVRRIREGALPLSPAVLYQDLVDLFKDRVWRRSMVIFVCGVAVGGGAGLMVGLRAGTRAPLGPHARALHTQSDQTVLLVEDAVSPGVSAGEVLVRVQSFSVCPVDRAVLRGRGSALRALLGTGPVTVGRGFAGVVLDVGQGVNDLELGDEVWGVVSEWSGGGASELLAIRRSRVSRSPRGVSPPHAAAIPWGGTRALKALGKIDMKGKRVCVVGGNTSASCLSIQLLCGRGARVCVAEHSKAHPTMKALALVTNSTHVFPEGAHEFIDLSSEAWWVSLEKATSRAALWDSLILGDTAPNIPHKGLVKATSRLRSISLRPRPLLSDRVPSPVWPAFCVAFMMYRLLRWSVGLGWHTDWLEDGNPSEQLEVLREMVDEGQLAPVLDKVYLPHDFEAALAHACSDEAVGTTVIRFP-