Monarch geneset OGS2.0

DPOGS212359
TranscriptDPOGS212359-TA1164 bp
ProteinDPOGS212359-PA387 aa
Genomic positionDPSCF300019 + 36118-37594
RNAseq coverage269x (Rank: top 40%)
Annotation
HeliconiusHMEL0053212e-0728.44% 
BombyxBGIBMGA012009-TA2e-11760.87% 
DrosophilaCG17221-PA6e-5434.57% 
EBI UniRef50UniRef50_B0WEB21e-6539.51%Zinc binding dehydrogenase n=3 Tax=Culicidae RepID=B0WEB2_CULQU
NCBI RefSeqXP_319942.42e-6740.33%AGAP009178-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582999424e-6640.33%AGAP009178-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1582999421e-6340.33%AGAP009178-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00082701.1e-69zinc ion binding
GO:00551141.1e-69oxidation-reduction process
GO:00164911.1e-69oxidoreductase activity
GO:00054886.8e-08binding
GO:00167476.1e-06transferase activity, transferring acyl groups other than amino-acyl groups
KEGG pathwaybat:BAS33062e-23 
 K00001 (E1.1.1.1, adh)maps-> Drug metabolism - cytochrome P450
    Glycolysis / Gluconeogenesis
    Fatty acid metabolism
    3-Chloroacrylic acid degradation
    Tyrosine metabolism
    Metabolism of xenobiotics by cytochrome P450
    1- and 2-Methylnaphthalene degradation
    Retinol metabolism
InterPro domain[4-367] IPR0020851.1e-69Alcohol dehydrogenase superfamily, zinc-type
[13-189] IPR0110321.7e-29GroES-like
[189-246] IPR0160406.8e-08NAD(P)-binding domain
[41-110] IPR0131541.4e-06Alcohol dehydrogenase GroES-like
[24-363] IPR0208436.1e-06Polyketide synthase, enoylreductase
Orthology groupMCL14090 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212359-TA
ATGAATGTCGCGAGGGGAGCAGTGAGCGCGGGTCGCATGCGAGCCTGGCGGGTGCACGCCTACAGCGCCGGAACCGAGGAGTTGCGGCTGGAGAGCGCGCGCGTGCCGCCGCTGAGGGCTCCCGATCAGCTGCTTGTGCGAGTCCACACCGCCTCCATCAACCCACTGGACGTGGCCATGCTCGGCGGGTACGGTTCTCGGATACTGAACACGCTGCGGACGCTGGACGGCACCGACCTCGAGTTCCCGCTAGTGCCAGGGAGGGACTTCGCCGGCGAAGTCGTCGCAGCCGGTGCGAGTTGCCGGCTGCGGGTCGGCGACCGCGTGTGGGGTGTGGTCCCGCCGCACAGGCCGGGCTCGCATGCGGAGTACGTGACGGTGCGCGAGCGCTGGACCGGCCTTGCCCCGCTTGCTCTGTCCGACGAGGAGGCAGGCGGGGCGCTGTACGCGGCTCTGAGCGCGTGCGCGGCGCTCCGGGTTGGAGGCCTTCCGCCAGGGAGACGCGCCCGCCGTCCGCCGCGCGTGTTATTACTGGGACTGGGCGGGGTCGGACACGTGGCCCTTCAGCTGCTCGTGGACGCTGGCGCCGAGGTGATCGTTGGCTGCTCTGCGGACCTGTGTGAGCGCGCGACCTCGCTCGGTGCCGCGGCGGCGCTCGATCGGTCGGCGGCTGACTACGACCGCCTCCTCGAGGAGTCCGGCCCGTACGAGGTGATCGTGGACTGTGCGGGAGTGGGTGGCGCGGAGGCCGGTTCGCGGCGCTGGAGGTTCTCCCGGTTCGTGACCCTGAGCTCGCCGCTGCTCCGGCTTACGGACGCCCGCGGGCTGGTGGGCGGGGGATGTGCGGCGGCGGCCCAGCTAGTCGCCGATGGCCTGTCCGCGGCCCGGAGCGCGCCCGCACCGTCCTCCTGCCCGCCGCACGTCCGCTGGGCCTTCTTCGCTCCGTCCTCGGACGACATCGAGACGCTCCGTCGCCTCGCGGAGAGAGGCAGGCTGTCGGTGTGTGTGGAGCGCGTGTTCCCCTGGTGGGAGGGTGTGGCGGCGTACGAGCGCGCGGCTCGTGGCGGGGCGCGAGGGAAGCTCGTGCTGGACTTCACGCGCTCGCCACCCCCCGCTCTCGCCGCTCCCCCCGCCCCCGCCGACCGCACAGTGTCGTCGCGTTAG

Protein sequence:

>DPOGS212359-PA
MNVARGAVSAGRMRAWRVHAYSAGTEELRLESARVPPLRAPDQLLVRVHTASINPLDVAMLGGYGSRILNTLRTLDGTDLEFPLVPGRDFAGEVVAAGASCRLRVGDRVWGVVPPHRPGSHAEYVTVRERWTGLAPLALSDEEAGGALYAALSACAALRVGGLPPGRRARRPPRVLLLGLGGVGHVALQLLVDAGAEVIVGCSADLCERATSLGAAAALDRSAADYDRLLEESGPYEVIVDCAGVGGAEAGSRRWRFSRFVTLSSPLLRLTDARGLVGGGCAAAAQLVADGLSAARSAPAPSSCPPHVRWAFFAPSSDDIETLRRLAERGRLSVCVERVFPWWEGVAAYERAARGGARGKLVLDFTRSPPPALAAPPAPADRTVSSR-