Monarch geneset OGS2.0

DPOGS213463
TranscriptDPOGS213463-TA894 bp
ProteinDPOGS213463-PA297 aa
Genomic positionDPSCF300100 - 404377-407156
RNAseq coverage251x (Rank: top 42%)
Annotation
HeliconiusHMEL0168324e-12578.52% 
BombyxBGIBMGA004489-TA5e-11467.54% 
DrosophilaCG5844-PA1e-4541.00% 
EBI UniRef50UniRef50_G6D1K24e-12491.43%Enoyl-CoA hydratase n=2 Tax=Obtectomera RepID=G6D1K2_DANPL
NCBI RefSeqXP_002053766.14e-4540.28%GJ23164 [Drosophila virilis]
NCBI nr blastpgi|2839931414e-10264.09%enoyl-CoA hydratase [Heliothis virescens]
NCBI nr blastxgi|2839931412e-9664.09%enoyl-CoA hydratase [Heliothis virescens]
Group
Gene OntologyGO:00081526.9e-33metabolic process
GO:00038246.9e-33catalytic activity
KEGG pathwayame:4093241e-41 
 K01692 (E4.2.1.17, paaG)maps-> Benzoate degradation via CoA ligation
    Propanoate metabolism
    Limonene and pinene degradation
    Tryptophan metabolism
    Lysine degradation
    Valine, leucine and isoleucine degradation
    beta-Alanine metabolism
    Geraniol degradation
    Fatty acid metabolism
    Caprolactam degradation
    Butanoate metabolism
InterPro domain[63-219] IPR0017536.9e-33Crotonase, core
Orthology groupMCL16374 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213463-TA
ATGGCTAGTCTTGTTAGAAAATTATTTTCCGTTACCACCGTTTCATTTCGTCGAACAAATATACGTCTGGTTTCTTCTAAAAGCGATGCTCAAAAAAAAGAAGAGGAACAAAAAGTACCGAAAGAGGAGTTAGAAGATCAGATCAAGAAAAACATAGTTGTAGAGAAATTCGGTGGCATCACAACCCTTAATATAGATAGACAGAAATCAAGAAATAGTTTAGATGAGGCCACGTTAAGAGAAATGTCTGAAGCTATTAATGCATTTGACAAAGATACAGATGCAAAAGTTCTTGTGTTTAACGGTGAAGGTGGCAGTTTCTGTTCTGGTTTCGATCTGGACGAAGTTGGTGAAAAGGGATATCAAAATTTCATAGATGCTGGGTCAAGACTCCTTCGTCGGCCGCTCTGTGATAAGCCGACTATAGCCGCGGTTACAGGATATGCTGTGGGTGAGGGTTTTGAACTGGCACTTGCTTGTGATCTGCGCATCATTGAAGATACCGCTGTCCTCGGTTGTCTGGGACGGAGATTCGGGGCTCCACAAAGTCTGTATGGTGGGAGACGTCTAACATCTCTTATAGGTCTTTCTCGGGCATTAGACTTATTGATAACTGGTAGACCGATTTCCGGCACAGAGGCGCATGCTTTGGGACTATCTTCTCTGGGAGAAGCAGCGATAAAATTAGCCAAGTCGTTGACAAAATTCCCTCAAAATGCTCTTATAATGGACAAATTAGCCGCCGTTAATTCCCAGCTGAATCCGAACAGCGAAGAGAGTATGAGAGATGAAGCTGTCATGAACAGTTTGCTCGGATCAGCTCTAGAAGATCTCAATGAGGGAATCAAAAAGTTCAAAGGAGATGTAACCTTGCACCACAATGAGTACGATTAA

Protein sequence:

>DPOGS213463-PA
MASLVRKLFSVTTVSFRRTNIRLVSSKSDAQKKEEEQKVPKEELEDQIKKNIVVEKFGGITTLNIDRQKSRNSLDEATLREMSEAINAFDKDTDAKVLVFNGEGGSFCSGFDLDEVGEKGYQNFIDAGSRLLRRPLCDKPTIAAVTGYAVGEGFELALACDLRIIEDTAVLGCLGRRFGAPQSLYGGRRLTSLIGLSRALDLLITGRPISGTEAHALGLSSLGEAAIKLAKSLTKFPQNALIMDKLAAVNSQLNPNSEESMRDEAVMNSLLGSALEDLNEGIKKFKGDVTLHHNEYD-