Monarch geneset OGS2.0

DPOGS200362
TranscriptDPOGS200362-TA888 bp
ProteinDPOGS200362-PA295 aa
Genomic positionDPSCF300026 + 813408-816300
RNAseq coverage1802x (Rank: top 7%)
Annotation
HeliconiusHMEL0000221e-14482.83% 
BombyxBGIBMGA005656-TA3e-13580.13% 
DrosophilaCG6543-PB6e-9964.12% 
EBI UniRef50UniRef50_Q28XY64e-10064.42%GA19673 n=2 Tax=pseudoobscura subgroup RepID=Q28XY6_DROPS
NCBI RefSeqXP_971757.15e-10664.63%PREDICTED: similar to cyclohex-1-ene-1-carboxyl-CoA hydratase, putative [Tribolium castaneum]
NCBI nr blastpgi|3085127153e-14383.73%enoyl-CoA hydratase [Biston betularia]
NCBI nr blastxgi|3085127158e-13883.73%enoyl-CoA hydratase [Biston betularia]
Group
Gene OntologyGO:00081528.7e-60metabolic process
GO:00038248.7e-60catalytic activity
KEGG pathwaytca:6604331e-105 
 K07511 (ECHS1)maps-> Fatty acid elongation in mitochondria
    Benzoate degradation via CoA ligation
    Propanoate metabolism
    Limonene and pinene degradation
    Tryptophan metabolism
    Lysine degradation
    Valine, leucine and isoleucine degradation
    beta-Alanine metabolism
    Fatty acid metabolism
    Caprolactam degradation
    Butanoate metabolism
InterPro domain[54-219] IPR0017538.7e-60Crotonase, core
Orthology groupMCL13563 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200362-TA
ATGGCCTCTGTAACTTCTATTTCTAGGGTTTTCGGAAAGTTATCCTTGAGTTCGAATTGGACTGCTGCGGCAAATACAAGTTTTATAAAGTTCTATAGTACAGGTGCTCAGTATGAAAATATCAAAGTTGATGTGGTGGGTGCTAAAAAGAATGTGGGTCTCATAAGTCTCAACAGACCTAAAGCTTTAAATGCTTTATGCAAAGACCTCTTTGTTGAACTTGGCAAAGCTGTTAAGGACTTTGATGCTGATGATAAAATTGCCACCATCATAATAACAGGGAACGAGAAGGCCTTTGCTGCGGGAGCTGATATTAAAGAAATGCAAAATAACACATACAGCAGTAATGTCAAAGCGGGATTCTTACAAGAATGGGAAGACATTTCTAACTGTGGAAAACCAATTATTGCAGCTGTAAATGGTTTTGCTTTAGGAGGAGGTTGTGAGTTGGCCATGTTATGTGACATCATATATGCCGGTGAGAAGGCCAAATTTGGTCAACCTGAGATTAACATTGGAACCATCCCTGGAGCTGGAGGTACCCAACGTCTTCCAAGATATGTCGGAAAATCCAAAGCTATGGAGATAGTTTTAAGTGGCAACTTTGTTGATGCTCATGAAGCTGAGAAAATGGGTCTTGTCAGTAAAGTTTTCCCTGTTGAGAAGCTTTTGGAAGAAACAATTAAATTGGCTGAAAGAATCGGCACTCATTCACCGCTTGTTGTGAAATTGGCTAAACAAGCTGTCAACCAGGCTTATGAAACCACACTGAAATCGGGTCTACTTTATGAGAAATCCAGTTTCTACGGAACTTTTGCTACTGAGGATCGTAAAGAAGGCATGACGGCATTTGTCGAGAAAAGAGCGCCTAATTTCAAAAACAATTGA

Protein sequence:

>DPOGS200362-PA
MASVTSISRVFGKLSLSSNWTAAANTSFIKFYSTGAQYENIKVDVVGAKKNVGLISLNRPKALNALCKDLFVELGKAVKDFDADDKIATIIITGNEKAFAAGADIKEMQNNTYSSNVKAGFLQEWEDISNCGKPIIAAVNGFALGGGCELAMLCDIIYAGEKAKFGQPEINIGTIPGAGGTQRLPRYVGKSKAMEIVLSGNFVDAHEAEKMGLVSKVFPVEKLLEETIKLAERIGTHSPLVVKLAKQAVNQAYETTLKSGLLYEKSSFYGTFATEDRKEGMTAFVEKRAPNFKNN-