Monarch geneset OGS2.0

DPOGS200425
TranscriptDPOGS200425-TA687 bp
ProteinDPOGS200425-PA228 aa
Genomic positionDPSCF300236 - 86963-88221
RNAseq coverage263x (Rank: top 40%)
Annotation
HeliconiusHMEL0000221e-2432.21% 
BombyxBGIBMGA008998-TA6e-3552.48% 
DrosophilaCG6984-PA3e-6851.75% 
EBI UniRef50UniRef50_B0WFW84e-7555.70%Cyclohex-1-ene-1-carboxyl-CoA hydratase n=6 Tax=Endopterygota RepID=B0WFW8_CULQU
NCBI RefSeqXP_001661868.12e-8059.65%cyclohex-1-ene-1-carboxyl-CoA hydratase, putative [Aedes aegypti]
NCBI nr blastpgi|1571302803e-7959.65%cyclohex-1-ene-1-carboxyl-CoA hydratase, putative [Aedes aegypti]
NCBI nr blastxgi|1571302801e-7459.65%cyclohex-1-ene-1-carboxyl-CoA hydratase, putative [Aedes aegypti]
Group
Gene OntologyGO:00081524.8e-37metabolic process
GO:00038244.8e-37catalytic activity
KEGG pathwaycpb:Cphamn1_21114e-57 
 K01692 (E4.2.1.17, paaG)maps-> Benzoate degradation via CoA ligation
    Propanoate metabolism
    Limonene and pinene degradation
    Tryptophan metabolism
    Lysine degradation
    Valine, leucine and isoleucine degradation
    beta-Alanine metabolism
    Geraniol degradation
    Fatty acid metabolism
    Caprolactam degradation
    Butanoate metabolism
InterPro domain[1-152] IPR0017534.8e-37Crotonase, core
Orthology groupMCL15107 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200425-TA
ATGATGAACAGCCTTATAGAAGCATTAAATCTCAATAAAGGAGATACCTCATTGCGAGCTATTGTTTTATCTGCAAAGGGTAATGTTTTTTCCGCTGGTCATAATTTAAAGGAATTACAAATTTCATCGGATTTAGAGAAACAAAAATTAATATTTCAAAAAGCCACTGAATTGATGACTTCCATTATTCAAAGTCCAGTTCCAGTTATAGCAAAGGTTAATGGATTTGCAGCAGCGGCTGGGTGTCAGTTAGTGGCTACTTGCGACATTATAATTTGCTCTGACAAAAGCAAATTTTCAACCCCAGGTGCTAACTTTGGTATATTCTGTTCAACACCAGGAATTGCTATCGGCAGAAGTGTCCCTAAGTCGAGGGCTATGTATATGTTGTTAACTGGTGAGCCCTTAAGTGCCCAAGAAGCCTATGAAAGTGGACTCGTCACAAAAGTTGTACCTGCTGAAAAGCTTGATTCTGAAGTTAATGAAACTATTGAACAGATTAAACGTAAAAGTAGAAGTGTAATATCACTTGGAAAAGAGTTTTTCTACAAACAGATCGGTCTCAATGTTCTAGATGCGTACAGACTGGGTGAAGAAATCATGGTCAAGAATATAAACTCACTTGACGGACAAGAGGGAATAAACAGTTTCATAGAAAAACGTAAAGCTGTATGGAATCACAAGTAG

Protein sequence:

>DPOGS200425-PA
MMNSLIEALNLNKGDTSLRAIVLSAKGNVFSAGHNLKELQISSDLEKQKLIFQKATELMTSIIQSPVPVIAKVNGFAAAAGCQLVATCDIIICSDKSKFSTPGANFGIFCSTPGIAIGRSVPKSRAMYMLLTGEPLSAQEAYESGLVTKVVPAEKLDSEVNETIEQIKRKSRSVISLGKEFFYKQIGLNVLDAYRLGEEIMVKNINSLDGQEGINSFIEKRKAVWNHK-