Monarch geneset OGS2.0

DPOGS205286
TranscriptDPOGS205286-TA1548 bp
ProteinDPOGS205286-PA515 aa
Genomic positionDPSCF300021 + 438297-440654
RNAseq coverage1459x (Rank: top 9%)
Annotation
HeliconiusHMEL0174840.084.15% 
BombyxBGIBMGA011029-TA0.088.15% 
DrosophilaCG10932-PA1e-16766.59% 
EBI UniRef50UniRef50_Q56CY61e-17774.87%Acetoacetyl-CoA thiolase n=4 Tax=Eukaryota RepID=Q56CY6_9CUCU
NCBI RefSeqXP_975008.20.070.74%PREDICTED: similar to acetyl-CoA acetyltransferase, mitochondrial [Tribolium castaneum]
NCBI nr blastpgi|1892347850.070.74%PREDICTED: similar to acetyl-CoA acetyltransferase, mitochondrial [Tribolium castaneum]
NCBI nr blastxgi|1892347850.070.74%PREDICTED: similar to acetyl-CoA acetyltransferase, mitochondrial [Tribolium castaneum]
Group
Gene OntologyGO:00081523.1e-237metabolic process
GO:00167473.1e-237transferase activity, transferring acyl groups other than amino-acyl groups
GO:00038247.7e-78catalytic activity
KEGG pathwaytca:6638850.0 
 K00626 (E2.3.1.9, atoB)maps-> Terpenoid backbone biosynthesis
    Benzoate degradation via CoA ligation
    Two-component system
    Propanoate metabolism
    Tryptophan metabolism
    Lysine degradation
    Valine, leucine and isoleucine degradation
    Pyruvate metabolism
    Synthesis and degradation of ketone bodies
    Fatty acid metabolism
    Butanoate metabolism
InterPro domain[26-426] IPR0021553.1e-237Thiolase
[35-294] IPR0206164e-97Thiolase, N-terminal
[36-302] IPR0160397.7e-78Thiolase-like
[226-308] IPR0160385.4e-62Thiolase-like, subgroup
[303-422] IPR0206172e-41Thiolase, C-terminal
[430-506] IPR0123361.5e-16Thioredoxin-like fold
[433-505] IPR0085541.5e-12Glutaredoxin-like
Orthology groupMCL14001 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205286-TA
ATGATAATTTTAAAGGGAAGTACTTTAATAATGAACTCCAAAATGCATTTATCAAGAAAGTTGTTAACGGCCATGGCAGCATACTCTACAAAAGTTTCCTTAAACGAAGTGGTAATAGCATCTGCAGTCAGAACTCCTATAGGATCGTTCAGGGGAAGCCTTTCGTCGTTATCAGCCACCGAGCTTGGAGCTGTTGCCGTCAAAGCGGCTATCGAAAGAGCCGGCATTCCTAAGGAAGAAATTAAAGAGGTGTACATGGGTAATGTTTGTGCGGCATATCTTGGACAAGCTCCAGCCAGACAAGCAGTAATATTTGGTGGCATGCCAAAAAGCACAATATGCACTACAGTAAACAAAGTTTGTTCTTCTGGAATGAAGTCAATCATGTTGGCTGCACAAGGCCTACAAACTGGAGCTCAGGAAGTAATGCTTGCGGGTGGCATGGAATCTATGTCAAATGTACCTTTCTACTTAAAAAGAGGTGAAACTTCTTATGGTGGAATGCAGTTAGTTGATGGTATAGTGTATGATGGGCTTACAGATGTCTATAACAAATTTCACATGGGCAATTGTGCTGAAAATACAGCAAAAAAACTTGGTATATCAAGGCAAGAGCAAGATGATTATGCCATATCTAGTTACAAACGAAGTGCAGCTGCTTACGAATCAAAAGCCTTTGTTGATGAATTAATCCCAGTATCAGTACCTCAGAAGAGAGGTGCACCCCCAGTATTATTTTCTGAAGATGAAGAGTACAAAAAGGTTAATTTTGAGAAGTTTAATAAACTGTCAACAGTATTTCAAAAAGAAAATGGCACAGTCACAGCAGGCAATGCTTCAACACTAAATGATGGTGCTGCAGCACTTGTCTTAATGACAGCGGAAGCTGCACATAGGTTGAATGTAAAACCATTAGCTCGTATCATTGGATTTGCTGATGGAGAATGTGATCCAATTGATTTTCCCATTGCACCAGCTGTAGCTATCCCAAAATTGTTGGAAAAAACTGGTGTCAATAAAGATGACATAGCACTTTGGGAAATCAATGAAGCCTTCAGTGTTGTGGCTGTTGCTAATCAGAAGATGTTGAATCTAGACCCTTCTAAATTGAATGTCCATGGTGGTGGAGTTAGTCTTGGTCATCCAATAGGCATGTCTGGAAGCCGTATAGTAGTTCATTTATGCCATGCTTTAAAAAAGGGAGAAAAAGGTGTTGCAGCAATTTGTAATGGAGGCGGCGGAGCTTCATCAATTATGATTGAAAAATTGGCTGAGGCCATTGACGGCCCACCTGTTCTAACCTTCTACACTAAAGATCCATGTCAGCTTTGTGATATTGTAATGGAAGAATTATCAACCTATAAAGATAAGTTAATAATTGAGAAAATTGATATAACTAAAAAAGAAAATGTAAGGTGGCTTAGATTATATAGGCATGATATACCAGTATTATTTTTAAATGGAAAATTTCTTTGTATGCATAGATTAAATCACGGTTTATTAGAAAGACGACTACAAATTATAGAACAAGAAAATATCAAAAACTAG

Protein sequence:

>DPOGS205286-PA
MIILKGSTLIMNSKMHLSRKLLTAMAAYSTKVSLNEVVIASAVRTPIGSFRGSLSSLSATELGAVAVKAAIERAGIPKEEIKEVYMGNVCAAYLGQAPARQAVIFGGMPKSTICTTVNKVCSSGMKSIMLAAQGLQTGAQEVMLAGGMESMSNVPFYLKRGETSYGGMQLVDGIVYDGLTDVYNKFHMGNCAENTAKKLGISRQEQDDYAISSYKRSAAAYESKAFVDELIPVSVPQKRGAPPVLFSEDEEYKKVNFEKFNKLSTVFQKENGTVTAGNASTLNDGAAALVLMTAEAAHRLNVKPLARIIGFADGECDPIDFPIAPAVAIPKLLEKTGVNKDDIALWEINEAFSVVAVANQKMLNLDPSKLNVHGGGVSLGHPIGMSGSRIVVHLCHALKKGEKGVAAICNGGGGASSIMIEKLAEAIDGPPVLTFYTKDPCQLCDIVMEELSTYKDKLIIEKIDITKKENVRWLRLYRHDIPVLFLNGKFLCMHRLNHGLLERRLQIIEQENIKN-