Monarch geneset OGS2.0

DPOGS206001
TranscriptDPOGS206001-TA1011 bp
ProteinDPOGS206001-PA336 aa
Genomic positionDPSCF300253 - 169570-172993
RNAseq coverage2137x (Rank: top 6%)
Annotation
HeliconiusHMEL0146145e-17384.12% 
BombyxBGIBMGA012660-TA2e-13265.27% 
Drosophilayip2-PA4e-13465.98% 
EBI UniRef50UniRef50_P427655e-12061.38%3-ketoacyl-CoA thiolase, mitochondrial n=134 Tax=root RepID=THIM_HUMAN
NCBI RefSeqNP_001182381.13e-17085.42%putative acetyl transferase [Bombyx mori]
NCBI nr blastpgi|2184367061e-17084.82%acetyltransferase 1 [Ostrinia scapulalis]
NCBI nr blastxgi|3065186386e-16785.42%putative acetyl transferase [Bombyx mori]
Group
Gene OntologyGO:00081525.6e-204metabolic process
GO:00167475.6e-204transferase activity, transferring acyl groups other than amino-acyl groups
GO:00038243.7e-60catalytic activity
KEGG pathwaytca:6567733e-137 
 K07508 (ACAA2)maps-> Fatty acid metabolism
    Fatty acid elongation in mitochondria
    Benzoate degradation via hydroxylation
    Valine, leucine and isoleucine degradation
InterPro domain[5-337] IPR0021555.6e-204Thiolase
[5-208] IPR0206164.8e-71Thiolase, N-terminal
[221-332] IPR0160383.7e-60Thiolase-like, subgroup
[7-215] IPR0160393.2e-59Thiolase-like
[213-334] IPR0206172.3e-47Thiolase, C-terminal
Orthology groupMCL11040 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206001-TA
ATGGCGGCCACGTCGACGGACGGTATATACTGCGCTCGTCACTCATCCTTGCGCGCGGGCATACCCCAGGACCGGCCCGCGCTGGCCGTCAACCGGCTCTGCGGCTCCGGCTTCCAGTCCATCGTGAACAGTGCACAGGACATTCTGTCGGGCGCGGCGGATGTGTCCGTCGCTGGTGGTGTGGAGAATATGTCCCAGGCTCCGTTCGCGGTGAGGAATGTCAGGTTCGGGGCTGTCTTGGGCAGTTCCTACGCCTTCGAGGACACTCTATGGGCAGGACTCACGGACTCCTTCTGCAACATGCCCATGGGCATGACTGCCGAAAAACTTGGAGCACAGTACGGAATCACCAGAGACGAGGTTGATAACTTCGCGCTGAAATCACAACAGAAATGGAAAACCGCGAACGATGCGGGAGTGTTCAAGGCGGAGATCGAACCTGTGACACTGACGATCAAACGTAAAGAGGTCAAAGTTGACACCGACGAGCACCCTCGCCCGCAGACCACCCTCGAGGGTCTCAAGAAACTGCCGCCCGTCTTCAAGAAGGAGGGGCTGGTGACGGCCGGCACCGCCTCCGGTATCAGTGACGGAGCCGGGGCCCTGGTGCTGGCCAGCGAACAGGCCGCCAAGAACCTGAAGCCCTTGGCTCGCCTGGTAGGATGGTCTTACGTGGGAGTGGACCCTAGCATCATGGGGGTGGGACCCGTACCCGCCATAGAGAACTTGCTTAAGGCCACCAAATTAACCCTCAAAGACATCGATCTGGTCGAGATCAACGAGGCTTTCGTGGCCCAGACTCTGTCCTGTGCGAAGGCTCTGAAGTTGGACTTAGAGAAACTCAACGTGAACGGAGGAGCCACTGCCCTCGGACACCCGCTGGCCGCGTCCGGCTCCAGGATCACCGCTCACCTTGTACACGAGCTCCGTCGCCGCGGTCTGAAGCGCGCCATCGGCTCGGCGTGTATCGGCGGCGGGCAGGGCATCGCTGTCATGGTGGAAGCTGTTTGA

Protein sequence:

>DPOGS206001-PA
MAATSTDGIYCARHSSLRAGIPQDRPALAVNRLCGSGFQSIVNSAQDILSGAADVSVAGGVENMSQAPFAVRNVRFGAVLGSSYAFEDTLWAGLTDSFCNMPMGMTAEKLGAQYGITRDEVDNFALKSQQKWKTANDAGVFKAEIEPVTLTIKRKEVKVDTDEHPRPQTTLEGLKKLPPVFKKEGLVTAGTASGISDGAGALVLASEQAAKNLKPLARLVGWSYVGVDPSIMGVGPVPAIENLLKATKLTLKDIDLVEINEAFVAQTLSCAKALKLDLEKLNVNGGATALGHPLAASGSRITAHLVHELRRRGLKRAIGSACIGGGQGIAVMVEAV-