Monarch geneset OGS2.0

DPOGS216161
TranscriptDPOGS216161-TA1422 bp
ProteinDPOGS216161-PA473 aa
Genomic positionDPSCF300155 - 298046-302803
RNAseq coverage2887x (Rank: top 4%)
Annotation
HeliconiusHMEL0165600.092.34% 
BombyxBGIBMGA014181-TA0.092.11% 
DrosophilaThiolase-PA0.077.06% 
EBI UniRef50UniRef50_Q17IM10.079.69%Trifunctional enzyme beta subunit (Tp-beta) n=18 Tax=cellular organisms RepID=Q17IM1_AEDAE
NCBI RefSeqXP_001661254.10.079.69%trifunctional enzyme beta subunit (tp-beta) [Aedes aegypti]
NCBI nr blastpgi|2839931390.091.08%fatty acid beta-oxidation complex subunit beta [Heliothis virescens]
NCBI nr blastxgi|2839931390.091.08%fatty acid beta-oxidation complex subunit beta [Heliothis virescens]
Group
Gene OntologyGO:00081526.6e-255metabolic process
GO:00167476.6e-255transferase activity, transferring acyl groups other than amino-acyl groups
GO:00038241.5e-62catalytic activity
KEGG pathwayaag:AaeL_AAEL0022960.0 
 K07509 (HADHB)maps-> Fatty acid metabolism
    Fatty acid elongation in mitochondria
    Benzoate degradation via hydroxylation
    Valine, leucine and isoleucine degradation
InterPro domain[35-470] IPR0021556.6e-255Thiolase
[46-318] IPR0206162.2e-85Thiolase, N-terminal
[47-325] IPR0160391.5e-62Thiolase-like
[255-331] IPR0160389.5e-52Thiolase-like, subgroup
[326-466] IPR0206172.9e-43Thiolase, C-terminal
Orthology groupMCL13071 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216161-TA
ATGGCATCGCAAGTTGCAAAGTCCCTAGTAAAAGCCGCTCGTCCCAATTCGACGGTGAAATTTAACACAGCCTGCCGTGCTCTCAGTGTTGGAGCAGCGTTACAGCAGAAAAAGAGTCTCCCTGACCGCACGGGCAAAAATGTTGTGCTCGTTGATGGTGTGCGAACACCTTTCCTGGTGTCGTTCACTGATTACTCCAAGATGATGCCTCACGAACTAGCCCGACATTCACTACTGGGTTTGCTCCAAAAGACTGGCATCCCCAAGGAGGTAGTGGACTATATTGTTTATGGTACCGTCATACAAGAGGTGAAGACATCCAATATTGGTCGTGAGGCAGCCCTGGCTGCAGGGTTCAGTGATCGGACACCGGCCCATACGGTCACCATGGCCTGCATATCATCAAACCAAGCCATCACCACTGGTATCGGCATGATAGCGGCCGGAGCTTATGACGTAATAGTAGCTGGCGGAGTAGAGTTCATGTCTGATGTACCGATCAGACACTCCCGCAAGATGAGGTCGCTCTTACTGAGACTGAACCGCGCCAAGACACCAGCTCAGAGGCTCTCACTGCTGGCGTCAATAAGACCTGACTTCTTCGCTCCGGAGCTCCCAGCTGTAGCGGAGTTCTCGTCAGGTGAGACGATGGGTCATAGTGCAGATCGTCTTGCAGCAGCCTTTGGAGCATCTAGAGAGGAACAAGATCAGTACGCTCTAAGATCCCATTCCTTGGCACACCAAGCACAGCAGAACGGTTACTTCACGGATCTTATACCAGTTAAAGTGGAAGGCAAGGATGGTGTCGTTGATAAGGACAATGGTATTCGAGTGTCCACACCTGAACAACTTGCCAAACTACGTCCAGCCTTCATCAAGCCACATGGCAGCGTTACAGCCGCTAATGCATCCTTCCTGACTGACGGTGCATCTGCATGTCTGGTGATGTCCGAGGCTAAGGCTAAGGAACTGGGGCTCAAGCCGAAGGCTTATCTGAGAGACTTCACCTATGTAGCTCAGGATCCAGTGGACCAACTACTCCTTGGTCCAGCTTATGGCATCCCAAAGATTCTGGACCAAGCCGGCCTTAAACTGAGTGATATCGATACTTGGGAAATTCATGAAGCCTTCGCTGGACAAATTTTGGCCAATCTAAAGGCTTTGGACTCCGATTGGTTTGCACAGACATATCTTGGCCGTCAGTCTAAGGTCGGTTCTCCTGATTTGGATAAATGGAACAAGTGGGGCGGTTCTCTCTCCATTGGACATCCGTTTGCCGCTACTGGGGTTCGTCTCGCGATGCACACCGCTCACCGTCTCGTTCGCGAGGACGGTCAGTTCGGTATGATCAGCGCCTGTGCCGCTGGGGGGCAGGGGGTCGCCATGCTGCTCGAGAGACACCCTGACGCCAAACACGACTAG

Protein sequence:

>DPOGS216161-PA
MASQVAKSLVKAARPNSTVKFNTACRALSVGAALQQKKSLPDRTGKNVVLVDGVRTPFLVSFTDYSKMMPHELARHSLLGLLQKTGIPKEVVDYIVYGTVIQEVKTSNIGREAALAAGFSDRTPAHTVTMACISSNQAITTGIGMIAAGAYDVIVAGGVEFMSDVPIRHSRKMRSLLLRLNRAKTPAQRLSLLASIRPDFFAPELPAVAEFSSGETMGHSADRLAAAFGASREEQDQYALRSHSLAHQAQQNGYFTDLIPVKVEGKDGVVDKDNGIRVSTPEQLAKLRPAFIKPHGSVTAANASFLTDGASACLVMSEAKAKELGLKPKAYLRDFTYVAQDPVDQLLLGPAYGIPKILDQAGLKLSDIDTWEIHEAFAGQILANLKALDSDWFAQTYLGRQSKVGSPDLDKWNKWGGSLSIGHPFAATGVRLAMHTAHRLVREDGQFGMISACAAGGQGVAMLLERHPDAKHD-