Monarch geneset OGS2.0

DPOGS214377
TranscriptDPOGS214377-TA1371 bp
ProteinDPOGS214377-PA456 aa
Genomic positionDPSCF300020 + 888902-890272
RNAseq coverage266x (Rank: top 40%)
Annotation
HeliconiusHMEL0054510.088.21% 
BombyxBGIBMGA004001-TA0.086.15% 
DrosophilaHmgs-PB2e-16961.59% 
EBI UniRef50UniRef50_B4MPF23e-16660.66%GK21611 n=8 Tax=root RepID=B4MPF2_DROWI
NCBI RefSeqNP_001093297.10.086.15%3-hydroxy-3-methylglutaryl-CoA synthase [Bombyx mori]
NCBI nr blastpgi|1537917030.086.15%3-hydroxy-3-methylglutaryl-CoA synthase [Bombyx mori]
NCBI nr blastxgi|1537917030.086.15%3-hydroxy-3-methylglutaryl-CoA synthase [Bombyx mori]
Group
Gene OntologyGO:00082996.2e-216isoprenoid biosynthetic process
GO:00044216.2e-216hydroxymethylglutaryl-CoA synthase activity
GO:00081522.5e-60metabolic process
GO:00038242.5e-60catalytic activity
KEGG pathwaydan:Dana_GF135934e-169 
 K01641 (E2.3.3.10, pksG)maps-> Terpenoid backbone biosynthesis
    Valine, leucine and isoleucine degradation
    Butanoate metabolism
    Synthesis and degradation of ketone bodies
InterPro domain[6-455] IPR0101226.2e-216Hydroxymethylglutaryl-CoA synthase, eukaryotic
[178-455] IPR0137461.4e-108Hydroxymethylglutaryl-coenzyme A synthase C-terminal
[6-177] IPR0135282.9e-93Hydroxymethylglutaryl-coenzyme A synthase, N-terminal
[178-455] IPR0160392.5e-60Thiolase-like
[6-174] IPR0160383.1e-31Thiolase-like, subgroup
Orthology groupMCL10996 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214377-TA
ATGGCTAATAGAGTCGAAAATGTTGGCATTCTGGCTATGGAGATATATATTCCTTCTCAATATGTAGCTCAGGAAGAATTAGAAAAATTCGATGGTGTGGATACTGGTAAATATACAATCGGGTTAGGCCAGAGTAAAATGGGATTTTGTTCAGATAGAGAAGACATAAACTCGATTTGTATGACTGCTTTGCACCGCCTCATTGAGAATAACAACATAAACCTTCATGACATTGGAAGGTTAGAGGTTGGTACAGAAACTATTATTGATAAAAGCAAAAGTGTAAAAACATTTCTCATGACATTATTTGCCAAAGAGGGTGCAACTGATATTGAAGGCATTGACACCACAAATGCTTGTTATGGTGGGACAGCTGCATTGTTCAATACTATTAATTGGGTGGAATCTTCTTCTTGGGATGGCAGGAAGGCTATTGTTGTGGCTGGTGACATTGCTGTATATGGCAAAGGCCCAGCTCGGCCGACTGGAGGTGCAGGAGCAGTTGCTATGCTCATTGGCCCTGATGCACCATTAGTATTTGATTGTGGTGTACGTGCATCTTATATGACTCATGCATATGATTTCTACAAGCCAGATCTTGCATCAGAATTTCCTTATGTGGATGGCAAGCTATCAATTCAGTGTTATCTTAATGCTTTAGACAAATGTTATAATTTGTTTTGTGATAAAATGAAAAAGGTAAACCCGGACTTTAAAGGTCTTTTGAGCCTGGACGGCATGTTATTCCATTCTCCTTATTGTAAGCTCGTTCAAAAATCACTAGCCAGAGTGTCTTTCAATGATTTCTTGAATTGTGCTGAAGATGATAGAGAAAAACAATTCCCGGGACTTTCACAGTTCAGCAAACACCAAAGATCTGAAACATATTTTGATAGAGATCTTGAAAAGGCATTTATGGCTTACAGCAAAGATCTGTTTGAAGAAAAAACTAAGCCGTCTCTGTACATTGCAAGAAACGTCGGCAATATGTACACCGCCTCACTGTATGGTGGTTTAGTTTCATATTTAATCAGCAAGTCACCAGAGCAGTTAATTGGCAAGAAATTTGCCTTGTTCTCTTACGGCTCTGGATTGGCATCGACTATGTACTCTGTCAATATATGCAATGATATGAGCGCTGGTTCCAAACTAGAAAAGCTCATTAATTCTCTTCATAATAATGTAGCTATGTTAGATAAAAGAATTAATGTTGAACCGCAAGCCTTCTCAGATTCCATGCAAATTAGGACAGAGAATTATCACACGGCACCATACGAGCCATCGGGTTCCATTGATATACTTTTCCCTGGAACGTACTATCTGGTGAAGATCGATGACCAAAGAAGACGGACATATGATAGAAAATTATAA

Protein sequence:

>DPOGS214377-PA
MANRVENVGILAMEIYIPSQYVAQEELEKFDGVDTGKYTIGLGQSKMGFCSDREDINSICMTALHRLIENNNINLHDIGRLEVGTETIIDKSKSVKTFLMTLFAKEGATDIEGIDTTNACYGGTAALFNTINWVESSSWDGRKAIVVAGDIAVYGKGPARPTGGAGAVAMLIGPDAPLVFDCGVRASYMTHAYDFYKPDLASEFPYVDGKLSIQCYLNALDKCYNLFCDKMKKVNPDFKGLLSLDGMLFHSPYCKLVQKSLARVSFNDFLNCAEDDREKQFPGLSQFSKHQRSETYFDRDLEKAFMAYSKDLFEEKTKPSLYIARNVGNMYTASLYGGLVSYLISKSPEQLIGKKFALFSYGSGLASTMYSVNICNDMSAGSKLEKLINSLHNNVAMLDKRINVEPQAFSDSMQIRTENYHTAPYEPSGSIDILFPGTYYLVKIDDQRRRTYDRKL-