Monarch geneset OGS2.0

DPOGS201755
TranscriptDPOGS201755-TA1038 bp
ProteinDPOGS201755-PA345 aa
Genomic positionDPSCF300279 + 20556-27082
RNAseq coverage220x (Rank: top 45%)
Annotation
Heliconius% 
BombyxBGIBMGA002650-TA2e-7084.03% 
DrosophilaCG10399-PA2e-10457.53% 
EBI UniRef50UniRef50_Q2F6853e-15581.09%Hydroxymethylglutaryl-CoA lyase isoform 1 n=2 Tax=Bombyx mori RepID=Q2F685_BOMMO
NCBI RefSeqNP_001040133.16e-15681.09%hydroxymethylglutaryl-CoA lyase [Bombyx mori]
NCBI nr blastpgi|1140518641e-15481.09%hydroxymethylglutaryl-CoA lyase [Bombyx mori]
NCBI nr blastxgi|1140518647e-15181.09%hydroxymethylglutaryl-CoA lyase [Bombyx mori]
Group
Gene OntologyGO:00081524.4e-67metabolic process
GO:00038244.4e-67catalytic activity
KEGG pathwaydmo:Dmoj_GI177292e-105 
 K01640 (E4.1.3.4, HMGCL, hmgL)maps-> Peroxisome
    Valine, leucine and isoleucine degradation
    Butanoate metabolism
    Geraniol degradation
    Synthesis and degradation of ketone bodies
InterPro domain[37-313] IPR0137854.4e-67Aldolase-type TIM barrel
[46-293] IPR0008919.8e-52Pyruvate carboxyltransferase
Orthology groupMCL12402 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201755-TA
ATGATTGTTAGTGGAGATCTTGTAATTAAGGGCTTGTACATCTGCATTCCCGAAAATATCAACAACGCTAAAATTGACGATAACACGAAGGACAGTCAAGCGGTTCCAGATATTCGAATATACGAAGTAGGTCCAAGAGACGGTCTTCAAAATGAGTCTAAGTTTGTACCAACTGATATAAAAGTAGAACTTATTCACAAACTCTCTGAAGCAGGAATCAAAGATATTGAATGTGCCAGCTTTGTAAGTCCAAAATGGGTAAAACAAATGAGTGATGGTACTGAAGTTATGAAAACTATCAAAAGGGTACCTGGTGTTAACTATCCAGTATTAATACCCAATCTAAAAGGATATGAGGCAGCTAAACAATGTAATATTGAAGAAATAGCAATATTTCCTGCTGGTTCAGAGGGTTTCTCTCAGAAGAATTTAAATTGTTCTATAGAAGAAGGATTAAAACGGTTCAAACTGGTCGCTGATCAGGCTATTAAAGATGGAATGAGAGTCAGAGGTTACGTTTCATGTGTTGTAGGCTGTCCCTATGATGGTCCAATAAATCCAAAAGGGATTGCCAAGATAACTGAAGAGTTGTTTACAATGGGTTGCTATGAGGTATCACTGGGTGACACTATCGGAGTGGGAACGGCCGGATCGGTGAAGAAATTAATGAATGAGGTTATAAAAGTAGCAACACCTGACAAAATAGCACTTCACTTCCATGATACATATGGACAGGGGCTATCTAACTTACTGGCTGGCTTGGAGTTCGGAATTAAAACTGTGGATTCGTCTGTGTCCGGACTTGGCGGGTGTCCGTATGCCCGTGGTGCGAGCGGGAACCTTGCTACTGAGGACCTTGTATACTTTCTCTACGGGCTAGGAGTGAACACTAACATAGACCTGGTCAAACTCATAGAAGCTGGCCGCTACATATCAAACTTCCTCGCAAAACCGACCGAGTCCAAAGTCAACCGTGCCATCGGGGATAGATTTAAAAATCATAAAGATATTATAAAAATAGCGTCTTGTACTTTATAA

Protein sequence:

>DPOGS201755-PA
MIVSGDLVIKGLYICIPENINNAKIDDNTKDSQAVPDIRIYEVGPRDGLQNESKFVPTDIKVELIHKLSEAGIKDIECASFVSPKWVKQMSDGTEVMKTIKRVPGVNYPVLIPNLKGYEAAKQCNIEEIAIFPAGSEGFSQKNLNCSIEEGLKRFKLVADQAIKDGMRVRGYVSCVVGCPYDGPINPKGIAKITEELFTMGCYEVSLGDTIGVGTAGSVKKLMNEVIKVATPDKIALHFHDTYGQGLSNLLAGLEFGIKTVDSSVSGLGGCPYARGASGNLATEDLVYFLYGLGVNTNIDLVKLIEAGRYISNFLAKPTESKVNRAIGDRFKNHKDIIKIASCTL-