Monarch geneset OGS2.0

DPOGS211492
TranscriptDPOGS211492-TA1014 bp
ProteinDPOGS211492-PA337 aa
Genomic positionDPSCF300113 + 606697-609670
RNAseq coverage4074x (Rank: top 3%)
Annotation
HeliconiusHMEL0116732e-7558.69% 
BombyxBGIBMGA002750-TA7e-16985.41% 
DrosophilaCG11876-PD1e-14672.46% 
EBI UniRef50UniRef50_P111771e-13468.92%Pyruvate dehydrogenase E1 component subunit beta, mitochondrial n=511 Tax=root RepID=ODPB_HUMAN
NCBI RefSeqXP_001648922.12e-15477.30%pyruvate dehydrogenase [Aedes aegypti]
NCBI nr blastpgi|1571055614e-15377.30%pyruvate dehydrogenase [Aedes aegypti]
NCBI nr blastxgi|1571055619e-14877.78%pyruvate dehydrogenase [Aedes aegypti]
Group
Gene OntologyGO:00081521.7e-42metabolic process
GO:00038241.7e-42catalytic activity
KEGG pathwayaag:AaeL_AAEL0043386e-154 
 K00162 (PDHB, pdhB)maps-> Citrate cycle (TCA cycle)
    Glycolysis / Gluconeogenesis
    Valine, leucine and isoleucine biosynthesis
    Butanoate metabolism
    Pyruvate metabolism
InterPro domain[2-177] IPR0054752.7e-57Transketolase-like, pyrimidine-binding domain
[196-326] IPR0159411.7e-42Transketolase-like, C-terminal
[191-328] IPR0090141.2e-40Transketolase, C-terminal/Pyruvate-ferredoxin oxidoreductase, domain II
[195-315] IPR0054761.3e-36Transketolase, C-terminal
Orthology groupMCL11142 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211492-TA
ATGGTCCCTGTTCGTGATGCTTTAAAACAGGCAATTGATGAAGAAATGGAGAGAGACGAAAAAGTGTTCATTCTAGGTGAAGAAGTAGCCCAATACGACGGCGCATACAAAGTTACCAGAGGTCTTTGGAAGAAGTATGGGGATAAGAGGGTAGTAGACACTCCGATCACAGAGATTGGCTTTGCAGGTATCGCAGTCGGAGCTGCCTTCGCTGGTCTCCGGCCTATCTGTGAATTCATGACATTCAATTTCGCCATGCAGGCCATAGATCACATAATAAATTCGGCGGCCAAGACATTCTACATGTCAGCCGGGGCTGTACCCGTTCCGATAGTGTTCAGGGGACCGAACGGAGCGGCGGCGGGAGTGGCGGCGCAGCACTCGCAATGCTTCGCAGCTTGGTACAGCAGTGTACCCGGCCTGAAGGTGCTCATGCCGTACTCCTCGGAAGACGCCAAAGGTCTGTTGAAGGCTGCTATCCGGGACCCCGATCCAGTAGTGTTCCTGGAGGATGAGATTGTCTACGGTGTGCCCTTCCCCATGTCAGATGAGGCCATGTCGCCTGATTTCGTACTACCCATTGGTAAAGCGAAAGTGGAGAGAGCCGGAGATCATATCACTATAGTGTGCGCGGGGAAGGCGACTCACACTGCATTGGATGCAGCCAATGAGCTGGCCGGGAAAGGTATCGAGTGTGAGGTCATCAACCTCCGCTCCATCCGGCCGCTCGACTTCCAGACTATCGCTCAGTCCATCGCCAAGACACATCACCTCATCACGTTGGAGCAAGGTTGGCCGCAGTCTGGTGTGGGAGCTGAGATCTGCGCTCGTGTGATGGAGTCACCGTCGTTCTTCGAGCTGGACGCGCCCGTGTGGCGAGTGACCGGCGCCGACGTGCCAATGCCATACACCAGGAGTCTAGAGACGCTCGCTCTGCCGCAGCGCGGGGACGTGGTCGCCGCCGTCGCCGCTGTACTAGGGAACAGAATCAGCGAGGCGTCGGTGAGTCAGTGA

Protein sequence:

>DPOGS211492-PA
MVPVRDALKQAIDEEMERDEKVFILGEEVAQYDGAYKVTRGLWKKYGDKRVVDTPITEIGFAGIAVGAAFAGLRPICEFMTFNFAMQAIDHIINSAAKTFYMSAGAVPVPIVFRGPNGAAAGVAAQHSQCFAAWYSSVPGLKVLMPYSSEDAKGLLKAAIRDPDPVVFLEDEIVYGVPFPMSDEAMSPDFVLPIGKAKVERAGDHITIVCAGKATHTALDAANELAGKGIECEVINLRSIRPLDFQTIAQSIAKTHHLITLEQGWPQSGVGAEICARVMESPSFFELDAPVWRVTGADVPMPYTRSLETLALPQRGDVVAAVAAVLGNRISEASVSQ-