Monarch geneset OGS2.0

DPOGS213237
TranscriptDPOGS213237-TA1200 bp
ProteinDPOGS213237-PA399 aa
Genomic positionDPSCF300124 - 487557-492253
RNAseq coverage1318x (Rank: top 10%)
Annotation
HeliconiusHMEL0214669e-12773.00% 
BombyxBGIBMGA009515-TA0.081.70% 
Drosophilal(1)G0334-PC2e-14568.39% 
EBI UniRef50UniRef50_E3XDC77e-14061.28%Putative uncharacterized protein n=1 Tax=Anopheles darlingi RepID=E3XDC7_ANODA
NCBI RefSeqNP_001093304.10.081.70%pyruvate dehydrogenase [Bombyx mori]
NCBI nr blastpgi|1537923090.081.70%pyruvate dehydrogenase [Bombyx mori]
NCBI nr blastxgi|1537923090.081.70%pyruvate dehydrogenase [Bombyx mori]
Group
Gene OntologyGO:00432311.2e-129intracellular membrane-bounded organelle
GO:00060961.2e-129glycolysis
GO:00047391.2e-129pyruvate dehydrogenase (acetyl-transferring) activity
GO:00551141.2e-129oxidation-reduction process
GO:00081525.7e-117metabolic process
GO:00166245.7e-117oxidoreductase activity, acting on the aldehyde or oxo group of donors, disulfide as acceptor
KEGG pathwayaga:AgaP_AGAP0047864e-154 
 K00161 (PDHA, pdhA)maps-> Citrate cycle (TCA cycle)
    Glycolysis / Gluconeogenesis
    Valine, leucine and isoleucine biosynthesis
    Butanoate metabolism
    Pyruvate metabolism
InterPro domain[63-373] IPR0175971.2e-129Pyruvate dehydrogenase (acetyl-transferring) E1 component, alpha subunit, subgroup y
[71-365] IPR0010175.7e-117Dehydrogenase, E1 component
Orthology groupMCL10246 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213237-TA
ATGAGTAAGCTTATTCCATCCGTTGGTAAACTGCTTGGCGGTGTAACAACTACTACGAAAGTCGCCCCAATCGCTGTCATTACGAACGCAAAATACAGTACAAAAAACGAAGCTACATTTGAAACGAAGCCTTATAAGCTCCACAAACTGGAAAAAGGTCCGGCAACATCAGCAACTTTAACATCCGAAGATGCACTTAGTATGTATGAGAAATTGGCTGTTATTAGAAGAATTGAAACCGCCTCAGGAAATCTGTACAAAGAGAAGAGTGTCAGAGGCTTCTGTCATTTATATTCAGGACAGGAAGCGGTTGCCGTAGGAATGCACTCAGCTATGAGGGATATTGATTCATTGATAACAGCATATCGTTGTCACGGATGGACTTACCTTATGGGTGTCAGTGTCCTCGGTGTCCTTAGTGAGTTAACTGGTCGGAGAACTGGTTGTTCCAGAGGCAAAGGTGGTTCTATGCACTTGTATGCCAAGAATTTCTACGGAGGAAATGGCATAGTCGGTGCTCAGGTCCCGTTGGGCGCGGGTATAGCTTTCGCTCACAAGTACCGTAACGACGGTGGCGTTTGTTTCGCTCTGTATGGTGACGGAGCAGCCAATCAGGGCCAGATCTTCGAGGCTTACAACATGGCCAAATTATGGAATCTACCCTGTATATTTGTTTGCGAAAACAATGGTTATGGTATGGGTACAAGTGTTGAGCGTTCTTCTGCCAGTACGGACTATTACAGCCGGGGGGATTACATACCCGGCCTGTGGGTGGACGGTATGGACGTCGTCACTACTAGGGAAGCGACCAGATTTGCCATTGACTATTGCACTAGCGGAAAAGGTCCTCTAGTAATCGAAATGGAGACATACCGATACTCCGGCCACTCTATGTCCGACCCCGGCACTTCCTACCGGACCCGCGACGAGGTCCAGGCTGTCAGACAGACCAGGGACCCCATCACCTCCTTCAAAGAAAAGATCCTATCAAACGGGCTCGCTACTGCTGATCAGCTCAAGGAAATAGACACCAAGATCCGCAAAGAGGTGGACGAGGCCACTAAGATAGCGAAGTCCGAACCGGAAGTGGGTCCCGAAGAACTCGCTGGGGACATTTATTACAAGAACTTGGAGCCCTTCATCAGGGGTGTCCACCCCAACAGCCCCTTGCAACATATCGAAACCGCTGCCAGGAATTAA

Protein sequence:

>DPOGS213237-PA
MSKLIPSVGKLLGGVTTTTKVAPIAVITNAKYSTKNEATFETKPYKLHKLEKGPATSATLTSEDALSMYEKLAVIRRIETASGNLYKEKSVRGFCHLYSGQEAVAVGMHSAMRDIDSLITAYRCHGWTYLMGVSVLGVLSELTGRRTGCSRGKGGSMHLYAKNFYGGNGIVGAQVPLGAGIAFAHKYRNDGGVCFALYGDGAANQGQIFEAYNMAKLWNLPCIFVCENNGYGMGTSVERSSASTDYYSRGDYIPGLWVDGMDVVTTREATRFAIDYCTSGKGPLVIEMETYRYSGHSMSDPGTSYRTRDEVQAVRQTRDPITSFKEKILSNGLATADQLKEIDTKIRKEVDEATKIAKSEPEVGPEELAGDIYYKNLEPFIRGVHPNSPLQHIETAARN-