Monarch geneset OGS2.0

DPOGS202991
TranscriptDPOGS202991-TA1080 bp
ProteinDPOGS202991-PA359 aa
Genomic positionDPSCF300068 - 297213-302551
RNAseq coverage3739x (Rank: top 3%)
Annotation
HeliconiusHMEL0072798e-16581.63% 
BombyxBGIBMGA012336-TA1e-15677.04% 
DrosophilaImpL3-PA1e-12661.21% 
EBI UniRef50UniRef50_Q950282e-12461.21%L-lactate dehydrogenase n=83 Tax=Coelomata RepID=LDH_DROME
NCBI RefSeqNP_001095933.13e-15577.04%lactate dehydrogenase [Bombyx mori]
NCBI nr blastpgi|1562552107e-15477.04%lactate dehydrogenase [Bombyx mori]
NCBI nr blastxgi|1562552106e-14777.04%lactate dehydrogenase [Bombyx mori]
Group
Gene OntologyGO:00166168.6e-132oxidoreductase activity, acting on the CH-OH group of donors, NAD or NADP as acceptor
GO:00442628.6e-132cellular carbohydrate metabolic process
GO:00551148.6e-132oxidation-reduction process
GO:00060963.9e-116glycolysis
GO:00057373.9e-116cytoplasm
GO:00044593.9e-116L-lactate dehydrogenase activity
GO:00038241.5e-69catalytic activity
GO:00059751.5e-69carbohydrate metabolic process
GO:00054888.1e-55binding
GO:00164916.8e-44oxidoreductase activity
KEGG pathwaycqu:CpipJ_CPIJ0144541e-129 
 K00016 (LDH, ldh)maps-> Glycolysis / Gluconeogenesis
    Propanoate metabolism
    Pyruvate metabolism
    Cysteine and methionine metabolism
InterPro domain[47-359] IPR0015578.6e-132L-lactate/malate dehydrogenase
[52-351] IPR0113043.9e-116L-lactate dehydrogenase
[190-358] IPR0159551.5e-69Lactate dehydrogenase/glycoside hydrolase, family 4, C-terminal
[37-189] IPR0160408.1e-55NAD(P)-binding domain
[48-188] IPR0012366.8e-44Lactate/malate dehydrogenase, N-terminal
[190-352] IPR0223837.6e-31Lactate/malate dehydrogenase, C-terminal
Orthology groupMCL11571 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202991-TA
ATGAGTTTTATGAACGGGAACGCTAACGGGAATGGGACCGCCAATGGCAACGCTTACAATGGCTTCTACGGTTATAGACACAAAATGGCGACTTTGGACCAGCTTTTCCAGCCCATCCAGCAGAAGACGGAAGCCACCGGCAACAAGGTCACTGTGGTGGGATCTGGCCAAGTCGGCATGGCAGCGGTCTTCGCAATGCTAACACAGGGCGTCACTAACAATATAGCGTTGGTGGATTGTATGGCAGACAAATTGAAAGGCGAAATGATGGATCTACAGCACGGCTCATGCTTCATGAAAGATGCTAAGATCCAGTCAAGCACAGATTACGCCGTCTCTGAGGGTTCTAAGATCTGTGTTGTGACCGCTGGTGTGAGACAACGCGTTGGGGAGACCAGGCTCGAATTGGTACAGAGGAACACTGACGTCTTGAAAGTTATTATACCCCAGCTTGTGAAGTACTGCCCGGACGCCATATTCATCATAGCGAGTAACCCAGTGGACGTTCTAACCTACGTGACCTGGAAAATCAGCGGACTGCCCAAGAACAGGATTATAGGATCCGGAACCAACCTTGACTCAGCTCGTTTCAGGTACTTGCTATCCGAGAAGTTGGCTGTAGCCTCTACTTCCTGTCACGGTTATGTCATCGGTGAACACGGGGACAGCAGTGTACCGGTATGGTCCGGCGTGAACGTAGCCGGTGTGCGTCTCAGCGATCTCAACCCAAAAATAGGCTCGGATGACGATCCTGAAAACTGGAAAAAGATCCACGAAGATGTGGTTAAAAGCGCCTACGAAATCATCAAACTCAAAGGCTATACTTCCTGGGCTATTGGACTGTCTCTATCACAACTCTGTCGCGCTATACTTTACAACATGAATAGCGTGCACCCTGTCACGACCTGCGTTAAGGGCGAGCATGGTATAGAGGACGAAGTGTTCCTGTCCCTGCCCTGCGTGCTCGGCAGGAAAGGCATCTATGACATCATTCGACAAACACTCACCGATAGCGAACTGACCCAGCTTCGTAAATCTGCTGAAGTCATGGCGAAATTGCAAGCCGGTATCAAATTTTAG

Protein sequence:

>DPOGS202991-PA
MSFMNGNANGNGTANGNAYNGFYGYRHKMATLDQLFQPIQQKTEATGNKVTVVGSGQVGMAAVFAMLTQGVTNNIALVDCMADKLKGEMMDLQHGSCFMKDAKIQSSTDYAVSEGSKICVVTAGVRQRVGETRLELVQRNTDVLKVIIPQLVKYCPDAIFIIASNPVDVLTYVTWKISGLPKNRIIGSGTNLDSARFRYLLSEKLAVASTSCHGYVIGEHGDSSVPVWSGVNVAGVRLSDLNPKIGSDDDPENWKKIHEDVVKSAYEIIKLKGYTSWAIGLSLSQLCRAILYNMNSVHPVTTCVKGEHGIEDEVFLSLPCVLGRKGIYDIIRQTLTDSELTQLRKSAEVMAKLQAGIKF-