Monarch geneset OGS2.0

DPOGS212589
TranscriptDPOGS212589-TA810 bp
ProteinDPOGS212589-PA269 aa
Genomic positionDPSCF300245 - 358622-360697
RNAseq coverage40x (Rank: top 73%)
Annotation
HeliconiusHMEL0060293e-11675.38% 
BombyxBGIBMGA005192-TA2e-8977.32% 
DrosophilaMdh2-PA3e-2531.58% 
EBI UniRef50UniRef50_E9BQH01e-2432.62%Malate dehydrogenase, putative n=5 Tax=Leishmania RepID=E9BQH0_LEIDB
NCBI RefSeqXP_002402153.19e-2630.36%malate dehydrogenase, putative [Ixodes scapularis]
NCBI nr blastpgi|2412435452e-2430.36%malate dehydrogenase, putative [Ixodes scapularis]
NCBI nr blastxgi|1583014785e-2534.08%AGAP001903-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00166161.5e-19oxidoreductase activity, acting on the CH-OH group of donors, NAD or NADP as acceptor
GO:00059751.5e-19carbohydrate metabolic process
GO:00038241.5e-19catalytic activity
GO:00551141.5e-19oxidation-reduction process
GO:00300602.8e-19L-malate dehydrogenase activity
GO:00061082.8e-19malate metabolic process
GO:00054881.9e-07binding
KEGG pathwayisc:IscW_ISCW0035283e-25 
 K00026 (MDH2)maps-> Citrate cycle (TCA cycle)
    Pyruvate metabolism
    Carbon fixation in photosynthetic organisms
    Glyoxylate and dicarboxylate metabolism
InterPro domain[72-229] IPR0159551.5e-19Lactate dehydrogenase/glycoside hydrolase, family 4, C-terminal
[11-225] IPR0100972.8e-19Malate dehydrogenase, type 1
[74-229] IPR0223833.6e-11Lactate/malate dehydrogenase, C-terminal
[8-68] IPR0160401.9e-07NAD(P)-binding domain
Orthology groupMCL25110 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212589-TA
ATGCCGCCTTGTTGCAATATTGAAGACAGAGATCTCTTTTTCGAGAACCTCCAGCATGTGCGAACAGTGACCATAGCCTGCGCTGAATTTTGTCCCCACGCAATAATAGCTATCCAGACTCCCCCTGTTGATTGCAACTTTGCGCTGTGTATACATACTTTGCAACTAGCTCGAGTTTACGATAAGCGCAAGGTTCTTGGTGTAAACGCTATAAACTCTATGAGAGCCAACCAGCTGTTCTGCTCCATAACGGGATCTGACCCATCGACGAGGTCGACTCCCGTTATCTGTGGAACCGGCCGATGTACCAGAGTCCCTGTGTTCTCCGCTGGTAATGCTAGCAATTTTCCACAGACTCAGGTTGACTGTCTTACACGTCTCGTAAGAGAAGCTGACGACATCATCTGTAAAGTGAAGAGTAACAGTGAACAGGGTCATCTTTCTATCGGCTTCGCCACCGCTAGATTTGTTGTCAATATAATGAAAGGACTCTTCGAAAAACCAACGTTTATCGACAGCGCACTAGTAGAACAAGCGCATCCTGAAAAATGTTACAACATGACGATTTGTGCGACGCCCGTGACTGTTGGTAAGAATGGGATCCAGGATTATGCGGTACCCAATTTGAACGAAGCGGAGACAAGACTTCTAGAGGATAGCAAATGTGATTTGGAAGACATGTTGAATCTTGGTCGCTGCCACGCCGTCGGTGATGAATACTTCGTCCATCCAGCAAAAATATGTCCAGGGTGTTATTGTACTCCCTGTAGAGTTTGCGAGCCTTGCATGAGAAAAAAAACACAAACCTAA

Protein sequence:

>DPOGS212589-PA
MPPCCNIEDRDLFFENLQHVRTVTIACAEFCPHAIIAIQTPPVDCNFALCIHTLQLARVYDKRKVLGVNAINSMRANQLFCSITGSDPSTRSTPVICGTGRCTRVPVFSAGNASNFPQTQVDCLTRLVREADDIICKVKSNSEQGHLSIGFATARFVVNIMKGLFEKPTFIDSALVEQAHPEKCYNMTICATPVTVGKNGIQDYAVPNLNEAETRLLEDSKCDLEDMLNLGRCHAVGDEYFVHPAKICPGCYCTPCRVCEPCMRKKTQT-