Monarch geneset OGS2.0

DPOGS207101
TranscriptDPOGS207101-TA1029 bp
ProteinDPOGS207101-PA342 aa
Genomic positionDPSCF300001 + 3089654-3092747
RNAseq coverage5175x (Rank: top 2%)
Annotation
HeliconiusHMEL0132842e-16784.21% 
BombyxBGIBMGA013063-TA1e-16482.75% 
DrosophilaMdh2-PA4e-12870.15% 
EBI UniRef50UniRef50_P409262e-11662.17%Malate dehydrogenase, mitochondrial n=20 Tax=Opisthokonta RepID=MDHM_HUMAN
NCBI RefSeqXP_001600547.12e-13269.10%PREDICTED: similar to mitochondrial malate dehydrogenase [Nasonia vitripennis]
NCBI nr blastpgi|1565536553e-13169.10%PREDICTED: malate dehydrogenase, mitochondrial-like [Nasonia vitripennis]
NCBI nr blastxgi|1700444555e-12871.93%mitochondrial malate dehydrogenase 2 [Culex quinquefasciatus]
Group
Gene OntologyGO:00551142.4e-171oxidation-reduction process
GO:00061082.4e-171malate metabolic process
GO:00300602.4e-171L-malate dehydrogenase activity
GO:00166168e-63oxidoreductase activity, acting on the CH-OH group of donors, NAD or NADP as acceptor
GO:00038248e-63catalytic activity
GO:00059758e-63carbohydrate metabolic process
GO:00054884.4e-61binding
GO:00442622.1e-55cellular carbohydrate metabolic process
GO:00164917.4e-46oxidoreductase activity
KEGG pathwaynvi:1001140334e-132 
 K00026 (MDH2)maps-> Citrate cycle (TCA cycle)
    Pyruvate metabolism
    Carbon fixation in photosynthetic organisms
    Glyoxylate and dicarboxylate metabolism
InterPro domain[1-342] IPR0100972.4e-171Malate dehydrogenase, type 1
[175-341] IPR0159558e-63Lactate dehydrogenase/glycoside hydrolase, family 4, C-terminal
[30-174] IPR0160404.4e-61NAD(P)-binding domain
[28-342] IPR0015572.1e-55L-lactate/malate dehydrogenase
[30-172] IPR0012367.4e-46Lactate/malate dehydrogenase, N-terminal
[175-339] IPR0223834.2e-44Lactate/malate dehydrogenase, C-terminal
Orthology groupMCL14162 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207101-TA
ATGTTCTCCCGCACGCTAAAGCCTGCCGCTTCTGTGACTTTTCAGAATGGTTTCAAAAATTTCTCCACTTCATCGCAGAACAACTTCAAAGTCGTGGTTGCTGGGGCGGCTGGAGGAATCGGTCAGCCCTTAGGTCTACTGCTGAAGCAGAATAAGCTAGTCACGAATTTAGCACTATATGACATTGCCCCGGTGACCCCGGGAGTGGCTGCCGACTTGTCCCACATGGATACCCCGGCCAGGGTCAGCGGACATAAAGGACCAGAACAGCTAGCTGATGCTATCAAATGTGCAGATGTAGTGGTAATCCCAGCTGGTGTTCCCCGCAAGCCTGGTATGACTCGTGACGACCTGTTCAACACTAATGCATCCATCGTCCGAGACCTGGCCGCTTGTATTGCAGCGAAAGCTCCAAAGGCTCTCATAGCCATCATCACCAACCCTGTCAATTCCATGGTCCCCATCGCCTCCGAAGTTTTGAAGAAGGCGGGTGTATACGACCCTAATCGTGTGTTTGGTGTGACCACTCTGGATGTAGTCCGTGCCGCGGCCTTCATCGGCGAAATCAACGGTGTGGACCCTGGGTGCGTGTCGATCCCCGTTATTGGAGGTCACTCCGGGGTCACCATCATCCCAGTACTGAGTCAGTGCGAACCTGCCGTCAAATTGTCCGACCAAAGCAAAATAGAAGCTCTCACTAAGAGGATCCAGGAAGCCGGCACTGAGGTGGTGAAGGCCAAGGCTGGTGGTGGATCAGCGACGCTGTCGATGGCATACGCTGGTGCAAGACTCGTATGCTCCCTGCTCAGGGGTCTCAATGGAGAATCAAACGTGATTGAGTGCGCCTATGTCAAATCTGATCTGACTGAGGCCACATATTTCGCGAACCCGCTTTTACTTGGGAAGAACGGCTTGGAGAAGAATCTTGGCTTCGGTAACCTTAACGGTTACGAGCAGGGGCTCCTCAAGGCTGCCATCCCAGAACTACTGAAGAACATCAAGACCGGAGAAGATTTCGCCAAGAAGTAA

Protein sequence:

>DPOGS207101-PA
MFSRTLKPAASVTFQNGFKNFSTSSQNNFKVVVAGAAGGIGQPLGLLLKQNKLVTNLALYDIAPVTPGVAADLSHMDTPARVSGHKGPEQLADAIKCADVVVIPAGVPRKPGMTRDDLFNTNASIVRDLAACIAAKAPKALIAIITNPVNSMVPIASEVLKKAGVYDPNRVFGVTTLDVVRAAAFIGEINGVDPGCVSIPVIGGHSGVTIIPVLSQCEPAVKLSDQSKIEALTKRIQEAGTEVVKAKAGGGSATLSMAYAGARLVCSLLRGLNGESNVIECAYVKSDLTEATYFANPLLLGKNGLEKNLGFGNLNGYEQGLLKAAIPELLKNIKTGEDFAKK-