Monarch geneset OGS2.0

DPOGS209706
TranscriptDPOGS209706-TA1668 bp
ProteinDPOGS209706-PA555 aa
Genomic positionDPSCF300105 - 471386-482087
RNAseq coverage135x (Rank: top 56%)
Annotation
HeliconiusHMEL0178711e-11068.54% 
BombyxBGIBMGA008918-TA3e-15057.41% 
DrosophilaMdh2-PA6e-3529.72% 
EBI UniRef50UniRef50_E0VE206e-3530.89%Malate dehydrogenase, putative n=1 Tax=Pediculus humanus corporis RepID=E0VE20_PEDHC
NCBI RefSeqXP_002402153.12e-3632.42%malate dehydrogenase, putative [Ixodes scapularis]
NCBI nr blastpgi|2412435453e-3532.42%malate dehydrogenase, putative [Ixodes scapularis]
NCBI nr blastxgi|1971293072e-3331.65%putative malate dehydrogenase mitochondrial variant 1 [Taeniopygia guttata]
Group
Gene OntologyGO:00551141.7e-36oxidation-reduction process
GO:00061081.7e-36malate metabolic process
GO:00300601.7e-36L-malate dehydrogenase activity
GO:00054886.1e-28binding
GO:00164911.4e-15oxidoreductase activity
GO:00166161.5e-14oxidoreductase activity, acting on the CH-OH group of donors, NAD or NADP as acceptor
GO:00038241.5e-14catalytic activity
GO:00059751.5e-14carbohydrate metabolic process
KEGG pathwayisc:IscW_ISCW0035285e-36 
 K00026 (MDH2)maps-> Citrate cycle (TCA cycle)
    Pyruvate metabolism
    Carbon fixation in photosynthetic organisms
    Glyoxylate and dicarboxylate metabolism
InterPro domain[122-459] IPR0100971.7e-36Malate dehydrogenase, type 1
[147-300] IPR0160406.1e-28NAD(P)-binding domain
[148-298] IPR0012361.4e-15Lactate/malate dehydrogenase, N-terminal
[306-462] IPR0159551.5e-14Lactate dehydrogenase/glycoside hydrolase, family 4, C-terminal
Orthology groupMCL25554 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209706-TA
ATGCCTCAGCGCTCGTCATCCTCAAGTCTGATGCGACCCTTCGTGGTGTTGGCTCTACCCGGAATGTATCTGCTGTATAAGTACAATCAGTACCGACAGCAGCAAATGGAGACCGCTCGTCGACGTGTCACCGAGCGAGAGCTTATGCACCTCAACACCAAAATTAAAAGGGATGATGGTTGCCGAAGAGCCTGTGCCGATATTCCGCCTGAAAGAGACCCTGTTTGTCCCATTCCGTGTGATAGAAGGGGCACAAAGAAAGAAGGCATCCTGCACAGGATCAAACTGAGGTTAATAGATGGTGGAACGAACTTCGGGACGCCCTTTCGTTACCTCCAAGACGACAAGTATGTCTGTAAAGAAATCGAGGTTCCGGAACCGAAGGCGTTACCTGCGCCCGTTGAGGAGAAGCCTTGCATACAGGAGACTCGGTCCGGGGTGCAAGTGGCGGTGCTCGGGGCGGACACGAGTATAGGACAATATGTTTCACTGTTGTTGAAGCAGTGTCCTTGTATAAAGAAATTACGTTTGTACGAAGCGAGGGATACAAGTACAGGATGTAGTCGTGAGCTTTGTCAAGTTGTTCATGACTTGCAGCATATTGATACCAATTGTCTGGTGCAGGCGTATAGCTGTGCGTGTCATGATTTGGATAGATGCTTACAGAATTCTGATATTGTCTTGATGCTAGAAAGCGGCTACCTAAGTATGGATATGCCTATGGAAACTAGATTCAACTATCAAGCACCAATAGTAAAGAAATATGCTGATGCTATCGCCAAAGAATGTCCAAATGCATTTATCTTAGTCTGCGCCTCGCCTATAGACTGCATGGTGCCTTTGGTTGCTGAGACGTTAAAGGAGACCGGTTGGTACAATCCTCGTAAACTACTAGGATCTTTGGCGGTTCCTGAGATGAGAGCCAGTACTCTTGCAGCAAGAGCGCTGAGCTTGGAGCCATGTTATGTAAATGTACCATGTGTTGGTGGCACTGAAGGAGATGCCTTAGTGCCACTTTTCTCAAAAGCCTTAGAATATTTCGAATTCAGTGAGCAAAACGCTGAAATGATGACGAATACAGTTCGATGCGCTCCAGAAGCCGTGGCCAAGTCTGATTGCAATTGTGTTAGATCTGCTGAGCTCAGTGAGGCGCACGCACTCGCCGGCTTGGTCACTAAAGTGGCTTGGGCTTTGCTGTGCAGAGATGTGCCGCAGATGTCTGGCTTTGTTGAGACTGATCCGTGTCAAATTATCTCTCCAGCGAGATATATCGCCAATGTTGTGGAGATAACTGGAACAGGGATCATAAGAAGCGTTGGTCTTCCTAAAATAACGGATTCTGAGATAAGACGCGTGGATATAGCATTGAACGAACTTTATAGGAAACAAAAAATGACTATGGACTGGTTTCTTGGCGTTAACGTGTTACTGAACAACCATGGTATCTGTAAAATAGTCGACTTACCAGGAAACTTCAGCCCGTTAGAATTACATCTCATGGACTACGCTGTTATGAATCTCAAATACAACGAGGATTTAGCAATGAACTGGTACTGCGATTATATGCATGCGGAATGTCAATTGGGAATGATGGGAACGAAAACTCAATTCTTCGACACGAAAACAAAGTATCCAAGGGTGCTAGAATGCATAAAAGATATGGCGTAA

Protein sequence:

>DPOGS209706-PA
MPQRSSSSSLMRPFVVLALPGMYLLYKYNQYRQQQMETARRRVTERELMHLNTKIKRDDGCRRACADIPPERDPVCPIPCDRRGTKKEGILHRIKLRLIDGGTNFGTPFRYLQDDKYVCKEIEVPEPKALPAPVEEKPCIQETRSGVQVAVLGADTSIGQYVSLLLKQCPCIKKLRLYEARDTSTGCSRELCQVVHDLQHIDTNCLVQAYSCACHDLDRCLQNSDIVLMLESGYLSMDMPMETRFNYQAPIVKKYADAIAKECPNAFILVCASPIDCMVPLVAETLKETGWYNPRKLLGSLAVPEMRASTLAARALSLEPCYVNVPCVGGTEGDALVPLFSKALEYFEFSEQNAEMMTNTVRCAPEAVAKSDCNCVRSAELSEAHALAGLVTKVAWALLCRDVPQMSGFVETDPCQIISPARYIANVVEITGTGIIRSVGLPKITDSEIRRVDIALNELYRKQKMTMDWFLGVNVLLNNHGICKIVDLPGNFSPLELHLMDYAVMNLKYNEDLAMNWYCDYMHAECQLGMMGTKTQFFDTKTKYPRVLECIKDMA-