Monarch geneset OGS2.0

DPOGS214357
TranscriptDPOGS214357-TA1074 bp
ProteinDPOGS214357-PA357 aa
Genomic positionDPSCF300020 + 569845-572321
RNAseq coverage23x (Rank: top 78%)
Annotation
HeliconiusHMEL0076053e-11757.30% 
BombyxBGIBMGA004130-TA9e-14165.73% 
DrosophilaCG10512-PA7e-10250.71% 
EBI UniRef50UniRef50_A7UUE07e-10252.12%AGAP006576-PA n=26 Tax=Eumetazoa RepID=A7UUE0_ANOGA
NCBI RefSeqXP_001866932.11e-10352.41%malate dehydrogenase [Culex quinquefasciatus]
NCBI nr blastpgi|1700630442e-10252.41%malate dehydrogenase [Culex quinquefasciatus]
NCBI nr blastxgi|1700630444e-10153.16%malate dehydrogenase [Culex quinquefasciatus]
Group
Gene OntologyGO:00081526.6e-139metabolic process
GO:00551146.6e-139oxidation-reduction process
GO:00164916.6e-139oxidoreductase activity
KEGG pathwaypab:PAB17911e-56 
 K05884 (E1.1.1.272)maps-> Cysteine and methionine metabolism
InterPro domain[4-320] IPR0037676.6e-139Malate/L-lactate dehydrogenase
Orthology groupMCL30226 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214357-TA
ATGCCGGAAGTTAAAATAGAAGAAGCGAAGAGATTTATGGTTGAAGCTTTGCGGGCTGCAGGAGCCCCTCTCAAAGAAGCCGAGGCTCATTCAGAACTACTCCTGCATGCTGATATCGTCGGACACTATAGCCATGGAATGAACAGACTAGAGCTCTACCTAAACGATATGAAGTCGGGTGTTTGTGATGCGAACGCCAAGCAGGTAATACTCAAGGAGACAGCGGCAACAGCCTGGGTCGATGGATGTAGAGCACTCGGGGCCACTGTTGGCAACTTCTGTATGAGCCTCGCCATCAAGAAAGCTAAGGAGTGTGGTGTAGGAATAGTAGCGGTCAAAAACTGTAATCACTATGGCATGGCGGGCTATTGGGCGCTAAGAGCTGAGAAGGAAGGTCTTATCGGAATGTCATTTACAAATTCATCTCCCGTCATGGTACCAACGAGAGCAAAAAAGAGTGCATTAGGCACTAATCCAATAGCCTTCGCGGCGCCTGCGTCAGGAGGCGATTCTCTCGTCGTGGACATGGCTAGCACCACTGCCGCTATGGGAAAGATAGAAATGCAAATTCGTAAAGGAGATCCGATTCCAGAAGGTTGGGCTTTAGGTGCTGACGGTAAAACAACAAGGGATCCGCAAGAGGCCTTCAACACAGGTCATTTGTTGCCTCTCGGAGGTTTTGAGGCAACCAGCGGTTATAAGGGATATGCGCTGAGTGCAATGGTCGAGGTTCTATGCAGCGGACTGTCTGGATCCAACGCCGCATACAACGTCCCTCCGTGGACCCACACTCAGACGAAGTCTCCCAATCTCGGTCAATGCTTCGTGGCGCTCGACCCCTCGTGCTTCGCCCCCGGTTTTGGCGACCGACTTGGCCATTGTTTGAACCACTGGAGGGGGATGGAACCCATGGATCCCGCATCACCAGTTCTAGCACCGGGAGACAAGGAAAAAATAAACGCAAGTACAACATATGAAAGAGGCACCATCGTGTACCCACAGCAGCAGATAGACTCGTACGGAACAGTTGCTGAAAGAATGGGAATACGAGCGATGGAAATTCTTAACAGTTAA

Protein sequence:

>DPOGS214357-PA
MPEVKIEEAKRFMVEALRAAGAPLKEAEAHSELLLHADIVGHYSHGMNRLELYLNDMKSGVCDANAKQVILKETAATAWVDGCRALGATVGNFCMSLAIKKAKECGVGIVAVKNCNHYGMAGYWALRAEKEGLIGMSFTNSSPVMVPTRAKKSALGTNPIAFAAPASGGDSLVVDMASTTAAMGKIEMQIRKGDPIPEGWALGADGKTTRDPQEAFNTGHLLPLGGFEATSGYKGYALSAMVEVLCSGLSGSNAAYNVPPWTHTQTKSPNLGQCFVALDPSCFAPGFGDRLGHCLNHWRGMEPMDPASPVLAPGDKEKINASTTYERGTIVYPQQQIDSYGTVAERMGIRAMEILNS-