Monarch geneset OGS2.0

DPOGS207425
TranscriptDPOGS207425-TA786 bp
ProteinDPOGS207425-PA261 aa
Genomic positionDPSCF300087 + 369013-372221
RNAseq coverage1961x (Rank: top 6%)
Annotation
HeliconiusHMEL0149093e-9977.34% 
BombyxBGIBMGA007144-TA2e-0824.51% 
DrosophilaCG4747-PB7e-8555.34% 
EBI UniRef50UniRef50_Q7Q1618e-9664.43%Putative oxidoreductase GLYR1 homolog n=17 Tax=Arthropoda RepID=GLYR1_ANOGA
NCBI RefSeqXP_002426591.12e-9872.61%2-hydroxy-3-oxopropionate reductase, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3227961222e-9870.36%hypothetical protein SINV_02941 [Solenopsis invicta]
NCBI nr blastxgi|3227961221e-9470.36%hypothetical protein SINV_02941 [Solenopsis invicta]
Group
Gene OntologyGO:00551141.3e-128oxidation-reduction process
GO:00164911.3e-128oxidoreductase activity
GO:00166166.5e-25oxidoreductase activity, acting on the CH-OH group of donors, NAD or NADP as acceptor
GO:00506626.5e-25coenzyme binding
GO:00054885.9e-24binding
GO:00060982.3e-21pentose-phosphate shunt
GO:00046162.3e-21phosphogluconate dehydrogenase (decarboxylating) activity
KEGG pathwaycqu:CpipJ_CPIJ0004004e-97 
 K00020 (E1.1.1.31, mmsB)maps-> Valine, leucine and isoleucine degradation
InterPro domain[9-256] IPR0158151.3e-1283-hydroxyacid dehydrogenase/reductase
[139-256] IPR0133286.5e-25Dehydrogenase, multihelical
[9-136] IPR0160405.9e-24NAD(P)-binding domain
[137-260] IPR0089275.2e-236-phosphogluconate dehydrogenase, C-terminal-like
[9-135] IPR0061152.3e-216-phosphogluconate dehydrogenase, NADP-binding
Orthology groupMCL14560 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207425-TA
ATGTATGTTCCATATTGCCTAACTTGTAAAGACTTCGAGAAAGTCGGTGCTACGATCGCTGTGACTCCCTGCGATGTGGTGGAAGAGGCTGACATCACCTTCTCTTGCGTAGCGGACCCTCAGGCGGCCAAAGAGATGGTGTTCGGCAACTGCGGAGTGCTGCACTGTCCCACCCTGGAGGGCAAGGGCTACGTGGAGATGACCTCCATAGACGCCGACACCTCACACGACATAGTGGAGGCGCTCGGGGGAAAGGGAGGGAGATATCTGGAAGCACAGATCCAAGGCTCCAAGACCCAGGCGGAGGAGGGTACGCTCATCATCCTGGCGGCCGGGGACCGCTCGCTGTTCGACGACTGTCAGTCGTGCTTCAAGGCCATGAGCAAGAACTCCTTTTACCTCGGTAGTGAGATAGGCAACGCGTCCAAGATGAACTCGGTGCTGCAGGTGGTGGGCGGAGTGTCGCTGGGCGCGCTGGCCGAGGGCCTGGCGCTGGCGGACCGCGCAGGCCTCAGCCAGGCCGACCTCCTGGATGTGCTGGCGCTCACGCCGCTCGCCAGCCCGCACCTCATACTCAAGGGACGAGCCATGATCGAGTCGTCGTACTCGACCCACCAGCCGCTGAGCCACATGCAGAAGGACCTGAAGCTGGCGCTGGGGCTGGGAGACGCCCTGGAGCAGTCCCTGCCGCTCACCGCCACCACCAACGAGATCTTCAAGCACGCCAAGCGGCTCGGCTACGCCAACCATGACGTGGCCGCCGTCTACATCCGCGCCAGGTTCTAG

Protein sequence:

>DPOGS207425-PA
MYVPYCLTCKDFEKVGATIAVTPCDVVEEADITFSCVADPQAAKEMVFGNCGVLHCPTLEGKGYVEMTSIDADTSHDIVEALGGKGGRYLEAQIQGSKTQAEEGTLIILAAGDRSLFDDCQSCFKAMSKNSFYLGSEIGNASKMNSVLQVVGGVSLGALAEGLALADRAGLSQADLLDVLALTPLASPHLILKGRAMIESSYSTHQPLSHMQKDLKLALGLGDALEQSLPLTATTNEIFKHAKRLGYANHDVAAVYIRARF-