Monarch geneset OGS2.0

DPOGS205757
TranscriptDPOGS205757-TA903 bp
ProteinDPOGS205757-PA300 aa
Genomic positionDPSCF300255 - 105972-107481
RNAseq coverage246x (Rank: top 42%)
Annotation
HeliconiusHMEL0077479e-12067.47% 
BombyxBGIBMGA009955-TA4e-9068.33% 
DrosophilaCG8525-PA6e-8056.27% 
EBI UniRef50UniRef50_Q9Y3156e-8753.74%Putative deoxyribose-phosphate aldolase n=121 Tax=cellular organisms RepID=DEOC_HUMAN
NCBI RefSeqXP_972358.11e-9257.38%PREDICTED: similar to 2-deoxyribose-5-phosphate aldolase homolog [Tribolium castaneum]
NCBI nr blastpgi|910939252e-9157.38%PREDICTED: similar to 2-deoxyribose-5-phosphate aldolase homolog [Tribolium castaneum]
NCBI nr blastxgi|910939253e-8958.16%PREDICTED: similar to 2-deoxyribose-5-phosphate aldolase homolog [Tribolium castaneum]
Group
Gene OntologyGO:00041392.9e-115deoxyribose-phosphate aldolase activity
GO:00057372.9e-115cytoplasm
GO:00092642.9e-115deoxyribonucleotide catabolic process
GO:00081527.9e-79metabolic process
GO:00038247.9e-79catalytic activity
GO:00168291.3e-35lyase activity
KEGG pathwaytca:6610793e-92 
 K01619 (E4.1.2.4, deoC)maps-> Pentose phosphate pathway
InterPro domain[39-297] IPR0113432.9e-115Deoxyribose-phosphate aldolase
[43-294] IPR0137857.9e-79Aldolase-type TIM barrel
[49-267] IPR0029151.3e-35Deoxyribose-phosphate aldolase/phospho-2-dehydro-3-deoxyheptonate aldolase
Orthology groupMCL12921 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205757-TA
ATGGTTCATGTTTACCCAAAAACTTTAGATACAAAAATTCTGAATGAAATCTATATTAACAAAAACAGTGTTGATGCACAAGTTGAGAAAATTCTTAACCAGTTTCCAGTATCAAATCCTATAAATAAGGATTGGCTGTACAGAGTTATTTCTATGATAGATTTAACAACCCTTGGTGGTGATGACACCCGTTCAAATGTTGTCAGACTGTGCAATAAAGCTGCCAATCCATTAGGAGTATATCAAAACAATGTAAAAGTTAGAACGGCAGCAGTGTGTGTTTACCCTAACCGAATAAAAGATGCCTATGAAACCATTTCAAGATTGAATCTTAAAAAAGATATTCAAATTGCCTCAGTGGCAACAGGCTTTCCTTCTGGACAATATCCTTTAAGCACACGCTTACAGGAAATAAAGTTTGCTTTAGACAATGGAGCTACTGAGATAGATGTAGTAATAGATAGAAGCCTCGTCCTCATGGGGGAGTGGGAAACATTGTATGATGAGTTACTTCAGATGAGGAGGGTGTGTGGGAGAGCACATTTAAAAGTTATCCTTGGAGTGGGAGAACTCGGTTCTTATGAAAATGTTTATAATGCATCCATGGTGTCAATGATGGCAGGAGCCGACTTTATAAAAACTTCCACAGGAAAGGAGGCTGTGAATGCTACTTTGCCAGTGGGATTAGTGATGTGTCGTGCAATAAGAAACTATTATCTGATGACTGGAACTAAGGTGGGCTTGAAACCCGCTGGAGGTATCAAGACATCCAAAGATGCTGTTAATTGGCTGACATTGGTTTATAATGAATTAGGGCAAGAGTGGCTCTCTCCAAAACTCTTCCGCATCGGTGCGTCCAGTCTACTTGATGTTATAATCAGGGACATACAAAAAGTTCAGTGA

Protein sequence:

>DPOGS205757-PA
MVHVYPKTLDTKILNEIYINKNSVDAQVEKILNQFPVSNPINKDWLYRVISMIDLTTLGGDDTRSNVVRLCNKAANPLGVYQNNVKVRTAAVCVYPNRIKDAYETISRLNLKKDIQIASVATGFPSGQYPLSTRLQEIKFALDNGATEIDVVIDRSLVLMGEWETLYDELLQMRRVCGRAHLKVILGVGELGSYENVYNASMVSMMAGADFIKTSTGKEAVNATLPVGLVMCRAIRNYYLMTGTKVGLKPAGGIKTSKDAVNWLTLVYNELGQEWLSPKLFRIGASSLLDVIIRDIQKVQ-