Monarch geneset OGS2.0

DPOGS209167
TranscriptDPOGS209167-TA1557 bp
ProteinDPOGS209167-PA518 aa
Genomic positionDPSCF300061 - 98203-100595
RNAseq coverage144x (Rank: top 54%)
Annotation
HeliconiusHMEL0071579e-8137.63% 
BombyxBGIBMGA011482-TA0.073.22% 
DrosophilaSsadh-PE7e-15052.38% 
EBI UniRef50UniRef50_E0VDZ15e-14753.44%Succinate semialdehyde dehydrogenase, putative n=15 Tax=Pancrustacea RepID=E0VDZ1_PEDHC
NCBI RefSeqXP_972566.14e-16857.47%PREDICTED: similar to succinate semialdehyde dehydrogenase, mitochondrial [Tribolium castaneum]
NCBI nr blastpgi|1669980403e-16759.67%succinic semialdehyde dehydrogenase [Ctenocephalides felis]
NCBI nr blastxgi|1669980422e-16959.67%succinic semialdehyde dehydrogenase [Ctenocephalides felis]
Group
Gene OntologyGO:00081522.3e-164metabolic process
GO:00551142.3e-164oxidation-reduction process
GO:00164912.3e-164oxidoreductase activity
GO:00166202e-60oxidoreductase activity, acting on the aldehyde or oxo group of donors, NAD or NADP as acceptor
KEGG pathwaydme:Dmel_CG46856e-148 
 K00139 (E1.2.1.24)maps-> Alanine, aspartate and glutamate metabolism
    Butanoate metabolism
InterPro domain[2-514] IPR0161612.3e-164Aldehyde/histidinol dehydrogenase
[47-509] IPR0155903.6e-160Aldehyde dehydrogenase domain
[40-299] IPR0161621.5e-101Aldehyde dehydrogenase, N-terminal
[300-482] IPR0161632e-60Aldehyde dehydrogenase, C-terminal
Orthology groupMCL15673 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209167-TA
ATGGACGAAGAATCGATGTGGAAAGGTTTTTATGAAAAACTACAGCAACAGAAATCGGTGGATGAACAGATTCATGACACGAGAATGGATTATAGAAATATGCATTATTTGAGAAAGAGCGCGTATATTAACGGCCAATGGGTCAATGCCGAAAGCAAAGCCGTATTTCCAGTGTTGAACCCGGCTGATGATAAGGTTATCGCTGAGATTCCTGACATGGATGCTGCGGATGCAAAGAGAGCTATTGCTGCTGCTACAGAAAGCTTTCGTACGTGGAAGTATACGACAGCTAAGGACAGAGCTGCAATATTAAGGAGATGGTATGACCTTTGTGTAAAAAACCAGGATCATCTTGCAGAAATTATAACAGCGGAGGCTGGTAAGCCCTTGGCTGAAGCCAAAGGTGAAATTGTTTATGGCAGTTCATTCTTGGAGTTCTTTGCTGATACTGCTCGACATATTAACGGTGAAGTGATTCCAAGTCCCTGGCCAAACAAGCAGATCATGTTAACAAGACAGCCTTTAGGGGTAGTGTCTGTTATTACGCCATGGAACTTCCCATTTGCTATGATCACTCGTAAAGTTGGTGCTTTACTGGCAGCTGGCTGCACTTGTGTCATAAAACCTGCTGAAGACACTCCGCTGACTGCTTTAGCAGCAGTTCAGTTAGCAGAAGAGGCAGGGTTTCCCAAAGGTATAATTAATGTAGTGACAGCAAGCAGAAAAAATGCCCCCATAGTGGGTAAAGTTTTATGTGAAGATGCCAATGTTGGTTGTATTTCATTTACTGGTTCCACTCATGTTGGAAAGATTCTTTATGGAATGGCAGCAAAAGGAGTAAAACGAGTTTCCCTTGAACTAGGCGGTAATTGTCCATTTATAGTTTTTCCAAGTGCAGACATCAATCATGCTGTTGAACAAGCCATGATGGCTAAATTCAGGAATAATGGGCAAGCATGTGTCGGAGCCAATCGATTCTTAATTCATGCCGATATTTTTGATGCATTTGTAGAAGCATTTAAAAATAAGATTATGGAAAAATGTATTATCGGCCCCGGATACAAAGCTGGGGTAACTTGTGGGCCCCTAATAAATCTTGAGCAAGCTACCAAAGTGAAGAGTTTAGTTGAAGATGCTTGTGGAAAAGGTGCGAACGTCGTGATTGGTGGAAAACCTGCTCCCAAACATGGAACCAAATTTTATGAATCTACAATATTGACAGATGTAAAGCCGGAAATGAAAATATATGGTGAAGAAATATTTGGGCCGGTTGCCGTTTGTCATAAATTCAATACAGAGGAGGAAGTTCTAGAGATTGCGAATAGTACCAGAACAGGGCTCGGATCTTATGTTTTTACAAGAGATCTCGGTCAGGCATTTAGGATGTCTCAAAAGTTAGAATTCGGCATGGTGGCCATCAATGATGGTGTATTATCAGCAGCAGAACCGGCATTCGGCGGCATTAAGGAGTCGGGAATCGGCAGAGAAGGCAGCAAACACGGCGTGGAAGAATACACAGATATTAAATATACTCTATTTTCAGCTTTAGACAAATGA

Protein sequence:

>DPOGS209167-PA
MDEESMWKGFYEKLQQQKSVDEQIHDTRMDYRNMHYLRKSAYINGQWVNAESKAVFPVLNPADDKVIAEIPDMDAADAKRAIAAATESFRTWKYTTAKDRAAILRRWYDLCVKNQDHLAEIITAEAGKPLAEAKGEIVYGSSFLEFFADTARHINGEVIPSPWPNKQIMLTRQPLGVVSVITPWNFPFAMITRKVGALLAAGCTCVIKPAEDTPLTALAAVQLAEEAGFPKGIINVVTASRKNAPIVGKVLCEDANVGCISFTGSTHVGKILYGMAAKGVKRVSLELGGNCPFIVFPSADINHAVEQAMMAKFRNNGQACVGANRFLIHADIFDAFVEAFKNKIMEKCIIGPGYKAGVTCGPLINLEQATKVKSLVEDACGKGANVVIGGKPAPKHGTKFYESTILTDVKPEMKIYGEEIFGPVAVCHKFNTEEEVLEIANSTRTGLGSYVFTRDLGQAFRMSQKLEFGMVAINDGVLSAAEPAFGGIKESGIGREGSKHGVEEYTDIKYTLFSALDK-