Monarch geneset OGS2.0

DPOGS213213
TranscriptDPOGS213213-TA840 bp
ProteinDPOGS213213-PA279 aa
Genomic positionDPSCF300114 + 337525-340294
RNAseq coverage1717x (Rank: top 7%)
Annotation
HeliconiusHMEL0029893e-14485.30% 
BombyxBGIBMGA007360-TA3e-10966.19% 
DrosophilaCG4389-PA7e-3634.16% 
EBI UniRef50UniRef50_E0VKS46e-10662.45%Short chain 3-hydroxyacyl-CoA dehydrogenase, putative n=10 Tax=Opisthokonta RepID=E0VKS4_PEDHC
NCBI RefSeqXP_973042.11e-11671.22%PREDICTED: similar to 3-hydroxyacyl-coa dehyrogenase [Tribolium castaneum]
NCBI nr blastpgi|3796988943e-13682.08%3-hydroxyacyl-CoA dehydrogenase [Bombyx mori]
NCBI nr blastxgi|3796988942e-13082.08%3-hydroxyacyl-CoA dehydrogenase [Bombyx mori]
Group
Gene OntologyGO:00066311.9e-56fatty acid metabolic process
GO:00038571.9e-563-hydroxyacyl-CoA dehydrogenase activity
GO:00551141.9e-56oxidation-reduction process
GO:00164911.9e-56oxidoreductase activity
GO:00054887.2e-53binding
GO:00166161.8e-36oxidoreductase activity, acting on the CH-OH group of donors, NAD or NADP as acceptor
GO:00506621.8e-36coenzyme binding
KEGG pathwaytca:6618104e-116 
 K00022 (HADH)maps-> Fatty acid elongation in mitochondria
    Tryptophan metabolism
    Lysine degradation
    Valine, leucine and isoleucine degradation
    Geraniol degradation
    Fatty acid metabolism
    Caprolactam degradation
    Butanoate metabolism
InterPro domain[1-177] IPR0061761.9e-563-hydroxyacyl-CoA dehydrogenase, NAD binding
[1-180] IPR0160407.2e-53NAD(P)-binding domain
[181-277] IPR0133281.8e-36Dehydrogenase, multihelical
[179-276] IPR0061083.6e-353-hydroxyacyl-CoA dehydrogenase, C-terminal
[179-277] IPR0089275.1e-356-phosphogluconate dehydrogenase, C-terminal-like
Orthology groupMCL17816 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213213-TA
ATGGGTTCGGGAATTGCTCAGGTAGCAGCACAAGCTGGTCAAAATGTGACCTTAGTGGACGTTAGCTCTGATGTGTTAGCTAAATCTCAAAAATCAATAGCCACCAATCTGAATAGAGTCGCTAAGAAGATTTATAAAGACAAACCTCAGGATGGTGAGAAGTTTGTTGCTGAGGCTATTGCTAGAATAAAAACCAACACAGATCCTGCAGCAGCCAGCCAAGAAGCTGATTTGATTGTGGAGGCAATCGTTGAAAACATGGATGTAAAACACCAATTGTTCAAAAAGCTTGATGCGGCGGCTCCTGGTCATACCATCTTTGCCTCGAACACATCCTCCCTCTCTATCAATGAGATCTCTTCAGTTGTTAAGAGGACAGACAAATTTGGTGGTCTACACTTCTTCAATCCCGTGCCGGTGATGCGTCTACTTGAAGTGGTCCGTGGTGAGAAAACATCTGACAACACTTACAAGGCCATGATGGAATGGGGCAAGACTGTGGGCAAGACCTGCATAACTTGCAAAGATACTCCTGGATTTGTTGTCAACAGGCTTCTGGTTCCGTATATTGCAGAGGCAATTCGGTTGTTTGAGAGAGGTGATGCTTCAGCTCGGGACATTGATGTGGCCATGAAGTTAGGTGCTGGTTATCCGATGGGACCGCTGGAGTTAGCTGACTATGTAGGTCTGGACACTAACAAGTTTATCTTAGACGGCTGGCATAAAAAATACCCCGAAGAACCCTTATTCAAACCCAGCCCTCTCCTTAATAAACTGGTGTCGGAAGGGAAGCTTGGAGTCAAGAGCGGAGAAGGCTTCTACAAGTATGAAAAGAAATGA

Protein sequence:

>DPOGS213213-PA
MGSGIAQVAAQAGQNVTLVDVSSDVLAKSQKSIATNLNRVAKKIYKDKPQDGEKFVAEAIARIKTNTDPAAASQEADLIVEAIVENMDVKHQLFKKLDAAAPGHTIFASNTSSLSINEISSVVKRTDKFGGLHFFNPVPVMRLLEVVRGEKTSDNTYKAMMEWGKTVGKTCITCKDTPGFVVNRLLVPYIAEAIRLFERGDASARDIDVAMKLGAGYPMGPLELADYVGLDTNKFILDGWHKKYPEEPLFKPSPLLNKLVSEGKLGVKSGEGFYKYEKK-