Monarch geneset OGS2.0

DPOGS213171
TranscriptDPOGS213171-TA918 bp
ProteinDPOGS213171-PA305 aa
Genomic positionDPSCF300114 - 331554-334239
RNAseq coverage649x (Rank: top 20%)
Annotation
HeliconiusHMEL0029891e-12167.96% 
BombyxBGIBMGA007360-TA5e-14480.07% 
DrosophilaCG4389-PA9e-3531.77% 
EBI UniRef50UniRef50_E0VKS43e-10059.12%Short chain 3-hydroxyacyl-CoA dehydrogenase, putative n=10 Tax=Opisthokonta RepID=E0VKS4_PEDHC
NCBI RefSeqNP_001040414.11e-13577.12%3-hydroxyacyl-CoA dehydrogenase [Bombyx mori]
NCBI nr blastpgi|1140509173e-13477.12%3-hydroxyacyl-CoA dehydrogenase [Bombyx mori]
NCBI nr blastxgi|1140509176e-12977.12%3-hydroxyacyl-CoA dehydrogenase [Bombyx mori]
Group
Gene OntologyGO:00066317.3e-57fatty acid metabolic process
GO:00038577.3e-573-hydroxyacyl-CoA dehydrogenase activity
GO:00551147.3e-57oxidation-reduction process
GO:00164917.3e-57oxidoreductase activity
GO:00054881.9e-54binding
GO:00166162.3e-33oxidoreductase activity, acting on the CH-OH group of donors, NAD or NADP as acceptor
GO:00506622.3e-33coenzyme binding
KEGG pathwaytca:6618105e-105 
 K00022 (HADH)maps-> Fatty acid elongation in mitochondria
    Tryptophan metabolism
    Lysine degradation
    Valine, leucine and isoleucine degradation
    Geraniol degradation
    Fatty acid metabolism
    Caprolactam degradation
    Butanoate metabolism
InterPro domain[22-205] IPR0061767.3e-573-hydroxyacyl-CoA dehydrogenase, NAD binding
[19-208] IPR0160401.9e-54NAD(P)-binding domain
[207-303] IPR0061081.4e-343-hydroxyacyl-CoA dehydrogenase, C-terminal
[209-304] IPR0133282.3e-33Dehydrogenase, multihelical
[207-304] IPR0089277.2e-326-phosphogluconate dehydrogenase, C-terminal-like
Orthology groupMCL17816 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213171-TA
ATGAATAATTTGAACGTTATATGTAGAAAGTTTTCCCATTCGGCATCTTTAAATGCTATCAAAACGGTAACAGTTGTCGGCGGCGGGCTTATGGGTTCTGGAATCGCACAGGCAACTCAAGCCGGACAAAATGTAACAATAGTCGATTTGAACTCGGAAATACTTGAAAAAGCTCAGAAATCTATACAAAACAATCTAGGCAGAGTGGCGAGAAAGCTGTATAAAGATGATCCTTTGAAAATGGAGGAATTTGTTAAAGAAGCCAATAGTAGAATTAAGGTTTCAACCAAGATTGAAGATGGTGTGGACGCTGATCTGATCGTAGAGGCCATCGTCGAGTTACTTGAGCCCAAACAGAAGCTGTTCAACAGATTGGATGAGCTGGCTCCAGAACATACAATTTTAGCGAGCAACACATCATCTATATCAATCAATGAGATAGGCAGCGGTATTAAGAGAAAAGATAGGTTTGGTGGGCTGCACTTCTTCAATCCTGTGCCTGTGATGCGTCTCCTGGAGGTTATCAAGAGCGACCGCATGTCCCAAGAGACTTATAACGCCATGATGGAGTGGGGCAAGTCCGTGGGCAAGACCTGCATCACCTGCAAAGATACTCCTGGATTTGTCGTCAACAGGCTACTAGGGCCTTACAGTGCTGAAGCTTTCAGAATGTTTGAGCGAGGTGATGCCAGTAAAGAAGACATCGATATTGCTATGAAGTTGGGTGCGGGATACCCCATGGGTCCGCTAGAGCTGGCCGACTACACCGGACTCGATACTAACAAGTTCGTCCTCGAGGTGTTGTACCAGAAGACAAAGAACCAAGTGTTTAAGCCGATACCGTTACTGAATAAAATGGTGGAAGAAGGCAAACTGGGAATCAAGACCGGGGAGGGGATCTATAAATACAAGAAGTGA

Protein sequence:

>DPOGS213171-PA
MNNLNVICRKFSHSASLNAIKTVTVVGGGLMGSGIAQATQAGQNVTIVDLNSEILEKAQKSIQNNLGRVARKLYKDDPLKMEEFVKEANSRIKVSTKIEDGVDADLIVEAIVELLEPKQKLFNRLDELAPEHTILASNTSSISINEIGSGIKRKDRFGGLHFFNPVPVMRLLEVIKSDRMSQETYNAMMEWGKSVGKTCITCKDTPGFVVNRLLGPYSAEAFRMFERGDASKEDIDIAMKLGAGYPMGPLELADYTGLDTNKFVLEVLYQKTKNQVFKPIPLLNKMVEEGKLGIKTGEGIYKYKK-